This webpage is made exclusively to apply to the modeling award. the information contained here are just the answers to the key questions for good modeling project.
However, our aptamer folding software have much more characteristics than the ones explained here. If you find this content interesting, we encourage you to dive deeper in our aptamer folding software in the dedicated webpage.
1 How impressive is the modeling?
There currently exist some softwares, such as AlphaFold (only for protein folding), Mfold (which only computes the secondary structure) and Rosetta (that gives a folding for a given aptamer given, but does not select the best one). But none of them can compute the 3D modeling correctly.
2 Did the model help the team understand a part, device, or system?
This model enables the reduction of steps in the SELEX process by improving the algorithms used, so the team could better understand the SELEX process and the initial steps, how to improve them and how to apply computational tools in biological problems.
The model also aids understanding of artificial-intelligence algorithms like GAN used in aptamer folding, as well as existing software like Rosetta or ViennaRNA. The team learnt how to work the Rosetta scoring, the low-energy calculation of proteins, computer representation of DNA and RNA, the use and manipulation of formats like FASTA, PDBs or secondary structures in a computer. The team studied the application of optimization tools such as terminal use or thread use in order to improve a code and its computational time.
3 Did the team use measurements of a part, device, or system to develop the model?
We also obtained measurements of time, in order to check that our algorithm spent less time than the previous algorithms and we use, for the calculation times, Python tools that measure the times and compare these times between codes.
4 Does the modeling approach provide a good example for others?
The GAN is easier to compute on computers with Python dependencies, but only with the installation of Rosetta (or pyrosetta open-source) and ViennaRNA (open-source). The computational power needed is not so high and can be compiled in computers with low GAN memory because it only performs CNN in data type formats (no images for example), and each entry in the database is only constituted by a sequence, a matrix with the degrees of the nucleotides (5 per nucleotide) and the scoring (a number).
The uploaded archive is totally commented and each part is understandable in order to allow future programmers to change the desired parts and to improve the code.