The purpose of this project is to identify specified genes in DNA.
The first genes we'll identify are the 4R and7R variants of the drd4 dopamine receptor.
We will use a reference chromosome from the National Library of Medicine. Specifically, the Homo sapiens chromosome 1, GRCh38 reference primary assembly
- Data Exploration
- Convert txt to fasta filetype
- Identify ORFs using orfipy (Python lib for finding open read frames (genes typically follow ORFs)
- Search following codons for what matches 4R or 7R alleles
- Take over the world!... by knowing who might have ADHD!