; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g04370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g04370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr3:3256123..3272583
RNA-Seq ExpressionMoc03g04370
SyntenyMoc03g04370
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-6149.84Show/hide
Query:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------
        F+L E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE                                                  
Subjt:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------

Query:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK
                NGA+IDE SQV+FILESLP+SFL FRSNAVMNK+ YTLT LLNELQT++SL+K KGQ+GEA+VATS +KFHRGSTS  KS+ SSS +K +KK
Subjt:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK

Query:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD
        KK  G+G + +  AA T KK K   A KG C HCN +GHWKRNCPKYLAEKKKA +GKYDLL                             GISSWRQL+
Subjt:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD

Query:  AGEMTLK
         GEMT++
Subjt:  AGEMTLK

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-6149.84Show/hide
Query:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------
        F+L E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE                                                  
Subjt:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------

Query:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK
                NGA+IDE SQV+FILESLP+SFL FRSNAVMNK+ YTLT LLNELQT++SL+K KGQ+GEA+VATS +KFHRGSTS  KS+ SSS +K +KK
Subjt:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK

Query:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD
        KK  G+G + +  AA T KK K   A KG C HCN +GHWKRNCPKYLAEKKKA +GKYDLL                             GISSWRQL+
Subjt:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD

Query:  AGEMTLK
         GEMT++
Subjt:  AGEMTLK

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-6149.84Show/hide
Query:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------
        F+L E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE                                                  
Subjt:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------

Query:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK
                NGA+IDE SQV+FILESLP+SFL FRSNAVMNK+ YTLT LLNELQT++SL+K KGQ+GEA+VATS +KFHRGSTS  KS+ SSS +K +KK
Subjt:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK

Query:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD
        KK  G+G + +  AA T KK K   A KG C HCN +GHWKRNCPKYLAEKKKA +GKYDLL                             GISSWRQL+
Subjt:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD

Query:  AGEMTLK
         GEMT++
Subjt:  AGEMTLK

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-6149.84Show/hide
Query:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------
        F+L E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE                                                  
Subjt:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------

Query:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK
                NGA+IDE SQV+FILESLP+SFL FRSNAVMNK+ YTLT LLNELQT++SL+K KGQ+GEA+VATS +KFHRGSTS  KS+ SSS +K +KK
Subjt:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK

Query:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD
        KK  G+G + +  AA T KK K   A KG C HCN +GHWKRNCPKYLAEKKKA +GKYDLL                             GISSWRQL+
Subjt:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD

Query:  AGEMTLK
         GEMT++
Subjt:  AGEMTLK

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]6.4e-6149.35Show/hide
Query:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------
        F+L E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE                                                  
Subjt:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------

Query:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK
                NGA+IDE SQV+FILESLP+SFL FRSNAVMNK+ YTLT LLNELQT++SL+K KGQ+GEA+VATS +KFHRGSTS  KS+ SSS +K +KK
Subjt:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK

Query:  KKAIGKGPRPDPTAATAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLDA
        KK  G+G + +   A AK  K   A KG C HCN +GHWKRNCPKYLAEKKKA +GKYDLL                             GISSWRQL+ 
Subjt:  KKAIGKGPRPDPTAATAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLDA

Query:  GEMTLK
        GEMT++
Subjt:  GEMTLK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.8e-6149.84Show/hide
Query:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------
        F+L E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE                                                  
Subjt:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------

Query:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK
                NGA+IDE SQV+FILESLP+SFL FRSNAVMNK+ YTLT LLNELQT++SL+K KGQ+GEA+VATS +KFHRGSTS  KS+ SSS +K +KK
Subjt:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK

Query:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD
        KK  G+G + +  AA T KK K   A KG C HCN +GHWKRNCPKYLAEKKKA +GKYDLL                             GISSWRQL+
Subjt:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD

Query:  AGEMTLK
         GEMT++
Subjt:  AGEMTLK

A0A5A7TWB9 Gag/pol protein1.8e-6149.84Show/hide
Query:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------
        F+L E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE                                                  
Subjt:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------

Query:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK
                NGA+IDE SQV+FILESLP+SFL FRSNAVMNK+ YTLT LLNELQT++SL+K KGQ+GEA+VATS +KFHRGSTS  KS+ SSS +K +KK
Subjt:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK

Query:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD
        KK  G+G + +  AA T KK K   A KG C HCN +GHWKRNCPKYLAEKKKA +GKYDLL                             GISSWRQL+
Subjt:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD

Query:  AGEMTLK
         GEMT++
Subjt:  AGEMTLK

A0A5A7TZD7 Gag/pol protein1.8e-6149.84Show/hide
Query:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------
        F+L E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE                                                  
Subjt:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------

Query:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK
                NGA+IDE SQV+FILESLP+SFL FRSNAVMNK+ YTLT LLNELQT++SL+K KGQ+GEA+VATS +KFHRGSTS  KS+ SSS +K +KK
Subjt:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK

Query:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD
        KK  G+G + +  AA T KK K   A KG C HCN +GHWKRNCPKYLAEKKKA +GKYDLL                             GISSWRQL+
Subjt:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD

Query:  AGEMTLK
         GEMT++
Subjt:  AGEMTLK

A0A5A7UGV2 Gag/pol protein1.8e-6149.84Show/hide
Query:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------
        F+L E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE                                                  
Subjt:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------

Query:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK
                NGA+IDE SQV+FILESLP+SFL FRSNAVMNK+ YTLT LLNELQT++SL+K KGQ+GEA+VATS +KFHRGSTS  KS+ SSS +K +KK
Subjt:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK

Query:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD
        KK  G+G + +  AA T KK K   A KG C HCN +GHWKRNCPKYLAEKKKA +GKYDLL                             GISSWRQL+
Subjt:  KKAIGKGPRPDPTAA-TAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLD

Query:  AGEMTLK
         GEMT++
Subjt:  AGEMTLK

A0A5D3CPJ6 Gag/pol protein3.1e-6149.35Show/hide
Query:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------
        F+L E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE                                                  
Subjt:  FILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHED-------------------------------------------------

Query:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK
                NGA+IDE SQV+FILESLP+SFL FRSNAVMNK+ YTLT LLNELQT++SL+K KGQ+GEA+VATS +KFHRGSTS  KS+ SSS +K +KK
Subjt:  ------TSNGAIIDEQSQVNFILESLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATS-KKFHRGSTSAAKSVSSSSRSKTFKK

Query:  KKAIGKGPRPDPTAATAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLDA
        KK  G+G + +   A AK  K   A KG C HCN +GHWKRNCPKYLAEKKKA +GKYDLL                             GISSWRQL+ 
Subjt:  KKAIGKGPRPDPTAATAKKGKAKVAEKGKCLHCNIDGHWKRNCPKYLAEKKKANEGKYDLL-----------------------------GISSWRQLDA

Query:  GEMTLK
        GEMT++
Subjt:  GEMTLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCATTCGGCGCTGTACAGCACCATGGCACCATGCCTTAGCGACGGGGCACTGCTGCTGCGGCTTTTTGTTGTAGCAGCGCCATGGCGCTGCCCTGAGGTGCCGAG
GCACTGTCCTGGGTTCATCTTGCAAGAGGATTGTCCTCAAGCTCCTACGCCTAACGCCACTGTGGTGGTGCGCAACGTCTATGACAGATGGATCAAGGCCAATGACAAGG
CCAAGGTTTACATCTTGGCGAGCATATCTGATGTGCTTGCTAAGAAGCACGAGGACACGTCGAACGGGGCTATCATAGACGAGCAGAGTCAGGTCAACTTCATTCTGGAA
TCTCTTCCGAAGAGTTTCCTGCCATTCCGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACCATGCTCTTGAATGAGCTACAGACTTATCAGTCTCTTATTAA
AAGTAAGGGACAAGAAGGGGAGGCAGACGTTGCCACCTCGAAAAAGTTCCACCGAGGTTCGACCTCTGCAGCCAAGTCTGTGTCATCCTCTTCTAGAAGTAAGACTTTTA
AGAAGAAGAAGGCCATTGGTAAGGGGCCTAGACCTGACCCCACTGCTGCCACTGCCAAGAAAGGCAAGGCCAAGGTTGCAGAGAAAGGAAAGTGTTTACACTGCAATATA
GACGGGCATTGGAAGCGAAACTGCCCAAAATACTTGGCCGAAAAGAAGAAAGCCAACGAAGGTAAATATGATTTACTTGGAATTAGTTCCTGGAGGCAGCTTGACGCCGG
AGAGATGACTCTTAAGGCGCCTGGCACCTTCCCCAATCCCGAGATTGCCCTTAATGTAAAACGCACATTTTGGTGTGATTTGAAGCCTTCCCAAGCTCTGCCACGTGTCT
TCCATGCATTAAAGGTTGCCAATTCAACAACTACCCAACCAAAAGGTCGTGAGTCCAAAGATTCGAGACCGAGTACTTTTGGTTTGGGTCCTTTACGTGCCAATTCTTCG
CAACTTGATTCGGTTCCTTCACATGCCGATTGTTCATTTCCTGTTGTGGCTCCTATTCTTGATTCTGTTTCCCCAATGCTAATTGATGGGGAGCTTATTGAGGTTCCCTC
GGATGAGGTTGTTTATGAAGGGGAGAAGCTTTCTTGCATGGCACGTCGTGTGATTGGGTCTTATCCAATGGGCCTTGGCATATTGGTGGTAAGCTGCTTCTCTTTCGTCG
TTAGACGCCTGGGTTTTAGTATTATTGCTACTGTGCTTGGGACCCTGTTAGGGCTTGATAAGGCGATAGAGAAGCATAGTCGGCTTTTATTTGCTAGGGTATGTATTGAG
ATGAGGGTTGCTTCTTCTTTTCCCACTTCTGTTAAAGTTTTATTCCCTATATCTGTTGAGTATGCCTGGAAGCCTCGACGTTGCTCTCAATGCTGTGTTTTTGGTCACTC
GGATACGGCTTGTCCTATTACTTCTAAACGGGCTCCTTTAGTTCCGGCTAGGGAGCCTGTGGTTCTGGATATGGTGCCAGTGGTTTCGGATCTAGGGCCGGTTCGGCCGG
TTGTTGGTGGAAATCGTTTTCCTACATTGGCGTCTAGTGAGGATGTTATTGAGGCTGATGCTGATACAAGTCAGGTGGTGTGTGTGAAGGATTTTTCTCCGCCTTTTGGG
GGTTTTGATGAGCAGGAGGATTTTACAACGGTTGTGTCCAACGAATTCCCCTTTCGAGCAATCTTCTCATGGTCTGGTCCTGAAGTGGTTATGGATGACTTTAATGTTAT
TCGTCGTTTGTCAGATGTTTTAGATGGTAATTCGGATATGACAGAGATGGTTCCGTTTGATGAGACGTTGCTGCAGGCTGATCTTGTTGAGCCTCGTGTGTCCAAGCCTT
GGTTTACTTGGACGAATAAGCAGTTAGAGGTTCATGTTCGCGAGTGGGATGCTTCCGATCATTGCCCTGTTGTATTTTCTGTGGGTGATGTTATTTCGAGGAAGCAGTCT
TCATTTAGAGCTCTTAAGACTGTGTTGAGTCGTTTGGGTCTCACTCTGAGTGCTATTCGAAAGGGTGTTCAAGAGGCGCGTGATCAGATGGTGGCTGTCCAGGCGATGGT
TGTTCAAAATGAGTTAGCTTCATTGATTGATGTTGCGGGCCAGTTGGTTATTGACCGTTCTGAGATTGCTCAGATGGTGGTTCAATTTTATTGCAATCTCTTAGGCACTA
AGCCTATGGGCTATCGTGATTTGATTCATCAGCTCGTGGGAATCCTTGATTTTATTTGGCCTGCTAATTGTAGTGCTGACTTTTGTCGTCTGCTGGCCGCTTTGGAGGTT
AAAAAGGTCTTATTTTCTATGAGTAATGGAAAGGCCTTTGTCCCTGATGGGTTTTCGGTAGAGGGATCTTGGTTCTTTATCGTTATTAAGGATATTTTGGCCTCTTTTGG
AGAGATATCGGGTCTTGTTGGGAATACTCGGAAGAGCTCGTTTTTTTGGGCGGGCCTTCCTGATGTTGTGGTGGAATGGTTGGCGGCTTTTTTAGGCTTTTCATTGGTTT
CGCTTCATGTGCGTTATCTCGGGTTCCGCTCCGCACCGACTTTCTCAACGCGATTGTTACCCTTTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGCATTCGGCGCTGTACAGCACCATGGCACCATGCCTTAGCGACGGGGCACTGCTGCTGCGGCTTTTTGTTGTAGCAGCGCCATGGCGCTGCCCTGAGGTGCCGAG
GCACTGTCCTGGGTTCATCTTGCAAGAGGATTGTCCTCAAGCTCCTACGCCTAACGCCACTGTGGTGGTGCGCAACGTCTATGACAGATGGATCAAGGCCAATGACAAGG
CCAAGGTTTACATCTTGGCGAGCATATCTGATGTGCTTGCTAAGAAGCACGAGGACACGTCGAACGGGGCTATCATAGACGAGCAGAGTCAGGTCAACTTCATTCTGGAA
TCTCTTCCGAAGAGTTTCCTGCCATTCCGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACCATGCTCTTGAATGAGCTACAGACTTATCAGTCTCTTATTAA
AAGTAAGGGACAAGAAGGGGAGGCAGACGTTGCCACCTCGAAAAAGTTCCACCGAGGTTCGACCTCTGCAGCCAAGTCTGTGTCATCCTCTTCTAGAAGTAAGACTTTTA
AGAAGAAGAAGGCCATTGGTAAGGGGCCTAGACCTGACCCCACTGCTGCCACTGCCAAGAAAGGCAAGGCCAAGGTTGCAGAGAAAGGAAAGTGTTTACACTGCAATATA
GACGGGCATTGGAAGCGAAACTGCCCAAAATACTTGGCCGAAAAGAAGAAAGCCAACGAAGGTAAATATGATTTACTTGGAATTAGTTCCTGGAGGCAGCTTGACGCCGG
AGAGATGACTCTTAAGGCGCCTGGCACCTTCCCCAATCCCGAGATTGCCCTTAATGTAAAACGCACATTTTGGTGTGATTTGAAGCCTTCCCAAGCTCTGCCACGTGTCT
TCCATGCATTAAAGGTTGCCAATTCAACAACTACCCAACCAAAAGGTCGTGAGTCCAAAGATTCGAGACCGAGTACTTTTGGTTTGGGTCCTTTACGTGCCAATTCTTCG
CAACTTGATTCGGTTCCTTCACATGCCGATTGTTCATTTCCTGTTGTGGCTCCTATTCTTGATTCTGTTTCCCCAATGCTAATTGATGGGGAGCTTATTGAGGTTCCCTC
GGATGAGGTTGTTTATGAAGGGGAGAAGCTTTCTTGCATGGCACGTCGTGTGATTGGGTCTTATCCAATGGGCCTTGGCATATTGGTGGTAAGCTGCTTCTCTTTCGTCG
TTAGACGCCTGGGTTTTAGTATTATTGCTACTGTGCTTGGGACCCTGTTAGGGCTTGATAAGGCGATAGAGAAGCATAGTCGGCTTTTATTTGCTAGGGTATGTATTGAG
ATGAGGGTTGCTTCTTCTTTTCCCACTTCTGTTAAAGTTTTATTCCCTATATCTGTTGAGTATGCCTGGAAGCCTCGACGTTGCTCTCAATGCTGTGTTTTTGGTCACTC
GGATACGGCTTGTCCTATTACTTCTAAACGGGCTCCTTTAGTTCCGGCTAGGGAGCCTGTGGTTCTGGATATGGTGCCAGTGGTTTCGGATCTAGGGCCGGTTCGGCCGG
TTGTTGGTGGAAATCGTTTTCCTACATTGGCGTCTAGTGAGGATGTTATTGAGGCTGATGCTGATACAAGTCAGGTGGTGTGTGTGAAGGATTTTTCTCCGCCTTTTGGG
GGTTTTGATGAGCAGGAGGATTTTACAACGGTTGTGTCCAACGAATTCCCCTTTCGAGCAATCTTCTCATGGTCTGGTCCTGAAGTGGTTATGGATGACTTTAATGTTAT
TCGTCGTTTGTCAGATGTTTTAGATGGTAATTCGGATATGACAGAGATGGTTCCGTTTGATGAGACGTTGCTGCAGGCTGATCTTGTTGAGCCTCGTGTGTCCAAGCCTT
GGTTTACTTGGACGAATAAGCAGTTAGAGGTTCATGTTCGCGAGTGGGATGCTTCCGATCATTGCCCTGTTGTATTTTCTGTGGGTGATGTTATTTCGAGGAAGCAGTCT
TCATTTAGAGCTCTTAAGACTGTGTTGAGTCGTTTGGGTCTCACTCTGAGTGCTATTCGAAAGGGTGTTCAAGAGGCGCGTGATCAGATGGTGGCTGTCCAGGCGATGGT
TGTTCAAAATGAGTTAGCTTCATTGATTGATGTTGCGGGCCAGTTGGTTATTGACCGTTCTGAGATTGCTCAGATGGTGGTTCAATTTTATTGCAATCTCTTAGGCACTA
AGCCTATGGGCTATCGTGATTTGATTCATCAGCTCGTGGGAATCCTTGATTTTATTTGGCCTGCTAATTGTAGTGCTGACTTTTGTCGTCTGCTGGCCGCTTTGGAGGTT
AAAAAGGTCTTATTTTCTATGAGTAATGGAAAGGCCTTTGTCCCTGATGGGTTTTCGGTAGAGGGATCTTGGTTCTTTATCGTTATTAAGGATATTTTGGCCTCTTTTGG
AGAGATATCGGGTCTTGTTGGGAATACTCGGAAGAGCTCGTTTTTTTGGGCGGGCCTTCCTGATGTTGTGGTGGAATGGTTGGCGGCTTTTTTAGGCTTTTCATTGGTTT
CGCTTCATGTGCGTTATCTCGGGTTCCGCTCCGCACCGACTTTCTCAACGCGATTGTTACCCTTTGCTTGA
Protein sequenceShow/hide protein sequence
MLHSALYSTMAPCLSDGALLLRLFVVAAPWRCPEVPRHCPGFILQEDCPQAPTPNATVVVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTSNGAIIDEQSQVNFILE
SLPKSFLPFRSNAVMNKLEYTLTMLLNELQTYQSLIKSKGQEGEADVATSKKFHRGSTSAAKSVSSSSRSKTFKKKKAIGKGPRPDPTAATAKKGKAKVAEKGKCLHCNI
DGHWKRNCPKYLAEKKKANEGKYDLLGISSWRQLDAGEMTLKAPGTFPNPEIALNVKRTFWCDLKPSQALPRVFHALKVANSTTTQPKGRESKDSRPSTFGLGPLRANSS
QLDSVPSHADCSFPVVAPILDSVSPMLIDGELIEVPSDEVVYEGEKLSCMARRVIGSYPMGLGILVVSCFSFVVRRLGFSIIATVLGTLLGLDKAIEKHSRLLFARVCIE
MRVASSFPTSVKVLFPISVEYAWKPRRCSQCCVFGHSDTACPITSKRAPLVPAREPVVLDMVPVVSDLGPVRPVVGGNRFPTLASSEDVIEADADTSQVVCVKDFSPPFG
GFDEQEDFTTVVSNEFPFRAIFSWSGPEVVMDDFNVIRRLSDVLDGNSDMTEMVPFDETLLQADLVEPRVSKPWFTWTNKQLEVHVREWDASDHCPVVFSVGDVISRKQS
SFRALKTVLSRLGLTLSAIRKGVQEARDQMVAVQAMVVQNELASLIDVAGQLVIDRSEIAQMVVQFYCNLLGTKPMGYRDLIHQLVGILDFIWPANCSADFCRLLAALEV
KKVLFSMSNGKAFVPDGFSVEGSWFFIVIKDILASFGEISGLVGNTRKSSFFWAGLPDVVVEWLAAFLGFSLVSLHVRYLGFRSAPTFSTRLLPFA