; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g25980 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g25980
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr8:18671277..18680426
RNA-Seq ExpressionMoc08g25980
SyntenyMoc08g25980
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]4.8e-7357.25Show/hide
Query:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD
        + GEALNWWDS+AA EDHAN+ + W +FKDLLY+YY+ ETVKD+KEAEFLHL QG + VAQYERKFTELSRFA +LI   A+KIKRFVKGL KGIR  VD
Subjt:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD

Query:  HQHPATYAEAVRGALITNKDVSKRVQPLLENGSSS-------------------------GYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVT
         Q PA+YAEAVRGALI +KDVS +   L E GSSS                         G P +C   ++RH GQCWT  K CF+CG+E HFARECP++
Subjt:  HQHPATYAEAVRGALITNKDVSKRVQPLLENGSSS-------------------------GYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVT

Query:  SLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVL
        + NTQRLGQ     +  +G NQ+AR F LT KEA +AETVVTG      V  +V+
Subjt:  SLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVL

XP_022156067.1 uncharacterized protein LOC111023035 [Momordica charantia]3.8e-7042.59Show/hide
Query:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD
        + GEA+NWW+SVAA EDHAN+ VTW +FKDLLYEYYFP TV++ K AEFL LTQG++ VAQY+RKFTELSRF    IPTE +KI +F+ GL + I+ L+ 
Subjt:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD

Query:  HQHPATYAEAVRGALITNK---------------DVSKRVQPLLENGSSSGY---------PSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVTS
         +   TYA AVR AL+ +K                V ++      + SS G+         P  C + K+ H G CW   ++CF+C KEGHFARECP+T 
Subjt:  HQHPATYAEAVRGALITNK---------------DVSKRVQPLLENGSSSGY---------PSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVTS

Query:  LNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLLVRPVVVSIYT--------IKK
         NTQ LGQ+ P+    +GG Q+AR F LT  +  +AE VVTGT+LV ++PAY LFDSGSSH+FI++ FV  A LEL+ L   + VS  +        + K
Subjt:  LNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLLVRPVVVSIYT--------IKK

Query:  SNERSTFGLLHRPVLAELDCS--EVELTVDDISAVLARLLLDKSLCYEEVPIEILANETKMLRNWAIDLVKVLWRNHQ
          + S  G      L +LD    +V L +D ++A  A +   K    +EV   + + +    +     + +V++ +HQ
Subjt:  SNERSTFGLLHRPVLAELDCS--EVELTVDDISAVLARLLLDKSLCYEEVPIEILANETKMLRNWAIDLVKVLWRNHQ

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]3.8e-9463.93Show/hide
Query:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD
        + GEALNWWDSVAA ED+AN+ + W +FK+LLY+YY+PETVKD+KEAEFLHL QG + VAQYERKFTELSRFA +LIPTEA+KIKRFVKGL KGIR  VD
Subjt:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD

Query:  HQHPATYAEAVRGALITNKDVSKRVQPLLENGSSS-------------------------GYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVT
         Q P TYAEAVRGAL+ +KDVS +  PL E GSSS                         G P +C   ++RH GQCWT  K CF+CG+EGHFARECP++
Subjt:  HQHPATYAEAVRGALITNKDVSKRVQPLLENGSSS-------------------------GYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVT

Query:  SLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLL
        + NTQRLGQ  P  +  +G NQ+AR F LT KEA +AETVVTGTVLVH+VPAYVLFDSGSSHTFIS+ FV QA LEL+ L
Subjt:  SLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLL

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]3.8e-7041.16Show/hide
Query:  AYSYSMRLHSNVNLTFKGKNAADPLVPLAGAQAGVVPLFPSAVAQ--ERVVPPAPLLGVVVHKLSHPDIFKLLRVRPNSSRISNATGP----LPLIEESY
        A+  + R H+  +   +G+ AADP VP A    GV P  P A +Q   +V P   LL          +  ++L    N +  +    P    +P  E S 
Subjt:  AYSYSMRLHSNVNLTFKGKNAADPLVPLAGAQAGVVPLFPSAVAQ--ERVVPPAPLLGVVVHKLSHPDIFKLLRVRPNSSRISNATGP----LPLIEESY

Query:  SRRRVGQGVRSFLRV---SRLQRPVQGQRCGLYVEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFT
              + VR    +          + +     + GEA+NWW+SVAA EDHAN+ VTW +FKDLLYEYYFP TV++ K  EFL LTQG++ VA+YERKFT
Subjt:  SRRRVGQGVRSFLRV---SRLQRPVQGQRCGLYVEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFT

Query:  ELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVDHQHPATYAEAVRGALITNKDVSKRVQPLLENGSSSGY-------------------------PSICS
        ELSRF    IPT+ +KI +F+ GL + I+ L+  + P TYA AVR AL+ +K + +  Q     GSSSG                          P +C 
Subjt:  ELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVDHQHPATYAEAVRGALITNKDVSKRVQPLLENGSSSGY-------------------------PSICS

Query:  NQKRRHVGQCWTEIKVCFKCGKEGHFARECPVTSLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFIST
        + K+ H G CW   ++C++C KEGHFARECP+T  NTQ LGQ  P+    +GG  +AR F LT  +   AE VVT TVLV ++PAY LFDSGSSH+FI++
Subjt:  NQKRRHVGQCWTEIKVCFKCGKEGHFARECPVTSLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFIST

Query:  AFVCQAKLELKLL
         FV  A LEL+ L
Subjt:  AFVCQAKLELKLL

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]9.0e-8057.5Show/hide
Query:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD
        + GEALNWWDSVA  EDHAN+ +TW +FKDLLY+YY+P+T+KD+KEAEFLH + G + VAQYERKFTELS FA +LIPTEA+KIKRFVKGL KGIR  VD
Subjt:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD

Query:  HQHPATYAEAVRGALITNKDVSKRVQPLLENGSSS-------------------------GYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVT
         Q PATYAEAVRG LI + DVS  VQPL+E GSSS                         G P +C + ++R  GQCWT  + CF+CG+EGHFAREC +T
Subjt:  HQHPATYAEAVRGALITNKDVSKRVQPLLENGSSS-------------------------GYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVT

Query:  SLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLL
        + NTQRLGQ A   +  +G                       GT LVHNVPAYVLFD GSSHTFISTAFV QA LEL+ L
Subjt:  SLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLL

TrEMBL top hitse value%identityAlignment
A0A6J1DL73 uncharacterized protein LOC1110221442.3e-7357.25Show/hide
Query:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD
        + GEALNWWDS+AA EDHAN+ + W +FKDLLY+YY+ ETVKD+KEAEFLHL QG + VAQYERKFTELSRFA +LI   A+KIKRFVKGL KGIR  VD
Subjt:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD

Query:  HQHPATYAEAVRGALITNKDVSKRVQPLLENGSSS-------------------------GYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVT
         Q PA+YAEAVRGALI +KDVS +   L E GSSS                         G P +C   ++RH GQCWT  K CF+CG+E HFARECP++
Subjt:  HQHPATYAEAVRGALITNKDVSKRVQPLLENGSSS-------------------------GYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVT

Query:  SLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVL
        + NTQRLGQ     +  +G NQ+AR F LT KEA +AETVVTG      V  +V+
Subjt:  SLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVL

A0A6J1DR22 uncharacterized protein LOC1110230351.8e-7042.59Show/hide
Query:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD
        + GEA+NWW+SVAA EDHAN+ VTW +FKDLLYEYYFP TV++ K AEFL LTQG++ VAQY+RKFTELSRF    IPTE +KI +F+ GL + I+ L+ 
Subjt:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD

Query:  HQHPATYAEAVRGALITNK---------------DVSKRVQPLLENGSSSGY---------PSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVTS
         +   TYA AVR AL+ +K                V ++      + SS G+         P  C + K+ H G CW   ++CF+C KEGHFARECP+T 
Subjt:  HQHPATYAEAVRGALITNK---------------DVSKRVQPLLENGSSSGY---------PSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVTS

Query:  LNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLLVRPVVVSIYT--------IKK
         NTQ LGQ+ P+    +GG Q+AR F LT  +  +AE VVTGT+LV ++PAY LFDSGSSH+FI++ FV  A LEL+ L   + VS  +        + K
Subjt:  LNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLLVRPVVVSIYT--------IKK

Query:  SNERSTFGLLHRPVLAELDCS--EVELTVDDISAVLARLLLDKSLCYEEVPIEILANETKMLRNWAIDLVKVLWRNHQ
          + S  G      L +LD    +V L +D ++A  A +   K    +EV   + + +    +     + +V++ +HQ
Subjt:  SNERSTFGLLHRPVLAELDCS--EVELTVDDISAVLARLLLDKSLCYEEVPIEILANETKMLRNWAIDLVKVLWRNHQ

A0A6J1DUM2 uncharacterized protein LOC1110232471.8e-9463.93Show/hide
Query:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD
        + GEALNWWDSVAA ED+AN+ + W +FK+LLY+YY+PETVKD+KEAEFLHL QG + VAQYERKFTELSRFA +LIPTEA+KIKRFVKGL KGIR  VD
Subjt:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD

Query:  HQHPATYAEAVRGALITNKDVSKRVQPLLENGSSS-------------------------GYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVT
         Q P TYAEAVRGAL+ +KDVS +  PL E GSSS                         G P +C   ++RH GQCWT  K CF+CG+EGHFARECP++
Subjt:  HQHPATYAEAVRGALITNKDVSKRVQPLLENGSSS-------------------------GYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVT

Query:  SLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLL
        + NTQRLGQ  P  +  +G NQ+AR F LT KEA +AETVVTGTVLVH+VPAYVLFDSGSSHTFIS+ FV QA LEL+ L
Subjt:  SLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLL

A0A6J1DWP4 uncharacterized protein LOC1110252151.8e-7041.16Show/hide
Query:  AYSYSMRLHSNVNLTFKGKNAADPLVPLAGAQAGVVPLFPSAVAQ--ERVVPPAPLLGVVVHKLSHPDIFKLLRVRPNSSRISNATGP----LPLIEESY
        A+  + R H+  +   +G+ AADP VP A    GV P  P A +Q   +V P   LL          +  ++L    N +  +    P    +P  E S 
Subjt:  AYSYSMRLHSNVNLTFKGKNAADPLVPLAGAQAGVVPLFPSAVAQ--ERVVPPAPLLGVVVHKLSHPDIFKLLRVRPNSSRISNATGP----LPLIEESY

Query:  SRRRVGQGVRSFLRV---SRLQRPVQGQRCGLYVEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFT
              + VR    +          + +     + GEA+NWW+SVAA EDHAN+ VTW +FKDLLYEYYFP TV++ K  EFL LTQG++ VA+YERKFT
Subjt:  SRRRVGQGVRSFLRV---SRLQRPVQGQRCGLYVEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFT

Query:  ELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVDHQHPATYAEAVRGALITNKDVSKRVQPLLENGSSSGY-------------------------PSICS
        ELSRF    IPT+ +KI +F+ GL + I+ L+  + P TYA AVR AL+ +K + +  Q     GSSSG                          P +C 
Subjt:  ELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVDHQHPATYAEAVRGALITNKDVSKRVQPLLENGSSSGY-------------------------PSICS

Query:  NQKRRHVGQCWTEIKVCFKCGKEGHFARECPVTSLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFIST
        + K+ H G CW   ++C++C KEGHFARECP+T  NTQ LGQ  P+    +GG  +AR F LT  +   AE VVT TVLV ++PAY LFDSGSSH+FI++
Subjt:  NQKRRHVGQCWTEIKVCFKCGKEGHFARECPVTSLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFIST

Query:  AFVCQAKLELKLL
         FV  A LEL+ L
Subjt:  AFVCQAKLELKLL

A0A6J1DYU5 uncharacterized protein LOC1110255174.4e-8057.5Show/hide
Query:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD
        + GEALNWWDSVA  EDHAN+ +TW +FKDLLY+YY+P+T+KD+KEAEFLH + G + VAQYERKFTELS FA +LIPTEA+KIKRFVKGL KGIR  VD
Subjt:  VEGEALNWWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVD

Query:  HQHPATYAEAVRGALITNKDVSKRVQPLLENGSSS-------------------------GYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVT
         Q PATYAEAVRG LI + DVS  VQPL+E GSSS                         G P +C + ++R  GQCWT  + CF+CG+EGHFAREC +T
Subjt:  HQHPATYAEAVRGALITNKDVSKRVQPLLENGSSS-------------------------GYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVT

Query:  SLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLL
        + NTQRLGQ A   +  +G                       GT LVHNVPAYVLFD GSSHTFISTAFV QA LEL+ L
Subjt:  SLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVTGTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G43590.1 zinc knuckle (CCHC-type) family protein5.0e-0440.48Show/hide
Query:  EIKVCFKCGKEGHFARECPVTSLNTQRLGQEAPSIILKRGGN
        E   C++CG+EGHFARECP +S  +   G+E+ ++  +  G+
Subjt:  EIKVCFKCGKEGHFARECPVTSLNTQRLGQEAPSIILKRGGN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAACTCAGTTATGGTCCGACCAATAATTGGAAGTCCCTCTCGAGCCAGTGAGAGTGTGGAGCCCCATGTTCATGTCTTGGAGTTAGCATGTAAGGTGTTG
AATCGACGCGTGGTTTGGAAGCATAGCAGTGTTTATTATCATGTCACTACTGGGCATCGAGGCTCCGAGTACAAATGGTGGAGGGTCGATAGACTAGTTTTGGTG
GAGATGAGGGTTAAGGCTCCGGAAGATGCCTACTCGTATAGTATGAGGCTGCATTCGAACGTCAATCTAACATTCAAAGGCAAGAATGCGGCGGACCCACTGGTC
CCTCTCGCAGGTGCCCAAGCAGGGGTAGTCCCTCTTTTTCCTTCAGCAGTAGCTCAGGAGAGGGTGGTTCCCCCAGCTCCCCTGTTGGGGGTGGTGGTGCACAAG
CTCAGCCACCCCGACATTTTCAAGCTCCTCAGAGTGAGGCCCAATTCATCAAGGATTTCAAACGCTACGGGCCCTCTACCTTTGATAGAGGAAAGCTACAGCCGC
AGAAGAGTAGGTCAGGGAGTTAGAAGCTTTTTACGTGTATCTAGGTTGCAACGACCAGTTCAAGGTCAACGGTGCGGTCTTTATGTTGAGGGCGAAGCCCTAAAT
TGGTGGGACTCAGTAGCAGCGATAGAAGACCATGCTAATATAGCTGTCACATGGGTGAAGTTCAAAGACTTGTTGTATGAATACTACTTTCCAGAGACCGTGAAA
GATGTAAAGGAGGCGGAGTTCCTCCATCTCACACAAGGCAATATGATAGTAGCACAATATGAAAGAAAGTTTACGGAACTCTCCCGTTTTGCTTCGGACCTAATT
CCCACCGAGGCAGTGAAGATCAAGAGGTTTGTTAAAGGCTTGTGCAAAGGGATCAGAGTACTAGTTGATCATCAGCACCCAGCCACTTACGCGGAAGCAGTTAGG
GGCGCCTTAATTACGAATAAGGATGTCTCCAAGAGGGTTCAACCTCTGCTAGAAAACGGTTCATCTTCAGGTTACCCTAGCATTTGTAGCAATCAAAAGCGGAGA
CATGTCGGGCAGTGTTGGACTGAAATTAAAGTTTGTTTCAAATGCGGGAAAGAAGGACATTTTGCAAGAGAGTGTCCTGTGACGAGTTTGAACACACAGAGGCTA
GGTCAGGAGGCCCCCTCAATAATTTTGAAGCGAGGGGGCAACCAGAAAGCTCGTGGTTTTACACTTACACCCAAGGAAGCGGTGAATGCCGAAACCGTTGTAACG
GGTACTGTCTTAGTCCATAATGTACCTGCTTATGTATTGTTTGATTCGGGGTCAAGCCACACTTTTATTTCCACTGCATTTGTTTGTCAAGCAAAACTTGAGCTA
AAACTGTTAGTTAGGCCTGTCGTTGTCAGTATCTACACCATCAAAAAGAGCAACGAGCGTAGTACCTTTGGCTTGCTTCATAGACCGGTACTAGCTGAGTTGGAT
TGTTCTGAGGTGGAGTTAACAGTGGATGACATTTCGGCAGTGTTAGCTCGACTCTTGTTAGATAAATCTCTGTGCTATGAAGAGGTACCAATTGAGATCTTAGCA
AATGAGACCAAGATGCTGAGAAACTGGGCGATTGACTTGGTGAAGGTTCTTTGGAGAAATCACCAAGTGGAGGATGCTACCTGGGAAAGGGAAGGCGAGATCAAA
GCCAAATTCCCTGAGATATTTGCTCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAACTCAGTTATGGTCCGACCAATAATTGGAAGTCCCTCTCGAGCCAGTGAGAGTGTGGAGCCCCATGTTCATGTCTTGGAGTTAGCATGTAAGGTGTTG
AATCGACGCGTGGTTTGGAAGCATAGCAGTGTTTATTATCATGTCACTACTGGGCATCGAGGCTCCGAGTACAAATGGTGGAGGGTCGATAGACTAGTTTTGGTG
GAGATGAGGGTTAAGGCTCCGGAAGATGCCTACTCGTATAGTATGAGGCTGCATTCGAACGTCAATCTAACATTCAAAGGCAAGAATGCGGCGGACCCACTGGTC
CCTCTCGCAGGTGCCCAAGCAGGGGTAGTCCCTCTTTTTCCTTCAGCAGTAGCTCAGGAGAGGGTGGTTCCCCCAGCTCCCCTGTTGGGGGTGGTGGTGCACAAG
CTCAGCCACCCCGACATTTTCAAGCTCCTCAGAGTGAGGCCCAATTCATCAAGGATTTCAAACGCTACGGGCCCTCTACCTTTGATAGAGGAAAGCTACAGCCGC
AGAAGAGTAGGTCAGGGAGTTAGAAGCTTTTTACGTGTATCTAGGTTGCAACGACCAGTTCAAGGTCAACGGTGCGGTCTTTATGTTGAGGGCGAAGCCCTAAAT
TGGTGGGACTCAGTAGCAGCGATAGAAGACCATGCTAATATAGCTGTCACATGGGTGAAGTTCAAAGACTTGTTGTATGAATACTACTTTCCAGAGACCGTGAAA
GATGTAAAGGAGGCGGAGTTCCTCCATCTCACACAAGGCAATATGATAGTAGCACAATATGAAAGAAAGTTTACGGAACTCTCCCGTTTTGCTTCGGACCTAATT
CCCACCGAGGCAGTGAAGATCAAGAGGTTTGTTAAAGGCTTGTGCAAAGGGATCAGAGTACTAGTTGATCATCAGCACCCAGCCACTTACGCGGAAGCAGTTAGG
GGCGCCTTAATTACGAATAAGGATGTCTCCAAGAGGGTTCAACCTCTGCTAGAAAACGGTTCATCTTCAGGTTACCCTAGCATTTGTAGCAATCAAAAGCGGAGA
CATGTCGGGCAGTGTTGGACTGAAATTAAAGTTTGTTTCAAATGCGGGAAAGAAGGACATTTTGCAAGAGAGTGTCCTGTGACGAGTTTGAACACACAGAGGCTA
GGTCAGGAGGCCCCCTCAATAATTTTGAAGCGAGGGGGCAACCAGAAAGCTCGTGGTTTTACACTTACACCCAAGGAAGCGGTGAATGCCGAAACCGTTGTAACG
GGTACTGTCTTAGTCCATAATGTACCTGCTTATGTATTGTTTGATTCGGGGTCAAGCCACACTTTTATTTCCACTGCATTTGTTTGTCAAGCAAAACTTGAGCTA
AAACTGTTAGTTAGGCCTGTCGTTGTCAGTATCTACACCATCAAAAAGAGCAACGAGCGTAGTACCTTTGGCTTGCTTCATAGACCGGTACTAGCTGAGTTGGAT
TGTTCTGAGGTGGAGTTAACAGTGGATGACATTTCGGCAGTGTTAGCTCGACTCTTGTTAGATAAATCTCTGTGCTATGAAGAGGTACCAATTGAGATCTTAGCA
AATGAGACCAAGATGCTGAGAAACTGGGCGATTGACTTGGTGAAGGTTCTTTGGAGAAATCACCAAGTGGAGGATGCTACCTGGGAAAGGGAAGGCGAGATCAAA
GCCAAATTCCCTGAGATATTTGCTCAGTAA
Protein sequenceShow/hide protein sequence
MNNSVMVRPIIGSPSRASESVEPHVHVLELACKVLNRRVVWKHSSVYYHVTTGHRGSEYKWWRVDRLVLVEMRVKAPEDAYSYSMRLHSNVNLTFKGKNAADPLV
PLAGAQAGVVPLFPSAVAQERVVPPAPLLGVVVHKLSHPDIFKLLRVRPNSSRISNATGPLPLIEESYSRRRVGQGVRSFLRVSRLQRPVQGQRCGLYVEGEALN
WWDSVAAIEDHANIAVTWVKFKDLLYEYYFPETVKDVKEAEFLHLTQGNMIVAQYERKFTELSRFASDLIPTEAVKIKRFVKGLCKGIRVLVDHQHPATYAEAVR
GALITNKDVSKRVQPLLENGSSSGYPSICSNQKRRHVGQCWTEIKVCFKCGKEGHFARECPVTSLNTQRLGQEAPSIILKRGGNQKARGFTLTPKEAVNAETVVT
GTVLVHNVPAYVLFDSGSSHTFISTAFVCQAKLELKLLVRPVVVSIYTIKKSNERSTFGLLHRPVLAELDCSEVELTVDDISAVLARLLLDKSLCYEEVPIEILA
NETKMLRNWAIDLVKVLWRNHQVEDATWEREGEIKAKFPEIFAQ