; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr3:9571588..9580627
RNA-Seq ExpressionMoc03g14200
SyntenyMoc03g14200
Gene Ontology termsNA
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047432.1 uncharacterized protein E6C27_scaffold498G00230 [Cucumis melo var. makuwa]6.7e-3951.11Show/hide
Query:  WSTMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLK
        ++TMEEH  HLQ VF+KLK+NQLYVKREKCSFAQE                                          +N YRRFV+GF +RA PL ELLK
Subjt:  WSTMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLK

Query:  NNQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI
         +  WNW  ECQAAF+ L++A+MEG +L IADV +PFEVETDAS++ALGGVL Q+GH I YES KLN AER +   + E+
Subjt:  NNQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI

KAA0047432.1 uncharacterized protein E6C27_scaffold498G00230 [Cucumis melo var. makuwa]1.0e-1041.67Show/hide
Query:  VVGELTNNFKTANDDVRSEIAELSTRVNLTMRVMGNQAPAGGHIQFNNVKVPDLKPFCGTRDTKAFENFILDLDY----NDTSQEHSWSTMEEHHI
        ++  ++ +F+   D VR+EIA+++ R++LTMR M NQAPA G I  + VK+ + KPFCG RD KA EN+I DL+      +T  E +  T+   H+
Subjt:  VVGELTNNFKTANDDVRSEIAELSTRVNLTMRVMGNQAPAGGHIQFNNVKVPDLKPFCGTRDTKAFENFILDLDY----NDTSQEHSWSTMEEHHI

KAA0047432.1 uncharacterized protein E6C27_scaffold498G00230 [Cucumis melo var. makuwa]2.0e-3851.4Show/hide
Query:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN
        +TMEEH  HLQ VF+KLK+NQLYVKREKCSFAQE                                          +N YRRFV+GF +RA PL ELLK 
Subjt:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN

Query:  NQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI
        +  WNW  ECQAAF+ L++A+MEG +L IADV +PFEVETDAS++ALGGVL Q+GH I YES KLN AER +   + E+
Subjt:  NQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI

TYK01597.1 reverse transcriptase [Cucumis melo var. makuwa]2.8e-1343.88Show/hide
Query:  LAVVGELTNNFKTANDDVRSEIAELSTRVNLTMRVMGNQAPAGGHIQFNNVKVPDLKPFCGTRDTKAFENFILDLDY----NDTSQEHSWSTMEEHHI
        L ++  ++ +F+   D VR+EIA+++ R++LTMR M NQAPAGG I  + VK+P+ KPFCG RD KA EN+I DL+      +T  E +  T+   H+
Subjt:  LAVVGELTNNFKTANDDVRSEIAELSTRVNLTMRVMGNQAPAGGHIQFNNVKVPDLKPFCGTRDTKAFENFILDLDY----NDTSQEHSWSTMEEHHI

TYK01597.1 reverse transcriptase [Cucumis melo var. makuwa]3.9e-3951.96Show/hide
Query:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN
        +TMEEH  HLQ VF+KLK+NQLYVKREKCSFAQE                                          +N YRRFV+GFL+RA PL ELLK 
Subjt:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN

Query:  NQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI
        +  WNW  ECQAAF+ L++A+MEG +L IADV +PFEVETDAS++ALGGVL Q+GH I YES KLN AER +   + E+
Subjt:  NQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI

TYK07954.1 reverse transcriptase [Cucumis melo var. makuwa]1.2e-1134.59Show/hide
Query:  LAVVGELTNNFKTANDDVRSEIAELSTRVNLTMRVMGNQAPAGGHIQFNNVKVPDLKPFCGTRDTKAFENFILDLD--YNDTSQEHSWSTMEEHHIHLQL
        L ++  ++ +F+   D VR+EIA+++ R++L MR M NQAPAGG I  + VK+P+ KPFCG R+ KA EN+I DL+  +  T+ E   +      +  +L
Subjt:  LAVVGELTNNFKTANDDVRSEIAELSTRVNLTMRVMGNQAPAGGHIQFNNVKVPDLKPFCGTRDTKAFENFILDLD--YNDTSQEHSWSTMEEHHIHLQL

Query:  VFEKLKQNQLYVKREKCSFAQESNCYRRFVDGF
          +   +N   + R K    + +   R +V  F
Subjt:  VFEKLKQNQLYVKREKCSFAQESNCYRRFVDGF

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]9.7e-4658.43Show/hide
Query:  TMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKNN
        TMEEH +HLQLVFEKLKQNQLYVKREKCSFAQE                                          +N YRRFV+GF +R  PL +LLK N
Subjt:  TMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKNN

Query:  QSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI
        Q WNWT EC AAFESL+KAMMEGS+L IADV RPFEVETDAS+FALGGVL QDGH I YES KLNDAER +   + E+
Subjt:  QSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]3.5e-0364.58Show/hide
Query:  DNLRCVESRLDEIFTKADGIDVLNACIDWLAVVGELTNNFKTANDDVR
        DNLR VESRLDEI TKADGIDV+NA ID LA + EL    +T  D V+
Subjt:  DNLRCVESRLDEIFTKADGIDVLNACIDWLAVVGELTNNFKTANDDVR

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]3.0e-3952.51Show/hide
Query:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN
        +TMEEH  HLQ VF+KLK+NQLYVKREKCSFAQE                                          +N YRRFV+GF +RA PL ELLK 
Subjt:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN

Query:  NQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI
        +  WNW  ECQAAF+ L++AMMEG +L IADV +PFEVETDASN+ALGGVL Q+GH I YES KLN AER +   + E+
Subjt:  NQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI

TrEMBL top hitse value%identityAlignment
A0A5A7TVD8 Reverse transcriptase domain-containing protein3.3e-3951.11Show/hide
Query:  WSTMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLK
        ++TMEEH  HLQ VF+KLK+NQLYVKREKCSFAQE                                          +N YRRFV+GF +RA PL ELLK
Subjt:  WSTMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLK

Query:  NNQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI
         +  WNW  ECQAAF+ L++A+MEG +L IADV +PFEVETDAS++ALGGVL Q+GH I YES KLN AER +   + E+
Subjt:  NNQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI

A0A5A7TVD8 Reverse transcriptase domain-containing protein4.9e-1141.67Show/hide
Query:  VVGELTNNFKTANDDVRSEIAELSTRVNLTMRVMGNQAPAGGHIQFNNVKVPDLKPFCGTRDTKAFENFILDLDY----NDTSQEHSWSTMEEHHI
        ++  ++ +F+   D VR+EIA+++ R++LTMR M NQAPA G I  + VK+ + KPFCG RD KA EN+I DL+      +T  E +  T+   H+
Subjt:  VVGELTNNFKTANDDVRSEIAELSTRVNLTMRVMGNQAPAGGHIQFNNVKVPDLKPFCGTRDTKAFENFILDLDY----NDTSQEHSWSTMEEHHI

A0A5A7TVD8 Reverse transcriptase domain-containing protein9.5e-3951.4Show/hide
Query:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN
        +TMEEH  H+Q VF+KLK+NQLYVKREKCSFAQE                                          +N YRRFV+GF +RA PL ELLK 
Subjt:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN

Query:  NQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI
        +  WNW  ECQAAF+ L++AMMEG +L IADV +PFEVETDAS++ALGGVL Q+GH I YES KLN AER +   + E+
Subjt:  NQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI

A0A5D3BQE4 Reverse transcriptase1.4e-1343.88Show/hide
Query:  LAVVGELTNNFKTANDDVRSEIAELSTRVNLTMRVMGNQAPAGGHIQFNNVKVPDLKPFCGTRDTKAFENFILDLDY----NDTSQEHSWSTMEEHHI
        L ++  ++ +F+   D VR+EIA+++ R++LTMR M NQAPAGG I  + VK+P+ KPFCG RD KA EN+I DL+      +T  E +  T+   H+
Subjt:  LAVVGELTNNFKTANDDVRSEIAELSTRVNLTMRVMGNQAPAGGHIQFNNVKVPDLKPFCGTRDTKAFENFILDLDY----NDTSQEHSWSTMEEHHI

A0A5D3BQE4 Reverse transcriptase1.9e-3951.96Show/hide
Query:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN
        +TMEEH  HLQ VF+KLK+NQLYVKREKCSFAQE                                          +N YRRFV+GFL+RA PL ELLK 
Subjt:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN

Query:  NQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI
        +  WNW  ECQAAF+ L++A+MEG +L IADV +PFEVETDAS++ALGGVL Q+GH I YES KLN AER +   + E+
Subjt:  NQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI

A0A6J1DLQ6 uncharacterized protein LOC1110223204.7e-4658.43Show/hide
Query:  TMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKNN
        TMEEH +HLQLVFEKLKQNQLYVKREKCSFAQE                                          +N YRRFV+GF +R  PL +LLK N
Subjt:  TMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKNN

Query:  QSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI
        Q WNWT EC AAFESL+KAMMEGS+L IADV RPFEVETDAS+FALGGVL QDGH I YES KLNDAER +   + E+
Subjt:  QSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI

A0A6J1DLQ6 uncharacterized protein LOC1110223201.7e-0364.58Show/hide
Query:  DNLRCVESRLDEIFTKADGIDVLNACIDWLAVVGELTNNFKTANDDVR
        DNLR VESRLDEI TKADGIDV+NA ID LA + EL    +T  D V+
Subjt:  DNLRCVESRLDEIFTKADGIDVLNACIDWLAVVGELTNNFKTANDDVR

A0A6J1DLQ6 uncharacterized protein LOC1110223201.5e-3952.51Show/hide
Query:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN
        +TMEEH  HLQ VF+KLK+NQLYVKREKCSFAQE                                          +N YRRFV+GF +RA PL ELLK 
Subjt:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN

Query:  NQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI
        +  WNW  ECQAAF+ L++AMMEG +L IADV +PFEVETDASN+ALGGVL Q+GH I YES KLN AER +   + E+
Subjt:  NQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.7e-1328.89Show/hide
Query:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN
        ++++EH   L LVFEKL +  L ++ +KC F ++                                          +  YR+F+  F + A+P+ + LK 
Subjt:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN

Query:  NQSWNWTS-ECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI
        N   + T+ E  +AF+ L+  + E  IL++ D  + F + TDAS+ ALG VL QDGH + Y S  LN+ E ++ T + E+
Subjt:  NQSWNWTS-ECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI

P10272 Gag-Pol polyprotein3.2e-0732.11Show/hide
Query:  REKCSFAQESNCYRRFVDGFLERARPLIELLKNNQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQD----GHAIVYESW
        RE   F   +   R ++ GF E A PL  L K +  + W +E Q AFE+L+KA++    L + D ++PF +  D       GVL Q        + Y S 
Subjt:  REKCSFAQESNCYRRFVDGFLERARPLIELLKNNQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQD----GHAIVYESW

Query:  KLNDAERSW
        KL+     W
Subjt:  KLNDAERSW

P10394 Retrovirus-related Pol polyprotein from transposon 4124.0e-1034.23Show/hide
Query:  FAQESNCYRRFVDGFLERARPLIELLKNNQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQ--DGH--AIVYESWKLNDA
        F    N YRRF+  F + +R +  L K N  + WT ECQ AF  L+  ++  ++L+  D ++ F + TDAS  A G VL Q  +GH   + Y S      
Subjt:  FAQESNCYRRFVDGFLERARPLIELLKNNQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQ--DGH--AIVYESWKLNDA

Query:  ERSWLTFDPEI
        E +  T + E+
Subjt:  ERSWLTFDPEI

P20825 Retrovirus-related Pol polyprotein from transposon 2975.2e-1026.67Show/hide
Query:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN
        +++ EH   +QLVF KL    L ++ +KC F ++                                          +  YR+F+  + + A+P+   LK 
Subjt:  STMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQE------------------------------------------SNCYRRFVDGFLERARPLIELLKN

Query:  NQSWNWTS-ECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI
            +    E   AFE L+  ++   IL++ D  + F + TDASN ALG VL Q+GH I + S  LND E ++   + E+
Subjt:  NQSWNWTS-ECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEI

Q9UR07 Transposon Tf2-11 polyprotein4.1e-0724.62Show/hide
Query:  YNDTSQEHSWSTMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQES------------------------------------------NCYRRFVDGFLER
        Y D    HS S   EH  H++ V +KLK   L + + KC F Q                                            N  R+F+    + 
Subjt:  YNDTSQEHSWSTMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQES------------------------------------------NCYRRFVDGFLER

Query:  ARPLIELLKNNQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDG-----HAIVYESWKLNDAERSWLTFDPEIERTLK
          PL  LLK +  W WT     A E++++ ++   +L   D ++   +ETDAS+ A+G VL Q       + + Y S K++ A+ ++   D E+   +K
Subjt:  ARPLIELLKNNQSWNWTSECQAAFESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDG-----HAIVYESWKLNDAERSWLTFDPEIERTLK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCGACATATAGCTATTGTGCCACCGTTGGTATCAGAGCCTGGTTGGCTCCTCGAATATGAAAACGACAATCTTAGATGTGTGGAATCTCGGCTAGATGAG
ATCTTCACCAAAGCTGACGGAATTGACGTCCTGAATGCCTGTATAGATTGGCTTGCTGTGGTCGGCGAGTTAACTAACAACTTCAAAACCGCCAATGACGACGTG
AGGTCGGAGATTGCTGAATTAAGCACTAGAGTAAATCTCACTATGAGAGTAATGGGAAATCAAGCCCCAGCTGGGGGACATATTCAATTCAACAACGTGAAAGTT
CCAGATCTCAAGCCCTTCTGTGGGACCCGAGATACTAAAGCTTTCGAGAACTTCATCTTGGATCTTGATTACAACGATACATCTCAAGAGCATTCATGGTCAACC
ATGGAAGAACACCATATACATTTGCAACTTGTTTTCGAGAAGCTCAAGCAGAACCAGTTGTACGTCAAGCGGGAAAAATGTTCTTTCGCTCAAGAGTCCAACTGC
TATAGACGATTTGTAGATGGATTCTTGGAAAGGGCAAGACCCTTGATTGAACTACTAAAGAATAATCAATCGTGGAATTGGACTTCAGAGTGTCAAGCCGCGTTC
GAGAGCTTGGAAAAGGCAATGATGGAGGGGTCGATATTGGAGATTGCAGATGTTGCAAGACCATTTGAAGTAGAAACTGATGCTTCAAACTTTGCTCTAGGAGGA
GTGCTTTTCCAAGACGGCCACGCCATTGTATACGAGAGTTGGAAGTTGAATGATGCTGAAAGGAGTTGGTTGACCTTTGATCCAGAGATTGAAAGAACCCTTAAG
AGGAGAAGGCGTGAGCAGAGGTTAAGAAAACAAAGAGAAAATCAAAAAGAGAAAGAAGTTGAAGAAGAGACCATCGAGATGAATAGAAATCTACAAGATCCTCCA
TCTCCACAAAATTCACCTGTGAATGTAGATATAACAGGTGAAGGAGCAGCAAACCGAGAACGGTGCAAGGACATGGCTAAATGCGTTAGAACCAAATTCTATCAC
CAGATTGAGCAGTTTTATAGAGCATTGGATGTAACGTCCTGCGTTACTCAGTTTCACGACCACCGTCGGCGGCCCTCCGATGAACCCATTCTCCAACGGTCTCAC
CCAACGACGGCACGGCGACCCAACCCACTACAGCAGGTGTGGCGGACTGTTCGGGACAGTACCATGCAGAGGACAACATTTCCGTTCGTGACAGCGGCATGTGAC
GACAACCCCAACCCACGGCAGCAGGCGTGGCGGTCTATTCGGGATAGCAGCATGCAGTGGGCGGCGATTTCGTTAGTGACAGCGGCGTGCGGCGGTGACAGCAGT
TTTTCTGTATTCAGCGACTACTACCCACGTCGGTTAACAGCCCGATCTACGCAACCCACGCTCCACCGAGTGTTGAATCAACGCGTGTTTTGGAAGCCCAGTAGT
GTGGTGTCGATGCTCCGGGTACAAAAGTCGGGGGTCGATACACCAGTTTTGGTGGAGAGGAGTGTCGAGGCTTCGGGTGTAAACAAGACCCTCGGCCGTGGTGAT
GACGTGGAGGAGCTAATCCAGCATCTATTTTCTATTTATGACGGACCTTCACTTCCAAATGATGGAACCGATGTGGCTACACCTGTTCCTGCATCCACCTCCACT
CCACAACCAGAAAAGAAAGCAGAACCCATAAGTTTAGAAGAGAAAGGTAAAAAGGGAGACAAAGGTAAGCAAGTAGTGCCTTGCACTACTCTACAGGTAGTCACC
TTCAATGTCCTTGATGCAATGCGGCTCCCGGATGAAGTCGAGGATTGCTTTGCAATAGGGGCAATCATGGAGGAACTCAAGGAGATGATTGTGGAAGACTTAGAA
ACAGATTTGGAGCCCGTAGAAGAAGAAGCAAAAATTGCACCTGAGTATTTTGCCACAGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCGACATATAGCTATTGTGCCACCGTTGGTATCAGAGCCTGGTTGGCTCCTCGAATATGAAAACGACAATCTTAGATGTGTGGAATCTCGGCTAGATGAG
ATCTTCACCAAAGCTGACGGAATTGACGTCCTGAATGCCTGTATAGATTGGCTTGCTGTGGTCGGCGAGTTAACTAACAACTTCAAAACCGCCAATGACGACGTG
AGGTCGGAGATTGCTGAATTAAGCACTAGAGTAAATCTCACTATGAGAGTAATGGGAAATCAAGCCCCAGCTGGGGGACATATTCAATTCAACAACGTGAAAGTT
CCAGATCTCAAGCCCTTCTGTGGGACCCGAGATACTAAAGCTTTCGAGAACTTCATCTTGGATCTTGATTACAACGATACATCTCAAGAGCATTCATGGTCAACC
ATGGAAGAACACCATATACATTTGCAACTTGTTTTCGAGAAGCTCAAGCAGAACCAGTTGTACGTCAAGCGGGAAAAATGTTCTTTCGCTCAAGAGTCCAACTGC
TATAGACGATTTGTAGATGGATTCTTGGAAAGGGCAAGACCCTTGATTGAACTACTAAAGAATAATCAATCGTGGAATTGGACTTCAGAGTGTCAAGCCGCGTTC
GAGAGCTTGGAAAAGGCAATGATGGAGGGGTCGATATTGGAGATTGCAGATGTTGCAAGACCATTTGAAGTAGAAACTGATGCTTCAAACTTTGCTCTAGGAGGA
GTGCTTTTCCAAGACGGCCACGCCATTGTATACGAGAGTTGGAAGTTGAATGATGCTGAAAGGAGTTGGTTGACCTTTGATCCAGAGATTGAAAGAACCCTTAAG
AGGAGAAGGCGTGAGCAGAGGTTAAGAAAACAAAGAGAAAATCAAAAAGAGAAAGAAGTTGAAGAAGAGACCATCGAGATGAATAGAAATCTACAAGATCCTCCA
TCTCCACAAAATTCACCTGTGAATGTAGATATAACAGGTGAAGGAGCAGCAAACCGAGAACGGTGCAAGGACATGGCTAAATGCGTTAGAACCAAATTCTATCAC
CAGATTGAGCAGTTTTATAGAGCATTGGATGTAACGTCCTGCGTTACTCAGTTTCACGACCACCGTCGGCGGCCCTCCGATGAACCCATTCTCCAACGGTCTCAC
CCAACGACGGCACGGCGACCCAACCCACTACAGCAGGTGTGGCGGACTGTTCGGGACAGTACCATGCAGAGGACAACATTTCCGTTCGTGACAGCGGCATGTGAC
GACAACCCCAACCCACGGCAGCAGGCGTGGCGGTCTATTCGGGATAGCAGCATGCAGTGGGCGGCGATTTCGTTAGTGACAGCGGCGTGCGGCGGTGACAGCAGT
TTTTCTGTATTCAGCGACTACTACCCACGTCGGTTAACAGCCCGATCTACGCAACCCACGCTCCACCGAGTGTTGAATCAACGCGTGTTTTGGAAGCCCAGTAGT
GTGGTGTCGATGCTCCGGGTACAAAAGTCGGGGGTCGATACACCAGTTTTGGTGGAGAGGAGTGTCGAGGCTTCGGGTGTAAACAAGACCCTCGGCCGTGGTGAT
GACGTGGAGGAGCTAATCCAGCATCTATTTTCTATTTATGACGGACCTTCACTTCCAAATGATGGAACCGATGTGGCTACACCTGTTCCTGCATCCACCTCCACT
CCACAACCAGAAAAGAAAGCAGAACCCATAAGTTTAGAAGAGAAAGGTAAAAAGGGAGACAAAGGTAAGCAAGTAGTGCCTTGCACTACTCTACAGGTAGTCACC
TTCAATGTCCTTGATGCAATGCGGCTCCCGGATGAAGTCGAGGATTGCTTTGCAATAGGGGCAATCATGGAGGAACTCAAGGAGATGATTGTGGAAGACTTAGAA
ACAGATTTGGAGCCCGTAGAAGAAGAAGCAAAAATTGCACCTGAGTATTTTGCCACAGTATGA
Protein sequenceShow/hide protein sequence
MLRHIAIVPPLVSEPGWLLEYENDNLRCVESRLDEIFTKADGIDVLNACIDWLAVVGELTNNFKTANDDVRSEIAELSTRVNLTMRVMGNQAPAGGHIQFNNVKV
PDLKPFCGTRDTKAFENFILDLDYNDTSQEHSWSTMEEHHIHLQLVFEKLKQNQLYVKREKCSFAQESNCYRRFVDGFLERARPLIELLKNNQSWNWTSECQAAF
ESLEKAMMEGSILEIADVARPFEVETDASNFALGGVLFQDGHAIVYESWKLNDAERSWLTFDPEIERTLKRRRREQRLRKQRENQKEKEVEEETIEMNRNLQDPP
SPQNSPVNVDITGEGAANRERCKDMAKCVRTKFYHQIEQFYRALDVTSCVTQFHDHRRRPSDEPILQRSHPTTARRPNPLQQVWRTVRDSTMQRTTFPFVTAACD
DNPNPRQQAWRSIRDSSMQWAAISLVTAACGGDSSFSVFSDYYPRRLTARSTQPTLHRVLNQRVFWKPSSVVSMLRVQKSGVDTPVLVERSVEASGVNKTLGRGD
DVEELIQHLFSIYDGPSLPNDGTDVATPVPASTSTPQPEKKAEPISLEEKGKKGDKGKQVVPCTTLQVVTFNVLDAMRLPDEVEDCFAIGAIMEELKEMIVEDLE
TDLEPVEEEAKIAPEYFATV