; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g02860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g02860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRNA-directed DNA polymerase
Genome locationchr3:2168979..2186767
RNA-Seq ExpressionMoc03g02860
SyntenyMoc03g02860
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]1.2e-4738.48Show/hide
Query:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINY------------------------------KTNTIQ----LV
        M T  +KKV  VA KLK  ASAWWDQ+  NRQ+  K PIR WEKMKKLM+ RF+P NY                              +TN ++    L+
Subjt:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINY------------------------------KTNTIQ----LV

Query:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQR-W------NTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKLVEQPIKKTI
        + F+ GLR  +KEK++LQP  +L+EAI  A TVE+   N+ K   +R W       T+ G +K    + EK   Q  SSG K         V +  KK  
Subjt:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQR-W------NTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKLVEQPIKKTI

Query:  NNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKCT
        N Y RP  G C+RCG +GH SN+CPQRK + +  D+ D     L + D++   +E D+G+ ++CI++ +  S  +           + ++    F+ +CT
Subjt:  NNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKCT

Query:  VDGKICNNIIDSGSTENVVSSKLVAALHLK
        + GK+CN IIDSGS+EN VS KLV AL+LK
Subjt:  VDGKICNNIIDSGSTENVVSSKLVAALHLK

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]1.5e-4737.39Show/hide
Query:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYK----------------------------------TNTIQLV
        M+TP+ KKV  VA KL+  ASAWWDQLE NRQR  K P+R WEKMKKL++ARFLP NY+                                   N    V
Subjt:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYK----------------------------------TNTIQLV

Query:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQR--WNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKL-VEQPIKKTI----
        ARF+ GLR  IKEK++LQP  +L+EAI+ A TVE+  A + K   +R  W T+  ++K        +   +TS+ +K K  D  ++ VE+  ++T     
Subjt:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQR--WNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKL-VEQPIKKTI----

Query:  -NNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKC
         N+Y+RP+LGKCFRCG  GH S+ CPQRK + + ++     +  + +++++   +E DDGE+V+C+++ +  +  +            +++    F+ +C
Subjt:  -NNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKC

Query:  TVDGKICNNIIDSGSTENVVSSKLVAALHLKLVQFHP
        T++G++C+ IIDSGS+EN V+ KLV  L+LK  + HP
Subjt:  TVDGKICNNIIDSGSTENVVSSKLVAALHLKLVQFHP

XP_031743026.1 uncharacterized protein LOC116404533 [Cucumis sativus]1.4e-4838.07Show/hide
Query:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYK----------------------------------TNTIQLV
        M+TP+ KKV  VA KL+  ASAWWDQLE NRQR  K P+R WEKMKKL++ARFLP NY+                                   N    V
Subjt:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYK----------------------------------TNTIQLV

Query:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQR--WNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKL-VEQPIKKTI----
        ARF+ GLR  IKEK++LQP  +L+EAI+ A TVE+  A + K   +R  W T+  ++K        +   +TS+ +K K  D  ++ VE+  ++T     
Subjt:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQR--WNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKL-VEQPIKKTI----

Query:  -NNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKC
         NNY+RP+LGKCFRCG  GH SN CPQRK + + ++     +  + +++++   +E DDGE+V+C+++ +  +  +            +++    F+ +C
Subjt:  -NNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKC

Query:  TVDGKICNNIIDSGSTENVVSSKLVAALHLK
        T++G++C+ IIDSGS+EN V+ KLV  L+LK
Subjt:  TVDGKICNNIIDSGSTENVVSSKLVAALHLK

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]7.2e-4235.74Show/hide
Query:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYKTNT----------IQLVARFIS-----GLRTYIKEKLQLQP
        M+TP+ KKV  VA KL+  ASAWWDQLE NRQR  K PIR WEKMKKL++ARFLP NY+             ++ VA +I        RT + E  Q Q 
Subjt:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYKTNT----------IQLVARFIS-----GLRTYIKEKLQLQP

Query:  IGYLNEAIATAVTVEDQQANKLKFQYQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKL-VEQPIKKTI-----NNYNRPTLGKCFRCGHVGHS
          ++ E +   + +  +  N+       W T+  ++K        +   +TS+  K K  D  ++ VE+  ++T      N+Y+RP+LGKCFRCG  GH 
Subjt:  IGYLNEAIATAVTVEDQQANKLKFQYQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKL-VEQPIKKTI-----NNYNRPTLGKCFRCGHVGHS

Query:  SNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIP---YFELKCTVDGKICNNIIDSGSTEN
        SN CPQRK + + ++     +  + +++++   +E DDGE+V+C ++ +             LI+    +++     F+ +CT++G++C+ IIDSGS+EN
Subjt:  SNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIP---YFELKCTVDGKICNNIIDSGSTEN

Query:  VVSSKLVAALHLKLVQFHP
         V+ KLV  L+LK  + HP
Subjt:  VVSSKLVAALHLKLVQFHP

XP_040994264.1 uncharacterized protein LOC121240799 [Juglans microcarpa x Juglans regia]1.1e-3735Show/hide
Query:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYK-------------TNTI---------------------QLV
        M  P+ ++VK VA+KL+  ASAWW+Q + NR+R  K P+R+W KMK+LMRARFLP +Y+               TI                     Q V
Subjt:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYK-------------TNTI---------------------QLV

Query:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKLVEQPIKKTI-------
        AR+I GLR  I++K+ L  +  L+EA+  A+ +E Q    L     R       TK      + +     SS    +  +  ++   P   TI       
Subjt:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKLVEQPIKKTI-------

Query:  ---NNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFEL
           N YN+P  GKCFRC   GH SNECP RK VNLV D  DP K    +S++D  ++E D+G+ V C+++ +  +  +   ++  +I          F+ 
Subjt:  ---NNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFEL

Query:  KCTVDGKICNNIIDSGSTENVVSSKLVAALHLKLVQFHPE
        +CTV+ K+CN IIDSGS EN+VS  LV+ L L   + HP+
Subjt:  KCTVDGKICNNIIDSGSTENVVSSKLVAALHLKLVQFHPE

TrEMBL top hitse value%identityAlignment
A0A2I0VI82 RNA-directed DNA polymerase3.4e-3733.54Show/hide
Query:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINY----------------------------------KTNTIQLV
        M  P +K+VK+VA +LK  ASAWW QL+ NRQR  K P+R W +MK++MR  FLP +Y                                  + +  QLV
Subjt:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINY----------------------------------KTNTIQLV

Query:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKLVEQPIKKTINN-YNRP
        AR+  GL+  +++KLQL  +  L++A+  A+  E Q + + K    R    D  T+    S  K     +++ S+     + +   +P     NN Y +P
Subjt:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKLVEQPIKKTINN-YNRP

Query:  TLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDI-TYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKCTVDGKI
        T  KCFRC   GH SNECP R  + +V+   +    +  D++DD+   L PD+GE V CI+E +  +  +  +S+   I          F  +CT+ G++
Subjt:  TLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDI-TYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKCTVDGKI

Query:  CNNIIDSGSTENVVSSKLVAALHLK
        C  +ID+G TENVVS  LV AL LK
Subjt:  CNNIIDSGSTENVVSSKLVAALHLK

A0A5A7UXS4 CCHC-type domain-containing protein4.4e-3736.09Show/hide
Query:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYKTNTIQLVARFISGLRT---YIKEKLQLQPIGYLNE------
        M+T D KKV  VA +L+  ASAWWDQLE NRQR  K PI  WEKMKKL++ARFLP NY+            G R+   YI+E  +L     L E      
Subjt:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYKTNTIQLVARFISGLRT---YIKEKLQLQPIGYLNE------

Query:  AIATAVTVEDQQANKLKFQYQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKG---GDEVKLVEQPIK-KTINNYNRPTLGKCFRCGHVGHSSNECPQRK
        A     TVE+     LK   ++       +KK   S   +   +TS G K+K     D  K  +   K K+ N Y RP+L KCFRCG  GH SN CPQR+
Subjt:  AIATAVTVEDQQANKLKFQYQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKG---GDEVKLVEQPIK-KTINNYNRPTLGKCFRCGHVGHSSNECPQRK

Query:  IVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKCTVDGKICNNIIDSGSTENVVSSKLVAALH
         ++L D   +      ++ +++  ++E DDG++++ +++ +  +  +           ++ +    F+ +CT++ ++C+ IIDSGS+EN V+ KLV  L+
Subjt:  IVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKCTVDGKICNNIIDSGSTENVVSSKLVAALH

Query:  LK
        LK
Subjt:  LK

A0A5B7BER3 Uncharacterized protein1.6e-3935.34Show/hide
Query:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYKT-----------------------NTI-----------QLV
        M   DDK+VK VA+KLK  ASAWWDQ++ NR+R  K P+R W+KM++L+R RFLP++Y+                        NT+           Q V
Subjt:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYKT-----------------------NTI-----------QLV

Query:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANK-LKFQ---------YQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKLVEQPIK
        AR++ GLR  I+++L L+ I  LNEA + A+ VE QQ+ + L+ Q          +     D Q +  +   +K + +  +S SKN+          P +
Subjt:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANK-LKFQ---------YQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKLVEQPIK

Query:  KTINNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPF-------LQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSV-
        K+ N Y RP  GKCFRC   GH SNECP R+ VN+V  ++D    F        QD        E D+GE V+C+++ +             L+    V 
Subjt:  KTINNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPF-------LQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSV-

Query:  -RSIPYFELKCTVDGKICNNIIDSGSTENVVSSKLVAALHLKLVQFHP
         +    F  +CT++ K+C+ IIDSGS+EN+VS  LV AL LK  + HP
Subjt:  -RSIPYFELKCTVDGKICNNIIDSGSTENVVSSKLVAALHLKLVQFHP

A0A5D3DGR0 Reverse transcriptase5.6e-4838.48Show/hide
Query:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINY------------------------------KTNTIQ----LV
        M T  +KKV  VA KLK  ASAWWDQ+  NRQ+  K PIR WEKMKKLM+ RF+P NY                              +TN ++    L+
Subjt:  MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINY------------------------------KTNTIQ----LV

Query:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQR-W------NTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKLVEQPIKKTI
        + F+ GLR  +KEK++LQP  +L+EAI  A TVE+   N+ K   +R W       T+ G +K    + EK   Q  SSG K         V +  KK  
Subjt:  ARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQR-W------NTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKLVEQPIKKTI

Query:  NNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKCT
        N Y RP  G C+RCG +GH SN+CPQRK + +  D+ D     L + D++   +E D+G+ ++CI++ +  S  +           + ++    F+ +CT
Subjt:  NNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKCT

Query:  VDGKICNNIIDSGSTENVVSSKLVAALHLK
        + GK+CN IIDSGS+EN VS KLV AL+LK
Subjt:  VDGKICNNIIDSGSTENVVSSKLVAALHLK

A0A6J1CCQ8 uncharacterized protein LOC111009540 isoform X29.9e-3737.27Show/hide
Query:  NTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINY------------------------------KTNTIQL----VA
        NTP+DKKVK VAFK+++ ASAWWDQLE N +R  K PIR W +M +LMR RFLP N+                              KTN  +     +A
Subjt:  NTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINY------------------------------KTNTIQL----VA

Query:  RFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDE------VKLVEQPIKKTINN
        RF+ GLR  I++++ +QPI  L +AI  A  +ED++  +   +   W+     +K       K     T+S S  K  D+       K  +   K+  N 
Subjt:  RFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQYQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDE------VKLVEQPIKKTINN

Query:  YNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSK
        Y RPTLGKCFRCG V H SNECPQR+ + LVD     +      ++DD TY+EPD+G+ ++C+++ +   K
Subjt:  YNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQVACIMEHITDSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATACGCCGGATGACAAAAAGGTTAAGTTCGTCGCCTTCAAATTGAAAACTAGGGCATCTGCATGGTGGGATCAGCTTGAAACGAATCGTCAGCGTTTTTGCAAGGC
TCCTATTCGTATGTGGGAGAAAATGAAGAAGCTAATGAGGGCTCGGTTCTTGCCTATTAACTATAAAACAAATACTATACAACTAGTCGCAAGATTTATTAGTGGTTTAC
GTACATACATTAAAGAAAAACTTCAACTACAACCAATTGGATATTTGAATGAAGCAATTGCAACGGCTGTGACTGTGGAAGACCAACAGGCGAACAAGTTAAAATTTCAA
TACCAAAGATGGAATACAAGTGATGGTCAGACCAAAAAAGGATTATTGTCTTATGAAAAATCATCTATCCAAGCTACATCGTCGGGTTCTAAGAACAAGGGTGGTGACGA
AGTTAAGCTAGTGGAACAACCAATCAAGAAAACAATCAATAACTACAACAGACCTACGTTGGGTAAGTGTTTTCGGTGTGGACATGTAGGTCATTCATCCAATGAATGTC
CACAACGGAAAATCGTAAATCTTGTGGATGATTCCCAAGATCCTGATAAGCCATTTCTTCAAGATTCTGATGATGACATTACATACTTAGAACCAGATGATGGAGAGCAG
GTTGCGTGTATTATGGAACACATTACTGACTCCAAAAACAAATTTGGTTCCTCAAAGACATTCCTTATTTTGAACTCAAGTGTACGGTCGATTCCTTATTTTGAACTCAA
GTGTACGGTCGATGGCAAAATATGCAATAACATTATTGATAGTGGCAGTACAGAAAATGTAGTCTCCAGCAAACTTGTTGCAGCCTTGCATCTCAAGCTGGTTCAATTTC
ACCCTGAGACTGAGGTTCTATTGCTTCTGGACTCTCTTGAGTTTCTGTTTCTTCAGTGGGATGTTGGAGTTCAAGCAGATTCAGATACAGTCAACACCTCTGCTGAATTG
TATCCTTCATGGACATTGGCTGAGGACTCTCTCCCTACTTTGCGAGATATACAACATCACATTGATCTTTTGCTTGTTGGATGGAACGACAAGAGAGGCCCAACTGGTAC
GCAGGATCAACAAGGTACAGGGACTTTGAAGGTTGTCGTCAACCCAATTGCTCCTAATGGAGATCTGCCACGCAAGTCTGCTAGAACAAATCTCCCACTTGCTCCAATTC
ACCAACCCATCTGCTCGAACCGCAGTAGAAGTGTGATCACTAAAAAGACTGGGCGTAGACTGGTTGCTTATACCCTGCCTCCAACAATCCAACCCTCAGAACTTTCACCG
CTGCCCAACCCTGCCTCCGTCTTCATTTACAGAACCTTCATCGTCCAACAATCCCCAACCTTCACCGTCGCCCAACCTTCAGCCGCCTTCCATCCACGCCGGACGCTGCC
GATCGTCGCCGCCCCCTCAACGCCGAATGCGGAACGTCGGACCTCACTGCTTCACGCCGCCACGCCCCCTCCTGTCGCCACGTCGTTCGTCCCTCGTTCCTTCACGTTAA
ACGCAATCGCCACTGCATTGCCCATCCCGCCGTTGCATCGCCCATCCCGCCTCTGCATCGCCCACCCCGCCGCCGCTTCACCCAACCTGCAGCCGTGCTGCCGCCCTCCT
GTCGCCGCACCGCCGCCCCTCGTTCCTTCATGCCGAACGCAATTGCCCAGAATCCAACGTTCACTTGATCCGTCCAAACCAGATTTGACCCTCCCGCCACAGACTAAAAC
CGGAGTCCCCACCTTCTTCCCACCTTTTGGTCTCACGCAACTTCATGCGCCCCACAGCCATCATCTTCCTCACGAAGCAGATTCGCAAGCTGCCATCCCGCCACTCACCT
TCAAGGCCATCATCCCGTCTTGCTGCTCACCTCATGAAGCAAATCCGTGCACCGCTCACCTTCACGCCGCCGTCCACGAACCAAAACCAGACCCGTCACCGTTGCACCCA
ACTGCCGCCACCCTCCCCGTTGGCCGCAACATGTACCATATTTGCTTGATGCGCCACGTCCCAAACCAGATCTACCGTCCCGCCACCCCATTAGATCCGCCGGCCACCCC
AGACCAGATCCACCTACTGCCGCTTCGATATGTTTCCGATCTGCTGCCCAACTTCCATCTTCTAGCAACTGTTCGAGGAAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATACGCCGGATGACAAAAAGGTTAAGTTCGTCGCCTTCAAATTGAAAACTAGGGCATCTGCATGGTGGGATCAGCTTGAAACGAATCGTCAGCGTTTTTGCAAGGC
TCCTATTCGTATGTGGGAGAAAATGAAGAAGCTAATGAGGGCTCGGTTCTTGCCTATTAACTATAAAACAAATACTATACAACTAGTCGCAAGATTTATTAGTGGTTTAC
GTACATACATTAAAGAAAAACTTCAACTACAACCAATTGGATATTTGAATGAAGCAATTGCAACGGCTGTGACTGTGGAAGACCAACAGGCGAACAAGTTAAAATTTCAA
TACCAAAGATGGAATACAAGTGATGGTCAGACCAAAAAAGGATTATTGTCTTATGAAAAATCATCTATCCAAGCTACATCGTCGGGTTCTAAGAACAAGGGTGGTGACGA
AGTTAAGCTAGTGGAACAACCAATCAAGAAAACAATCAATAACTACAACAGACCTACGTTGGGTAAGTGTTTTCGGTGTGGACATGTAGGTCATTCATCCAATGAATGTC
CACAACGGAAAATCGTAAATCTTGTGGATGATTCCCAAGATCCTGATAAGCCATTTCTTCAAGATTCTGATGATGACATTACATACTTAGAACCAGATGATGGAGAGCAG
GTTGCGTGTATTATGGAACACATTACTGACTCCAAAAACAAATTTGGTTCCTCAAAGACATTCCTTATTTTGAACTCAAGTGTACGGTCGATTCCTTATTTTGAACTCAA
GTGTACGGTCGATGGCAAAATATGCAATAACATTATTGATAGTGGCAGTACAGAAAATGTAGTCTCCAGCAAACTTGTTGCAGCCTTGCATCTCAAGCTGGTTCAATTTC
ACCCTGAGACTGAGGTTCTATTGCTTCTGGACTCTCTTGAGTTTCTGTTTCTTCAGTGGGATGTTGGAGTTCAAGCAGATTCAGATACAGTCAACACCTCTGCTGAATTG
TATCCTTCATGGACATTGGCTGAGGACTCTCTCCCTACTTTGCGAGATATACAACATCACATTGATCTTTTGCTTGTTGGATGGAACGACAAGAGAGGCCCAACTGGTAC
GCAGGATCAACAAGGTACAGGGACTTTGAAGGTTGTCGTCAACCCAATTGCTCCTAATGGAGATCTGCCACGCAAGTCTGCTAGAACAAATCTCCCACTTGCTCCAATTC
ACCAACCCATCTGCTCGAACCGCAGTAGAAGTGTGATCACTAAAAAGACTGGGCGTAGACTGGTTGCTTATACCCTGCCTCCAACAATCCAACCCTCAGAACTTTCACCG
CTGCCCAACCCTGCCTCCGTCTTCATTTACAGAACCTTCATCGTCCAACAATCCCCAACCTTCACCGTCGCCCAACCTTCAGCCGCCTTCCATCCACGCCGGACGCTGCC
GATCGTCGCCGCCCCCTCAACGCCGAATGCGGAACGTCGGACCTCACTGCTTCACGCCGCCACGCCCCCTCCTGTCGCCACGTCGTTCGTCCCTCGTTCCTTCACGTTAA
ACGCAATCGCCACTGCATTGCCCATCCCGCCGTTGCATCGCCCATCCCGCCTCTGCATCGCCCACCCCGCCGCCGCTTCACCCAACCTGCAGCCGTGCTGCCGCCCTCCT
GTCGCCGCACCGCCGCCCCTCGTTCCTTCATGCCGAACGCAATTGCCCAGAATCCAACGTTCACTTGATCCGTCCAAACCAGATTTGACCCTCCCGCCACAGACTAAAAC
CGGAGTCCCCACCTTCTTCCCACCTTTTGGTCTCACGCAACTTCATGCGCCCCACAGCCATCATCTTCCTCACGAAGCAGATTCGCAAGCTGCCATCCCGCCACTCACCT
TCAAGGCCATCATCCCGTCTTGCTGCTCACCTCATGAAGCAAATCCGTGCACCGCTCACCTTCACGCCGCCGTCCACGAACCAAAACCAGACCCGTCACCGTTGCACCCA
ACTGCCGCCACCCTCCCCGTTGGCCGCAACATGTACCATATTTGCTTGATGCGCCACGTCCCAAACCAGATCTACCGTCCCGCCACCCCATTAGATCCGCCGGCCACCCC
AGACCAGATCCACCTACTGCCGCTTCGATATGTTTCCGATCTGCTGCCCAACTTCCATCTTCTAGCAACTGTTCGAGGAAGGTAA
Protein sequenceShow/hide protein sequence
MNTPDDKKVKFVAFKLKTRASAWWDQLETNRQRFCKAPIRMWEKMKKLMRARFLPINYKTNTIQLVARFISGLRTYIKEKLQLQPIGYLNEAIATAVTVEDQQANKLKFQ
YQRWNTSDGQTKKGLLSYEKSSIQATSSGSKNKGGDEVKLVEQPIKKTINNYNRPTLGKCFRCGHVGHSSNECPQRKIVNLVDDSQDPDKPFLQDSDDDITYLEPDDGEQ
VACIMEHITDSKNKFGSSKTFLILNSSVRSIPYFELKCTVDGKICNNIIDSGSTENVVSSKLVAALHLKLVQFHPETEVLLLLDSLEFLFLQWDVGVQADSDTVNTSAEL
YPSWTLAEDSLPTLRDIQHHIDLLLVGWNDKRGPTGTQDQQGTGTLKVVVNPIAPNGDLPRKSARTNLPLAPIHQPICSNRSRSVITKKTGRRLVAYTLPPTIQPSELSP
LPNPASVFIYRTFIVQQSPTFTVAQPSAAFHPRRTLPIVAAPSTPNAERRTSLLHAATPPPVATSFVPRSFTLNAIATALPIPPLHRPSRLCIAHPAAASPNLQPCCRPP
VAAPPPLVPSCRTQLPRIQRSLDPSKPDLTLPPQTKTGVPTFFPPFGLTQLHAPHSHHLPHEADSQAAIPPLTFKAIIPSCCSPHEANPCTAHLHAAVHEPKPDPSPLHP
TAATLPVGRNMYHICLMRHVPNQIYRPATPLDPPATPDQIHLLPLRYVSDLLPNFHLLATVRGR