; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0023129 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0023129
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr06:13782614..13783593
RNA-Seq ExpressionIVF0023129
SyntenyIVF0023129
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042872.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.31e-203100Show/hide
Query:  MKAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS
        MKAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS
Subjt:  MKAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS

Query:  VPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCW
        VPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCW
Subjt:  VPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCW

Query:  RMPDVKCRRFYLLGHIERLCKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQNL
        RMPDVKCRRFYLLGHIERLCKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQNL
Subjt:  RMPDVKCRRFYLLGHIERLCKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQNL

XP_008450210.1 PREDICTED: uncharacterized protein LOC103491872 [Cucumis melo]2.53e-17987.5Show/hide
Query:  MKAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS
        MKAKARACLHTAVS VIFN+LMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS
Subjt:  MKAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS

Query:  VPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCW
        VPKRYEATIASLENTKDLSKLKVIEVVS LQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSS ESLAKDVGSACKHYGKQNHPHFRCW
Subjt:  VPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCW

Query:  RMPDVKCRR--------FYLLGHIERL------CKAVTTQH-------GCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLIT
        RMPDVKCR          Y     E        C + TTQ+       GCTNHMTSYKDLFK+LDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLIT
Subjt:  RMPDVKCRR--------FYLLGHIERL------CKAVTTQH-------GCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLIT

Query:  EVLFVPEIDQNL
        EVLFVPEIDQNL
Subjt:  EVLFVPEIDQNL

XP_016669911.2 uncharacterized protein LOC107889873 [Gossypium hirsutum]1.18e-12165.45Show/hide
Query:  KAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSV
        KAKAR+CL+ +VS  IFN++MA   AKEIW++ K+EY+G ERIK MKVLNL+REFER+Q+K+S+SI EYSDKLI IANK RA G DLSDSRLVQKILVSV
Subjt:  KAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSV

Query:  PKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIE----GALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVG-------SACKHYG
        P++YEATIAS ENTKDL++LKV+E++S LQ QEQRRLM QEGSIE    GALKA+MQQGE  +E+KW G K +  S +E++AK          S+CK+ G
Subjt:  PKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIE----GALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVG-------SACKHYG

Query:  KQNHPHFRCWRMPDVKCRRFYLLGHIERLCKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQN
        K NHPHFRCWR PDVKC R  L+GHIE    +     GCTNH+T  + LFKDLD S KS++ IGNG YLEVKG+G V+IES AGTKLI++VLFVPEIDQN
Subjt:  KQNHPHFRCWRMPDVKCRRFYLLGHIERLCKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQN

Query:  L
        L
Subjt:  L

XP_022148138.1 uncharacterized protein LOC111016891 [Momordica charantia]1.13e-14671.91Show/hide
Query:  KAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSV
        K KA ACL  AVS  IFN++MAL+SAKEIWEFLK+EYEG ERIKGMKVLNL+REFE MQ+KDS+SI EYSDKLIGIANK RALG DLS +RLVQKILVSV
Subjt:  KAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSV

Query:  PKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCWR
        P+RYEATIASLENTKDLSKLKVIEVVSVLQ QEQRRL+ QEGS+EGALKARMQ GEGG+E KWKG K +G SS+E  +KDVGSACKH GK NHPHFRCWR
Subjt:  PKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCWR

Query:  MPDVKCRRFYLLGHIERLCKAVTTQH----------------------------------GCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTV
         P VKCR   LLGHIER  K   TQ                                   GCTNHMTS K+LFKDLD SFKSR+KI NG YLEVKGKGTV
Subjt:  MPDVKCRRFYLLGHIERLCKAVTTQH----------------------------------GCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTV

Query:  SIESCAGTKLITEVLFVPEIDQNL
        SIESC GTKLI EVLFVPEIDQNL
Subjt:  SIESCAGTKLITEVLFVPEIDQNL

XP_038889190.1 uncharacterized protein LOC120079069 [Benincasa hispida]4.90e-12167.92Show/hide
Query:  KAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSV
        K KAR CL+ AVS  IFN++MAL+S KEIWEFLKSEYEG ERIKGMKVLNL+REFERMQ+KD KSI EYSDKLIGIANK RALG DLSD+RLV       
Subjt:  KAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSV

Query:  PKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCWR
                              IEVVS LQ QEQ RL+RQEGSIEGALKAR+ QGEG +EKK     G+G SS+ES  KD GS CKH GKQNHPHFRCWR
Subjt:  PKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCWR

Query:  MPDVKCRRFYLLGHIERL---CKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQNL
         P+VKCR  +LLGHIERL   C       GCTNHMT+ K+LFKD+D SFK R+KIGNG YLEVKGK TVSIESCAG KLIT+VLFVPEIDQNL
Subjt:  MPDVKCRRFYLLGHIERL---CKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQNL

TrEMBL top hitse value%identityAlignment
A0A1S3BNQ3 uncharacterized protein LOC1034918723.3e-14087.5Show/hide
Query:  MKAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS
        MKAKARACLHTAVS VIFN+LMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS
Subjt:  MKAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS

Query:  VPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCW
        VPKRYEATIASLENTKDLSKLKVIEVVS LQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSS ESLAKDVGSACKHYGKQNHPHFRCW
Subjt:  VPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCW

Query:  RMPDVKC--------RRFYLLGHIER------LCKAVTTQH-------GCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLIT
        RMPDVKC        R  Y     E        C + TTQ+       GCTNHMTSYKDLFK+LDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLIT
Subjt:  RMPDVKC--------RRFYLLGHIER------LCKAVTTQH-------GCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLIT

Query:  EVLFVPEIDQNL
        EVLFVPEIDQNL
Subjt:  EVLFVPEIDQNL

A0A1U8HV73 uncharacterized protein LOC1078898732.8e-9965.78Show/hide
Query:  KAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSV
        KAKAR+CL+ +VS  IFN++MA   AKEIW++ K+EY+G ERIK MKVLNL+REFER+Q+K+S+SI EYSDKLI IANK RA G DLSDSRLVQKILVSV
Subjt:  KAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSV

Query:  PKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEG----SIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVG-------SACKHYG
        P++YEATIAS ENTKDL++LKV+E++S LQ QEQRRLM QEG    SIEGALKA+MQQGE G+E+KW G K +  S +E++AK          S+CK+ G
Subjt:  PKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEG----SIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVG-------SACKHYG

Query:  KQNHPHFRCWRMPDVKCRRFYLLGHIERLCKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQN
        K NHPHFRCWR PDVKC R  L+GHIE    +     GCTNH+T  + LFKDLD S KS++ IGNG YLEVKG+G V+IES AGTKLI++VLFVPEIDQN
Subjt:  KQNHPHFRCWRMPDVKCRRFYLLGHIERLCKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQN

Query:  L
        L
Subjt:  L

A0A5D3C1N2 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-158100Show/hide
Query:  MKAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS
        MKAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS
Subjt:  MKAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS

Query:  VPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCW
        VPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCW
Subjt:  VPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCW

Query:  RMPDVKCRRFYLLGHIERLCKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQNL
        RMPDVKCRRFYLLGHIERLCKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQNL
Subjt:  RMPDVKCRRFYLLGHIERLCKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQNL

A0A6J1D394 uncharacterized protein LOC1110168911.5e-11671.91Show/hide
Query:  KAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSV
        K KA ACL  AVS  IFN++MAL+SAKEIWEFLK+EYEG ERIKGMKVLNL+REFE MQ+KDS+SI EYSDKLIGIANK RALG DLS +RLVQKILVSV
Subjt:  KAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSV

Query:  PKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCWR
        P+RYEATIASLENTKDLSKLKVIEVVSVLQ QEQRRL+ QEGS+EGALKARMQ GEGG+E KWKG K +G SS+E  +KDVGSACKH GK NHPHFRCWR
Subjt:  PKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCWR

Query:  MPDVKCRRFYLLGHIERLCKAVTTQH----------------------------------GCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTV
         P VKCR   LLGHIER  K   TQ                                   GCTNHMTS K+LFKDLD SFKSR+KI NG YLEVKGKGTV
Subjt:  MPDVKCRRFYLLGHIERLCKAVTTQH----------------------------------GCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTV

Query:  SIESCAGTKLITEVLFVPEIDQNL
        SIESC GTKLI EVLFVPEIDQNL
Subjt:  SIESCAGTKLITEVLFVPEIDQNL

A0A6J1DFA6 uncharacterized protein LOC1110195545.3e-8263.08Show/hide
Query:  MKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSVPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIE
        MKVLNL+REFERMQ+KDS+SI  YSDKLIGIA K RALG DLSD+RLVQKILVSVPKRYEATIASLENTK LSKLKVIEV                    
Subjt:  MKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSVPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIE

Query:  GALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCWRMPDVKCRRFYLLGHIERLCKAVTTQH-------------------
                 GEGG+EKKWK  K +G SS+E  AKDVGS CKH GK NHPHFRCWR PDVKCRR  LLGHIER CK   TQ                    
Subjt:  GALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCWRMPDVKCRRFYLLGHIERLCKAVTTQH-------------------

Query:  ---------------GCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQNL
                       GCTNHMTS   LFKDLD SFKSR+KIGNG YL+VKGKGTVSIESC GTKLI +VLFVPEIDQNL
Subjt:  ---------------GCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein7.7e-0921.62Show/hide
Query:  AKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGG--ERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS
        AKA   L ++++  +F K ++  SAK++W+ L+   E     R++ + +  L ++ E +++ D +S + Y DK + I  +      + SD  + + +  +
Subjt:  AKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGG--ERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVS

Query:  VPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCW
        +   ++   + LE   D+ K+    +V     +        E +I G LK    +    K +KW G       + E     + +  +    +    +R  
Subjt:  VPKRYEATIASLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCW

Query:  RMPDVKCRRF----YLLGHIERLCKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAG-TKLITEVLFVPEIDQNL
         +P++  + +    +++  +  +            +MT Y   F  LD +FK+ +   +G  L V+GKG V I    G  K I  V+FVP +++N+
Subjt:  RMPDVKCRRF----YLLGHIERLCKAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAG-TKLITEVLFVPEIDQNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCAAAGGCAAGAGCTTGCCTACATACAGCTGTGTCTTCTGTTATATTCAACAAACTTATGGCCTTAGAGTCAGCAAAGGAGATCTGGGAGTTTCTCAAAAGTGA
GTACGAAGGTGGTGAAAGAATTAAAGGCATGAAGGTGTTGAACTTGATGCGAGAATTCGAGCGAATGCAAATAAAGGATTCTAAGTCCATCACAGAATACTCAGATAAGT
TGATTGGGATTGCTAATAAGGAAAGAGCATTAGGACGTGATTTATCTGATAGTAGATTGGTTCAGAAGATTCTGGTTTCAGTACCTAAGCGATATGAAGCAACTATTGCT
TCCTTAGAAAATACTAAAGACCTCTCAAAACTTAAGGTGATAGAAGTAGTTAGTGTTTTGCAAAAACAAGAGCAAAGGAGGTTGATGAGGCAAGAAGGAAGCATTGAGGG
GGCACTAAAAGCTAGAATGCAGCAGGGAGAAGGTGGAAAAGAGAAGAAGTGGAAAGGGAATAAGGGAAATGGCAAAAGTAGCACGGAGTCTCTTGCAAAGGATGTTGGTA
GTGCATGCAAGCACTATGGAAAACAGAATCATCCACATTTCAGATGCTGGAGAATGCCAGATGTGAAGTGTAGAAGGTTTTATTTATTGGGGCACATTGAACGATTATGT
AAGGCAGTAACCACTCAACATGGGTGTACCAATCACATGACAAGTTACAAAGATTTGTTCAAGGACCTTGACACGTCATTCAAGTCAAGGATGAAAATTGGAAATGGTGC
GTATTTAGAAGTAAAGGGGAAGGGCACAGTGTCAATAGAGAGTTGTGCTGGAACCAAGTTGATTACTGAAGTGTTGTTTGTCCCTGAGATTGATCAAAACTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCAAAGGCAAGAGCTTGCCTACATACAGCTGTGTCTTCTGTTATATTCAACAAACTTATGGCCTTAGAGTCAGCAAAGGAGATCTGGGAGTTTCTCAAAAGTGA
GTACGAAGGTGGTGAAAGAATTAAAGGCATGAAGGTGTTGAACTTGATGCGAGAATTCGAGCGAATGCAAATAAAGGATTCTAAGTCCATCACAGAATACTCAGATAAGT
TGATTGGGATTGCTAATAAGGAAAGAGCATTAGGACGTGATTTATCTGATAGTAGATTGGTTCAGAAGATTCTGGTTTCAGTACCTAAGCGATATGAAGCAACTATTGCT
TCCTTAGAAAATACTAAAGACCTCTCAAAACTTAAGGTGATAGAAGTAGTTAGTGTTTTGCAAAAACAAGAGCAAAGGAGGTTGATGAGGCAAGAAGGAAGCATTGAGGG
GGCACTAAAAGCTAGAATGCAGCAGGGAGAAGGTGGAAAAGAGAAGAAGTGGAAAGGGAATAAGGGAAATGGCAAAAGTAGCACGGAGTCTCTTGCAAAGGATGTTGGTA
GTGCATGCAAGCACTATGGAAAACAGAATCATCCACATTTCAGATGCTGGAGAATGCCAGATGTGAAGTGTAGAAGGTTTTATTTATTGGGGCACATTGAACGATTATGT
AAGGCAGTAACCACTCAACATGGGTGTACCAATCACATGACAAGTTACAAAGATTTGTTCAAGGACCTTGACACGTCATTCAAGTCAAGGATGAAAATTGGAAATGGTGC
GTATTTAGAAGTAAAGGGGAAGGGCACAGTGTCAATAGAGAGTTGTGCTGGAACCAAGTTGATTACTGAAGTGTTGTTTGTCCCTGAGATTGATCAAAACTTGTGA
Protein sequenceShow/hide protein sequence
MKAKARACLHTAVSSVIFNKLMALESAKEIWEFLKSEYEGGERIKGMKVLNLMREFERMQIKDSKSITEYSDKLIGIANKERALGRDLSDSRLVQKILVSVPKRYEATIA
SLENTKDLSKLKVIEVVSVLQKQEQRRLMRQEGSIEGALKARMQQGEGGKEKKWKGNKGNGKSSTESLAKDVGSACKHYGKQNHPHFRCWRMPDVKCRRFYLLGHIERLC
KAVTTQHGCTNHMTSYKDLFKDLDTSFKSRMKIGNGAYLEVKGKGTVSIESCAGTKLITEVLFVPEIDQNL