; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0018729 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0018729
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptiontranscription initiation factor TFIID subunit 7-like
Genome locationchr04:2555183..2556531
RNA-Seq ExpressionIVF0018729
SyntenyIVF0018729
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136309.2 uncharacterized protein LOC101218277 [Cucumis sativus]7.88e-13390.48Show/hide
Query:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGG-DEVQSKRKEGGLCGLESLEKALP
        MEVL GPPTFSIEVPP SAFS VSLPSENPSAA+ QNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGG DEVQSK KEGGLCGLESLEKALP
Subjt:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGG-DEVQSKRKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNEN--DEQKQEEDGKDSGEESDEEDEGKGGRR
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSRKK SFYNWPNPKSMPLLALNEN  +E+++EEDGK+SGEES   DEGKGGRR
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNEN--DEQKQEEDGKDSGEESDEEDEGKGGRR

Query:  RNLGQRFHDGKLVNGFKFKSCFDLQECEQQQ
        R+LGQRFHDGKLVNG KFKSCFDLQE EQQQ
Subjt:  RNLGQRFHDGKLVNGFKFKSCFDLQECEQQQ

XP_008466301.1 PREDICTED: uncharacterized protein LOC103503753 [Cucumis melo]4.23e-153100Show/hide
Query:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPI
        MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPI
Subjt:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPI

Query:  KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEEDEGKGGRRRNL
        KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEEDEGKGGRRRNL
Subjt:  KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEEDEGKGGRRRNL

Query:  GQRFHDGKLVNGFKFKSCFDLQECEQQQ
        GQRFHDGKLVNGFKFKSCFDLQECEQQQ
Subjt:  GQRFHDGKLVNGFKFKSCFDLQECEQQQ

XP_022937406.1 uncharacterized protein LOC111443706 [Cucurbita moschata]1.78e-9974.57Show/hide
Query:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAA-EAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALP
        MEV+ GPPTF+IEV   +AF GVSL  ENP+A  E QN  R+ F  SGSGSSIGENSS SSSSIGVPDGDSDDDGG  EVQSK KEGGLC L+SLE ALP
Subjt:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAA-EAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEED---GKDS------GEESDEED
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKR+RILMASKWSRK ASFYNWPNPKSMPLLAL+E++E+K  ++   G DS       +E DEED
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEED---GKDS------GEESDEED

Query:  EGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQ
        E    R R LG RFHD KLVNGFK KSCFDLQ
Subjt:  EGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQ

XP_022975769.1 uncharacterized protein LOC111476326 [Cucurbita maxima]1.70e-10073.19Show/hide
Query:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAA-EAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALP
        MEV+ GPPTF++EV   +AF GVSL  ENP+A  E QN  R+GF  SGS SSIGENSS SSSSIGVPDGDSDDDGG  EVQSK KEGGLC L+SLE ALP
Subjt:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAA-EAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEED------------GKDSGEESD
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKR+RILMASKWSRK ASFYNWPNPKSMPLLAL+E++E+K  ++            G D  +E D
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEED------------GKDSGEESD

Query:  EEDEGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQ
        EEDE    RRR LG RFHD KLVNGFK KSCFDLQ
Subjt:  EEDEGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQ

XP_038898503.1 uncharacterized protein LOC120086121 [Benincasa hispida]9.23e-11481.03Show/hide
Query:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPI
        MEVL GP TFSIEVPPP+AF+GVS+P ENPS  E QN ARSGF  SGSGSSIGENSS SSSSIG+PD DSDDDG  DEVQSK  EGGLCGLESLE+ALPI
Subjt:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPI

Query:  KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQ-----KQEEDGKDSGEESDEEDEGKGG
        KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRK ASFYNWPNPKSMPLLALNE++E+      +E D +D   ESDEEDE K  
Subjt:  KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQ-----KQEEDGKDSGEESDEEDEGKGG

Query:  RRRNLGQRFHDGKLVNGFKFKSCFDLQECEQQ
        RRR LGQRFHD KLVNGFK KSCFDLQE EQQ
Subjt:  RRRNLGQRFHDGKLVNGFKFKSCFDLQECEQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LEH9 Uncharacterized protein4.7e-10290.48Show/hide
Query:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDD-GGGDEVQSKRKEGGLCGLESLEKALP
        MEVL GPPTFSIEVPP SAFS VSLPSENPSAA+ QNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDD GGGDEVQSK KEGGLCGLESLEKALP
Subjt:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDD-GGGDEVQSKRKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNEN--DEQKQEEDGKDSGEESDEEDEGKGGRR
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSRKK SFYNWPNPKSMPLLALNEN  +E+++EEDGK+SGEES   DEGKGGRR
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNEN--DEQKQEEDGKDSGEESDEEDEGKGGRR

Query:  RNLGQRFHDGKLVNGFKFKSCFDLQECEQQQ
        R+LGQRFHDGKLVNG KFKSCFDLQE EQQQ
Subjt:  RNLGQRFHDGKLVNGFKFKSCFDLQECEQQQ

A0A1S3CQX1 uncharacterized protein LOC1035037531.4e-117100Show/hide
Query:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPI
        MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPI
Subjt:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPI

Query:  KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEEDEGKGGRRRNL
        KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEEDEGKGGRRRNL
Subjt:  KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEEDEGKGGRRRNL

Query:  GQRFHDGKLVNGFKFKSCFDLQECEQQQ
        GQRFHDGKLVNGFKFKSCFDLQECEQQQ
Subjt:  GQRFHDGKLVNGFKFKSCFDLQECEQQQ

A0A6J1C5V2 transcription initiation factor TFIID subunit 7-like1.1e-7470.54Show/hide
Query:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEA----------QNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCG
        MEVL   PTFSIEVPP +AF GVS+  ENP+ A +            L+ SG   SGSGSS+GE SSESSSSIGVPD DS+DDGGG+EVQSK KEGGLCG
Subjt:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEA----------QNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCG

Query:  LESLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEEDGKDS----GEES
        L+SLE ALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSR K SFYNWPNPKSMPLLALNE++EQ +EE+ +++     E+ 
Subjt:  LESLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEEDGKDS----GEES

Query:  DEEDEGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQECEQQ
        DEEDE    RRR L  +FHD KLVNG K KSCFDLQE + +
Subjt:  DEEDEGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQECEQQ

A0A6J1FA94 uncharacterized protein LOC1114437065.3e-7774.57Show/hide
Query:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSA-AEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALP
        MEV+ GPPTF+IEV   +AF GVSL  ENP+A  E QN  R+ F  SGSGSSIGENSS SSSSIGVPDGDSDDDGG  EVQSK KEGGLC L+SLE ALP
Subjt:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSA-AEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEED---GKDS------GEESDEED
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKR+RILMASKWSR KASFYNWPNPKSMPLLAL+E++E+K  ++   G DS       +E DEED
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEED---GKDS------GEESDEED

Query:  EGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQ
        E    R R LG RFHD KLVNGFK KSCFDLQ
Subjt:  EGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQ

A0A6J1ILJ2 uncharacterized protein LOC1114763268.1e-7873.19Show/hide
Query:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSA-AEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALP
        MEV+ GPPTF++EV   +AF GVSL  ENP+A  E QN  R+GF  SGS SSIGENSS SSSSIGVPDGDSDDDGG  EVQSK KEGGLC L+SLE ALP
Subjt:  MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSA-AEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEED------------GKDSGEESD
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKR+RILMASKWSR KASFYNWPNPKSMPLLAL+E++E+K  ++            G D  +E D
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEED------------GKDSGEESD

Query:  EEDEGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQ
        EEDE    RRR LG RFHD KLVNGFK KSCFDLQ
Subjt:  EEDEGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein1.4e-2140.93Show/hide
Query:  GFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGL-ESLEKALPIKRGLSSHFSGKSKSFANLSEVI-QVKDLEKPENPFNKRRR
        G   S + +   E SS+SSSSIG    + +++   D V  +R  G L     SLE +LPIKRGLS+H+ GKSKSF NL E   + KDLEK ENPFNKRRR
Subjt:  GFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGL-ESLEKALPIKRGLSSHFSGKSKSFANLSEVI-QVKDLEKPENPFNKRRR

Query:  ILMASKWSRK-----KASFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEEDEGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQECEQQ
        +++A+K  R+      ++FY+W NP SMPLLAL E +E+       D      E+D+G G   R +     + K +   + +SCF L   +++
Subjt:  ILMASKWSRK-----KASFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEEDEGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQECEQQ

AT3G43850.1 unknown protein1.3e-1142.31Show/hide
Query:  NSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFANLSEV--IQVKDLEKPENPFNKRRRILMA----SKWSR
        +SS SS SIG  +   DD+GG +E++S    G L  +ESLE+ALPIKR +S  + GKSKSF +LSE   + VKDL KPEN +++RRR L++    S+   
Subjt:  NSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFANLSEV--IQVKDLEKPENPFNKRRRILMA----SKWSR

Query:  KKASFYNWPNPKSMPLLALNENDEQKQEED
         K  F      KS+  ++  E D     +D
Subjt:  KKASFYNWPNPKSMPLLALNENDEQKQEED

AT4G31510.1 unknown protein4.2e-1837.38Show/hide
Query:  IEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPIKRGLSSHFSGK
        +EV   S F   S  + +  A  A   +R G  R G          ESSSS+G    + +D+   D V S +         SLE +LPIKRGLS+H+ GK
Subjt:  IEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPIKRGLSSHFSGK

Query:  SKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKA----SFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEEDEGKGGRRRNLGQRFHDG
        SKSF NL E     DL K E+P NKRRR+L+A+K  R+ +    S Y   NP SMPLLAL E+D +  + +  D  ++S  +DE    + + +    H  
Subjt:  SKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKA----SFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEEDEGKGGRRRNLGQRFHDG

Query:  KLVNGFKFKSCFDL
         +V   + KSCF L
Subjt:  KLVNGFKFKSCFDL

AT5G21940.1 unknown protein1.3e-1139.71Show/hide
Query:  PSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFAN--------
        PS +PS +       S    S + SSIG NS +   S      D  DD G +EV+S  K G L  +ESLE+ LP+++G+S ++SGKSKSF N        
Subjt:  PSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFAN--------

Query:  LSEVIQVKDLEKPENPFNKRRRILMASK-WSRKKAS
        L+    +KDL KPENP+++RRR L+  + W   K +
Subjt:  LSEVIQVKDLEKPENPFNKRRRILMASK-WSRKKAS

AT5G24890.1 unknown protein5.4e-2639.58Show/hide
Query:  LLGPPTFSIEVPPPSAFSGVSLP-----SENPSAAEAQN---LARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDD-----GGGDEVQSKRKE-GGLC
        L+  PTFSIEV   S +    LP     S + S+ E  N   +  SG  R  SG +   + S  SSSIG P GDS++D        D+V SK     GL 
Subjt:  LLGPPTFSIEVPPPSAFSGVSLP-----SENPSAAEAQN---LARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDD-----GGGDEVQSKRKE-GGLC

Query:  GLESLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEE
         + SLE +LP KRGLS+H+ GKSKSF NL E+  VK++ K ENP NKRRR+ + +K +RK  SFY+W NPKSMPLL +NE+++   E+D ++  +   +E
Subjt:  GLESLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEE

Query:  DEGKGGRRRNLGQRFHDGKLVN-GFKFKSCFDLQECEQQQ
        ++               G   N  +K +SCF L +  +++
Subjt:  DEGKGGRRRNLGQRFHDGKLVN-GFKFKSCFDLQECEQQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTTTGCTCGGTCCTCCAACCTTCAGCATCGAGGTTCCGCCGCCCTCGGCCTTCTCCGGCGTATCTCTTCCGTCGGAGAATCCTTCCGCCGCCGAGGCTCAGAA
CCTAGCCCGGTCGGGTTTTCTCCGCTCTGGATCTGGTAGCTCGATTGGGGAAAATTCGTCGGAGAGTTCGTCGTCGATTGGAGTACCTGACGGCGATTCCGATGACGATG
GAGGTGGTGATGAGGTGCAGAGCAAGCGAAAGGAAGGAGGATTGTGCGGATTAGAGTCTCTGGAAAAAGCTCTTCCGATTAAAAGGGGATTATCGAGCCATTTCTCAGGG
AAATCGAAGTCGTTTGCGAATCTATCAGAGGTGATTCAAGTGAAAGATTTAGAGAAGCCAGAGAACCCTTTCAACAAGAGAAGACGGATTTTAATGGCGTCGAAATGGTC
AAGAAAGAAAGCTTCGTTCTACAACTGGCCAAACCCTAAATCGATGCCTTTGTTAGCTCTGAACGAAAATGATGAACAAAAACAAGAAGAAGATGGTAAAGACTCCGGTG
AGGAATCCGATGAAGAAGATGAAGGTAAAGGAGGAAGAAGACGAAACCTAGGGCAAAGATTCCATGATGGGAAACTCGTTAATGGCTTCAAATTCAAAAGCTGTTTTGAT
CTGCAAGAATGTGAACAACAACAATGA
mRNA sequenceShow/hide mRNA sequence
TTCAAACTAAATTCTTCCTCAAAACTCTTAAAATATCCCTTTCTCTGTAAAGTTCATCGAGACCCCCCTTATCCCCTACTCTCTCTTTCTCTCTCTCTCTCTACCCGAAA
AGTCAGTGCCAATCACCCCAACCGCCGCCGGTTTATGACAATCAACCCTAATCCTTGGGCGTCACCGGTGGGTTGAACCACAGCGGAGGAAGGTGGGAAGTCCGCCCCAT
CTCACTAGTGGGACGGAATCGATCTCCACCGCTAACTAACTGATGGAAGTTTTGCTCGGTCCTCCAACCTTCAGCATCGAGGTTCCGCCGCCCTCGGCCTTCTCCGGCGT
ATCTCTTCCGTCGGAGAATCCTTCCGCCGCCGAGGCTCAGAACCTAGCCCGGTCGGGTTTTCTCCGCTCTGGATCTGGTAGCTCGATTGGGGAAAATTCGTCGGAGAGTT
CGTCGTCGATTGGAGTACCTGACGGCGATTCCGATGACGATGGAGGTGGTGATGAGGTGCAGAGCAAGCGAAAGGAAGGAGGATTGTGCGGATTAGAGTCTCTGGAAAAA
GCTCTTCCGATTAAAAGGGGATTATCGAGCCATTTCTCAGGGAAATCGAAGTCGTTTGCGAATCTATCAGAGGTGATTCAAGTGAAAGATTTAGAGAAGCCAGAGAACCC
TTTCAACAAGAGAAGACGGATTTTAATGGCGTCGAAATGGTCAAGAAAGAAAGCTTCGTTCTACAACTGGCCAAACCCTAAATCGATGCCTTTGTTAGCTCTGAACGAAA
ATGATGAACAAAAACAAGAAGAAGATGGTAAAGACTCCGGTGAGGAATCCGATGAAGAAGATGAAGGTAAAGGAGGAAGAAGACGAAACCTAGGGCAAAGATTCCATGAT
GGGAAACTCGTTAATGGCTTCAAATTCAAAAGCTGTTTTGATCTGCAAGAATGTGAACAACAACAATGAAAATGAAAATGAAAATGAAAATGAAAAATCTTCAAGCCCTC
TTTGTTCTTCACTTTTTAAAGATTTGTTTATCTGAACAAAAAAGGAAGGCAACCAAAAAATTTATTTTTTTCTTTTTTCAATTTTATTTTATTTTTTATGTTTAATGAGT
CCATGTAGATAAATTAACAATATAGGTTCTATATGATTATCTAATTAGATATGTATATGTATTGTATTATATTGCTGGTTTATGATATGAATGTTTGATGGAGATGGCAA
CTTAGGG
Protein sequenceShow/hide protein sequence
MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSAAEAQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPIKRGLSSHFSG
KSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEEDGKDSGEESDEEDEGKGGRRRNLGQRFHDGKLVNGFKFKSCFD
LQECEQQQ