; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G44680 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G44680
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptiontranscription initiation factor TFIID subunit 7-like
Genome locationChr3:38235282..38236358
RNA-Seq ExpressionCSPI03G44680
SyntenyCSPI03G44680
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136309.2 uncharacterized protein LOC101218277 [Cucumis sativus]1.7e-11499.56Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
        MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENN-EEEEEEEDGKESGEESDEGKGGRRRSL
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENN EEEEEEEDGKESGEESDEGKGGRRRSL
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENN-EEEEEEEDGKESGEESDEGKGGRRRSL

Query:  GQRFHDGKLVNGLKFKSCFDLQEYEQQQ
        GQRFHDGKLVNGLKFKSCFDLQEYEQQQ
Subjt:  GQRFHDGKLVNGLKFKSCFDLQEYEQQQ

XP_008466301.1 PREDICTED: uncharacterized protein LOC103503753 [Cucumis melo]1.3e-10190.87Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
        MEVL GPPTFSIEVPP SAFS VSLPSENPSAA+ QNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDD GGGDEVQSK KEGGLCGLESLEKALP
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEES---DEGKGGRRR
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSRKK SFYNWPNPKSMPLLALNE N+E+++EEDGK+SGEES   DEGKGGRRR
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEES---DEGKGGRRR

Query:  SLGQRFHDGKLVNGLKFKSCFDLQEYEQQQ
        +LGQRFHDGKLVNG KFKSCFDLQE EQQQ
Subjt:  SLGQRFHDGKLVNGLKFKSCFDLQEYEQQQ

XP_022137206.1 transcription initiation factor TFIID subunit 7-like [Momordica charantia]3.5e-7572.38Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-----DTQNLARSGF--LRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLE
        MEVLF  PTFSIEVPPS+AF  VS+  ENP+ A      TQ+    G     SGSGSS+GE SSESSSSIGVPD DS+DD GGG+EVQSKPKEGGLCGL+
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-----DTQNLARSGF--LRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLE

Query:  SLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEE------DGKESGEE
        SLE ALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSR KGSFYNWPNPKSMPLLALNE+ E+ EEEE         E G+E
Subjt:  SLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEE------DGKESGEE

Query:  SDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ
         DE    RRR+L  +FHD KLVNG+K KSCFDLQEY+ +
Subjt:  SDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ

XP_022937406.1 uncharacterized protein LOC111443706 [Cucurbita moschata]5.6e-7371.67Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSA-ADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKAL
        MEV+FGPPTF+IEV  ++AF  VSL  ENP+A  +TQN  R+ F  SGSGSSIGENSS SSSSIGVPDGDSDDDGG G EVQSK KEGGLC L+SLE AL
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSA-ADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKAL

Query:  PIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEE-----------EEEEDGKESGEES
        PIKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKR+RILMASKWSR K SFYNWPNPKSMPLLAL+E+ EE+           E+ + G +  +E 
Subjt:  PIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEE-----------EEEEDGKESGEES

Query:  DEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQ
        DE    R R+LG RFHD KLVNG K KSCFDLQ
Subjt:  DEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQ

XP_038898503.1 uncharacterized protein LOC120086121 [Benincasa hispida]7.0e-8478.97Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
        MEVLFG PTFSIEVPP +AF+ VS+P ENPS  +TQN ARSGF  SGSGSSIGENSS SSSSIG+PD DSDDD G  DEVQSKP EGGLCGLESLE+ALP
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEE-------EDGKESGEESDEGKG
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSR K SFYNWPNPKSMPLLALNE+ EE+ +E       EDG    +E DE K 
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEE-------EDGKESGEESDEGKG

Query:  GRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ
         RRR+LGQRFHD KLVNG K KSCFDLQEYEQQ
Subjt:  GRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LEH9 Uncharacterized protein8.3e-11599.56Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
        MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENN-EEEEEEEDGKESGEESDEGKGGRRRSL
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENN EEEEEEEDGKESGEESDEGKGGRRRSL
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENN-EEEEEEEDGKESGEESDEGKGGRRRSL

Query:  GQRFHDGKLVNGLKFKSCFDLQEYEQQQ
        GQRFHDGKLVNGLKFKSCFDLQEYEQQQ
Subjt:  GQRFHDGKLVNGLKFKSCFDLQEYEQQQ

A0A1S3CQX1 uncharacterized protein LOC1035037536.2e-10290.87Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
        MEVL GPPTFSIEVPP SAFS VSLPSENPSAA+ QNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDD GGGDEVQSK KEGGLCGLESLEKALP
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEES---DEGKGGRRR
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSRKK SFYNWPNPKSMPLLALNE N+E+++EEDGK+SGEES   DEGKGGRRR
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEES---DEGKGGRRR

Query:  SLGQRFHDGKLVNGLKFKSCFDLQEYEQQQ
        +LGQRFHDGKLVNG KFKSCFDLQE EQQQ
Subjt:  SLGQRFHDGKLVNGLKFKSCFDLQEYEQQQ

A0A6J1C5V2 transcription initiation factor TFIID subunit 7-like1.7e-7572.38Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-----DTQNLARSGF--LRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLE
        MEVLF  PTFSIEVPPS+AF  VS+  ENP+ A      TQ+    G     SGSGSS+GE SSESSSSIGVPD DS+DD GGG+EVQSKPKEGGLCGL+
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-----DTQNLARSGF--LRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLE

Query:  SLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEE------DGKESGEE
        SLE ALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSR KGSFYNWPNPKSMPLLALNE+ E+ EEEE         E G+E
Subjt:  SLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEE------DGKESGEE

Query:  SDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ
         DE    RRR+L  +FHD KLVNG+K KSCFDLQEY+ +
Subjt:  SDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ

A0A6J1FA94 uncharacterized protein LOC1114437062.7e-7371.67Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSA-ADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKAL
        MEV+FGPPTF+IEV  ++AF  VSL  ENP+A  +TQN  R+ F  SGSGSSIGENSS SSSSIGVPDGDSDDDGG G EVQSK KEGGLC L+SLE AL
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSA-ADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKAL

Query:  PIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEE-----------EEEEDGKESGEES
        PIKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKR+RILMASKWSR K SFYNWPNPKSMPLLAL+E+ EE+           E+ + G +  +E 
Subjt:  PIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEE-----------EEEEDGKESGEES

Query:  DEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQ
        DE    R R+LG RFHD KLVNG K KSCFDLQ
Subjt:  DEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQ

A0A6J1ILJ2 uncharacterized protein LOC1114763264.6e-7370.76Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSA-ADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKAL
        MEV+FGPPTF++EV  ++AF  VSL  ENP+A  +TQN  R+GF  SGS SSIGENSS SSSSIGVPDGDSDDDGG G EVQSK KEGGLC L+SLE AL
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSA-ADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKAL

Query:  PIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEE--------------DGKESG
        PIKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKR+RILMASKWSR K SFYNWPNPKSMPLLAL+E+ EE+  +E              D ++  
Subjt:  PIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEE--------------DGKESG

Query:  EESDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQ
        +E DE    RRR+LG RFHD KLVNG K KSCFDLQ
Subjt:  EESDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein2.0e-2041.15Show/hide
Query:  GFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGL-ESLEKALPIKRGLSSHFSGKSKSFANLSEVI-QVKDLEKAENPFNKRR
        G   S + +   E SS+SSSSIG    +++++    D V    + G L     SLE +LPIKRGLS+H+ GKSKSF NL E   + KDLEK ENPFNKRR
Subjt:  GFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGL-ESLEKALPIKRGLSSHFSGKSKSFANLSEVI-QVKDLEKAENPFNKRR

Query:  RILMASKWSRK-----KGSFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEESDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ
        R+++A+K  R+       +FY+W NP SMPLLAL E NEE+    +      E D+G G   R +     + K +   + +SCF L   +++
Subjt:  RILMASKWSRK-----KGSFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEESDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ

AT3G43850.1 unknown protein2.1e-0938.31Show/hide
Query:  NSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFANLSEV--IQVKDLEKAENPFNKRRRILMASKWSRKKG
        +SS SS SIG    +SDDD GG +E++S    G L  +ESLE+ALPIKR +S  + GKSKSF +LSE   + VKDL K EN +++RRR L++ +   + G
Subjt:  NSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFANLSEV--IQVKDLEKAENPFNKRRRILMASKWSRKKG

Query:  SFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEESDEGKGGRRRSLGQRFHDG
                    +LA++      + E D   SG++S        ++L  R   G
Subjt:  SFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEESDEGKGGRRRSLGQRFHDG

AT4G31510.1 unknown protein1.0e-1634.56Show/hide
Query:  IEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFSG
        +EV   S F   S  + +  A      +R G  R G          ESSSS+G    + +D+    D V S           SLE +LPIKRGLS+H+ G
Subjt:  IEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFSG

Query:  KSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKG----SFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEESDEGKGGRRRSLGQRFHDGK
        KSKSF NL E     DL K E+P NKRRR+L+A+K  R+      S Y   NP SMPLLAL E++ E+ +  D  +  + S + +  + +    +  + +
Subjt:  KSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKG----SFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEESDEGKGGRRRSLGQRFHDGK

Query:  LVNGLKFKSCFDLQEYE
             + KSCF L  ++
Subjt:  LVNGLKFKSCFDLQEYE

AT5G21940.1 unknown protein4.2e-1038.06Show/hide
Query:  SENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFAN--------
        S +PS +D+     S    S + SSIG NS +   S      +   D  G +EV+S P +G L  +ESLE+ LP+++G+S ++SGKSKSF N        
Subjt:  SENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFAN--------

Query:  LSEVIQVKDLEKAENPFNKRRRILMASK-WSRKK
        L+    +KDL K ENP+++RRR L+  + W   K
Subjt:  LSEVIQVKDLEKAENPFNKRRRILMASK-WSRKK

AT5G24890.1 unknown protein3.2e-2640.42Show/hide
Query:  LFGPPTFSIEVPPSSAFSAVSLP-----SENPSAADTQN---LARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDD----GGGGDEVQSKPKE-GGLC
        L   PTFSIEV   S +    LP     S + S+ +T N   +  SG  R  SG +   + S  SSSIG P GDS++D        D+V SK     GL 
Subjt:  LFGPPTFSIEVPPSSAFSAVSLP-----SENPSAADTQN---LARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDD----GGGGDEVQSKPKE-GGLC

Query:  GLESLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEESDE
         + SLE +LP KRGLS+H+ GKSKSF NL E+  VK++ K ENP NKRRR+ + +K +RK  SFY+W NPKSMPLL +NE+ ++++E++D ++     DE
Subjt:  GLESLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEESDE

Query:  GKGGRRRSLGQR--FHDGKLVN-GLKFKSCFDLQEYEQQQ
         K        ++     G   N   K +SCF L +  +++
Subjt:  GKGGRRRSLGQR--FHDGKLVN-GLKFKSCFDLQEYEQQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTTTGTTCGGTCCTCCAACCTTTAGCATCGAGGTTCCGCCGTCCTCGGCGTTCTCCGCCGTCTCTCTTCCGTCGGAGAATCCATCCGCTGCCGACACTCAGAA
CCTAGCTCGGTCGGGTTTTCTCCGCTCTGGATCTGGTAGCTCGATTGGGGAAAATTCGTCGGAGAGTTCGTCGTCGATTGGAGTTCCTGACGGTGATTCCGATGACGATG
GAGGCGGTGGTGATGAGGTGCAGAGCAAGCCAAAGGAAGGAGGATTGTGCGGATTAGAGTCTCTTGAAAAAGCTCTTCCGATTAAAAGGGGATTATCGAGCCATTTCTCA
GGGAAATCGAAGTCGTTTGCGAATCTATCAGAGGTGATTCAAGTGAAAGATTTAGAGAAGGCAGAGAATCCTTTCAACAAGAGAAGAAGGATTTTAATGGCGTCGAAATG
GTCAAGAAAGAAAGGTTCGTTCTACAACTGGCCAAACCCTAAATCGATGCCTCTGTTAGCTCTGAACGAAAACAATGAAGAAGAAGAAGAAGAAGAAGATGGTAAAGAAT
CCGGCGAGGAATCCGATGAAGGTAAAGGAGGAAGAAGACGAAGCCTAGGGCAGAGATTCCATGATGGGAAGCTCGTTAATGGCTTAAAATTCAAAAGCTGTTTTGATCTG
CAAGAATATGAACAACAACAATGA
mRNA sequenceShow/hide mRNA sequence
ACAAATTCCAAACTAAATTCTTCCTGAAAACTCTTAAAATATCCCTTTCTCTGTAAAGTTCATCGAGACCCCCCTTATCCTTTACTCTCTCTTTCTCTTTCTCTCTACCA
GAAAAGTCAGTGCCAATCACCAAACCGCCGCCGGTTTATGACAACGAACCCTAATCCTTTGGCGTCGCCGGTGGGCTGAACTACAGCGGAGGAAGCTCGGAAGCCCGCCC
CATCTCACTAGTGGGACGGAATCCATCTCCACCGCTAACTCATGGAAGTTTTGTTCGGTCCTCCAACCTTTAGCATCGAGGTTCCGCCGTCCTCGGCGTTCTCCGCCGTC
TCTCTTCCGTCGGAGAATCCATCCGCTGCCGACACTCAGAACCTAGCTCGGTCGGGTTTTCTCCGCTCTGGATCTGGTAGCTCGATTGGGGAAAATTCGTCGGAGAGTTC
GTCGTCGATTGGAGTTCCTGACGGTGATTCCGATGACGATGGAGGCGGTGGTGATGAGGTGCAGAGCAAGCCAAAGGAAGGAGGATTGTGCGGATTAGAGTCTCTTGAAA
AAGCTCTTCCGATTAAAAGGGGATTATCGAGCCATTTCTCAGGGAAATCGAAGTCGTTTGCGAATCTATCAGAGGTGATTCAAGTGAAAGATTTAGAGAAGGCAGAGAAT
CCTTTCAACAAGAGAAGAAGGATTTTAATGGCGTCGAAATGGTCAAGAAAGAAAGGTTCGTTCTACAACTGGCCAAACCCTAAATCGATGCCTCTGTTAGCTCTGAACGA
AAACAATGAAGAAGAAGAAGAAGAAGAAGATGGTAAAGAATCCGGCGAGGAATCCGATGAAGGTAAAGGAGGAAGAAGACGAAGCCTAGGGCAGAGATTCCATGATGGGA
AGCTCGTTAATGGCTTAAAATTCAAAAGCTGTTTTGATCTGCAAGAATATGAACAACAACAATGA
Protein sequenceShow/hide protein sequence
MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFS
GKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEEEDGKESGEESDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDL
QEYEQQQ