; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G042120 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G042120
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptiontranscription initiation factor TFIID subunit 7-like
Genome locationGy14Chr3:39272634..39274388
RNA-Seq ExpressionCsGy3G042120
SyntenyCsGy3G042120
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136309.2 uncharacterized protein LOC101218277 [Cucumis sativus]3.59e-14998.68Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
        MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEE---DGKESGEESDEGKGGRRRSL
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEE   DGKESGEESDEGKGGRRRSL
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEE---DGKESGEESDEGKGGRRRSL

Query:  GQRFHDGKLVNGLKFKSCFDLQEYEQQQ
        GQRFHDGKLVNGLKFKSCFDLQEYEQQQ
Subjt:  GQRFHDGKLVNGLKFKSCFDLQEYEQQQ

XP_008466301.1 PREDICTED: uncharacterized protein LOC103503753 [Cucumis melo]1.79e-13391.27Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
        MEVL GPPTFSIEVPP SAFS VSLPSENPSAA+ QNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGG DEVQSK KEGGLCGLESLEKALP
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEE-EEEDGKESGEESDE---GKGGRRRS
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSRKK SFYNWPNPKSMPLLALNEN+E+ +EEDGK+SGEESDE   GKGGRRR+
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEE-EEEDGKESGEESDE---GKGGRRRS

Query:  LGQRFHDGKLVNGLKFKSCFDLQEYEQQQ
        LGQRFHDGKLVNG KFKSCFDLQE EQQQ
Subjt:  LGQRFHDGKLVNGLKFKSCFDLQEYEQQQ

XP_022137206.1 transcription initiation factor TFIID subunit 7-like [Momordica charantia]1.46e-9771.55Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-----DTQNLARSGFLRSGSGS--SIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLE
        MEVLF  PTFSIEVPPS+AF  VS+  ENP+ A      TQ+    G   SGSGS  S+GE SSESSSSIGVPD DS+DDGGG +EVQSKPKEGGLCGL+
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-----DTQNLARSGFLRSGSGS--SIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLE

Query:  SLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEDGKES--------GEE
        SLE ALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSRK GSFYNWPNPKSMPLLALNE+ E+ EE+ +E         G+E
Subjt:  SLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEDGKES--------GEE

Query:  SDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ
         DE    RRR+L  +FHD KLVNG+K KSCFDLQEY+ +
Subjt:  SDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ

XP_022937406.1 uncharacterized protein LOC111443706 [Cucurbita moschata]8.51e-9572.1Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-DTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKAL
        MEV+FGPPTF+IEV  ++AF  VSL  ENP+A  +TQN  R+ F  SGSGSSIGENSS SSSSIGVPDGDSDDDGG G EVQSK KEGGLC L+SLE AL
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-DTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKAL

Query:  PIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEE------------EEED-GKESGEES
        PIKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKR+RILMASKWSRK  SFYNWPNPKSMPLLAL+E+ EE            E+ D G +  +E 
Subjt:  PIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEE------------EEED-GKESGEES

Query:  DEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQ
        DE    R R+LG RFHD KLVNG K KSCFDLQ
Subjt:  DEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQ

XP_038898503.1 uncharacterized protein LOC120086121 [Benincasa hispida]2.21e-10978.54Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
        MEVLFGP TFSIEVPP +AF+ VS+P ENPS  +TQN ARSGF  SGSGSSIGENSS SSSSIG+PD DSDDDG   DEVQSKP EGGLCGLESLE+ALP
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEE---------EDGKESGEESDEGKG
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSRK  SFYNWPNPKSMPLLALNE+ EE+          EDG    +E DE K 
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEE---------EDGKESGEESDEGKG

Query:  GRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ
         RRR+LGQRFHD KLVNG K KSCFDLQEYEQQ
Subjt:  GRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LEH9 Uncharacterized protein1.74e-14998.68Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
        MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEE---DGKESGEESDEGKGGRRRSL
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEE   DGKESGEESDEGKGGRRRSL
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEE---DGKESGEESDEGKGGRRRSL

Query:  GQRFHDGKLVNGLKFKSCFDLQEYEQQQ
        GQRFHDGKLVNGLKFKSCFDLQEYEQQQ
Subjt:  GQRFHDGKLVNGLKFKSCFDLQEYEQQQ

A0A1S3CQX1 uncharacterized protein LOC1035037538.66e-13491.27Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP
        MEVL GPPTFSIEVPP SAFS VSLPSENPSAA+ QNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGG DEVQSK KEGGLCGLESLEKALP
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEE-EEEDGKESGEESDE---GKGGRRRS
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSRKK SFYNWPNPKSMPLLALNEN+E+ +EEDGK+SGEESDE   GKGGRRR+
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEE-EEEDGKESGEESDE---GKGGRRRS

Query:  LGQRFHDGKLVNGLKFKSCFDLQEYEQQQ
        LGQRFHDGKLVNG KFKSCFDLQE EQQQ
Subjt:  LGQRFHDGKLVNGLKFKSCFDLQEYEQQQ

A0A6J1C5V2 transcription initiation factor TFIID subunit 7-like7.06e-9871.55Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-----DTQNLARSGFLRSGSGS--SIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLE
        MEVLF  PTFSIEVPPS+AF  VS+  ENP+ A      TQ+    G   SGSGS  S+GE SSESSSSIGVPD DS+DDGGG +EVQSKPKEGGLCGL+
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-----DTQNLARSGFLRSGSGS--SIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLE

Query:  SLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEDGKES--------GEE
        SLE ALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSRK GSFYNWPNPKSMPLLALNE+ E+ EE+ +E         G+E
Subjt:  SLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEDGKES--------GEE

Query:  SDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ
         DE    RRR+L  +FHD KLVNG+K KSCFDLQEY+ +
Subjt:  SDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ

A0A6J1FA94 uncharacterized protein LOC1114437064.12e-9572.1Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-DTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKAL
        MEV+FGPPTF+IEV  ++AF  VSL  ENP+A  +TQN  R+ F  SGSGSSIGENSS SSSSIGVPDGDSDDDGG G EVQSK KEGGLC L+SLE AL
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-DTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKAL

Query:  PIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEE------------EEED-GKESGEES
        PIKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKR+RILMASKWSRK  SFYNWPNPKSMPLLAL+E+ EE            E+ D G +  +E 
Subjt:  PIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEE------------EEED-GKESGEES

Query:  DEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQ
        DE    R R+LG RFHD KLVNG K KSCFDLQ
Subjt:  DEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQ

A0A6J1ILJ2 uncharacterized protein LOC1114763269.18e-9570.34Show/hide
Query:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-DTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKAL
        MEV+FGPPTF++EV  ++AF  VSL  ENP+A  +TQN  R+GF  SGS SSIGENSS SSSSIGVPDGDSDDDGG G EVQSK KEGGLC L+SLE AL
Subjt:  MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAA-DTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKAL

Query:  PIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEE----------------DGKESG
        PIKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKR+RILMASKWSRK  SFYNWPNPKSMPLLAL+E+ EE+                  D ++  
Subjt:  PIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEE----------------DGKESG

Query:  EESDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQ
        +E DE    RRR+LG RFHD KLVNG K KSCFDLQ
Subjt:  EESDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein3.1e-2141.58Show/hide
Query:  GFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGL-ESLEKALPIKRGLSSHFSGKSKSFANLSEVI-QVKDLEKAENPFNKRR
        G   S + +   E SS+SSSSIG    +++++    D V    + G L     SLE +LPIKRGLS+H+ GKSKSF NL E   + KDLEK ENPFNKRR
Subjt:  GFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGL-ESLEKALPIKRGLSSHFSGKSKSFANLSEVI-QVKDLEKAENPFNKRR

Query:  RILMASKWSRK-----KGSFYNWPNPKSMPLLALNENNEEEEEDGKESGEESDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ
        R+++A+K  R+       +FY+W NP SMPLLAL E NEE+     +  E+ D+G G   R +     + K +   + +SCF L   +++
Subjt:  RILMASKWSRK-----KGSFYNWPNPKSMPLLALNENNEEEEEDGKESGEESDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQEYEQQ

AT3G43850.1 unknown protein5.4e-1038.82Show/hide
Query:  NSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFANLSEV--IQVKDLEKAENPFNKRRRILMASKWSRKKG
        +SS SS SIG    +SDDD GG +E++S    G L  +ESLE+ALPIKR +S  + GKSKSF +LSE   + VKDL K EN +++RRR L++ +   + G
Subjt:  NSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFANLSEV--IQVKDLEKAENPFNKRRRILMASKWSRKKG

Query:  SFYNWPNPKSMPLLALNENNEEEEEDGKESGEESDEGKGGRRRSLGQRFHDG
                    +LA++    + E D   SG++S        ++L  R   G
Subjt:  SFYNWPNPKSMPLLALNENNEEEEEDGKESGEESDEGKGGRRRSLGQRFHDG

AT4G31510.1 unknown protein7.8e-1735.62Show/hide
Query:  IEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFSG
        +EV   S F   S  + +  A      +R G  R G          ESSSS+G    + +D+    D V S           SLE +LPIKRGLS+H+ G
Subjt:  IEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFSG

Query:  KSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKG----SFYNWPNPKSMPLLALNENNEEE----EEDGKESGEESDEGKGGRRRSLGQRFHD
        KSKSF NL E     DL K E+P NKRRR+L+A+K  R+      S Y   NP SMPLLAL E++ E+    ++D  +     DE    + + +    H 
Subjt:  KSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKG----SFYNWPNPKSMPLLALNENNEEE----EEDGKESGEESDEGKGGRRRSLGQRFHD

Query:  GKLVNGLKFKSCFDLQEYE
          +V   + KSCF L  ++
Subjt:  GKLVNGLKFKSCFDLQEYE

AT5G21940.1 unknown protein4.2e-1038.06Show/hide
Query:  SENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFAN--------
        S +PS +D+     S    S + SSIG NS +   S      +   D  G +EV+S P +G L  +ESLE+ LP+++G+S ++SGKSKSF N        
Subjt:  SENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFAN--------

Query:  LSEVIQVKDLEKAENPFNKRRRILMASK-WSRKK
        L+    +KDL K ENP+++RRR L+  + W   K
Subjt:  LSEVIQVKDLEKAENPFNKRRRILMASK-WSRKK

AT5G24890.1 unknown protein3.2e-2641.25Show/hide
Query:  LFGPPTFSIEVPPSSAFSAVSLP-----SENPSAADTQN---LARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDD----GGGGDEVQSKPKE-GGLC
        L   PTFSIEV   S +    LP     S + S+ +T N   +  SG  R  SG +   + S  SSSIG P GDS++D        D+V SK     GL 
Subjt:  LFGPPTFSIEVPPSSAFSAVSLP-----SENPSAADTQN---LARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDD----GGGGDEVQSKPKE-GGLC

Query:  GLESLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEDGKESGEES--DE
         + SLE +LP KRGLS+H+ GKSKSF NL E+  VK++ K ENP NKRRR+ + +K +RK  SFY+W NPKSMPLL +NE+ ++++ED  E   +S  DE
Subjt:  GLESLEKALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEDGKESGEES--DE

Query:  GKGGRRRSLGQR--FHDGKLVN-GLKFKSCFDLQEYEQQQ
         K        ++     G   N   K +SCF L +  +++
Subjt:  GKGGRRRSLGQR--FHDGKLVN-GLKFKSCFDLQEYEQQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTTTGTTCGGTCCTCCAACCTTTAGCATCGAGGTTCCGCCGTCCTCGGCGTTCTCCGCCGTCTCTCTTCCGTCGGAGAATCCATCCGCTGCCGACACTCAGAA
CCTAGCTCGGTCGGGTTTTCTCCGCTCTGGATCTGGTAGCTCGATTGGGGAAAATTCGTCGGAGAGTTCGTCGTCGATTGGAGTTCCTGACGGTGATTCCGATGACGATG
GAGGCGGTGGTGATGAGGTGCAGAGCAAGCCAAAGGAAGGAGGATTGTGCGGATTAGAGTCTCTTGAAAAAGCTCTTCCGATTAAAAGGGGATTATCGAGCCATTTCTCA
GGGAAATCGAAGTCGTTTGCGAATCTATCAGAGGTGATTCAAGTGAAAGATTTAGAGAAGGCAGAGAATCCTTTCAACAAGAGAAGAAGGATTTTAATGGCGTCGAAATG
GTCAAGAAAGAAAGGTTCGTTCTACAACTGGCCAAACCCTAAATCGATGCCTCTGTTAGCTCTGAACGAAAACAATGAAGAAGAAGAAGAAGATGGTAAAGAATCCGGCG
AGGAATCCGATGAAGGTAAAGGAGGAAGAAGACGAAGCCTAGGGCAGAGATTCCATGATGGGAAGCTCGTTAATGGCTTAAAATTCAAAAGCTGTTTTGATCTGCAAGAA
TATGAACAACAACAATGA
mRNA sequenceShow/hide mRNA sequence
ATTTGTAATTTAGTTAAAGGAAAAAGAAAAAGAAAAGAAAAACAAAAAACAAATTCCAAACTAAATTCTTCCTGAAAACTCTTAAAATATCCCTTTCTCTGTAAAGTTCA
TCGAGACCCCCCTTATCCTTTACTCTCTCTTTCTCTTTCTCTCTACCAGAAAAGTCAGTGCCAATCACCAAACCGCCGCCGGTTTATGACAACGAACCCTAATCCTTTGG
CGTCGCCGGTGGGCTGAACTACAGCGGAGGAAGCTCGGAAGCCCGCCCCATCTCACTAGTGGGACGGAATCCATCTCCACCGCTAACTCATGGAAGTTTTGTTCGGTCCT
CCAACCTTTAGCATCGAGGTTCCGCCGTCCTCGGCGTTCTCCGCCGTCTCTCTTCCGTCGGAGAATCCATCCGCTGCCGACACTCAGAACCTAGCTCGGTCGGGTTTTCT
CCGCTCTGGATCTGGTAGCTCGATTGGGGAAAATTCGTCGGAGAGTTCGTCGTCGATTGGAGTTCCTGACGGTGATTCCGATGACGATGGAGGCGGTGGTGATGAGGTGC
AGAGCAAGCCAAAGGAAGGAGGATTGTGCGGATTAGAGTCTCTTGAAAAAGCTCTTCCGATTAAAAGGGGATTATCGAGCCATTTCTCAGGGAAATCGAAGTCGTTTGCG
AATCTATCAGAGGTGATTCAAGTGAAAGATTTAGAGAAGGCAGAGAATCCTTTCAACAAGAGAAGAAGGATTTTAATGGCGTCGAAATGGTCAAGAAAGAAAGGTTCGTT
CTACAACTGGCCAAACCCTAAATCGATGCCTCTGTTAGCTCTGAACGAAAACAATGAAGAAGAAGAAGAAGATGGTAAAGAATCCGGCGAGGAATCCGATGAAGGTAAAG
GAGGAAGAAGACGAAGCCTAGGGCAGAGATTCCATGATGGGAAGCTCGTTAATGGCTTAAAATTCAAAAGCTGTTTTGATCTGCAAGAATATGAACAACAACAATGAAAA
TGAAAAATCTTTAAGCCTCTTTGTTCTTCACTTTTTAAAGATTTATTTATCTGAACAAAAAGGAAGGCAACAAAAAAATATTTATTTTTTTCCTTTTTTTTATTTTCTTT
TTTTATGTTTAATGTGTCCATGTAGATAAATTAATAATATAGGTTCTATATAATTATCTATTTAGATATGTATATGTATTCTATTATATTGGTGTCTCAATGTGTGTATG
ATTTATGATATGAATGTTTGAGGGAGATGGCAAGTGATGGTGAAATTATGCAAATCTCCTCGGACATGGTTCGGGCTAAGAGTAGGATGAGTTATAATCCTTCTTAGTTT
TAGGTCATGAATTCAATCCAAGATGATCGTCTATCTAGAATTAATTTTCTATGGGTTTACTATCGTTCAAATTTTGTAAGATCGAACGTGTTGTCCTATGAGAACACTCA
CAAGTATATTAAGAATTGGTTTGGTTTTAGTTGAAGATTACGATTATATTTTAAAATAATTCACTCTCTTACTATTCTTCTACCAAAGGTATTTGTTTCAAGGGTACTTA
TGTACTGATAGATCGTCGTTGGCCAACCTTTGATCACCATCTTTGGCGACGTCGGTCAATAGCCAACTTCAGTGATTATTGAC
Protein sequenceShow/hide protein sequence
MEVLFGPPTFSIEVPPSSAFSAVSLPSENPSAADTQNLARSGFLRSGSGSSIGENSSESSSSIGVPDGDSDDDGGGGDEVQSKPKEGGLCGLESLEKALPIKRGLSSHFS
GKSKSFANLSEVIQVKDLEKAENPFNKRRRILMASKWSRKKGSFYNWPNPKSMPLLALNENNEEEEEDGKESGEESDEGKGGRRRSLGQRFHDGKLVNGLKFKSCFDLQE
YEQQQ