; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013519 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013519
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiontranscription initiation factor TFIID subunit 7-like
Genome locationChr02:2300079..2300961
RNA-Seq ExpressionHG10013519
SyntenyHG10013519
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136309.2 uncharacterized protein LOC101218277 [Cucumis sativus]8.7e-9082.48Show/hide
Query:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAAETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDND-GGGDEVQSKPKEGGLCGLESLEDALP
        MEVLFGPPTFSIEVPP +AFS VSLP ENP+AA+TQN ARSGF+ SGSGSSIGENSSESSSSIGVPD DSD+D GGGDEVQSKPKEGGLCGLESLE ALP
Subjt:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAAETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDND-GGGDEVQSKPKEGGLCGLESLEDALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSR-KASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEEN
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSR K SFYNWPNPKSMPLLALNEN EEEE +E       EDG    +E  E  
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSR-KASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEEN

Query:  ERRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
          RRR+LGQRFHD KLVNG K KSCFDLQEYEQQ
Subjt:  ERRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

XP_008466301.1 PREDICTED: uncharacterized protein LOC103503753 [Cucumis melo]5.5e-9283.26Show/hide
Query:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAAETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALPI
        MEVL GPPTFSIEVPPP+AFSGVSLP ENP+AAE QN ARSGF+ SGSGSSIGENSSESSSSIGVPD DSD+DGGGDEVQSK KEGGLCGLESLE ALPI
Subjt:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAAETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALPI

Query:  KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSR-KASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEENE
        KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSR KASFYNWPNPKSMPLLALNEN+E+++ +      D +D   ESDEE E   
Subjt:  KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSR-KASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEENE

Query:  RRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
         RRR LGQRFHD KLVNGFK KSCFDLQE EQQ
Subjt:  RRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

XP_022937406.1 uncharacterized protein LOC111443706 [Cucurbita moschata]7.6e-9485.28Show/hide
Query:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAA-AETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALP
        MEV+FGPPTF+IEV   TAF GVSL PENPAA  ETQNR R+ F GSGSGSSIGENSS SSSSIGVPD DSD+DGG  EVQSK KEGGLC L+SLEDALP
Subjt:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAA-AETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDR---ESDEEYE
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKR+RILMASKWSRKASFYNWPNPKSMPLLAL+E+EEE+ HKEAAAGSDSED DR   E DEE E
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDR---ESDEEYE

Query:  ENERRRRTLGQRFHDRKLVNGFKSKSCFDLQ
        ENERR RTLG RFHDRKLVNGFKSKSCFDLQ
Subjt:  ENERRRRTLGQRFHDRKLVNGFKSKSCFDLQ

XP_022975769.1 uncharacterized protein LOC111476326 [Cucurbita maxima]1.3e-9383.76Show/hide
Query:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAA-AETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALP
        MEV+FGPPTF++EV   TAF GVSL PENPAA  ETQNR R+GF GSGS SSIGENSS SSSSIGVPD DSD+DGG  EVQSK KEGGLC L+SLEDALP
Subjt:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAA-AETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDR------ESDE
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKR+RILMASKWSRKASFYNWPNPKSMPLLAL+E+EEE+ HKEAAAGSDSED DR      E DE
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDR------ESDE

Query:  EYEENERRRRTLGQRFHDRKLVNGFKSKSCFDLQ
        E EENERRRRTLG RFHD+KLVNGFKSKSCFDLQ
Subjt:  EYEENERRRRTLGQRFHDRKLVNGFKSKSCFDLQ

XP_038898503.1 uncharacterized protein LOC120086121 [Benincasa hispida]1.6e-9988.36Show/hide
Query:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAAETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALPI
        MEVLFG PTFSIEVPPPTAF+GVS+P ENP+  ETQNRARSGF  SGSGSSIGENSS SSSSIG+PD DSD+DG  DEVQSKP EGGLCGLESLE+ALPI
Subjt:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAAETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALPI

Query:  KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEENER
        KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNE +EEE+ KEA+  SDSEDGD ESDEE EE ER
Subjt:  KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEENER

Query:  RRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
        RRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
Subjt:  RRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LEH9 Uncharacterized protein4.2e-9082.48Show/hide
Query:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAAETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDND-GGGDEVQSKPKEGGLCGLESLEDALP
        MEVLFGPPTFSIEVPP +AFS VSLP ENP+AA+TQN ARSGF+ SGSGSSIGENSSESSSSIGVPD DSD+D GGGDEVQSKPKEGGLCGLESLE ALP
Subjt:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAAETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDND-GGGDEVQSKPKEGGLCGLESLEDALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSR-KASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEEN
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEK ENPFNKRRRILMASKWSR K SFYNWPNPKSMPLLALNEN EEEE +E       EDG    +E  E  
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSR-KASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEEN

Query:  ERRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
          RRR+LGQRFHD KLVNG K KSCFDLQEYEQQ
Subjt:  ERRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

A0A1S3CQX1 uncharacterized protein LOC1035037532.6e-9283.26Show/hide
Query:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAAETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALPI
        MEVL GPPTFSIEVPPP+AFSGVSLP ENP+AAE QN ARSGF+ SGSGSSIGENSSESSSSIGVPD DSD+DGGGDEVQSK KEGGLCGLESLE ALPI
Subjt:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAAETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALPI

Query:  KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSR-KASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEENE
        KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSR KASFYNWPNPKSMPLLALNEN+E+++ +      D +D   ESDEE E   
Subjt:  KRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSR-KASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEENE

Query:  RRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
         RRR LGQRFHD KLVNGFK KSCFDLQE EQQ
Subjt:  RRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

A0A6J1C5V2 transcription initiation factor TFIID subunit 7-like7.2e-9080.08Show/hide
Query:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAA-----ETQNRARSGF--IGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLES
        MEVLF  PTFSIEVPP TAF GVS+ PENPA A     +TQ+R   G    GSGSGSS+GE SSESSSSIGVPDDDS++DGGG+EVQSKPKEGGLCGL+S
Subjt:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAA-----ETQNRARSGF--IGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLES

Query:  LEDALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEE--EEEHKEAAAGSDSEDGDRES
        LEDALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRK SFYNWPNPKSMPLLALNE+EE  EEE +E A  SDSEDG    
Subjt:  LEDALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEE--EEEHKEAAAGSDSEDGDRES

Query:  DEEYEENERRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
        DEE EENERRRRTL  +FHDRKLVNG KSKSCFDLQEY+ +
Subjt:  DEEYEENERRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

A0A6J1FA94 uncharacterized protein LOC1114437063.7e-9485.28Show/hide
Query:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAA-AETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALP
        MEV+FGPPTF+IEV   TAF GVSL PENPAA  ETQNR R+ F GSGSGSSIGENSS SSSSIGVPD DSD+DGG  EVQSK KEGGLC L+SLEDALP
Subjt:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAA-AETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDR---ESDEEYE
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKR+RILMASKWSRKASFYNWPNPKSMPLLAL+E+EEE+ HKEAAAGSDSED DR   E DEE E
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDR---ESDEEYE

Query:  ENERRRRTLGQRFHDRKLVNGFKSKSCFDLQ
        ENERR RTLG RFHDRKLVNGFKSKSCFDLQ
Subjt:  ENERRRRTLGQRFHDRKLVNGFKSKSCFDLQ

A0A6J1ILJ2 uncharacterized protein LOC1114763266.3e-9483.76Show/hide
Query:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAA-AETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALP
        MEV+FGPPTF++EV   TAF GVSL PENPAA  ETQNR R+GF GSGS SSIGENSS SSSSIGVPD DSD+DGG  EVQSK KEGGLC L+SLEDALP
Subjt:  MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAA-AETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALP

Query:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDR------ESDE
        IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKR+RILMASKWSRKASFYNWPNPKSMPLLAL+E+EEE+ HKEAAAGSDSED DR      E DE
Subjt:  IKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDR------ESDE

Query:  EYEENERRRRTLGQRFHDRKLVNGFKSKSCFDLQ
        E EENERRRRTLG RFHD+KLVNGFKSKSCFDLQ
Subjt:  EYEENERRRRTLGQRFHDRKLVNGFKSKSCFDLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein1.3e-2241.03Show/hide
Query:  SGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGL-ESLEDALPIKRGLSSHFSGKSKSFANLSEVI-QVKDLEKPENPFNKRRRILMA
        S + +   E SS+SSSSIG   ++ + +   D V    + G L     SLED+LPIKRGLS+H+ GKSKSF NL E   + KDLEK ENPFNKRRR+++A
Subjt:  SGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGL-ESLEDALPIKRGLSSHFSGKSKSFANLSEVI-QVKDLEKPENPFNKRRRILMA

Query:  SKWSRK------ASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEENERRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
        +K  R+      ++FY+W NP SMPLLAL E  EE+ H       D ED D + D+        R+ +    + ++L+   +++SCF L   +++
Subjt:  SKWSRK------ASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEENERRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

AT3G43850.1 unknown protein2.0e-1240.15Show/hide
Query:  NSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALPIKRGLSSHFSGKSKSFANLSEV--IQVKDLEKPENPFNKRRRILMASKWSRKASF
        +SS SS SIG   ++SD+D GG+        G L  +ESLE+ALPIKR +S  + GKSKSF +LSE   + VKDL KPEN +++RRR L++ +   +   
Subjt:  NSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALPIKRGLSSHFSGKSKSFANLSEV--IQVKDLEKPENPFNKRRRILMASKWSRKASF

Query:  YNWPNPKSMPLLALNENEEEEEHKEAAAGSDS
           P      +LA+++ E +     +++G DS
Subjt:  YNWPNPKSMPLLALNENEEEEEHKEAAAGSDS

AT4G31510.1 unknown protein4.5e-2042.46Show/hide
Query:  ESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGL-ESLEDALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKA-----S
        ESSSS+G   + S+N+   D+  S  +   L     SLED+LPIKRGLS+H+ GKSKSF NL E     DL K E+P NKRRR+L+A+K  R++     S
Subjt:  ESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGL-ESLEDALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKA-----S

Query:  FYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEENERRRRTLGQRFHDRKLVNGFKSKSCFDLQEYE
         Y   NP SMPLLAL E+ + E+HK      D +D D  SD+E  + + +R  +    H   +V   ++KSCF L  ++
Subjt:  FYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEENERRRRTLGQRFHDRKLVNGFKSKSCFDLQEYE

AT5G21940.1 unknown protein2.0e-1240Show/hide
Query:  SGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALPIKRGLSSHFSGKSKSFAN--------LSEVIQVKDLEKPENPFNKR
        S + SSIG NS +   S     +D  +D G +EV+S P +G L  +ESLE  LP+++G+S ++SGKSKSF N        L+    +KDL KPENP+++R
Subjt:  SGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALPIKRGLSSHFSGKSKSFAN--------LSEVIQVKDLEKPENPFNKR

Query:  RRILMASKWSRKASFYNWPNPKSMP
        RR L+  +         W N K+ P
Subjt:  RRILMASKWSRKASFYNWPNPKSMP

AT5G24890.1 unknown protein3.6e-2540.83Show/hide
Query:  LFGPPTFSIEVPPPTAFSGVSLPPENPAAA-----ETQNR---ARSGFIGSGSGSSIGENSSESSSSIGVP----DDDSDNDGGGDEVQSKPKE-GGLCG
        L   PTFSIEV   + +    LP    A++     ET N      SG     SG +   + S  SSSIG P    +D+ +++   D+V SK     GL  
Subjt:  LFGPPTFSIEVPPPTAFSGVSLPPENPAAA-----ETQNR---ARSGFIGSGSGSSIGENSSESSSSIGVP----DDDSDNDGGGDEVQSKPKE-GGLCG

Query:  LESLEDALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENE--EEEEHKEAAAGSDSEDGD
        + SLED+LP KRGLS+H+ GKSKSF NL E+  VK++ K ENP NKRRR+ + +K +RK SFY+W NPKSMPLL +NE+E  ++E+  E    S  ++  
Subjt:  LESLEDALPIKRGLSSHFSGKSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENE--EEEEHKEAAAGSDSEDGD

Query:  RESDEEYEENERRRRTLGQRFHDRKLVNGFKSKSCFDLQE
          SDEE  +    R+     F +R     +KS+SCF L +
Subjt:  RESDEEYEENERRRRTLGQRFHDRKLVNGFKSKSCFDLQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTTTTGTTCGGTCCTCCGACTTTCAGCATCGAGGTTCCGCCGCCCACGGCGTTCTCCGGCGTCTCTTTACCGCCGGAGAATCCAGCCGCCGCCGAGACTCAGAA
TCGAGCCCGGTCGGGTTTTATCGGGTCTGGATCTGGTAGCTCGATTGGGGAAAATTCGTCGGAGAGTTCGTCGTCGATTGGAGTTCCCGATGACGATTCCGATAACGACG
GAGGCGGTGATGAGGTGCAAAGCAAGCCGAAGGAAGGAGGATTATGCGGATTGGAATCTCTCGAAGATGCTCTTCCGATTAAAAGGGGATTATCGAGCCATTTCTCAGGG
AAATCGAAGTCGTTCGCGAATCTATCAGAGGTTATTCAAGTGAAAGATTTAGAGAAGCCGGAGAATCCTTTCAATAAGAGAAGAAGAATTTTAATGGCGTCGAAATGGTC
GAGAAAAGCTTCGTTCTACAACTGGCCAAACCCTAAATCGATGCCTCTGTTGGCCCTGAACGAAAACGAAGAAGAAGAGGAACATAAAGAAGCGGCTGCGGGTTCCGATT
CAGAGGACGGAGATCGAGAAAGTGATGAAGAATATGAAGAGAACGAGCGAAGAAGAAGAACCCTAGGGCAAAGGTTCCATGATCGGAAGCTCGTTAATGGCTTTAAATCC
AAGAGCTGTTTTGATCTGCAAGAATATGAACAACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTTTTGTTCGGTCCTCCGACTTTCAGCATCGAGGTTCCGCCGCCCACGGCGTTCTCCGGCGTCTCTTTACCGCCGGAGAATCCAGCCGCCGCCGAGACTCAGAA
TCGAGCCCGGTCGGGTTTTATCGGGTCTGGATCTGGTAGCTCGATTGGGGAAAATTCGTCGGAGAGTTCGTCGTCGATTGGAGTTCCCGATGACGATTCCGATAACGACG
GAGGCGGTGATGAGGTGCAAAGCAAGCCGAAGGAAGGAGGATTATGCGGATTGGAATCTCTCGAAGATGCTCTTCCGATTAAAAGGGGATTATCGAGCCATTTCTCAGGG
AAATCGAAGTCGTTCGCGAATCTATCAGAGGTTATTCAAGTGAAAGATTTAGAGAAGCCGGAGAATCCTTTCAATAAGAGAAGAAGAATTTTAATGGCGTCGAAATGGTC
GAGAAAAGCTTCGTTCTACAACTGGCCAAACCCTAAATCGATGCCTCTGTTGGCCCTGAACGAAAACGAAGAAGAAGAGGAACATAAAGAAGCGGCTGCGGGTTCCGATT
CAGAGGACGGAGATCGAGAAAGTGATGAAGAATATGAAGAGAACGAGCGAAGAAGAAGAACCCTAGGGCAAAGGTTCCATGATCGGAAGCTCGTTAATGGCTTTAAATCC
AAGAGCTGTTTTGATCTGCAAGAATATGAACAACAATGA
Protein sequenceShow/hide protein sequence
MEVLFGPPTFSIEVPPPTAFSGVSLPPENPAAAETQNRARSGFIGSGSGSSIGENSSESSSSIGVPDDDSDNDGGGDEVQSKPKEGGLCGLESLEDALPIKRGLSSHFSG
KSKSFANLSEVIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNENEEEEEHKEAAAGSDSEDGDRESDEEYEENERRRRTLGQRFHDRKLVNGFKS
KSCFDLQEYEQQ