; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012257 (gene) of Snake gourd v1 genome

Gene IDTan0012257
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscription initiation factor TFIID subunit 7-like
Genome locationLG01:102147099..102148501
RNA-Seq ExpressionTan0012257
SyntenyTan0012257
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466301.1 PREDICTED: uncharacterized protein LOC103503753 [Cucumis melo]3.8e-8580.69Show/hide
Query:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP
        MEVL GP TFSIEV PP+AF+GVS P ENP+  AEAQN   SGF  SGSGSSIGENSSESSSSIGVPD DSDDDGGGDEVQSK KEGGLCGL+SLE ALP
Subjt:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP

Query:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSR-KASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENE
        IKRGLSSHF+GKSKSFANLSEVIQVKD+EKPENPFNKRRRILMASKWSR KASFYNWPNPKSMPLLALNE++E+  KQ  +G DS +   ESDEEDE   
Subjt:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSR-KASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENE

Query:  RRRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
         RR+ LGQRFHD KLVNGFK KSCFDLQE EQQ
Subjt:  RRRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

XP_022137206.1 transcription initiation factor TFIID subunit 7-like [Momordica charantia]1.3e-8578.01Show/hide
Query:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPA----EAQNR--PGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDS
        MEVLF   TFSIEV P TAF GVS  PENPA  A    + Q+R  PG     SGSGSS+GE SSESSSSIGVPDDDS+DDGGG+EVQSKPKEGGLCGLDS
Subjt:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPA----EAQNR--PGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDS

Query:  LEDALPIKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDE---EEEVKQPAEGSDSEDRDRES
        LEDALPIKRGLSSHF+GKSKSFANLSEVIQVKD+EKPENPFNKRRRILMASKWSRK SFYNWPNPKSMPLLALNEDE   EEE ++ A+ SDSED     
Subjt:  LEDALPIKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDE---EEEVKQPAEGSDSEDRDRES

Query:  DEEDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
        DEEDEENERRR+TL  +FHDRKLVNG KSKSCFDLQEY+ +
Subjt:  DEEDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

XP_022937406.1 uncharacterized protein LOC111443706 [Cucurbita moschata]9.6e-8982.25Show/hide
Query:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP
        MEV+FGP TF+IEV   TAF GVS  PENPA   E QNR  + FR SGSGSSIGENSS SSSSIGVPD DSDDDGG  EVQSK KEGGLC LDSLEDALP
Subjt:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP

Query:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEV-KQPAEGSDSEDRDR---ESDEEDE
        IKRGLSSHF+GKSKSFANLSEVIQVKD+EKPENPFNKR+RILMASKWSRKASFYNWPNPKSMPLLAL+EDEEE+  K+ A GSDSEDRDR   E DEEDE
Subjt:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEV-KQPAEGSDSEDRDR---ESDEEDE

Query:  ENERRRQTLGQRFHDRKLVNGFKSKSCFDLQ
        ENERR +TLG RFHDRKLVNGFKSKSCFDLQ
Subjt:  ENERRRQTLGQRFHDRKLVNGFKSKSCFDLQ

XP_022975769.1 uncharacterized protein LOC111476326 [Cucurbita maxima]1.6e-8880.77Show/hide
Query:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP
        MEV+FGP TF++EV   TAF GVS  PENPA   E QNR  +GFR SGS SSIGENSS SSSSIGVPD DSDDDGG  EVQSK KEGGLC LDSLEDALP
Subjt:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP

Query:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEV-KQPAEGSDSEDRDR------ESDE
        IKRGLSSHF+GKSKSFANLSEVIQVKD+EKPENPFNKR+RILMASKWSRKASFYNWPNPKSMPLLAL+EDEEE+  K+ A GSDSEDRDR      E DE
Subjt:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEV-KQPAEGSDSEDRDR------ESDE

Query:  EDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQ
        EDEENERRR+TLG RFHD+KLVNGFKSKSCFDLQ
Subjt:  EDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQ

XP_038898503.1 uncharacterized protein LOC120086121 [Benincasa hispida]4.3e-9787.07Show/hide
Query:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP
        MEVLFGP TFSIEV PPTAFAGVS P ENP+ P E QNR  SGFRESGSGSSIGENSS SSSSIG+PD DSDDDG  DEVQSKP EGGLCGL+SLE+ALP
Subjt:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP

Query:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENER
        IKRGLSSHF+GKSKSFANLSEVIQVKD+EKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEE+ K+ +E SDSED D ESDEEDEE ER
Subjt:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENER

Query:  RRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
        RR+TLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
Subjt:  RRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

TrEMBL top hitse value%identityAlignment
A0A1S3CQX1 uncharacterized protein LOC1035037531.8e-8580.69Show/hide
Query:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP
        MEVL GP TFSIEV PP+AF+GVS P ENP+  AEAQN   SGF  SGSGSSIGENSSESSSSIGVPD DSDDDGGGDEVQSK KEGGLCGL+SLE ALP
Subjt:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP

Query:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSR-KASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENE
        IKRGLSSHF+GKSKSFANLSEVIQVKD+EKPENPFNKRRRILMASKWSR KASFYNWPNPKSMPLLALNE++E+  KQ  +G DS +   ESDEEDE   
Subjt:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSR-KASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENE

Query:  RRRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
         RR+ LGQRFHD KLVNGFK KSCFDLQE EQQ
Subjt:  RRRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

A0A6J1C5V2 transcription initiation factor TFIID subunit 7-like6.3e-8678.01Show/hide
Query:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPA----EAQNR--PGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDS
        MEVLF   TFSIEV P TAF GVS  PENPA  A    + Q+R  PG     SGSGSS+GE SSESSSSIGVPDDDS+DDGGG+EVQSKPKEGGLCGLDS
Subjt:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPA----EAQNR--PGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDS

Query:  LEDALPIKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDE---EEEVKQPAEGSDSEDRDRES
        LEDALPIKRGLSSHF+GKSKSFANLSEVIQVKD+EKPENPFNKRRRILMASKWSRK SFYNWPNPKSMPLLALNEDE   EEE ++ A+ SDSED     
Subjt:  LEDALPIKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDE---EEEVKQPAEGSDSEDRDRES

Query:  DEEDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
        DEEDEENERRR+TL  +FHDRKLVNG KSKSCFDLQEY+ +
Subjt:  DEEDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

A0A6J1FA94 uncharacterized protein LOC1114437064.7e-8982.25Show/hide
Query:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP
        MEV+FGP TF+IEV   TAF GVS  PENPA   E QNR  + FR SGSGSSIGENSS SSSSIGVPD DSDDDGG  EVQSK KEGGLC LDSLEDALP
Subjt:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP

Query:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEV-KQPAEGSDSEDRDR---ESDEEDE
        IKRGLSSHF+GKSKSFANLSEVIQVKD+EKPENPFNKR+RILMASKWSRKASFYNWPNPKSMPLLAL+EDEEE+  K+ A GSDSEDRDR   E DEEDE
Subjt:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEV-KQPAEGSDSEDRDR---ESDEEDE

Query:  ENERRRQTLGQRFHDRKLVNGFKSKSCFDLQ
        ENERR +TLG RFHDRKLVNGFKSKSCFDLQ
Subjt:  ENERRRQTLGQRFHDRKLVNGFKSKSCFDLQ

A0A6J1FKY0 uncharacterized protein LOC1114463574.5e-8479.74Show/hide
Query:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP
        MEVL G  TFSIEV           P EN +  AE  NR GSGFRESGSGSSIGENSSE SSSIGVPDDDSDD    DEVQSKPKEGGL GLDSLEDAL 
Subjt:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP

Query:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENER
        IK GLS HF+GKSKSFANLSEVIQVKD+EKP+NPFNKRRRILMASKWSRKASFY+W NPKSMPLLALNEDEE++  QPAEGSDSE+R+RESDE+D++NER
Subjt:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENER

Query:  RRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
        RRQTLGQR+HDRKLVNGFKS SCFDLQEYEQQ
Subjt:  RRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

A0A6J1ILJ2 uncharacterized protein LOC1114763267.9e-8980.77Show/hide
Query:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP
        MEV+FGP TF++EV   TAF GVS  PENPA   E QNR  +GFR SGS SSIGENSS SSSSIGVPD DSDDDGG  EVQSK KEGGLC LDSLEDALP
Subjt:  MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALP

Query:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEV-KQPAEGSDSEDRDR------ESDE
        IKRGLSSHF+GKSKSFANLSEVIQVKD+EKPENPFNKR+RILMASKWSRKASFYNWPNPKSMPLLAL+EDEEE+  K+ A GSDSEDRDR      E DE
Subjt:  IKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEV-KQPAEGSDSEDRDR------ESDE

Query:  EDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQ
        EDEENERRR+TLG RFHD+KLVNGFKSKSCFDLQ
Subjt:  EDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein1.3e-2240.5Show/hide
Query:  GSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGL-DSLEDALPIKRGLSSHFTGKSKSFANLSEVI-QVKDIEKPENPFNKR
        G G R S + +   E SS+SSSSIG   ++ +++   D V    + G L     SLED+LPIKRGLS+H+ GKSKSF NL E   + KD+EK ENPFNKR
Subjt:  GSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGL-DSLEDALPIKRGLSSHFTGKSKSFANLSEVI-QVKDIEKPENPFNKR

Query:  RRILMASKWSRK------ASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ
        RR+++A+K  R+      ++FY+W NP SMPLLAL E  EE+        D ED D + D+        R+ +    + ++L+   +++SCF L   +++
Subjt:  RRILMASKWSRK------ASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ

AT3G43850.1 unknown protein1.2e-1248.39Show/hide
Query:  NSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALPIKRGLSSHFTGKSKSFANLSEV--IQVKDIEKPENPFNKRRRILMASK
        +SS SS SIG   ++SDDD GG+        G L  ++SLE+ALPIKR +S  + GKSKSF +LSE   + VKD+ KPEN +++RRR L++ +
Subjt:  NSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALPIKRGLSSHFTGKSKSFANLSEV--IQVKDIEKPENPFNKRRRILMASK

AT4G31510.1 unknown protein7.8e-2040.11Show/hide
Query:  ESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALPIKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKA-----SF
        ESSSS+G   ++ +D+   D V S           SLED+LPIKRGLS+H+ GKSKSF NL E     D+ K E+P NKRRR+L+A+K  R++     S 
Subjt:  ESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALPIKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKA-----SF

Query:  YNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQEYE
        Y   NP SMPLLAL E + E+ K      + +D D +S  +DE ++ + + +    H   +V   ++KSCF L  ++
Subjt:  YNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQEYE

AT5G21940.1 unknown protein1.7e-1134.39Show/hide
Query:  QPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALPIKRGLSSHFTGKSK
        + P      SP P     P+++ + P      S + SSIG NS +   S     +D  DD G +EV+S P +G L  ++SLE  LP+++G+S +++GKSK
Subjt:  QPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALPIKRGLSSHFTGKSK

Query:  SFAN--------LSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMP
        SF N        L+    +KD+ KPENP+++RRR L+  +         W N K+ P
Subjt:  SFAN--------LSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMP

AT5G24890.1 unknown protein1.6e-2540.93Show/hide
Query:  MEVLFGPSTFSIEV-QPPTAFAGVSPPPENPAGPAEAQNRPG---SGFRESGSGSSIGENSSESSSSIGVPDDDSDD----DGGGDEVQSKPKE-GGLCG
        ME++  P TFSIEV Q  T     +    + +   E  N  G   SG     SG +   + S  SSSIG P D  +D    +   D+V SK     GL  
Subjt:  MEVLFGPSTFSIEV-QPPTAFAGVSPPPENPAGPAEAQNRPG---SGFRESGSGSSIGENSSESSSSIGVPDDDSDD----DGGGDEVQSKPKE-GGLCG

Query:  LDSLEDALPIKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRES
        + SLED+LP KRGLS+H+ GKSKSF NL E+  VK++ K ENP NKRRR+ + +K +RK SFY+W NPKSMPLL +NEDE+++ +   E       D   
Subjt:  LDSLEDALPIKRGLSSHFTGKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRES

Query:  DEEDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQE
           DEE  ++       F +R     +KS+SCF L +
Subjt:  DEEDEENERRRQTLGQRFHDRKLVNGFKSKSCFDLQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTTTTGTTCGGTCCTTCGACGTTCAGCATCGAAGTTCAGCCGCCCACAGCGTTCGCCGGCGTCTCTCCCCCACCGGAGAATCCCGCCGGCCCCGCGGAGGCTCA
GAATCGTCCCGGGTCGGGTTTTCGCGAGTCCGGATCTGGTAGCTCGATTGGGGAAAATTCGTCGGAGAGTTCGTCATCGATTGGAGTACCCGACGACGATTCCGACGACG
ACGGCGGCGGCGATGAGGTGCAGAGCAAGCCAAAGGAAGGAGGATTATGCGGTTTGGATTCACTGGAAGACGCTCTTCCAATCAAAAGGGGATTATCGAGCCATTTCACG
GGGAAATCGAAGTCGTTTGCGAATTTATCAGAGGTTATTCAAGTAAAAGATATAGAGAAGCCGGAGAATCCTTTCAACAAGAGGAGAAGAATTTTAATGGCGTCGAAATG
GTCGAGAAAAGCCTCATTCTACAACTGGCCGAACCCTAAATCGATGCCTTTGCTGGCCCTGAACGAAGACGAAGAAGAAGAAGTAAAACAACCGGCCGAGGGTTCCGATT
CAGAGGACAGAGATCGAGAGAGCGATGAAGAAGATGAAGAGAACGAACGAAGAAGACAAACCCTGGGGCAAAGGTTCCATGATCGGAAGCTCGTTAATGGCTTCAAATCC
AAGAGCTGTTTTGATCTGCAAGAATATGAACAGCAATAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAATGACAAAAGAAAAAAACAGAGTTCCAAACCAAATTCTCCCTCAAAATTAAAATATCCCTTTCTCAGTAAAGTTCACAGAGACCCCCCTTATCCCCTTCTCTCT
TTCTCTCTCTATCTTCTCTCTCTTTCTACCAGAAAAGTCAGTACCAATCAGCCAACCGCCGCCGGTTGATGACAACGAACCCTAATCCTTGGTCGTCGCCGGCAAGAAGA
ACAGAGCGGAGGAAGGTGGGAAGCCCGCCCCGTCACCACTGCGTCGGAATCGATCTCCAGGCGGTGACTGATGGAGGTTTTGTTCGGTCCTTCGACGTTCAGCATCGAAG
TTCAGCCGCCCACAGCGTTCGCCGGCGTCTCTCCCCCACCGGAGAATCCCGCCGGCCCCGCGGAGGCTCAGAATCGTCCCGGGTCGGGTTTTCGCGAGTCCGGATCTGGT
AGCTCGATTGGGGAAAATTCGTCGGAGAGTTCGTCATCGATTGGAGTACCCGACGACGATTCCGACGACGACGGCGGCGGCGATGAGGTGCAGAGCAAGCCAAAGGAAGG
AGGATTATGCGGTTTGGATTCACTGGAAGACGCTCTTCCAATCAAAAGGGGATTATCGAGCCATTTCACGGGGAAATCGAAGTCGTTTGCGAATTTATCAGAGGTTATTC
AAGTAAAAGATATAGAGAAGCCGGAGAATCCTTTCAACAAGAGGAGAAGAATTTTAATGGCGTCGAAATGGTCGAGAAAAGCCTCATTCTACAACTGGCCGAACCCTAAA
TCGATGCCTTTGCTGGCCCTGAACGAAGACGAAGAAGAAGAAGTAAAACAACCGGCCGAGGGTTCCGATTCAGAGGACAGAGATCGAGAGAGCGATGAAGAAGATGAAGA
GAACGAACGAAGAAGACAAACCCTGGGGCAAAGGTTCCATGATCGGAAGCTCGTTAATGGCTTCAAATCCAAGAGCTGTTTTGATCTGCAAGAATATGAACAGCAATAAC
TCGTAATGAAATGAAATCGTTTGGCTTTGTTCTCCACTTCTTTAAAATTTAATGCTCCGAACAAAAAGAAAAGAAAAAAAAAACTATATTTTTCCTTTTTCCTTTTTTTT
GTTTTTTTGTTTTTATTTTTATTTGTATGTTTCATGAGTCCATGTAGATAAATTAATAATTTAGATTTATCTAGTTATCAATTGCGGTATATAATTATATATTTTTATGT
ATATGTATATGTGGTTGTGCC
Protein sequenceShow/hide protein sequence
MEVLFGPSTFSIEVQPPTAFAGVSPPPENPAGPAEAQNRPGSGFRESGSGSSIGENSSESSSSIGVPDDDSDDDGGGDEVQSKPKEGGLCGLDSLEDALPIKRGLSSHFT
GKSKSFANLSEVIQVKDIEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEEVKQPAEGSDSEDRDRESDEEDEENERRRQTLGQRFHDRKLVNGFKS
KSCFDLQEYEQQ