; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg21174 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg21174
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptiontranscription initiation factor TFIID subunit 7-like
Genome locationCarg_Chr01:11052024..11053098
RNA-Seq ExpressionCarg21174
SyntenyCarg21174
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608207.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]4.5e-9399.47Show/hide
Query:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSF
        MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSS+SSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSF
Subjt:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSF

Query:  ANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQR
        ANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQR
Subjt:  ANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQR

XP_022940912.1 uncharacterized protein LOC111446357 [Cucurbita moschata]1.7e-108100Show/hide
Query:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSF
        MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSF
Subjt:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSF

Query:  ANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGF
        ANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGF
Subjt:  ANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGF

Query:  KSMSCFDLQEYEQQ
        KSMSCFDLQEYEQQ
Subjt:  KSMSCFDLQEYEQQ

XP_022981755.1 uncharacterized protein LOC111480811 [Cucurbita maxima]1.5e-10497.67Show/hide
Query:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDS-DDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKS
        MEVLLGSPTFSIEVP ENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDS DDDEVQSK KEGGLLGLDSLEDALVIKGGLSRHFSGKSKS
Subjt:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDS-DDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKS

Query:  FANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNG
        FANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEE+QQPA+GSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNG
Subjt:  FANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNG

Query:  FKSMSCFDLQEYEQQ
        FKSMSCFDLQEYEQQ
Subjt:  FKSMSCFDLQEYEQQ

XP_023524376.1 uncharacterized protein LOC111788285 [Cucurbita pepo subsp. pepo]4.2e-10799.07Show/hide
Query:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSF
        MEVLLGSPTFSIEVP ENSSAVAETHNRDG GFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSF
Subjt:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSF

Query:  ANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGF
        ANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGF
Subjt:  ANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGF

Query:  KSMSCFDLQEYEQQ
        KSMSCFDLQEYEQQ
Subjt:  KSMSCFDLQEYEQQ

XP_038898503.1 uncharacterized protein LOC120086121 [Benincasa hispida]2.5e-7575.43Show/hide
Query:  MEVLLGSPTFSIEV-----------PLENSSAVAETHNRDGSGFRESGSGSSIGENSS-ESSSIGVPDDDSDD----DEVQSKPKEGGLLGLDSLEDALV
        MEVL G PTFSIEV           PLEN S   ET NR  SGFRESGSGSSIGENSS  SSSIG+PD DSDD    DEVQSKP EGGL GL+SLE+AL 
Subjt:  MEVLLGSPTFSIEV-----------PLENSSAVAETHNRDGSGFRESGSGSSIGENSS-ESSSIGVPDDDSDD----DEVQSKPKEGGLLGLDSLEDALV

Query:  IKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQ--QQPAEGSDSEERNRESDEDDDQNER
        IK GLS HFSGKSKSFANLSEVIQVKDLEKP+NPFNKRRRILMASKWSRKASFY+W NPKSMPLLALNEDEE+  ++ +E SDSE+ + ESDE+D++ ER
Subjt:  IKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQ--QQPAEGSDSEERNRESDEDDDQNER

Query:  RRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ
        RR+TLGQR+HDRKLVNGFKS SCFDLQEYEQQ
Subjt:  RRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ

TrEMBL top hitse value%identityAlignment
A0A1S3CQX1 uncharacterized protein LOC1035037533.2e-6872.73Show/hide
Query:  MEVLLGSPTFSIEV-----------PLENSSAVAETHNRDGSGFRESGSGSSIGENSSE-SSSIGVPDDDSDD----DEVQSKPKEGGLLGLDSLEDALV
        MEVLLG PTFSIEV           P EN SA AE  N   SGF  SGSGSSIGENSSE SSSIGVPD DSDD    DEVQSK KEGGL GL+SLE AL 
Subjt:  MEVLLGSPTFSIEV-----------PLENSSAVAETHNRDGSGFRESGSGSSIGENSSE-SSSIGVPDDDSDD----DEVQSKPKEGGLLGLDSLEDALV

Query:  IKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSR-KASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERR
        IK GLS HFSGKSKSFANLSEVIQVKDLEKP+NPFNKRRRILMASKWSR KASFY+W NPKSMPLLALNE++EQ+Q  +G DS E   ESDE+D+    R
Subjt:  IKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSR-KASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERR

Query:  RQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ
        R+ LGQR+HD KLVNGFK  SCFDLQE EQQ
Subjt:  RQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ

A0A6J1FA94 uncharacterized protein LOC1114437069.0e-7171.43Show/hide
Query:  MEVLLGSPTFSIEVPL-----------ENSSAVAETHNRDGSGFRESGSGSSIGENSS-ESSSIGVPDDDSDDD----EVQSKPKEGGLLGLDSLEDALV
        MEV+ G PTF+IEV             EN +AV ET NR  + FR SGSGSSIGENSS  SSSIGVPD DSDDD    EVQSK KEGGL  LDSLEDAL 
Subjt:  MEVLLGSPTFSIEVPL-----------ENSSAVAETHNRDGSGFRESGSGSSIGENSS-ESSSIGVPDDDSDDD----EVQSKPKEGGLLGLDSLEDALV

Query:  IKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQ---QQPAEGSDSEERNR---ESDEDDD
        IK GLS HFSGKSKSFANLSEVIQVKDLEKP+NPFNKR+RILMASKWSRKASFY+W NPKSMPLLAL+EDEE+   ++ A GSDSE+R+R   E DE+D+
Subjt:  IKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQ---QQPAEGSDSEERNR---ESDEDDD

Query:  QNERRRQTLGQRYHDRKLVNGFKSMSCFDLQ
        +NERR +TLG R+HDRKLVNGFKS SCFDLQ
Subjt:  QNERRRQTLGQRYHDRKLVNGFKSMSCFDLQ

A0A6J1FKY0 uncharacterized protein LOC1114463578.3e-109100Show/hide
Query:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSF
        MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSF
Subjt:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSF

Query:  ANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGF
        ANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGF
Subjt:  ANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGF

Query:  KSMSCFDLQEYEQQ
        KSMSCFDLQEYEQQ
Subjt:  KSMSCFDLQEYEQQ

A0A6J1ILJ2 uncharacterized protein LOC1114763261.5e-7070.09Show/hide
Query:  MEVLLGSPTFSIEVPL-----------ENSSAVAETHNRDGSGFRESGSGSSIGENSS-ESSSIGVPDDDSDDD----EVQSKPKEGGLLGLDSLEDALV
        MEV+ G PTF++EV             EN +AV ET NR  +GFR SGS SSIGENSS  SSSIGVPD DSDDD    EVQSK KEGGL  LDSLEDAL 
Subjt:  MEVLLGSPTFSIEVPL-----------ENSSAVAETHNRDGSGFRESGSGSSIGENSS-ESSSIGVPDDDSDDD----EVQSKPKEGGLLGLDSLEDALV

Query:  IKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQ---QQPAEGSDSEERNR------ESDE
        IK GLS HFSGKSKSFANLSEVIQVKDLEKP+NPFNKR+RILMASKWSRKASFY+W NPKSMPLLAL+EDEE+   ++ A GSDSE+R+R      E DE
Subjt:  IKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQ---QQPAEGSDSEERNR------ESDE

Query:  DDDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQ
        +D++NERRR+TLG R+HD+KLVNGFKS SCFDLQ
Subjt:  DDDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQ

A0A6J1J0I5 uncharacterized protein LOC1114808117.3e-10597.67Show/hide
Query:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDS-DDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKS
        MEVLLGSPTFSIEVP ENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDS DDDEVQSK KEGGLLGLDSLEDALVIKGGLSRHFSGKSKS
Subjt:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDS-DDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKS

Query:  FANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNG
        FANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEE+QQPA+GSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNG
Subjt:  FANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNG

Query:  FKSMSCFDLQEYEQQ
        FKSMSCFDLQEYEQQ
Subjt:  FKSMSCFDLQEYEQQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24550.1 unknown protein1.6e-1938.78Show/hide
Query:  GSGFRESGSGSSIGENSSE-SSSIGVPDDDSDDDEVQSKPK-EGGLLG--LDSLEDALVIKGGLSRHFSGKSKSFANLSEVI-QVKDLEKPDNPFNKRRR
        G G R S + +   E SS+ SSSIG   ++ +++E       + G L     SLED+L IK GLS H+ GKSKSF NL E   + KDLEK +NPFNKRRR
Subjt:  GSGFRESGSGSSIGENSSE-SSSIGVPDDDSDDDEVQSKPK-EGGLLG--LDSLEDALVIKGGLSRHFSGKSKSFANLSEVI-QVKDLEKPDNPFNKRRR

Query:  ILMASKWSRK------ASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ
        +++A+K  R+      ++FYSW+NP SMPLLAL E  E+       D    N + ++DD   +  R+ +    + ++L+   ++ SCF L   +++
Subjt:  ILMASKWSRK------ASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ

AT3G43850.1 unknown protein2.0e-0943.3Show/hide
Query:  SGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEV--IQVKDLEKPDNPFNKRRRILMASK
        S S  SIGENS         DD+  ++E++S    G L  ++SLE+AL IK  +S+ + GKSKSF +LSE   + VKDL KP+N +++RRR L++ +
Subjt:  SGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEV--IQVKDLEKPDNPFNKRRRILMASK

AT4G31510.1 unknown protein2.8e-1634.08Show/hide
Query:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESG------SGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFS
        MEVL+GS TF     +               G R  G      S SS+GE S         +++ +DD V S           SLED+L IK GLS H+ 
Subjt:  MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESG------SGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFS

Query:  GKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKA-----SFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQ
        GKSKSF NL E     DL K ++P NKRRR+L+A+K  R++     S Y+  NP SMPLLAL E + +       D ++   +S  DD+ ++ + + +  
Subjt:  GKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKA-----SFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQ

Query:  RYHDRKLVNGFKSMSCFDLQEYE
          H   +V   ++ SCF L  ++
Subjt:  RYHDRKLVNGFKSMSCFDLQEYE

AT5G21940.1 unknown protein1.8e-1038.46Show/hide
Query:  DGSGFRESGSGSSIGENSSE---SSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFAN--------LSEVIQVKDLEKPDN
        D S    S + SSIG NS +   SS  G   DD+ ++EV+S P +G L  ++SLE  L ++ G+S+++SGKSKSF N        L+    +KDL KP+N
Subjt:  DGSGFRESGSGSSIGENSSE---SSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFAN--------LSEVIQVKDLEKPDN

Query:  PFNKRRRILMASKWSRKASFYSWRNPKSMP
        P+++RRR L+  +         W N K+ P
Subjt:  PFNKRRRILMASKWSRKASFYSWRNPKSMP

AT5G24890.1 unknown protein3.9e-2641.28Show/hide
Query:  LLGSPTFSIEV------------PLENSSAVAETHNRDG---SGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQS------KPKEGGLLGL---DSL
        L+  PTFSIEV               +SS+  ET N +G   SG     SG +  + SS+SSSIG P D  +D+E           KE GL GL    SL
Subjt:  LLGSPTFSIEV------------PLENSSAVAETHNRDG---SGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQS------KPKEGGLLGL---DSL

Query:  EDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDED---
        ED+L  K GLS H+ GKSKSF NL E+  VK++ K +NP NKRRR+ + +K +RK SFYSW+NPKSMPLL +NEDE+     E  D E+     DE+   
Subjt:  EDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDED---

Query:  -DDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQE
         D++  ++       + +R     +KS SCF L +
Subjt:  -DDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTTTTGCTCGGTTCTCCGACGTTCAGCATCGAAGTCCCGTTGGAGAACTCCTCCGCCGTCGCGGAGACTCATAATCGGGACGGGTCGGGTTTTCGCGAGTCCGG
ATCTGGTAGTTCGATTGGGGAAAACTCGTCGGAGTCGTCGTCGATTGGAGTTCCCGACGATGATTCCGACGACGATGAGGTGCAGAGCAAGCCAAAGGAAGGAGGATTGC
TCGGATTGGATTCTCTCGAAGACGCTCTTGTAATCAAAGGAGGCTTATCGAGGCATTTCTCGGGGAAATCGAAGTCGTTTGCGAATTTATCAGAGGTGATTCAAGTGAAA
GATTTAGAGAAGCCGGATAATCCTTTCAACAAGAGGAGAAGAATTTTAATGGCGTCAAAATGGTCGAGAAAAGCCTCATTCTACAGCTGGCGAAACCCTAAATCGATGCC
TCTGCTTGCCTTAAACGAAGACGAAGAACAACAACAACCGGCGGAGGGTTCCGATTCAGAGGAAAGAAATCGAGAGAGCGATGAAGATGATGATCAAAACGAACGAAGAA
GACAAACCCTAGGGCAAAGGTACCACGATCGGAAGCTCGTTAATGGCTTCAAATCAATGAGCTGTTTTGATCTGCAAGAATATGAACAGCAATAA
mRNA sequenceShow/hide mRNA sequence
CAAAAACAGTTCACAGAGTGATAGTCCCCCCCTTATCCCCTTCTCTCTCTCTGTCTTCCTATCTTCTCTCTCCCTATCAGAAAAGTCGGTACCAATCTGCCGCCGGTTGA
TGACAACGAGCCCTAACCCTTGGGCGTTGCCGGCGAGGAGAAGCAGAGCGGAGGAAGGTTCAGAAATCTCTACTGCGACGGACTTGATCTCCCGCCTCTGAGTGATGGAG
GTTTTGCTCGGTTCTCCGACGTTCAGCATCGAAGTCCCGTTGGAGAACTCCTCCGCCGTCGCGGAGACTCATAATCGGGACGGGTCGGGTTTTCGCGAGTCCGGATCTGG
TAGTTCGATTGGGGAAAACTCGTCGGAGTCGTCGTCGATTGGAGTTCCCGACGATGATTCCGACGACGATGAGGTGCAGAGCAAGCCAAAGGAAGGAGGATTGCTCGGAT
TGGATTCTCTCGAAGACGCTCTTGTAATCAAAGGAGGCTTATCGAGGCATTTCTCGGGGAAATCGAAGTCGTTTGCGAATTTATCAGAGGTGATTCAAGTGAAAGATTTA
GAGAAGCCGGATAATCCTTTCAACAAGAGGAGAAGAATTTTAATGGCGTCAAAATGGTCGAGAAAAGCCTCATTCTACAGCTGGCGAAACCCTAAATCGATGCCTCTGCT
TGCCTTAAACGAAGACGAAGAACAACAACAACCGGCGGAGGGTTCCGATTCAGAGGAAAGAAATCGAGAGAGCGATGAAGATGATGATCAAAACGAACGAAGAAGACAAA
CCCTAGGGCAAAGGTACCACGATCGGAAGCTCGTTAATGGCTTCAAATCAATGAGCTGTTTTGATCTGCAAGAATATGAACAGCAATAACGCGTAGTGTAATGAAATCGT
TTGGCCTTTCTTCACTTATTGCTCTG
Protein sequenceShow/hide protein sequence
MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVK
DLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ