; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg010680 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg010680
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionEnzymatic polyprotein
Genome locationscaffold7:35614396..35619999
RNA-Seq ExpressionSpg010680
SyntenySpg010680
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059217.1 Enzymatic polyprotein [Cucumis melo var. makuwa]5.0e-3628.45Show/hide
Query:  FRIYYKLMNTNISPRALRSSPKGSTVLLEANLDRSAVTVPKSLSWEQITRNPTWKLTEAFTPPKKNSNLAQIVEHTDGTVEVKFSEEPETSKVKEFMSSR
        +RIY+KLM+TN+SP+AL  SPKG T+L+E N+++S++T+P++L W+++T+NP WKL     P  ++S  A I E  DG VEV+F+      +V E MSSR
Subjt:  FRIYYKLMNTNISPRALRSSPKGSTVLLEANLDRSAVTVPKSLSWEQITRNPTWKLTEAFTPPKKNSNLAQIVEHTDGTVEVKFSEEPETSKVKEFMSSR

Query:  PSTSGTSSEIESKFYFDRSNSLRVKSVNIEQNVANVRYEEQ--PQSPTQTDMDNRS-VFASQINVLIKEFSIDKEAL------------------KRDFL
        PSTS   SE   +    RS S+R  SV+    + +V YE++    SPTQ+DM+ RS    +QINV+    S DKE                    ++ FL
Subjt:  PSTSGTSSEIESKFYFDRSNSLRVKSVNIEQNVANVRYEEQ--PQSPTQTDMDNRS-VFASQINVLIKEFSIDKEAL------------------KRDFL

Query:  SF-------------KNSA----------------------------------KRKAFFK-----------------NFDKTQRSEIR------------
        +              KN A                                  + +A+F                  N DK Q+  +R            
Subjt:  SF-------------KNSA----------------------------------KRKAFFK-----------------NFDKTQRSEIR------------

Query:  ----SKWYSSMEEAEQNIPFFTWMNKSKDEINVLQESWKTTQRGEGLRLCNESKIQSKLNKSF-------GSR-------------------RAELGIFC
            S    ++E  E + P     N    +IN  Q  ++ +    G    + S   +++N+         GS+                   R ELG FC
Subjt:  ----SKWYSSMEEAEQNIPFFTWMNKSKDEINVLQESWKTTQRGEGLRLCNESKIQSKLNKSF-------GSR-------------------RAELGIFC

Query:  DQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFYKGQSSKGNVQCYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLN
         QYG    Q P   +K   K+ ++Y S++ +R      +++ ++    Y K    KG SSK +  C+KC   GH+AN+CP+K KIN + ID+ TKQ LL 
Subjt:  DQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFYKGQSSKGNVQCYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLN

Query:  IIQSDSE--ESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKRGKSPASS------SKSPAPMSSDQYAM-DLGFTPVNRPRTRSASLQIRESMESLT
         I+SD +       S++E  I  LQEE  S     Y + +     G  P +       S     ++ DQ  + DL     +    R+  L++++S+E   
Subjt:  IIQSDSE--ESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKRGKSPASS------SKSPAPMSSDQYAM-DLGFTPVNRPRTRSASLQIRESMESLT

Query:  P
        P
Subjt:  P

XP_023520850.1 uncharacterized protein LOC111784362 [Cucurbita pepo subsp. pepo]9.2e-3047.37Show/hide
Query:  WKTTQRG--------EGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY
        W+T   G        EGLRLCNESKIQ+KLN S  S R ELG FCDQYGC  I+ P T+R+  +K   K     SYRP   ++ K  +    +Y +R + 
Subjt:  WKTTQRG--------EGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY

Query:  KGQSSKGNVQ--CYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKR
          ++ +G  +  C+KCR  GH+A KCP+K KINEL+ID   K QLL +  ++SE+S      EG+ILKLQEE+DS SS+ YE E+ EGKR
Subjt:  KGQSSKGNVQ--CYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKR

XP_023520850.1 uncharacterized protein LOC111784362 [Cucurbita pepo subsp. pepo]1.4e-1752.08Show/hide
Query:  MDNRSVFASQINVLIKEFSIDKEALKRDFLSFKNSAKRKAFFKNFDKTQRSEIRSKWYSSMEEAEQNIPFFTWMNKSKDEINVL-QESWKTTQRGE
        MD +SV+ SQ+NV+  +F IDKE LK DF+S  NS+KR AFF+ + + +R+E+R++WYS ME  ++NIPFF W  ++  +I  L Q SWKTT+RGE
Subjt:  MDNRSVFASQINVLIKEFSIDKEALKRDFLSFKNSAKRKAFFKNFDKTQRSEIRSKWYSSMEEAEQNIPFFTWMNKSKDEINVL-QESWKTTQRGE

XP_023520850.1 uncharacterized protein LOC111784362 [Cucurbita pepo subsp. pepo]1.2e-2947.89Show/hide
Query:  WKTTQRG--------EGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY
        W+T   G        EGLRL NESKIQ+KLN S  S R ELG FCDQYGC  I+ PST+ ++ +K   K     SYRP   ++ K  +    +Y +R + 
Subjt:  WKTTQRG--------EGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY

Query:  --KGQSSKGNVQCYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKR
          K    K    C+KCR  GH+ANKCP++ KINEL+ID   K QLL +  +DSE+S      EG+IL+LQEE+DSYSS++YE E+ EGKR
Subjt:  --KGQSSKGNVQCYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKR

XP_023522280.1 uncharacterized protein LOC111786173, partial [Cucurbita pepo subsp. pepo]2.1e-2946.32Show/hide
Query:  WKTTQRG--------EGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY
        W+T   G        EGLRLCNESKIQ+KL+    S R ELG FCDQYGC  I+ PST+R+  +K   K     SYRP   ++ K  +    +Y +R + 
Subjt:  WKTTQRG--------EGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY

Query:  KGQSSKGNVQ--CYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKR
          ++ +G  +  C+KCR  GH+AN+CP++ KINEL+ID   K QLL +  +DSE+S      EG+IL+LQEE+DSYSS++YE  + EGKR
Subjt:  KGQSSKGNVQ--CYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKR

XP_023552915.1 uncharacterized protein LOC111810441 [Cucurbita pepo subsp. pepo]7.5e-3247.26Show/hide
Query:  WKTTQRG--------EGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY
        W+T   G        EGLRLCNESKIQ+KLN S  S R ELG FCDQYGC  I+ PST+R+   K   K     SYRP   ++ K  +    +Y +R + 
Subjt:  WKTTQRG--------EGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY

Query:  KGQSSKGNVQ--CYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKRGKSPASSSKS
          +  KG  +  C+KCR  GH+ANKCP++ KINELEID   K QLL +  +DSE+S      +G+IL+LQEE+DSYS+++YE E+ EGKR K   +  K 
Subjt:  KGQSSKGNVQ--CYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKRGKSPASSSKS

Query:  P
        P
Subjt:  P

TrEMBL top hitse value%identityAlignment
A0A5A7URX9 Enzymatic polyprotein8.4e-2943.09Show/hide
Query:  FRIYYKLMNTNISPRALRSSPKGSTVLLEANLDRSAVTVPKSLSWEQITRNPTWKLTEAFTPPKKNSNLAQIVEHTDGTVEVKFSEEPETSKVKEFMSSR
        +RIY+KLM+TN+SP+AL  SPKG T+L+E N+++S++T+P++L W+++T+NP WKL    TP K++S  A I+E  DG VEV+F+      ++ E MSSR
Subjt:  FRIYYKLMNTNISPRALRSSPKGSTVLLEANLDRSAVTVPKSLSWEQITRNPTWKLTEAFTPPKKNSNLAQIVEHTDGTVEVKFSEEPETSKVKEFMSSR

Query:  PSTSGTSSEIESKFYFDRSNSLRVKSVNIEQNVANVRYEEQPQ--SPTQTDMDNRS-VFASQINVLIKEFSIDKEALKRDF
        PSTS   +E   +    RS S+R  SV+    + +V YE++ +  SPTQ++M+ RS    +QINV+    S DKE  +  +
Subjt:  PSTSGTSSEIESKFYFDRSNSLRVKSVNIEQNVANVRYEEQPQ--SPTQTDMDNRS-VFASQINVLIKEFSIDKEALKRDF

A0A5A7URX9 Enzymatic polyprotein2.2e-1330.04Show/hide
Query:  IRSKWYSSMEEAEQNIPFFTWMNKSKDEINVLQESWKTTQRGEGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSR
        I  K+Y +M E          +N+  D  N+      +T +   + LC E+K  +K+ K     R ELG FC QYG       S   K   K+ +KY S+
Subjt:  IRSKWYSSMEEAEQNIPFFTWMNKSKDEINVLQESWKTTQRGEGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSR

Query:  QSYRPYGPFQKKNYKKPFNSYKKRPFYKGQSSKGNVQCYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEES--IIYSTDEGQILKLQEET
        + +R   P  +++ ++  + Y K    K  SSK N  C+KC   GH+AN+CP++ KIN L ID+ TKQ +L  I+SD + S     S++E  I  LQEE 
Subjt:  QSYRPYGPFQKKNYKKPFNSYKKRPFYKGQSSKGNVQCYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEES--IIYSTDEGQILKLQEET

Query:  DSYSSSDYEEEEYEGKRGKSPASSSKSPA------PMSSDQYAM-DLGFTPVNRPRTRSASLQIRESMESLTP
         S     Y + +     G  P +   +         ++ DQ  + DL    ++    R+  L++++S+E   P
Subjt:  DSYSSSDYEEEEYEGKRGKSPASSSKSPA------PMSSDQYAM-DLGFTPVNRPRTRSASLQIRESMESLTP

A0A5A7URX9 Enzymatic polyprotein1.4e-2845.56Show/hide
Query:  FRIYYKLMNTNISPRALRSSPKGSTVLLEANLDRSAVTVPKSLSWEQITRNPTWKLTEAFTPPKKNSNLAQIVEHTDGTVEVKFSEEPETSKVKEFMSSR
        +RIY+KLM+TN+SP+AL  SPKG T+L+E N+++S++T+P++L W+++T+NP WKL    TP K++S  A I E  DG VEV+F+      ++ E MSSR
Subjt:  FRIYYKLMNTNISPRALRSSPKGSTVLLEANLDRSAVTVPKSLSWEQITRNPTWKLTEAFTPPKKNSNLAQIVEHTDGTVEVKFSEEPETSKVKEFMSSR

Query:  PSTSGTSSEIESKFYFDRSNSLRVKSVNIEQNVANVRYEEQPQS--PTQTDMDNRS-VFASQINVLIKE
        PSTS   +E   K    RS S+R  SV+    + +V YE++  S  PTQ+DM+ RS    +QINV+  E
Subjt:  PSTSGTSSEIESKFYFDRSNSLRVKSVNIEQNVANVRYEEQPQS--PTQTDMDNRS-VFASQINVLIKE

A0A5A7UX67 Enzymatic polyprotein2.4e-3628.45Show/hide
Query:  FRIYYKLMNTNISPRALRSSPKGSTVLLEANLDRSAVTVPKSLSWEQITRNPTWKLTEAFTPPKKNSNLAQIVEHTDGTVEVKFSEEPETSKVKEFMSSR
        +RIY+KLM+TN+SP+AL  SPKG T+L+E N+++S++T+P++L W+++T+NP WKL     P  ++S  A I E  DG VEV+F+      +V E MSSR
Subjt:  FRIYYKLMNTNISPRALRSSPKGSTVLLEANLDRSAVTVPKSLSWEQITRNPTWKLTEAFTPPKKNSNLAQIVEHTDGTVEVKFSEEPETSKVKEFMSSR

Query:  PSTSGTSSEIESKFYFDRSNSLRVKSVNIEQNVANVRYEEQ--PQSPTQTDMDNRS-VFASQINVLIKEFSIDKEAL------------------KRDFL
        PSTS   SE   +    RS S+R  SV+    + +V YE++    SPTQ+DM+ RS    +QINV+    S DKE                    ++ FL
Subjt:  PSTSGTSSEIESKFYFDRSNSLRVKSVNIEQNVANVRYEEQ--PQSPTQTDMDNRS-VFASQINVLIKEFSIDKEAL------------------KRDFL

Query:  SF-------------KNSA----------------------------------KRKAFFK-----------------NFDKTQRSEIR------------
        +              KN A                                  + +A+F                  N DK Q+  +R            
Subjt:  SF-------------KNSA----------------------------------KRKAFFK-----------------NFDKTQRSEIR------------

Query:  ----SKWYSSMEEAEQNIPFFTWMNKSKDEINVLQESWKTTQRGEGLRLCNESKIQSKLNKSF-------GSR-------------------RAELGIFC
            S    ++E  E + P     N    +IN  Q  ++ +    G    + S   +++N+         GS+                   R ELG FC
Subjt:  ----SKWYSSMEEAEQNIPFFTWMNKSKDEINVLQESWKTTQRGEGLRLCNESKIQSKLNKSF-------GSR-------------------RAELGIFC

Query:  DQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFYKGQSSKGNVQCYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLN
         QYG    Q P   +K   K+ ++Y S++ +R      +++ ++    Y K    KG SSK +  C+KC   GH+AN+CP+K KIN + ID+ TKQ LL 
Subjt:  DQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFYKGQSSKGNVQCYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLN

Query:  IIQSDSE--ESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKRGKSPASS------SKSPAPMSSDQYAM-DLGFTPVNRPRTRSASLQIRESMESLT
         I+SD +       S++E  I  LQEE  S     Y + +     G  P +       S     ++ DQ  + DL     +    R+  L++++S+E   
Subjt:  IIQSDSE--ESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKRGKSPASS------SKSPAPMSSDQYAM-DLGFTPVNRPRTRSASLQIRESMESLT

Query:  P
        P
Subjt:  P

A0A5D3C4I7 Movement protein6.7e-0232.56Show/hide
Query:  GHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEES--IIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKRGKSPASS------SKSPAPMSSDQYAM
        GH+AN+CP+K KIN ++ID+ TKQ LL  I+SD + S     S++E  I  LQEE  S     Y + +     G  P +       S     ++ DQ  +
Subjt:  GHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEES--IIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKRGKSPASS------SKSPAPMSSDQYAM

Query:  -DLGFTPVNRPRTRSASLQIRESMESLTP
         DL     +    R+  L++++S+E   P
Subjt:  -DLGFTPVNRPRTRSASLQIRESMESLTP

A0A6J1EW44 uncharacterized protein LOC1114366184.9e-2946.2Show/hide
Query:  WKTTQRG--------EGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY
        W+T   G        EGLRLCNESKIQ+KLN S  S R ELG FCDQYGC  I+ PST+R+  +K   K     SYRP   +  K  +     Y +R + 
Subjt:  WKTTQRG--------EGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY

Query:  KGQSSKGNVQ--CYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEE
          ++ +G  +   +KCR  GH+ NKCP++ KINEL+ID   K QLL +  +DSE+S      EG+IL+LQ+E+DSYSS++YE E
Subjt:  KGQSSKGNVQ--CYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEE

A0A6J1EYM2 uncharacterized protein LOC1114397305.8e-3047.89Show/hide
Query:  WKTTQRG--------EGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY
        W+T   G        EGLRL NESKIQ+KLN S  S R ELG FCDQYGC  I+ PST+ ++ +K   K     SYRP   ++ K  +    +Y +R + 
Subjt:  WKTTQRG--------EGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY

Query:  --KGQSSKGNVQCYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKR
          K    K    C+KCR  GH+ANKCP++ KINEL+ID   K QLL +  +DSE+S      EG+IL+LQEE+DSYSS++YE E+ EGKR
Subjt:  --KGQSSKGNVQCYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTTTAGAATCTATTACAAGCTTATGAATACTAATATCTCTCCGAGAGCTTTGAGATCCTCACCTAAAGGATCAACTGTTCTTTTAGAAGCAAATCTAGATAGGTC
TGCAGTCACTGTTCCCAAAAGCCTATCCTGGGAACAGATCACAAGGAACCCCACATGGAAGCTTACAGAAGCTTTCACTCCACCAAAAAAGAACTCAAACTTAGCACAAA
TTGTTGAACATACCGATGGAACGGTAGAAGTCAAATTTTCTGAAGAACCAGAAACATCAAAGGTTAAAGAATTTATGTCCTCTAGACCAAGTACCTCTGGAACTTCTTCA
GAAATCGAGTCCAAATTCTATTTTGACCGATCCAACTCATTAAGAGTAAAATCAGTCAATATAGAGCAAAATGTGGCAAATGTTCGATATGAAGAACAACCACAATCTCC
CACACAAACAGACATGGATAATCGATCTGTGTTTGCCAGCCAGATTAATGTCCTTATCAAAGAATTTTCAATCGATAAAGAAGCCCTTAAGAGAGATTTCCTTTCTTTTA
AAAATAGTGCAAAAAGAAAAGCCTTTTTCAAAAATTTTGATAAAACCCAAAGATCTGAGATAAGATCAAAGTGGTATTCTTCCATGGAAGAAGCCGAACAAAATATTCCG
TTTTTCACTTGGATGAATAAATCCAAGGATGAGATCAATGTGCTTCAAGAATCCTGGAAAACTACTCAAAGAGGAGAAGGTCTCCGCCTCTGCAATGAATCAAAAATTCA
ATCCAAACTGAATAAATCCTTTGGATCTAGACGAGCAGAATTGGGAATTTTCTGCGATCAATATGGATGTGGAAAAATTCAACCTCCATCAACGGCGAGAAAATCCTCCA
TCAAAAGATTTCAGAAATATCAAAGTAGACAAAGCTACAGGCCATATGGGCCATTTCAAAAGAAAAATTACAAAAAACCTTTTAATTCATACAAAAAACGCCCCTTTTAC
AAAGGACAAAGTTCAAAAGGAAACGTTCAATGCTACAAATGTAGAGCTCATGGTCATTTTGCAAACAAGTGTCCTATGAAAAAGAAGATTAATGAGTTAGAAATAGATGA
CAATACGAAACAACAACTTCTAAACATTATTCAATCTGATTCTGAAGAATCAATCATCTACAGTACAGATGAGGGTCAAATTCTTAAATTACAAGAAGAAACCGATTCAT
ATTCCAGTAGTGATTATGAAGAGGAAGAATATGAAGGCAAAAGAGGCAAAAGCCCAGCTTCATCTTCAAAATCTCCAGCTCCTATGAGCTCAGATCAATATGCCATGGAT
CTTGGCTTCACTCCAGTGAATCGTCCAAGAACCAGAAGTGCATCCCTTCAAATTAGAGAATCAATGGAGTCTTTAACTCCACCACCGAGGCCATCCGCCTCTCTTGCACG
GCCTTCTGCCGTTACTCCAATGAGACCAACTGTCTCTCCAACCTCTGTAACTCAATCGAGACCTACTACAATTACATATTCATATTTCAAACAACTTGTCAAAATTCTTC
AAATCAAATGGTGGGATAAATATAATTTTTCTCACGCCGGCATCAAAGAAGTCAAACAATGGTTCGCCGATAATGGCTACCTTCAAGACTTATCAAAAAGGAAGAATGCT
GAATTCCTTAATTCAAAATCAAAACTACTAGCTGCTTTAGCTCAAACGACGACAGAAGCAGACTTTCAAAAGATCGTCAATCAAGCGACATCAATAGCGTCATCATCCTC
TCACCACAACTCCGATGTTGAGAACGAAGAAGAAAAGGGTGAATATGACCTCAATGACCCTTTCTTAGATTCACAACCCATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTTTAGAATCTATTACAAGCTTATGAATACTAATATCTCTCCGAGAGCTTTGAGATCCTCACCTAAAGGATCAACTGTTCTTTTAGAAGCAAATCTAGATAGGTC
TGCAGTCACTGTTCCCAAAAGCCTATCCTGGGAACAGATCACAAGGAACCCCACATGGAAGCTTACAGAAGCTTTCACTCCACCAAAAAAGAACTCAAACTTAGCACAAA
TTGTTGAACATACCGATGGAACGGTAGAAGTCAAATTTTCTGAAGAACCAGAAACATCAAAGGTTAAAGAATTTATGTCCTCTAGACCAAGTACCTCTGGAACTTCTTCA
GAAATCGAGTCCAAATTCTATTTTGACCGATCCAACTCATTAAGAGTAAAATCAGTCAATATAGAGCAAAATGTGGCAAATGTTCGATATGAAGAACAACCACAATCTCC
CACACAAACAGACATGGATAATCGATCTGTGTTTGCCAGCCAGATTAATGTCCTTATCAAAGAATTTTCAATCGATAAAGAAGCCCTTAAGAGAGATTTCCTTTCTTTTA
AAAATAGTGCAAAAAGAAAAGCCTTTTTCAAAAATTTTGATAAAACCCAAAGATCTGAGATAAGATCAAAGTGGTATTCTTCCATGGAAGAAGCCGAACAAAATATTCCG
TTTTTCACTTGGATGAATAAATCCAAGGATGAGATCAATGTGCTTCAAGAATCCTGGAAAACTACTCAAAGAGGAGAAGGTCTCCGCCTCTGCAATGAATCAAAAATTCA
ATCCAAACTGAATAAATCCTTTGGATCTAGACGAGCAGAATTGGGAATTTTCTGCGATCAATATGGATGTGGAAAAATTCAACCTCCATCAACGGCGAGAAAATCCTCCA
TCAAAAGATTTCAGAAATATCAAAGTAGACAAAGCTACAGGCCATATGGGCCATTTCAAAAGAAAAATTACAAAAAACCTTTTAATTCATACAAAAAACGCCCCTTTTAC
AAAGGACAAAGTTCAAAAGGAAACGTTCAATGCTACAAATGTAGAGCTCATGGTCATTTTGCAAACAAGTGTCCTATGAAAAAGAAGATTAATGAGTTAGAAATAGATGA
CAATACGAAACAACAACTTCTAAACATTATTCAATCTGATTCTGAAGAATCAATCATCTACAGTACAGATGAGGGTCAAATTCTTAAATTACAAGAAGAAACCGATTCAT
ATTCCAGTAGTGATTATGAAGAGGAAGAATATGAAGGCAAAAGAGGCAAAAGCCCAGCTTCATCTTCAAAATCTCCAGCTCCTATGAGCTCAGATCAATATGCCATGGAT
CTTGGCTTCACTCCAGTGAATCGTCCAAGAACCAGAAGTGCATCCCTTCAAATTAGAGAATCAATGGAGTCTTTAACTCCACCACCGAGGCCATCCGCCTCTCTTGCACG
GCCTTCTGCCGTTACTCCAATGAGACCAACTGTCTCTCCAACCTCTGTAACTCAATCGAGACCTACTACAATTACATATTCATATTTCAAACAACTTGTCAAAATTCTTC
AAATCAAATGGTGGGATAAATATAATTTTTCTCACGCCGGCATCAAAGAAGTCAAACAATGGTTCGCCGATAATGGCTACCTTCAAGACTTATCAAAAAGGAAGAATGCT
GAATTCCTTAATTCAAAATCAAAACTACTAGCTGCTTTAGCTCAAACGACGACAGAAGCAGACTTTCAAAAGATCGTCAATCAAGCGACATCAATAGCGTCATCATCCTC
TCACCACAACTCCGATGTTGAGAACGAAGAAGAAAAGGGTGAATATGACCTCAATGACCCTTTCTTAGATTCACAACCCATGTGA
Protein sequenceShow/hide protein sequence
MIFRIYYKLMNTNISPRALRSSPKGSTVLLEANLDRSAVTVPKSLSWEQITRNPTWKLTEAFTPPKKNSNLAQIVEHTDGTVEVKFSEEPETSKVKEFMSSRPSTSGTSS
EIESKFYFDRSNSLRVKSVNIEQNVANVRYEEQPQSPTQTDMDNRSVFASQINVLIKEFSIDKEALKRDFLSFKNSAKRKAFFKNFDKTQRSEIRSKWYSSMEEAEQNIP
FFTWMNKSKDEINVLQESWKTTQRGEGLRLCNESKIQSKLNKSFGSRRAELGIFCDQYGCGKIQPPSTARKSSIKRFQKYQSRQSYRPYGPFQKKNYKKPFNSYKKRPFY
KGQSSKGNVQCYKCRAHGHFANKCPMKKKINELEIDDNTKQQLLNIIQSDSEESIIYSTDEGQILKLQEETDSYSSSDYEEEEYEGKRGKSPASSSKSPAPMSSDQYAMD
LGFTPVNRPRTRSASLQIRESMESLTPPPRPSASLARPSAVTPMRPTVSPTSVTQSRPTTITYSYFKQLVKILQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKRKNA
EFLNSKSKLLAALAQTTTEADFQKIVNQATSIASSSSHHNSDVENEEEKGEYDLNDPFLDSQPM