; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G013150 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G013150
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionzinc finger homeobox protein 4-like isoform X1
Genome locationchr08:21520315..21521915
RNA-Seq ExpressionLsi08G013150
SyntenyLsi08G013150
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444812.1 PREDICTED: uncharacterized protein LOC103488048 [Cucumis melo]2.9e-8966.55Show/hide
Query:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE
        STH+CS SF+SD+FTP +HFVA IL +  LL+Q+S+FSLGL PSW +RRKRSAV SPPD +SV+ QPPPPPSSSE  KESSPTTPLS N   L RSESDE
Subjt:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE

Query:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL
        N  N K+SK+K  ++KKFQYLETIDKLTH+N+AL  DV+AMK+H+  LKTINS+LKAKKQE  MILGGS NQSEIPEIGTSSS          KSS SN+
Subjt:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL

Query:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAA
        EN+LH+ +PS+KNQTAP+ EQSN  QN QIP G IPL D    PMGIPDLNL+ E+  Q   +++MAA+ARQNRIRIWKNK  +N+NG A
Subjt:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAA

XP_008444813.1 PREDICTED: uncharacterized protein LOC103488049 [Cucumis melo]2.7e-10372.43Show/hide
Query:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL
        MAST+STHQCS SFDSDDF+P++ FVAQIL   PLL+Q+S FSLGLSPSW IRRKRSAVDSPPD+ S+I QPP PP     SSE  KESSPTTPLSLNSL
Subjt:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA
        PLSRSESDEN T  K+SKKK  ++KK QYLETIDKLTH+ +ALEGD++AMKRH+  LKTINS+LKAKKQE IL G  N S  PEIGTSSSVAME+AKLT 
Subjt:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA

Query:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQ
        KSS SN+EN+  + EPS+KNQT P  EQ NS +N+QIPIG IPLYDPSLGPMGIPDLNLS E+I     ++++AARARQNRI+IWKNKN +NNNGA KLQ
Subjt:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQ

Query:  S
        S
Subjt:  S

XP_011649663.1 uncharacterized protein LOC105434650 [Cucumis sativus]3.2e-7260.14Show/hide
Query:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE
        ST   S S +SDDFTP+DH VA IL + PLL+Q+S+FSLGL PSW IRRKRSAV SP   ++V+ QPPPPPSSSE  KE+SPTTPLSL+SL LSRSESDE
Subjt:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE

Query:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL
        N  N K+SK+K  + KKF+  E++DKLTH+N+AL  + +A K+ +   KTINS+LKAKKQE  MILGGS N+SEIPE GTS+S          KSS  N+
Subjt:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL

Query:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNN
        EN+LH+ EPS KNQTAPM EQSN  QN QIPI  IPL D     MGIPDLNL+ E+  +    + +AA+ARQNR RI KNK    N
Subjt:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNN

XP_011649664.1 myocardin-related transcription factor A [Cucumis sativus]1.4e-10471.85Show/hide
Query:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPP----SSSENVKESSPTTPLSLNSL
        MAST+STHQCS SFDSDDF+P++HFVAQIL   PLL+QQS FSLGLSPSW IRRKRSAVDSPPD++S+I QPP PP     SSE  KESSPTTPLSL+SL
Subjt:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPP----SSSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA
        PLSRSESDEN T  K+SKKK  ++KK QYLETI+KLTH+ +ALEGD++AMKRH+  LKTINS+LKAKKQE ILGG +N S  P+ GTS+SVAME+AKLT 
Subjt:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA

Query:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNK-NKHNNNGAAKL
        KSS SN+EN+  + EPS+KNQT P+ EQSNS QN+QIPIG IPLYDPSLGPMGIPDLNLS E+I     ++++AA+ARQNRI+IWKNK N +NNNGA KL
Subjt:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNK-NKHNNNGAAKL

Query:  QS
        QS
Subjt:  QS

XP_022996985.1 zinc finger homeobox protein 4-like isoform X1 [Cucurbita maxima]3.2e-7258.17Show/hide
Query:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVI----PQPPPPPSSSENVKESSPTTPLSLNSL
        MAST   HQC+   D D  TPD+    QIL +FPLLVQQ +FSLGL P+W +R KRSAV SPPDS S++    P PPPPP SS   KESSPTTP SL+SL
Subjt:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVI----PQPPPPPSSSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA
        PLSR ESDE N    L  KK  ++KK QYLET+ +LT +N+AL G V  +KRHYN+LKT NS+LKAK+Q+MI   S  +S  PEI  SSS A++  K T 
Subjt:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA

Query:  KSSASNLENHLHQREPSIKNQTA-----PMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIW--KNKNKHNN
        K      ++H HQ +P IKNQTA        EQSNS QN +IP GAI +YDPS GP GIPDLNLSF+EI+Q   +R MAA+ARQNRI+IW  KN N +NN
Subjt:  KSSASNLENHLHQREPSIKNQTA-----PMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIW--KNKNKHNN

Query:  NGAAKL
        NGAA+L
Subjt:  NGAAKL

TrEMBL top hitse value%identityAlignment
A0A0A0LRP1 Uncharacterized protein6.9e-10571.85Show/hide
Query:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPP----SSSENVKESSPTTPLSLNSL
        MAST+STHQCS SFDSDDF+P++HFVAQIL   PLL+QQS FSLGLSPSW IRRKRSAVDSPPD++S+I QPP PP     SSE  KESSPTTPLSL+SL
Subjt:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPP----SSSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA
        PLSRSESDEN T  K+SKKK  ++KK QYLETI+KLTH+ +ALEGD++AMKRH+  LKTINS+LKAKKQE ILGG +N S  P+ GTS+SVAME+AKLT 
Subjt:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA

Query:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNK-NKHNNNGAAKL
        KSS SN+EN+  + EPS+KNQT P+ EQSNS QN+QIPIG IPLYDPSLGPMGIPDLNLS E+I     ++++AA+ARQNRI+IWKNK N +NNNGA KL
Subjt:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNK-NKHNNNGAAKL

Query:  QS
        QS
Subjt:  QS

A0A1S3BAR4 uncharacterized protein LOC1034880491.3e-10372.43Show/hide
Query:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL
        MAST+STHQCS SFDSDDF+P++ FVAQIL   PLL+Q+S FSLGLSPSW IRRKRSAVDSPPD+ S+I QPP PP     SSE  KESSPTTPLSLNSL
Subjt:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA
        PLSRSESDEN T  K+SKKK  ++KK QYLETIDKLTH+ +ALEGD++AMKRH+  LKTINS+LKAKKQE IL G  N S  PEIGTSSSVAME+AKLT 
Subjt:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA

Query:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQ
        KSS SN+EN+  + EPS+KNQT P  EQ NS +N+QIPIG IPLYDPSLGPMGIPDLNLS E+I     ++++AARARQNRI+IWKNKN +NNNGA KLQ
Subjt:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQ

Query:  S
        S
Subjt:  S

A0A1S3BC34 uncharacterized protein LOC1034880481.4e-8966.55Show/hide
Query:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE
        STH+CS SF+SD+FTP +HFVA IL +  LL+Q+S+FSLGL PSW +RRKRSAV SPPD +SV+ QPPPPPSSSE  KESSPTTPLS N   L RSESDE
Subjt:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE

Query:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL
        N  N K+SK+K  ++KKFQYLETIDKLTH+N+AL  DV+AMK+H+  LKTINS+LKAKKQE  MILGGS NQSEIPEIGTSSS          KSS SN+
Subjt:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL

Query:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAA
        EN+LH+ +PS+KNQTAP+ EQSN  QN QIP G IPL D    PMGIPDLNL+ E+  Q   +++MAA+ARQNRIRIWKNK  +N+NG A
Subjt:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAA

A0A5A7VA15 Uncharacterized protein1.4e-8966.55Show/hide
Query:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE
        STH+CS SF+SD+FTP +HFVA IL +  LL+Q+S+FSLGL PSW +RRKRSAV SPPD +SV+ QPPPPPSSSE  KESSPTTPLS N   L RSESDE
Subjt:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE

Query:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL
        N  N K+SK+K  ++KKFQYLETIDKLTH+N+AL  DV+AMK+H+  LKTINS+LKAKKQE  MILGGS NQSEIPEIGTSSS          KSS SN+
Subjt:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL

Query:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAA
        EN+LH+ +PS+KNQTAP+ EQSN  QN QIP G IPL D    PMGIPDLNL+ E+  Q   +++MAA+ARQNRIRIWKNK  +N+NG A
Subjt:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAA

A0A5A7VHE1 Uncharacterized protein1.3e-10372.43Show/hide
Query:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL
        MAST+STHQCS SFDSDDF+P++ FVAQIL   PLL+Q+S FSLGLSPSW IRRKRSAVDSPPD+ S+I QPP PP     SSE  KESSPTTPLSLNSL
Subjt:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA
        PLSRSESDEN T  K+SKKK  ++KK QYLETIDKLTH+ +ALEGD++AMKRH+  LKTINS+LKAKKQE IL G  N S  PEIGTSSSVAME+AKLT 
Subjt:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA

Query:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQ
        KSS SN+EN+  + EPS+KNQT P  EQ NS +N+QIPIG IPLYDPSLGPMGIPDLNLS E+I     ++++AARARQNRI+IWKNKN +NNNGA KLQ
Subjt:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQ

Query:  S
        S
Subjt:  S

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTGAAACCCAGTCCCCAAGCAACTCTCACTCTTCTTCTTCTTCTTCCTCTTTTCTTCAATAAATCCCTCAATTCCTTCTCTTCCACTCTCCCATGGAATTTCTCTTCCTC
TGAATCTTCGCAAGAATCCTCTCAAACAACCAGATTCCCCATGGCTTCCACTACTTCCACTCATCAATGCTCCACCTCCTTCGATTCCGACGACTTCACCCCTGATGACC
ACTTCGTCGCTCAAATCCTCATCGATTTTCCTCTTCTCGTTCAACAATCACAGTTTTCTCTTGGCTTATCCCCTTCCTGGTCTATCCGACGCAAGAGATCCGCCGTCGAT
TCCCCGCCGGACTCCGCCTCCGTCATCCCCCAACCGCCGCCTCCTCCATCGTCGTCCGAGAACGTCAAGGAGTCTAGCCCTACTACTCCACTTTCACTCAACTCTTTACC
TTTGTCGCGGAGTGAATCTGACGAGAATAACACCAACACGAAGCTCTCCAAGAAGAAAACCTCTATCAATAAGAAATTTCAGTATTTGGAAACCATTGACAAATTGACCC
ACCGGAATCGAGCTCTGGAAGGGGACGTTGATGCTATGAAGCGACATTATAATCAACTGAAAACTATTAATTCGAAGTTGAAAGCCAAGAAGCAAGAGATGATTCTGGGT
GGTTCCAATAATCAATCAGAAATTCCAGAAATTGGGACCTCAAGTTCGGTCGCCATGGAAATGGCTAAGTTAACTGCGAAATCCTCAGCCTCAAATCTGGAGAATCATCT
TCATCAACGTGAACCGTCGATCAAGAATCAGACGGCTCCGATGGAAGAACAGAGCAACAGTAAACAGAATTTCCAAATTCCAATTGGGGCAATTCCTTTATATGATCCTT
CATTAGGTCCAATGGGGATTCCTGATTTGAACCTCTCTTTTGAAGAAATTAATCAGGGATATTGTTCAAGACACATGGCTGCTAGAGCAAGACAGAACAGGATTCGGATC
TGGAAGAACAAGAACAAGCACAACAACAATGGAGCTGCCAAATTGCAATCCTAA
mRNA sequenceShow/hide mRNA sequence
GTGAAACCCAGTCCCCAAGCAACTCTCACTCTTCTTCTTCTTCTTCCTCTTTTCTTCAATAAATCCCTCAATTCCTTCTCTTCCACTCTCCCATGGAATTTCTCTTCCTC
TGAATCTTCGCAAGAATCCTCTCAAACAACCAGATTCCCCATGGCTTCCACTACTTCCACTCATCAATGCTCCACCTCCTTCGATTCCGACGACTTCACCCCTGATGACC
ACTTCGTCGCTCAAATCCTCATCGATTTTCCTCTTCTCGTTCAACAATCACAGTTTTCTCTTGGCTTATCCCCTTCCTGGTCTATCCGACGCAAGAGATCCGCCGTCGAT
TCCCCGCCGGACTCCGCCTCCGTCATCCCCCAACCGCCGCCTCCTCCATCGTCGTCCGAGAACGTCAAGGAGTCTAGCCCTACTACTCCACTTTCACTCAACTCTTTACC
TTTGTCGCGGAGTGAATCTGACGAGAATAACACCAACACGAAGCTCTCCAAGAAGAAAACCTCTATCAATAAGAAATTTCAGTATTTGGAAACCATTGACAAATTGACCC
ACCGGAATCGAGCTCTGGAAGGGGACGTTGATGCTATGAAGCGACATTATAATCAACTGAAAACTATTAATTCGAAGTTGAAAGCCAAGAAGCAAGAGATGATTCTGGGT
GGTTCCAATAATCAATCAGAAATTCCAGAAATTGGGACCTCAAGTTCGGTCGCCATGGAAATGGCTAAGTTAACTGCGAAATCCTCAGCCTCAAATCTGGAGAATCATCT
TCATCAACGTGAACCGTCGATCAAGAATCAGACGGCTCCGATGGAAGAACAGAGCAACAGTAAACAGAATTTCCAAATTCCAATTGGGGCAATTCCTTTATATGATCCTT
CATTAGGTCCAATGGGGATTCCTGATTTGAACCTCTCTTTTGAAGAAATTAATCAGGGATATTGTTCAAGACACATGGCTGCTAGAGCAAGACAGAACAGGATTCGGATC
TGGAAGAACAAGAACAAGCACAACAACAATGGAGCTGCCAAATTGCAATCCTAATCCATCCCCTGTATGTCATCATACAGTTCATCAACAAATTTCCACGTTTTTTTTTC
TTAACCTTAGGATTCAATTCATCAATTTGATGGATGGCCTGTTTTGATTCTGGAATTGGGGTTGTTTTTTTTTTCCTTTTTTTTTTTTAATTTTTTAATTGGGTTACAGG
AATTGTAAAGGTAGATA
Protein sequenceShow/hide protein sequence
VKPSPQATLTLLLLLPLFFNKSLNSFSSTLPWNFSSSESSQESSQTTRFPMASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVD
SPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILG
GSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRI
WKNKNKHNNNGAAKLQS