; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010158 (gene) of Snake gourd v1 genome

Gene IDTan0010158
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionKelch repeat-containing protein
Genome locationLG05:73506947..73508252
RNA-Seq ExpressionTan0010158
SyntenyTan0010158
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652542.1 hypothetical protein Csa_013076 [Cucumis sativus]5.8e-7955.62Show/hide
Query:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGDA
        MDG+  DPFDSL +LC++S SQEDILR CSFAG P S+D S    SQ LHP P  +  AS PESL+QREQL   +A Q PPEQS G++P+ VDDPS+ DA
Subjt:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGDA

Query:  VAVAGGTVEAVEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPI--------VPIAE
         A       AV GGCV +V TGVDLGKN +LG  LEVQST+QT  IEIIGVRR ++SES    E ESASKRLKLSNEALG  SSVP         VP+ E
Subjt:  VAVAGGTVEAVEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPI--------VPIAE

Query:  SGGESKLVD---HNGEETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKPTSIIMEILK
          G  K+ D    NGEETHC K     ++ + EK+VENSQPE P ++    NRD  R  L S +N         SKE   + T SG SG   SIIMEILK
Subjt:  SGGESKLVD---HNGEETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKPTSIIMEILK

Query:  ILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPE
        IL++ E  +ED KLA+M+++E+   RGMTFPRPCWWPE
Subjt:  ILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPE

KAG6577658.1 hypothetical protein SDJN03_25232, partial [Cucurbita argyrosperma subsp. sororia]2.7e-2836.81Show/hide
Query:  VLSDSQEDILRRCSFAGNPKSVDG---SGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGDAVAVAGGTVEAVEG
        V+  S  ++ RRC F G PKS D    S S+DS   H +   IFM SLP+SL++     PH +Y+  PE+ V   P   D        +VA   + A   
Subjt:  VLSDSQEDILRRCSFAGNPKSVDG---SGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGDAVAVAGGTVEAVEG

Query:  GCVCDVATGVDLGKNPDLGVQLEVQSTQQTAE----IEIIGVRRREISESEGSR------EVE---SASKRLKLSNEALGTTSSVPI------VPIAESG
            D    +DLG++ D+G+  EVQST +T E    IE+I   R   SE++  R      +V    S+SK+L+LS EALG +S   I      V    SG
Subjt:  GCVCDVATGVDLGKNPDLGVQLEVQSTQQTAE----IEIIGVRRREISESEGSR------EVE---SASKRLKLSNEALGTTSSVPI------VPIAESG

Query:  GESKL--VDHNGE------------------------ETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKE--ATNRDPWRRVLPSTINTSKNNREKNSKE
        G SK+  VD+N +                        E HC  KE+  EKEV E  V+NSQ ++ S  +E  A       RVLP +++  K  R+ + + 
Subjt:  GESKL--VDHNGE------------------------ETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKE--ATNRDPWRRVLPSTINTSKNNREKNSKE

Query:  SETEETPSGNSGKPTSI-IMEILKILAEE---ESEEEDNKLANMSILEIVTRRGMTFPRPCWWP
        S T E        P  + ++EILKILA E   E+E  D  L+N+SIL+IV RRGMTFPRP WWP
Subjt:  SETEETPSGNSGKPTSI-IMEILKILAEE---ESEEEDNKLANMSILEIVTRRGMTFPRPCWWP

KAG6584353.1 hypothetical protein SDJN03_20285, partial [Cucurbita argyrosperma subsp. sororia]6.0e-9264.55Show/hide
Query:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILH-PQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGD
        MDGNGDDPFDSLT+ CVLSDSQEDILRRCSFAGNP+SVDGS ST SQI H  QP  +FMASLPESL+QREQ+T HNAYQAPPEQS GQ+PMA DDPSL D
Subjt:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILH-PQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGD

Query:  AVAVAGGTVEAVEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPIV----PI-AESG
        A  VAGG    VEGGCV DVAT VDLGKN DLG +LEVQSTQQT EIE++GVRRR++SES+   E ESASKRL  SNEALGT SSVPIV    P+  ESG
Subjt:  AVAVAGGTVEAVEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPIV----PI-AESG

Query:  GESKLVDH------------------------NGEETHC-KKKEKFAEKEVCEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESET
        G SKLVDH                        NGEETHC K KEKFA     EKRVENSQPEEP  ++    RD WR VLPST+N SKN+ E ++     
Subjt:  GESKLVDH------------------------NGEETHC-KKKEKFAEKEVCEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESET

Query:  EETPSGNSGKPTSIIMEILKILAEEESEEE
                  P SIIMEILK+LAEEE EE+
Subjt:  EETPSGNSGKPTSIIMEILKILAEEESEEE

KAG7019938.1 hypothetical protein SDJN02_18905 [Cucurbita argyrosperma subsp. argyrosperma]5.4e-10965.21Show/hide
Query:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILH-PQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGD
        MDGNGDDPFDSLT+ CVLSDSQEDILRRCSFAGNP+SVDGS ST SQI H  QP  +FMASLPESL+QREQ+T HNAYQAPPEQS GQ+PMA DDPSL D
Subjt:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILH-PQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGD

Query:  AVAVAGGTVEAVEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPIV----PI-AESG
        A  VAGG    VEGGCV DVAT VDLGKN DLG +LEVQSTQ T EIE++GVRRR++SES+   E ESASKRL  SNEALGT SSVPIV    P+  ESG
Subjt:  AVAVAGGTVEAVEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPIV----PI-AESG

Query:  GESKLVDH------------------------NGEETHC-KKKEKFAEKEVCEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESET
        G SKLVDH                        NGEETHC K KEKFA     EKRVENSQPEEP  ++    RD WR VLPST+N SKN+ E ++     
Subjt:  GESKLVDH------------------------NGEETHC-KKKEKFAEKEVCEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESET

Query:  EETPSGNSGKPTSIIMEILKILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPEGWDFSFK
                  P SIIMEILK+LAEEE  EED + A+MSILE+V+ RGMTFPRPCWWPEG +FSFK
Subjt:  EETPSGNSGKPTSIIMEILKILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPEGWDFSFK

TYJ98997.1 hypothetical protein E5676_scaffold248G002010 [Cucumis melo var. makuwa]7.1e-8555.59Show/hide
Query:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGDA
        MDG+G DPFDSL +LCV+S SQEDILR  SFAG PKS+D SG   SQ L   P  +  AS PESL+QREQL   +AY+ PPEQS G++P+AVDDPS+ DA
Subjt:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGDA

Query:  VAVAGGTVEAVEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPIVPIAESGGESKLV
         A       AV GGCV +V TGVDLGKN +LG  LEVQSTQQT EIEIIGVRR ++SES    E ESASKRLKLSNEALG  SSVP+V + ESG ES L+
Subjt:  VAVAGGTVEAVEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPIVPIAESGGESKLV

Query:  D-----------------HNGEETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKPTSI
        +                  NGEETHC K     ++++ EK+VENS PEEP ++    N DPWR  L ST+N         SKE + + + SG SG+  SI
Subjt:  D-----------------HNGEETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKPTSI

Query:  IMEILKILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPEGWDFS
        IMEILKI+++ E  +ED KLANM ++E+   RGMTFPRPCWWPE + ++
Subjt:  IMEILKILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPEGWDFS

TrEMBL top hitse value%identityAlignment
A0A0A0L1F4 Uncharacterized protein5.8e-2435Show/hide
Query:  VLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAP----PEQSVGQQPMAVDDPSLGDAVAVAGGTVEAVE
        +++ S  ++LRRC F GN  S   S S+DS+    +   IFM SLP+S+++++   PH+   AP    P  S  ++ +A D  S G  V     + E V+
Subjt:  VLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAP----PEQSVGQQPMAVDDPSLGDAVAVAGGTVEAVE

Query:  GGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAE----IEIIGVRRREISESEGSREV---------ESASKRLKLSNEALGTTSSVPIVPIAESG-----
         G        VDLG++ D+G+  EVQST +T E    I +IGV     SE + S  +         ES+SK+L+LS EALG +S         SG     
Subjt:  GGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAE----IEIIGVRRREISESEGSREV---------ESASKRLKLSNEALGTTSSVPIVPIAESG-----

Query:  --GESKLVDHNGEETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKE--ATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKPTSI----IME
           + K+ +   +E  C  KEK AE EV E  V+N +  + +KF+E  A       RVLP +I+  KNN          EE  + +  +P  +    I+ 
Subjt:  --GESKLVDHNGEETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKE--ATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKPTSI----IME

Query:  ILKILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWP
        ILK   +   E +D  L+ +SILEI   RGMTFPRP WWP
Subjt:  ILKILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWP

A0A0A0LT00 Uncharacterized protein2.8e-7955.62Show/hide
Query:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGDA
        MDG+  DPFDSL +LC++S SQEDILR CSFAG P S+D S    SQ LHP P  +  AS PESL+QREQL   +A Q PPEQS G++P+ VDDPS+ DA
Subjt:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGDA

Query:  VAVAGGTVEAVEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPI--------VPIAE
         A       AV GGCV +V TGVDLGKN +LG  LEVQST+QT  IEIIGVRR ++SES    E ESASKRLKLSNEALG  SSVP         VP+ E
Subjt:  VAVAGGTVEAVEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPI--------VPIAE

Query:  SGGESKLVD---HNGEETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKPTSIIMEILK
          G  K+ D    NGEETHC K     ++ + EK+VENSQPE P ++    NRD  R  L S +N         SKE   + T SG SG   SIIMEILK
Subjt:  SGGESKLVD---HNGEETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKPTSIIMEILK

Query:  ILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPE
        IL++ E  +ED KLA+M+++E+   RGMTFPRPCWWPE
Subjt:  ILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPE

A0A2N9G3N6 Uncharacterized protein2.8e-2633.79Show/hide
Query:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFA----GNPKSVDGSGSTDSQILHPQPAA--IFMASLPESLQQREQLTPHNAYQAPPEQSV------GQQ
        +D   +DPF S+T+LC +S SQE+ LR CSFA    G     DG  S  +Q +    A+  I M S PES ++++   PH+ +Q P E S+       QQ
Subjt:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFA----GNPKSVDGSGSTDSQILHPQPAA--IFMASLPESLQQREQLTPHNAYQAPPEQSV------GQQ

Query:  PMAVDDPSLGDAVAVAGGTVEAVEGGCVCDVATGVDLGKNPDLG---VQL--EVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTS
        PMAVD P   DA A+AG T          D +  VDLGK+ DLG   V+L   V       E E +G+ RRE S  E     ES SK+LK+ +  L + +
Subjt:  PMAVDDPSLGDAVAVAGGTVEAVEGGCVCDVATGVDLGKNPDLG---VQL--EVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTS

Query:  SVPIVPIAESGG----------ESKLVDHNGE------ETHCKKKEKF--AEKEVCEKRVEN--SQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKN
          P                   E   V+ N E      E+  K+K  F   + E+ +   E    + EE     E  N     RVLP +++    NR   
Subjt:  SVPIVPIAESGG----------ESKLVDHNGE------ETHCKKKEKF--AEKEVCEKRVEN--SQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKN

Query:  SKESETEETPSGNSGKPTSIIMEILKILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPE
           +E  +   G +GK  + I+++LK L ++  +EED+ L ++SI ++  ++GMTFP+PCWWPE
Subjt:  SKESETEETPSGNSGKPTSIIMEILKILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPE

A0A5A7TCP7 Uncharacterized protein6.8e-2535.14Show/hide
Query:  DILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAP----PEQSVGQQPMAVDDPSLGDAVAVAGGTVEAVEGGCVCDV
        ++ RRC F GN  S   S S+DS+ +  +   IFM SLP+S+++++   PH+   AP    P  S G++ +A D  S G  V     + E V+ G     
Subjt:  DILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAP----PEQSVGQQPMAVDDPSLGDAVAVAGGTVEAVEGGCVCDV

Query:  ATGVDLGKNPDLGVQLEVQSTQQTAE----IEIIGVRRREISESEGSREV---------ESASKRLKLSNEALGTTS-------SVPIVPIAESGG----
           VDLG++ D+G+  EVQST +T E    IE+IG R    SE++ S  +         ES+SK+L+LS EALG +S         P V +  SGG    
Subjt:  ATGVDLGKNPDLGVQLEVQSTQQTAE----IEIIGVRRREISESEGSREV---------ESASKRLKLSNEALGTTS-------SVPIVPIAESGG----

Query:  -------------ESKLVDHNGEETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKE--ATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKP
                     + K+ +   +E  C  KEK AEKEV E  V+N Q  + +KF+E  A       RVLP ++   K N          EE  + +  +P
Subjt:  -------------ESKLVDHNGEETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKE--ATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKP

Query:  TSI-IMEILKILAEEE---SEEEDNKLANMSILEIVTRRGMTFPRPCWWP
          + +++IL IL  E+    E +D  L+ +SILEI   RGMTFPRP WWP
Subjt:  TSI-IMEILKILAEEE---SEEEDNKLANMSILEIVTRRGMTFPRPCWWP

A0A5D3BIQ1 Uncharacterized protein3.4e-8555.59Show/hide
Query:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGDA
        MDG+G DPFDSL +LCV+S SQEDILR  SFAG PKS+D SG   SQ L   P  +  AS PESL+QREQL   +AY+ PPEQS G++P+AVDDPS+ DA
Subjt:  MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGDA

Query:  VAVAGGTVEAVEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPIVPIAESGGESKLV
         A       AV GGCV +V TGVDLGKN +LG  LEVQSTQQT EIEIIGVRR ++SES    E ESASKRLKLSNEALG  SSVP+V + ESG ES L+
Subjt:  VAVAGGTVEAVEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPIVPIAESGGESKLV

Query:  D-----------------HNGEETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKPTSI
        +                  NGEETHC K     ++++ EK+VENS PEEP ++    N DPWR  L ST+N         SKE + + + SG SG+  SI
Subjt:  D-----------------HNGEETHCKKKEKFAEKEVCEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKPTSI

Query:  IMEILKILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPEGWDFS
        IMEILKI+++ E  +ED KLANM ++E+   RGMTFPRPCWWPE + ++
Subjt:  IMEILKILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPEGWDFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGTAATGGCGATGATCCCTTCGATTCACTCACCCAACTCTGCGTCCTTTCCGACTCCCAAGAGGATATTTTGCGTCGCTGCTCTTTCGCCGGCAACCCCAAATC
CGTCGATGGCTCTGGTTCTACCGACTCCCAAATCCTTCACCCTCAGCCCGCTGCTATTTTCATGGCTTCTCTTCCCGAGAGTTTGCAACAGAGAGAACAACTCACCCCTC
ATAATGCCTACCAGGCTCCACCGGAACAGTCCGTCGGACAGCAGCCCATGGCTGTGGACGATCCTTCTCTTGGAGACGCGGTGGCGGTCGCCGGCGGTACGGTGGAAGCC
GTCGAAGGCGGATGTGTGTGCGATGTTGCCACGGGAGTCGATTTGGGGAAGAATCCTGACCTAGGGGTTCAATTGGAAGTTCAATCGACACAACAAACCGCTGAAATCGA
AATCATCGGTGTTCGTAGAAGAGAGATTTCGGAGTCTGAAGGTAGCAGAGAGGTTGAATCGGCATCGAAAAGGTTGAAATTGTCGAATGAAGCTTTGGGCACAACCTCTT
CTGTACCAATCGTACCAATTGCAGAGTCTGGTGGAGAATCGAAACTCGTGGATCACAACGGCGAAGAAACTCACTGTAAGAAGAAGGAAAAATTTGCAGAGAAGGAAGTG
TGTGAGAAGAGAGTCGAAAATTCCCAACCCGAAGAACCAAGCAAGTTCAAAGAAGCGACCAACCGTGATCCATGGAGGCGCGTTCTGCCATCAACAATCAATACATCAAA
GAACAATAGAGAGAAGAATTCGAAGGAGAGTGAAACTGAAGAAACGCCCTCTGGTAATTCTGGAAAGCCCACATCCATTATTATGGAAATTTTAAAGATTCTTGCAGAAG
AAGAAAGTGAAGAAGAAGACAATAAGTTGGCAAATATGAGCATACTGGAAATCGTAACACGTCGTGGAATGACATTTCCTCGGCCGTGTTGGTGGCCGGAAGGGTGGGAT
TTCAGCTTCAAGTAG
mRNA sequenceShow/hide mRNA sequence
TAAAAATTGTATCTCCTTTGATTTTGTTTTACTCCGCTGCTTTCCATGGACGGTAATGGCGATGATCCCTTCGATTCACTCACCCAACTCTGCGTCCTTTCCGACTCCCA
AGAGGATATTTTGCGTCGCTGCTCTTTCGCCGGCAACCCCAAATCCGTCGATGGCTCTGGTTCTACCGACTCCCAAATCCTTCACCCTCAGCCCGCTGCTATTTTCATGG
CTTCTCTTCCCGAGAGTTTGCAACAGAGAGAACAACTCACCCCTCATAATGCCTACCAGGCTCCACCGGAACAGTCCGTCGGACAGCAGCCCATGGCTGTGGACGATCCT
TCTCTTGGAGACGCGGTGGCGGTCGCCGGCGGTACGGTGGAAGCCGTCGAAGGCGGATGTGTGTGCGATGTTGCCACGGGAGTCGATTTGGGGAAGAATCCTGACCTAGG
GGTTCAATTGGAAGTTCAATCGACACAACAAACCGCTGAAATCGAAATCATCGGTGTTCGTAGAAGAGAGATTTCGGAGTCTGAAGGTAGCAGAGAGGTTGAATCGGCAT
CGAAAAGGTTGAAATTGTCGAATGAAGCTTTGGGCACAACCTCTTCTGTACCAATCGTACCAATTGCAGAGTCTGGTGGAGAATCGAAACTCGTGGATCACAACGGCGAA
GAAACTCACTGTAAGAAGAAGGAAAAATTTGCAGAGAAGGAAGTGTGTGAGAAGAGAGTCGAAAATTCCCAACCCGAAGAACCAAGCAAGTTCAAAGAAGCGACCAACCG
TGATCCATGGAGGCGCGTTCTGCCATCAACAATCAATACATCAAAGAACAATAGAGAGAAGAATTCGAAGGAGAGTGAAACTGAAGAAACGCCCTCTGGTAATTCTGGAA
AGCCCACATCCATTATTATGGAAATTTTAAAGATTCTTGCAGAAGAAGAAAGTGAAGAAGAAGACAATAAGTTGGCAAATATGAGCATACTGGAAATCGTAACACGTCGT
GGAATGACATTTCCTCGGCCGTGTTGGTGGCCGGAAGGGTGGGATTTCAGCTTCAAGTAGATGAAGAAGAAAGCTGAAGCTCATAATTATTAGCTACTGGAAGAAGAATG
AATCCCTTTTGGTAATTAACATTTCTGAGCTGAAGAAGAAACGTATTTAGTTTCTTTTTGAGGTGCTTGAATGTTGTTTTATGTAATGTATGAATTTATTATTATAGCAC
CATTGTGTTTTTGAGATGCTCCGAATTGGAGAAGGAAAAACACAGATAGATGAGTTTGTATTTTGTGAAGACTTTGAATGTAAATGGTTTTGTTTC
Protein sequenceShow/hide protein sequence
MDGNGDDPFDSLTQLCVLSDSQEDILRRCSFAGNPKSVDGSGSTDSQILHPQPAAIFMASLPESLQQREQLTPHNAYQAPPEQSVGQQPMAVDDPSLGDAVAVAGGTVEA
VEGGCVCDVATGVDLGKNPDLGVQLEVQSTQQTAEIEIIGVRRREISESEGSREVESASKRLKLSNEALGTTSSVPIVPIAESGGESKLVDHNGEETHCKKKEKFAEKEV
CEKRVENSQPEEPSKFKEATNRDPWRRVLPSTINTSKNNREKNSKESETEETPSGNSGKPTSIIMEILKILAEEESEEEDNKLANMSILEIVTRRGMTFPRPCWWPEGWD
FSFK