; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008711 (gene) of Snake gourd v1 genome

Gene IDTan0008711
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative
Genome locationLG08:26232079..26234333
RNA-Seq ExpressionTan0008711
SyntenyTan0008711
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022964992.1 uncharacterized protein At1g08160-like [Cucurbita moschata]1.3e-7470.53Show/hide
Query:  EEASASSKGTGQERRSSS----HGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATI
        EEAS+S+    +  + SS    HGT KRT+LMRIIGRSLL +M LVGLAIVICWL+VFPKTP   L NGHVTPHSLTDRKLNASI+FTIKSYNPNKRA+I
Subjt:  EEASASSKGTGQERRSSS----HGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATI

Query:  HIHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDN
        H+  M MT+D+MGQ F T+IPTF Q PGNQT   P + V+F+YPFG +K+  L DG+NPEL FSA +SYI+E+W SKRRS+EIYCDR RLKINGSTPFDN
Subjt:  HIHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDN

Query:  TKCKVDL
        TKCKVDL
Subjt:  TKCKVDL

XP_022970341.1 uncharacterized protein At1g08160-like [Cucurbita maxima]6.2e-7470.39Show/hide
Query:  EASASSKGTGQERRSS----SHGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATIH
        EAS+SS    +  + S     HGT KRT+LMRIIGRSLL +M LVGLAIVICWL+VFPKTP   L NGHVTPHSLTDRKLNASI+FTIKSYNPNKRA+IH
Subjt:  EASASSKGTGQERRSS----SHGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATIH

Query:  IHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDNT
        +  M MT+D+MGQ F T+IPTF Q PGNQT  TP + V+F+YPFG +K+  L +G+NPEL FSA +SYI+E+W SKRRS+EIYCDR RLKINGSTPFDNT
Subjt:  IHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDNT

Query:  KCKVDL
        KCKVDL
Subjt:  KCKVDL

XP_023520091.1 uncharacterized protein At1g08160-like [Cucurbita pepo subsp. pepo]2.1e-7470.39Show/hide
Query:  EASASSKGTGQERRSSS----HGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATIH
        EAS+S+    +  + SS    HGT KRT+LMRIIGRSLL +M LVGLAIVICWL+VFPKTP   L NGHVTPHSLTDRKLNASI+FTIKSYNPNKRA+IH
Subjt:  EASASSKGTGQERRSSS----HGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATIH

Query:  IHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDNT
        +  M MT+D+MGQ F T+IPTF Q PGNQT   P + V+F+YPFG +K+  L DG+NPEL FSA +SYI+E+W SKRRS+EIYCDR RLKINGSTPFDNT
Subjt:  IHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDNT

Query:  KCKVDL
        KCKVDL
Subjt:  KCKVDL

XP_023537505.1 uncharacterized protein LOC111798522 [Cucurbita pepo subsp. pepo]5.1e-6058.88Show/hide
Query:  SSSSRGADEEASASSKGTGQERRSSSHG---TGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYN
        SS++   D+ +S+SS      +R SS+G   T  RTRL+RIIGRS+LGLM+LVGLA+V CWL+V PK P+FSL  G VT HSLTDRKLNAS++F I+S+N
Subjt:  SSSSRGADEEASASSKGTGQERRSSSHG---TGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYN

Query:  PNKRATIHIHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKIN
        PNK+A IHI YMTMTI+EMG+ F   +  F Q PGN T+ +P I +DF+YP   L+E+   DG++PEL+ SA I YI+  W SKRRS+EIYC+R  LKIN
Subjt:  PNKRATIHIHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKIN

Query:  GSTPFDNTKCKVDL
        GSTP DN KC VDL
Subjt:  GSTPFDNTKCKVDL

XP_038895440.1 uncharacterized protein LOC120083674 [Benincasa hispida]7.1e-6260.4Show/hide
Query:  EASASSKGTGQERRSSSHGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATIHIHYM
        EAS+SSK   +      HGT KRT+L+RI GRSLLG+MILVG+ I+ICWL+VFPKTP  ++ +G V PH LTDRKL A+I FT+KSYNPNKRATIH+  M
Subjt:  EASASSKGTGQERRSSSHGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATIHIHYM

Query:  TMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDNTKCKV
         M + +MGQ F + IP F Q PGNQT++T  I  +F+YPFGH+KE    +G++P+LRFSA++SYI++RW SK R +EIYC   RLK N STPFDN KC V
Subjt:  TMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDNTKCKV

Query:  DL
        DL
Subjt:  DL

TrEMBL top hitse value%identityAlignment
A0A0A0LSV0 Uncharacterized protein2.8e-5653.81Show/hide
Query:  SSSRGADEEASASSKGTGQERRSSSHGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKR
        S+ +  +     SS  + +      HGT KRTR++RI GR+LLGLMILV +A++ICWL+VFP+ P   +  G V PHSLTDRKLNA+I FT+ SYNPNK+
Subjt:  SSSRGADEEASASSKGTGQERRSSSHGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKR

Query:  ATIHIHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTP
        A+I +  M M + +MG  F + IP+F Q P N+T+ T  I  +F+YPFGH+KE    +G++PELRFSA++SYI+ERW S+ R VE+YCD  RLK N ST 
Subjt:  ATIHIHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTP

Query:  FDNTKCKVDL
        FDN KCKVDL
Subjt:  FDNTKCKVDL

A0A6J1FBI3 uncharacterized protein LOC1114439281.4e-5857.48Show/hide
Query:  SSSSRGADEEASASSKGTGQERRSSSHG---TGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYN
        SS++   D+ +S+SS      +R SS+G   T  RTR++RIIGRS+LGLM+LVGLA+V CWL+V PK P+FSL  G VT HSLTDRKLNAS++F I+S+N
Subjt:  SSSSRGADEEASASSKGTGQERRSSSHG---TGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYN

Query:  PNKRATIHIHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKIN
        PNK+A IHI YM MTI+EMG+KF   +  F Q PGN  + +P I +DF+YP   L+E+   DG++PEL  SA I YI+  W SKRR +EIYC+R  LKIN
Subjt:  PNKRATIHIHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKIN

Query:  GSTPFDNTKCKVDL
        GSTP DN KC VDL
Subjt:  GSTPFDNTKCKVDL

A0A6J1HJ52 uncharacterized protein At1g08160-like6.1e-7570.53Show/hide
Query:  EEASASSKGTGQERRSSS----HGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATI
        EEAS+S+    +  + SS    HGT KRT+LMRIIGRSLL +M LVGLAIVICWL+VFPKTP   L NGHVTPHSLTDRKLNASI+FTIKSYNPNKRA+I
Subjt:  EEASASSKGTGQERRSSS----HGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATI

Query:  HIHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDN
        H+  M MT+D+MGQ F T+IPTF Q PGNQT   P + V+F+YPFG +K+  L DG+NPEL FSA +SYI+E+W SKRRS+EIYCDR RLKINGSTPFDN
Subjt:  HIHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDN

Query:  TKCKVDL
        TKCKVDL
Subjt:  TKCKVDL

A0A6J1I064 uncharacterized protein LOC1114685196.8e-5857.48Show/hide
Query:  SSSSRGADEEASASSKGTGQERRSSSHG---TGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYN
        SS+S+  D+ +S+SS      +R+SS+G   T  RTR++RIIGRS+LGLM+LVGLA+V CWL+V PK P+FSL  G VT H LTDRKLNAS++F I+S+N
Subjt:  SSSSRGADEEASASSKGTGQERRSSSHG---TGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYN

Query:  PNKRATIHIHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKIN
        PNK+A IHI YM MTI+EMG+KF   +  F Q PGN T+ +P I +DF+YP   L+E+   DG++PEL  SA I YI+  W SK R +EIYC+R  LKIN
Subjt:  PNKRATIHIHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKIN

Query:  GSTPFDNTKCKVDL
        GSTP DN KC VDL
Subjt:  GSTPFDNTKCKVDL

A0A6J1I0B9 uncharacterized protein At1g08160-like3.0e-7470.39Show/hide
Query:  EASASSKGTGQERRSS----SHGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATIH
        EAS+SS    +  + S     HGT KRT+LMRIIGRSLL +M LVGLAIVICWL+VFPKTP   L NGHVTPHSLTDRKLNASI+FTIKSYNPNKRA+IH
Subjt:  EASASSKGTGQERRSS----SHGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATIH

Query:  IHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDNT
        +  M MT+D+MGQ F T+IPTF Q PGNQT  TP + V+F+YPFG +K+  L +G+NPEL FSA +SYI+E+W SKRRS+EIYCDR RLKINGSTPFDNT
Subjt:  IHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDNT

Query:  KCKVDL
        KCKVDL
Subjt:  KCKVDL

SwissProt top hitse value%identityAlignment
Q8VZ13 Uncharacterized protein At1g081601.2e-0826.18Show/hide
Query:  GKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSL--TDRKLNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQKF-RTSIPT
        G+R   +  I  +L+ L +LVGLAI+I +L + PK  I+++    V   ++   D  +NA  ++ IKSYNP K  ++  H M ++     Q     +I  
Subjt:  GKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSL--TDRKLNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQKF-RTSIPT

Query:  FLQSPGNQTLFTPVILVDFL----YPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGST--PFDNTKCKVDL
        F Q P N+T     ++   +    +    L+    +  +  E+  +A++SY    + S+RR+++  C    + +  S+   F    CK  L
Subjt:  FLQSPGNQTLFTPVILVDFL----YPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGST--PFDNTKCKVDL

Q9SJ52 NDR1/HIN1-like protein 104.3e-0926.77Show/hide
Query:  HGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVT--PHSLTDRKLNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQKFRT-S
        HG G    L+ +  + ++ L++++G+A +I WL+V P+   F + +  +T   H+  D  L  ++  T+   NPNKR  ++   +       G++F T +
Subjt:  HGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVT--PHSLTDRKLNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQKFRT-S

Query:  IPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALE----DGV-NPELRFSAQISYILERWESKRRSVEIYCDRQRLKI---NGSTPFDNT---KCKVD
        +  F Q   N T+ TP      L  F   + R L      GV N E++F  ++ + L   + +R   ++ CD  RL +   NG+T        KC  D
Subjt:  IPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALE----DGV-NPELRFSAQISYILERWESKRRSVEIYCDRQRLKI---NGSTPFDNT---KCKVD

Arabidopsis top hitse value%identityAlignment
AT1G08160.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.8e-1026.18Show/hide
Query:  GKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSL--TDRKLNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQKF-RTSIPT
        G+R   +  I  +L+ L +LVGLAI+I +L + PK  I+++    V   ++   D  +NA  ++ IKSYNP K  ++  H M ++     Q     +I  
Subjt:  GKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSL--TDRKLNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQKF-RTSIPT

Query:  FLQSPGNQTLFTPVILVDFL----YPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGST--PFDNTKCKVDL
        F Q P N+T     ++   +    +    L+    +  +  E+  +A++SY    + S+RR+++  C    + +  S+   F    CK  L
Subjt:  FLQSPGNQTLFTPVILVDFL----YPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGST--PFDNTKCKVDL

AT2G35980.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.0e-1026.77Show/hide
Query:  HGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVT--PHSLTDRKLNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQKFRT-S
        HG G    L+ +  + ++ L++++G+A +I WL+V P+   F + +  +T   H+  D  L  ++  T+   NPNKR  ++   +       G++F T +
Subjt:  HGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVT--PHSLTDRKLNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQKFRT-S

Query:  IPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALE----DGV-NPELRFSAQISYILERWESKRRSVEIYCDRQRLKI---NGSTPFDNT---KCKVD
        +  F Q   N T+ TP      L  F   + R L      GV N E++F  ++ + L   + +R   ++ CD  RL +   NG+T        KC  D
Subjt:  IPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALE----DGV-NPELRFSAQISYILERWESKRRSVEIYCDRQRLKI---NGSTPFDNT---KCKVD

AT5G06320.1 NDR1/HIN1-like 31.1e-0424.34Show/hide
Query:  LMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSL---TDRKLNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQKFRTS--IPTFLQ
        ++ +I   L+ + +L+G+A +I WL+  P    F + +  +T  +L    + + N  + FTI+  NPN+R  ++   + +      Q+F  S  I  F Q
Subjt:  LMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSL---TDRKLNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQKFRTS--IPTFLQ

Query:  SPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPEL-----RFSAQISYILERWESKRRSVEIYCDRQRLKINGSTP---FDNTKCKVD
           N T+    ++   L      + + L + VN ++     +   +I +     +S R   +I CD +    + ST    F  TKC VD
Subjt:  SPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPEL-----RFSAQISYILERWESKRRSVEIYCDRQRLKINGSTP---FDNTKCKVD

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.7e-0621.39Show/hide
Query:  KRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLT-DRKLNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQ--KFRTSIPTF
        +R  L+  I   +L L+ +  +  +I WL   PK   +++ N  V   +LT D  ++A+  FTI+S+NPN R +++   + + +    Q   F T  P  
Subjt:  KRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLT-DRKLNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQ--KFRTSIPTF

Query:  LQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNP---ELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDNTKCKVDL
              + +   +I  +      + K+   ++ +     E+   A++ + +  W+S  R+ +I C    + ++      N+ C  D+
Subjt:  LQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNP---ELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDNTKCKVDL

AT5G53730.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.9e-0432.61Show/hide
Query:  LAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRK---LNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQKF--RTSIPTFLQSPGNQTLFT
        L I + WL++ P+ P FSL    +   +LT      LN+S+  T+ S NPNK+  I+   + +     GQ+     S+P F QS     L T
Subjt:  LAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRK---LNASITFTIKSYNPNKRATIHIHYMTMTIDEMGQKF--RTSIPTFLQSPGNQTLFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGATGGTGTCGAGTAGTAGTAGTAGAGGAGCAGATGAAGAAGCATCAGCATCCTCAAAAGGGACAGGACAAGAAAGGAGGTCGTCGAGCCATGGGACAGGAAAGCG
GACAAGACTGATGAGAATCATAGGAAGAAGTTTGTTGGGATTAATGATATTAGTTGGACTTGCAATAGTGATATGTTGGCTTCTGGTGTTCCCAAAAACACCAATATTCA
GTTTGGTAAATGGACATGTTACACCACATAGTTTAACTGATAGAAAGCTAAATGCCTCCATAACTTTCACAATAAAAAGCTACAACCCCAACAAAAGAGCCACCATACAT
ATTCATTATATGACAATGACAATCGATGAGATGGGCCAGAAGTTCCGCACTTCCATTCCCACCTTCTTACAGTCTCCCGGAAACCAGACCCTCTTCACTCCGGTCATCCT
CGTCGACTTTCTTTACCCCTTTGGCCACTTGAAAGAAAGGGCCCTCGAAGACGGCGTCAATCCCGAGCTTCGCTTCTCCGCCCAAATCAGTTACATTCTCGAGAGATGGG
AGTCGAAACGTCGGTCCGTAGAGATCTACTGTGATCGCCAAAGGCTTAAGATCAATGGTTCTACACCTTTTGATAATACCAAATGCAAAGTGGATCTTTGA
mRNA sequenceShow/hide mRNA sequence
CCAACACCCCAAAACGAAGAACAATAAGTTGAATTAGTTGAAAGGCATTGAAGAAGAAGAAGGAAGAAGAAGAAGAAGAAGAAGATGATGATGGTGTCGAGTAGTAGTAG
TAGAGGAGCAGATGAAGAAGCATCAGCATCCTCAAAAGGGACAGGACAAGAAAGGAGGTCGTCGAGCCATGGGACAGGAAAGCGGACAAGACTGATGAGAATCATAGGAA
GAAGTTTGTTGGGATTAATGATATTAGTTGGACTTGCAATAGTGATATGTTGGCTTCTGGTGTTCCCAAAAACACCAATATTCAGTTTGGTAAATGGACATGTTACACCA
CATAGTTTAACTGATAGAAAGCTAAATGCCTCCATAACTTTCACAATAAAAAGCTACAACCCCAACAAAAGAGCCACCATACATATTCATTATATGACAATGACAATCGA
TGAGATGGGCCAGAAGTTCCGCACTTCCATTCCCACCTTCTTACAGTCTCCCGGAAACCAGACCCTCTTCACTCCGGTCATCCTCGTCGACTTTCTTTACCCCTTTGGCC
ACTTGAAAGAAAGGGCCCTCGAAGACGGCGTCAATCCCGAGCTTCGCTTCTCCGCCCAAATCAGTTACATTCTCGAGAGATGGGAGTCGAAACGTCGGTCCGTAGAGATC
TACTGTGATCGCCAAAGGCTTAAGATCAATGGTTCTACACCTTTTGATAATACCAAATGCAAAGTGGATCTTTGAGCTTTGATTTTGGTGTGTGTTCTTCTCTTCTTCAC
CATGTTCGAACTCAATATTACCCCAAAAGATTGTAACTGAAAAGTTTCATCCTTTTTTTTCAGTTCAATTGTTTCTTTATGATTAGAG
Protein sequenceShow/hide protein sequence
MMMVSSSSSRGADEEASASSKGTGQERRSSSHGTGKRTRLMRIIGRSLLGLMILVGLAIVICWLLVFPKTPIFSLVNGHVTPHSLTDRKLNASITFTIKSYNPNKRATIH
IHYMTMTIDEMGQKFRTSIPTFLQSPGNQTLFTPVILVDFLYPFGHLKERALEDGVNPELRFSAQISYILERWESKRRSVEIYCDRQRLKINGSTPFDNTKCKVDL