; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g005580 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g005580
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionAnkyrin repeat-containing protein
Genome locationChr06:5149464..5153330
RNA-Seq ExpressionLcy06g005580
SyntenyLcy06g005580
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR020683 - Ankyrin repeat-containing domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055385.1 ankyrin repeat-containing protein [Cucumis melo var. makuwa]1.7e-4658.19Show/hide
Query:  AALLNSTTVESQQTIDVDAPSDSEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKY
        A L N     S    + DAPS  E       K FL++ ALKGEW  VE L+++ PHY R  +T+N+ET+LH+AAGAKQT FV++L+HRM+  DMT+QNKY
Subjt:  AALLNSTTVESQQTIDVDAPSDSEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKY

Query:  GNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI
        GNTALCFAA SG+VRIAQL+V KN++LPLIRGF+N+ TPLF+AVSYK   M +YL ++TD+ QL  ++QIELLIA+I
Subjt:  GNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI

KGN64473.1 hypothetical protein Csa_013828 [Cucumis sativus]3.9e-6766.82Show/hide
Query:  MMKRLRKSQSFPSISNPHYEN--ENGKTISAEHEHDVGDQFLAALLNSTTVESQQTIDVDAPSDSE-FQTSAATKIFLHQYALKGEWEYVELLMDECPHY
        M KR+RKS SFPSI  P+Y +  E  +    E  H   ++  A +            D D PSD   FQ  AAT+IFL+Q ALKGEWEYVELL+DE P+ 
Subjt:  MMKRLRKSQSFPSISNPHYEN--ENGKTISAEHEHDVGDQFLAALLNSTTVESQQTIDVDAPSDSE-FQTSAATKIFLHQYALKGEWEYVELLMDECPHY

Query:  VRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLS
        VRS ITRN+ETILHIAAGAKQ EFV KLL+RM+  DM LQN++GNTALCFAAASGVVRIA+LMVEKN NLPLIRGFNN VTPLFIAVSYKC EMVSYLLS
Subjt:  VRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLS

Query:  ITDLDQLNNQEQIELLIATI
        +TDL+QL  QEQIELLIATI
Subjt:  ITDLDQLNNQEQIELLIATI

XP_008440605.2 PREDICTED: uncharacterized protein LOC103484975 isoform X1 [Cucumis melo]1.7e-4658.19Show/hide
Query:  AALLNSTTVESQQTIDVDAPSDSEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKY
        A L N     S    + DAPS  E       K FL++ ALKGEW  VE L+++ PHY R  +T+N+ET+LH+AAGAKQT FV++L+HRM+  DMT+QNKY
Subjt:  AALLNSTTVESQQTIDVDAPSDSEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKY

Query:  GNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI
        GNTALCFAA SG+VRIAQL+V KN++LPLIRGF+N+ TPLF+AVSYK   M +YL ++TD+ QL  ++QIELLIA+I
Subjt:  GNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI

XP_011652468.2 uncharacterized protein LOC101218503 isoform X1 [Cucumis sativus]3.9e-6766.82Show/hide
Query:  MMKRLRKSQSFPSISNPHYEN--ENGKTISAEHEHDVGDQFLAALLNSTTVESQQTIDVDAPSDSE-FQTSAATKIFLHQYALKGEWEYVELLMDECPHY
        M KR+RKS SFPSI  P+Y +  E  +    E  H   ++  A +            D D PSD   FQ  AAT+IFL+Q ALKGEWEYVELL+DE P+ 
Subjt:  MMKRLRKSQSFPSISNPHYEN--ENGKTISAEHEHDVGDQFLAALLNSTTVESQQTIDVDAPSDSE-FQTSAATKIFLHQYALKGEWEYVELLMDECPHY

Query:  VRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLS
        VRS ITRN+ETILHIAAGAKQ EFV KLL+RM+  DM LQN++GNTALCFAAASGVVRIA+LMVEKN NLPLIRGFNN VTPLFIAVSYKC EMVSYLLS
Subjt:  VRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLS

Query:  ITDLDQLNNQEQIELLIATI
        +TDL+QL  QEQIELLIATI
Subjt:  ITDLDQLNNQEQIELLIATI

XP_011652476.2 uncharacterized protein LOC101218503 isoform X2 [Cucumis sativus]3.9e-6766.82Show/hide
Query:  MMKRLRKSQSFPSISNPHYEN--ENGKTISAEHEHDVGDQFLAALLNSTTVESQQTIDVDAPSDSE-FQTSAATKIFLHQYALKGEWEYVELLMDECPHY
        M KR+RKS SFPSI  P+Y +  E  +    E  H   ++  A +            D D PSD   FQ  AAT+IFL+Q ALKGEWEYVELL+DE P+ 
Subjt:  MMKRLRKSQSFPSISNPHYEN--ENGKTISAEHEHDVGDQFLAALLNSTTVESQQTIDVDAPSDSE-FQTSAATKIFLHQYALKGEWEYVELLMDECPHY

Query:  VRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLS
        VRS ITRN+ETILHIAAGAKQ EFV KLL+RM+  DM LQN++GNTALCFAAASGVVRIA+LMVEKN NLPLIRGFNN VTPLFIAVSYKC EMVSYLLS
Subjt:  VRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLS

Query:  ITDLDQLNNQEQIELLIATI
        +TDL+QL  QEQIELLIATI
Subjt:  ITDLDQLNNQEQIELLIATI

TrEMBL top hitse value%identityAlignment
A0A0A0LRD1 ANK_REP_REGION domain-containing protein4.1e-4658.56Show/hide
Query:  AALLNSTTVESQQTIDVDAPSDSE----FQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTL
        A L N TT E  +TI  D   DS+     +   + ++ L++ ALKG+W+  EL++++ PHYVR  ITRNKET+LH+AAGAKQ+ FVE+L+ RMT  DM L
Subjt:  AALLNSTTVESQQTIDVDAPSDSE----FQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTL

Query:  QNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI
        ++KYGNTALCFAA S +V+IA+LMVEKN  LPLIR F    TPL IAVSYK  +M+SYLLS+TDL QL  QE+IELLIATI
Subjt:  QNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI

A0A0A0LRT6 ANK_REP_REGION domain-containing protein1.9e-6766.82Show/hide
Query:  MMKRLRKSQSFPSISNPHYEN--ENGKTISAEHEHDVGDQFLAALLNSTTVESQQTIDVDAPSDSE-FQTSAATKIFLHQYALKGEWEYVELLMDECPHY
        M KR+RKS SFPSI  P+Y +  E  +    E  H   ++  A +            D D PSD   FQ  AAT+IFL+Q ALKGEWEYVELL+DE P+ 
Subjt:  MMKRLRKSQSFPSISNPHYEN--ENGKTISAEHEHDVGDQFLAALLNSTTVESQQTIDVDAPSDSE-FQTSAATKIFLHQYALKGEWEYVELLMDECPHY

Query:  VRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLS
        VRS ITRN+ETILHIAAGAKQ EFV KLL+RM+  DM LQN++GNTALCFAAASGVVRIA+LMVEKN NLPLIRGFNN VTPLFIAVSYKC EMVSYLLS
Subjt:  VRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLS

Query:  ITDLDQLNNQEQIELLIATI
        +TDL+QL  QEQIELLIATI
Subjt:  ITDLDQLNNQEQIELLIATI

A0A1S3B243 uncharacterized protein LOC103484975 isoform X18.3e-4758.19Show/hide
Query:  AALLNSTTVESQQTIDVDAPSDSEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKY
        A L N     S    + DAPS  E       K FL++ ALKGEW  VE L+++ PHY R  +T+N+ET+LH+AAGAKQT FV++L+HRM+  DMT+QNKY
Subjt:  AALLNSTTVESQQTIDVDAPSDSEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKY

Query:  GNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI
        GNTALCFAA SG+VRIAQL+V KN++LPLIRGF+N+ TPLF+AVSYK   M +YL ++TD+ QL  ++QIELLIA+I
Subjt:  GNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI

A0A1S4DY91 uncharacterized protein LOC103484975 isoform X28.3e-4758.19Show/hide
Query:  AALLNSTTVESQQTIDVDAPSDSEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKY
        A L N     S    + DAPS  E       K FL++ ALKGEW  VE L+++ PHY R  +T+N+ET+LH+AAGAKQT FV++L+HRM+  DMT+QNKY
Subjt:  AALLNSTTVESQQTIDVDAPSDSEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKY

Query:  GNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI
        GNTALCFAA SG+VRIAQL+V KN++LPLIRGF+N+ TPLF+AVSYK   M +YL ++TD+ QL  ++QIELLIA+I
Subjt:  GNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI

A0A5A7UP23 Ankyrin repeat-containing protein8.3e-4758.19Show/hide
Query:  AALLNSTTVESQQTIDVDAPSDSEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKY
        A L N     S    + DAPS  E       K FL++ ALKGEW  VE L+++ PHY R  +T+N+ET+LH+AAGAKQT FV++L+HRM+  DMT+QNKY
Subjt:  AALLNSTTVESQQTIDVDAPSDSEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKY

Query:  GNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI
        GNTALCFAA SG+VRIAQL+V KN++LPLIRGF+N+ TPLF+AVSYK   M +YL ++TD+ QL  ++QIELLIA+I
Subjt:  GNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI

SwissProt top hitse value%identityAlignment
P0C6S7 Ankyrin repeat and sterile alpha motif domain-containing protein 1B2.9e-0434.04Show/hide
Query:  LHQYALKGEWEYVELLMDECPHYVRSPITRNK-ETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNL
        +H  A KG+ E V++L+   P + R     N+ ET LH AA    +E V  LL  +T  D T++N    T L  AA  G +R+ ++++  + NL
Subjt:  LHQYALKGEWEYVELLMDECPHYVRSPITRNK-ETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNL

P16157 Ankyrin-12.9e-0434.43Show/hide
Query:  LHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFN
        LH  A  G    V+LL++   +   +  T    T LHIAA     E V  LL +   A      K G T L  AA  G VR+A+L++E++ + P   G N
Subjt:  LHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFN

Query:  NIVTPLFIAVSYKCIEMVSYLL
         + TPL +AV +  +++V  LL
Subjt:  NIVTPLFIAVSYKCIEMVSYLL

Q1RI31 Putative ankyrin repeat protein RBE_09023.4e-0527.98Show/hide
Query:  ALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVT
        A  G  +  E L+ +      + +T N +T+L +AA     +  E L+ +MT   +   NK GNTAL  AA+S + +I + ++ K  +   I   NN   
Subjt:  ALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVT

Query:  PLFIAVSYKCIEMVSYLL--SITD--LDQLNNQEQIELLIAT-ILLCGICRFLICSRQFTAIDDFCLN
           IA +   +E V   L   +T+  ++Q N+Q    L+ A    L  +C  LI    + AI+ +  N
Subjt:  PLFIAVSYKCIEMVSYLL--SITD--LDQLNNQEQIELLIAT-ILLCGICRFLICSRQFTAIDDFCLN

Q7Z6G8 Ankyrin repeat and sterile alpha motif domain-containing protein 1B2.9e-0434.04Show/hide
Query:  LHQYALKGEWEYVELLMDECPHYVRSPITRNK-ETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNL
        +H  A KG+ E V++L+   P + R     N+ ET LH AA    +E V  LL  +T  D T++N    T L  AA  G +R+ ++++  + NL
Subjt:  LHQYALKGEWEYVELLMDECPHYVRSPITRNK-ETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNL

Q8BIZ1 Ankyrin repeat and sterile alpha motif domain-containing protein 1B2.9e-0434.04Show/hide
Query:  LHQYALKGEWEYVELLMDECPHYVRSPITRNK-ETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNL
        +H  A KG+ E V++L+   P + R     N+ ET LH AA    +E V  LL  +T  D T++N    T L  AA  G +R+ ++++  + NL
Subjt:  LHQYALKGEWEYVELLMDECPHYVRSPITRNK-ETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNL

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein4.7e-1028.66Show/hide
Query:  TIDVDAPSD---SEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTL--QNKYGNTALCFA
        ++ +D  +D    E +   +T + L +    GE E  +  +D  P  + + +T N +T +H A  +   + VE+++ R+   +  L  +N  G TAL +A
Subjt:  TIDVDAPSD---SEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTL--QNKYGNTALCFA

Query:  AASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLN
        A  G+VRIA+ +V K   L  +R     + P+ +A  Y    +V YL S T L  L+
Subjt:  AASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLN

AT3G54070.1 Ankyrin repeat family protein8.5e-2037.33Show/hide
Query:  SEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVE
        SE  T   ++  +++  L G+W+    L+      V   IT N E  LHIA  AK  +FV  LL  M   D++L+NK GNT L FAAA G +  A++++ 
Subjt:  SEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVE

Query:  KNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIEL
          ++LP I      +TP+ IA  Y   EMV YL S T +  LN+Q+ + L
Subjt:  KNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIEL

AT4G05040.1 ankyrin repeat family protein2.0e-0527.4Show/hide
Query:  LHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLL-------HRMTSAD------MTLQNKYGNTALCFAAASGVVRIAQLMV
        LH  A  G  E V+ ++ ECP  V   +    +  LH+AA A  +  VE L+        R+   D        L++KYGNTAL  A     + +A  +V
Subjt:  LHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLL-------HRMTSAD------MTLQNKYGNTALCFAAASGVVRIAQLMV

Query:  EKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQ
         +N+N   +   N  ++ L++AV    + +V  +L     + L  +
Subjt:  EKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQ

AT4G05040.2 ankyrin repeat family protein2.0e-0527.4Show/hide
Query:  LHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLL-------HRMTSAD------MTLQNKYGNTALCFAAASGVVRIAQLMV
        LH  A  G  E V+ ++ ECP  V   +    +  LH+AA A  +  VE L+        R+   D        L++KYGNTAL  A     + +A  +V
Subjt:  LHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLL-------HRMTSAD------MTLQNKYGNTALCFAAASGVVRIAQLMV

Query:  EKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQ
         +N+N   +   N  ++ L++AV    + +V  +L     + L  +
Subjt:  EKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQ

AT5G35830.1 Ankyrin repeat family protein7.2e-2744.59Show/hide
Query:  ATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPL
        A  + L+Q ALKG+W+    ++ E  + +   IT   ET+LHIA  AK   FV  LL  + S D+ L+N  GNTALCFAAASGVV IA++++EKNK+LP+
Subjt:  ATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPL

Query:  IRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI
        IRG     TP+ +A  +   EMV YL   T   + N++E + L  A I
Subjt:  IRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGATGAAGCGTTTGAGAAAGTCACAGAGTTTCCCTTCAATTTCAAATCCACACTACGAAAATGAAAATGGGAAAACTATATCTGCTGAACATGAACATGATGTTGG
AGACCAATTTCTTGCCGCTCTGTTGAACAGCACGACAGTGGAGAGCCAGCAGACAATAGATGTCGATGCACCCTCGGATTCCGAATTCCAAACTAGTGCTGCAACGAAAA
TTTTTCTGCATCAATATGCACTAAAGGGTGAGTGGGAATATGTGGAATTACTAATGGACGAGTGCCCACATTATGTTCGTTCCCCAATAACAAGAAACAAAGAGACCATT
CTTCATATTGCTGCGGGAGCCAAACAAACTGAATTTGTGGAGAAATTGCTGCACAGAATGACCTCTGCTGACATGACTCTGCAAAACAAATATGGAAACACAGCCCTTTG
TTTTGCTGCTGCTTCGGGAGTTGTAAGAATTGCTCAGCTAATGGTGGAAAAGAACAAAAATCTTCCACTTATTCGTGGCTTCAACAATATTGTGACTCCACTTTTCATTG
CTGTATCATACAAGTGCATAGAGATGGTTTCTTATCTCTTGTCTATCACTGATCTCGACCAACTAAACAACCAAGAACAAATCGAGCTTCTTATTGCCACCATATTACTC
TGCGGGATATGTCGTTTTTTAATTTGCTCGAGACAATTTACGGCCATTGATGATTTTTGCTTGAACTGTCACCTGGTCTCCCATGACGAAAAATGTCTCTGCAACTAG
mRNA sequenceShow/hide mRNA sequence
AAAAGGAGCTTTAATAATGTTGTCTGCCATTAGTTTGCCTGCGGAGATATTTTGTGAGGCAGCCATCATATGGGTTTTGCTGCCAAAAGTTGAACTTGAGCCGCCGTCCA
TCATCTGTGTTTGGTCCGATGATGATGAAGCGTTTGAGAAAGTCACAGAGTTTCCCTTCAATTTCAAATCCACACTACGAAAATGAAAATGGGAAAACTATATCTGCTGA
ACATGAACATGATGTTGGAGACCAATTTCTTGCCGCTCTGTTGAACAGCACGACAGTGGAGAGCCAGCAGACAATAGATGTCGATGCACCCTCGGATTCCGAATTCCAAA
CTAGTGCTGCAACGAAAATTTTTCTGCATCAATATGCACTAAAGGGTGAGTGGGAATATGTGGAATTACTAATGGACGAGTGCCCACATTATGTTCGTTCCCCAATAACA
AGAAACAAAGAGACCATTCTTCATATTGCTGCGGGAGCCAAACAAACTGAATTTGTGGAGAAATTGCTGCACAGAATGACCTCTGCTGACATGACTCTGCAAAACAAATA
TGGAAACACAGCCCTTTGTTTTGCTGCTGCTTCGGGAGTTGTAAGAATTGCTCAGCTAATGGTGGAAAAGAACAAAAATCTTCCACTTATTCGTGGCTTCAACAATATTG
TGACTCCACTTTTCATTGCTGTATCATACAAGTGCATAGAGATGGTTTCTTATCTCTTGTCTATCACTGATCTCGACCAACTAAACAACCAAGAACAAATCGAGCTTCTT
ATTGCCACCATATTACTCTGCGGGATATGTCGTTTTTTAATTTGCTCGAGACAATTTACGGCCATTGATGATTTTTGCTTGAACTGTCACCTGGTCTCCCATGACGAAAA
ATGTCTCTGCAACTAGTTTGGACTGTGGTTGGTCATTTAAGGAAAGTGGTGGCAATGTATGTAGCTGATTTTGTTCCTCCATTGTTTTCCTTTTGTTTGTTGTGTTGGGA
AAATGGCGGCGGGAGTTCACTTCAGCAAAATTTCATTGTATTTTTCAGATATCAAGAGAAAATGATGCTGCAAACTCGGAAAGTTGAAAACGATGTAGTTTCAAAATAAA
GCAAATTACTTTTTGTGGAGAAAACTGCTTCCAACCCTATTAATTCCTTTACATAATTCGTTTTGTGGGGTGTTTCGAGATAGGAAAGAATTTGCAGAAGAATAGGGAAA
AGGGTACCAAAAACAGAAAGATTTTGATGAGAAAAATTGGTTTTAGTTTCCTTTTTAAACTGCAGAGGCAGACAAATTCAAGGGATTTTCAGAAGCCTGAAACCATTAGA
GACCAGAAGAAGAAAAGAAAGAGTTTCAGTGGAAGAGCAGAATTGTAGAATGCCAGAAGTTTGGTGTACTCTGAAGAAATCATTATCCTGCACCAAATCATTCCTATGTG
ACGTTCATGAGCCAGTAGCCATTGGTGATTCAATCGCCAAGGAAAGAACAGAGCGAGACTTGGGTGGATGCTTAAGGTCTAAATCTAACCTCAGAGACATCATTTGTGGA
AGCAAAAGACACTCGCAGAAGCCATCGCCAAGTTCCAGCACGAGATCAATGGCGACCAGTGAAGTTCTCCATACCATAATCCATGAAATTGAAGCCCAAATGAAAACAGA
TAAAATATCTGTTCCCCAGGAAGAAAAGGTCAGTCTTAAGCCTGATTCAACTTCAATGAAACTCTACACAAATTCTGTTACAAGAAATGGCCAGGTCAGTTCTGCAACTT
CAGACAGAATTGAGCAATCCTACGACGATTGTGGATTGATTTGCCAAGAATGTGGTGGGGTTTTTCAGAACTCGGATGCCGTCGAGTCACATAATCTTTCCAAGCATGCT
GGTAATGAAAGATTAAACTGTTTCTTTTGTTTTTCGAATTCTTCAAATTCTAGATTATAATGAGGTTTTCATGTTGCAGTCAAGGAAATAGTACAAGGGGATTCATCCAA
AAAAGTGATAGAATTAATCTGCCAAAGAAATTGGCCAATGTCCAAGTCCCATCACATTGAGAAGGTTTTCAAAGTCTGCAACTCGCCTAAAACTCTCTCTCTGTTTGAAG
AACACCGAGAAATGGTGAAAACCAAAGCAAGCAAACTGGAGAAGGAAAACCAACGTTGTTTAGTCGATGGAAACGAGCTCTTGAGGTTCCATGGCACAACAATCGCATGC
TCATTAGCCACTAATGGCTCTCAGATTCTCTGTAATTTGGACAACTGTGGAGTTTGCCAAATTTTAAGACATGGATTCAATGGTGTGTTTACTTGTGCAACAAGTGGAAG
AGCCTTCGAATCTATTGCGATGAATGAAGAAGATGGTGGCTTAAGAAGAGCTTTAATGGTGTGCAGAGTGATAGCAGGGAGGATCTATGATGGATGCATGGAGGAAGATA
AAGTGAATACAAATTCCGGGTTCGATTCATCGAGTGGAAAAACAGGTCAGAATTCAAAACTAGAACTTTTTGTCTTCAATCCTAGAGCTGTACTCCCTTGTTTTGTAGTA
ACTTACACACGGTAGATCATTGGATGAGGTTATTGTTGTTTATGTAGATTCAAAATTATATGTCCCTGATAATCACTTCATATAAATTCAAAATACGTTTTTGAATGATG
CAAAGCTGGCTCTCT
Protein sequenceShow/hide protein sequence
MMMKRLRKSQSFPSISNPHYENENGKTISAEHEHDVGDQFLAALLNSTTVESQQTIDVDAPSDSEFQTSAATKIFLHQYALKGEWEYVELLMDECPHYVRSPITRNKETI
LHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVEKNKNLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATILL
CGICRFLICSRQFTAIDDFCLNCHLVSHDEKCLCN