; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0000763 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0000763
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionDUF1336 domain-containing protein
Genome locationchr10:2713342..2716690
RNA-Seq ExpressionIVF0000763
SyntenyIVF0000763
Gene Ontology termsNA
InterPro domainsIPR009769 - Protein ENHANCED DISEASE RESISTANCE 2, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135210.1 uncharacterized protein LOC101206832 [Cucumis sativus]5.22e-28890.02Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS
        MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINS E AS
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS

Query:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC
        +SSISSSGDANHGDHNVNRHSAT DQIHRPGNSARVHSVSSSESQVARDSH QA+NPDDAEPQLKGCGHSSE NEPVFIDEISSTAGESSAKGDGILDNC
Subjt:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC

Query:  GILPSNCLPCLATTITLL-KEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD
        GILPSNCLPCLA+TI  + K KSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYA KD
Subjt:  GILPSNCLPCLATTITLL-KEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD

Query:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE
        KKKEFAPNH AYYPFGVDVFLSHRKVDHIARFVEMP AT SGTLPPILVVN  IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAE+LTSHFQENI+  I+
Subjt:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE

Query:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
                              ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
Subjt:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL

XP_008446285.1 PREDICTED: uncharacterized protein LOC103489062 [Cucumis melo]2.31e-29892.84Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS
        MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS

Query:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC
        SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC
Subjt:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC

Query:  GILPSNCLPCLATTITLL-KEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD
        GILPSNCLPCLATTI  + K KSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYA KD
Subjt:  GILPSNCLPCLATTITLL-KEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD

Query:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE
        KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN  IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIR  I+
Subjt:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE

Query:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
                              ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
Subjt:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL

XP_022151157.1 uncharacterized protein LOC111019152 isoform X1 [Momordica charantia]7.70e-26082.68Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQ-GSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGA
        MGACVSTP+GCVGGKFKKSS++KNRR+RR+G KT AFS +S+GSHRSDPIDH SFSNPTFQ GS DEAWFDTV KFESDCDEDYQS+PDD QSI S EGA
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQ-GSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGA

Query:  SSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDN
        S+SS+SSSGDANHGDHN +RHSATPDQIHRPGNSARVHSVSSS  QV+RDSHSQ MN DDAEPQ KG GH SE NEPVFIDEISSTAGESS KGDGILDN
Subjt:  SSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDN

Query:  CGILPSNCLPCLATTITLL-KEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASK
        CGILPSNCLPCLA+TI  + K KSLSSSPPSGLKKAALKLSFKWKEGN NAALFSSKALLQRP+AGSQVPFCPA KKMLDCWSHIEP SFKVRGVNYA K
Subjt:  CGILPSNCLPCLATTITLL-KEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASK

Query:  DKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTI
        DKKKEFA +HAAY PFGVDVFLSHRKVDHIARFVE+P A YSG LPPILVVN  IPLY AAIFQGETDGEG+SIVLYFKLSDAYA+ELTSHFQENIR  I
Subjt:  DKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTI

Query:  E----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
        +                      ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
Subjt:  E----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL

XP_022151158.1 uncharacterized protein LOC111019152 isoform X2 [Momordica charantia]1.11e-26182.86Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS
        MGACVSTP+GCVGGKFKKSS++KNRR+RR+G KT AFS +S+GSHRSDPIDH SFSNPTFQGS DEAWFDTV KFESDCDEDYQS+PDD QSI S EGAS
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS

Query:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC
        +SS+SSSGDANHGDHN +RHSATPDQIHRPGNSARVHSVSSS  QV+RDSHSQ MN DDAEPQ KG GH SE NEPVFIDEISSTAGESS KGDGILDNC
Subjt:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC

Query:  GILPSNCLPCLATTITLL-KEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD
        GILPSNCLPCLA+TI  + K KSLSSSPPSGLKKAALKLSFKWKEGN NAALFSSKALLQRP+AGSQVPFCPA KKMLDCWSHIEP SFKVRGVNYA KD
Subjt:  GILPSNCLPCLATTITLL-KEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD

Query:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE
        KKKEFA +HAAY PFGVDVFLSHRKVDHIARFVE+P A YSG LPPILVVN  IPLY AAIFQGETDGEG+SIVLYFKLSDAYA+ELTSHFQENIR  I+
Subjt:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE

Query:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
                              ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
Subjt:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL

XP_038892099.1 uncharacterized protein LOC120081366 isoform X1 [Benincasa hispida]2.35e-28488.94Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS
        MGACVSTP+GCVG KFKKSSKRKNRRRRRKGSKT AFSALS+GSHRSDPIDHCSFSNPTFQGS DEAWFDTVGKFESDCDEDYQSLPDD QSINSFEGAS
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS

Query:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC
        +SSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVS+SESQVAR+SHSQAMNPDDAEPQLKG GHSSE NEPVFIDEISSTAGESSAKGDGILDNC
Subjt:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC

Query:  GILPSNCLPCLATTITLL-KEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD
        GILPSNCLPCLA+TI  + K K+LSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYA KD
Subjt:  GILPSNCLPCLATTITLL-KEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD

Query:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE
        KKKEFAP+HAAYYPFGVDVFLSHRKVDHIARFVE+P A YSGTLPPILVVN  IPLY AAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIR  I+
Subjt:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE

Query:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
                              ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
Subjt:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL

TrEMBL top hitse value%identityAlignment
A0A0A0KVS2 DUF1336 domain-containing protein1.3e-22690.02Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS
        MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINS E AS
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS

Query:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC
        +SSISSSGDANHGDHNVNRHSAT DQIHRPGNSARVHSVSSSESQVARDSH QA+NPDDAEPQLKGCGHSSE NEPVFIDEISSTAGESSAKGDGILDNC
Subjt:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC

Query:  GILPSNCLPCLATTI-TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD
        GILPSNCLPCLA+TI ++ K KSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYA KD
Subjt:  GILPSNCLPCLATTI-TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD

Query:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENI-----
        KKKEFAPNH AYYPFGVDVFLSHRKVDHIARFVEMP AT SGTLPPILVVN  IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAE+LTSHFQENI     
Subjt:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENI-----

Query:  -----------------RGTIEILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
                         R  ++ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
Subjt:  -----------------RGTIEILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL

A0A1S3BE73 uncharacterized protein LOC1034890621.6e-23492.84Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS
        MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS

Query:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC
        SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC
Subjt:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC

Query:  GILPSNCLPCLATTI-TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD
        GILPSNCLPCLATTI ++ K KSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYA KD
Subjt:  GILPSNCLPCLATTI-TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD

Query:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE
        KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN  IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIR  I+
Subjt:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE

Query:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
                              ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
Subjt:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL

A0A5A7SYL5 DUF1336 domain-containing protein2.6e-20082.43Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS
        MGACVSTPQGCVG                                                GSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS

Query:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC
        SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC
Subjt:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC

Query:  GILPSNCLPCLATTI-TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD
        GILPSNCLPCLATTI ++ K KSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYA KD
Subjt:  GILPSNCLPCLATTI-TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD

Query:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE
        KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN  IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIR  I+
Subjt:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE

Query:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
                              ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
Subjt:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL

A0A6J1DBF2 uncharacterized protein LOC111019152 isoform X21.4e-20682.86Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS
        MGACVSTP+GCVGGKFKKSS++KNRR+RR+G KT AFS +S+GSHRSDPIDH SFSNPTFQGS DEAWFDTV KFESDCDEDYQS+PDD QSI S EGAS
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGAS

Query:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC
        +SS+SSSGDANHGDHN +RHSATPDQIHRPGNSARVHSVSSS  QV+RDSHSQ MN DDAEPQ KG GH SE NEPVFIDEISSTAGESS KGDGILDNC
Subjt:  SSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNC

Query:  GILPSNCLPCLATTI-TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD
        GILPSNCLPCLA+TI ++ K KSLSSSPPSGLKKAALKLSFKWKEGN NAALFSSKALLQRP+AGSQVPFCPA KKMLDCWSHIEP SFKVRGVNYA KD
Subjt:  GILPSNCLPCLATTI-TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKD

Query:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE
        KKKEFA +HAAY PFGVDVFLSHRKVDHIARFVE+P A YSG LPPILVVN  IPLY AAIFQGETDGEG+SIVLYFKLSDAYA+ELTSHFQENIR  I+
Subjt:  KKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIE

Query:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
                              ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
Subjt:  ----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL

A0A6J1DCQ8 uncharacterized protein LOC111019152 isoform X13.5e-20582.68Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQ-GSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGA
        MGACVSTP+GCVGGKFKKSS++KNRR+RR+G KT AFS +S+GSHRSDPIDH SFSNPTFQ GS DEAWFDTV KFESDCDEDYQS+PDD QSI S EGA
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQ-GSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGA

Query:  SSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDN
        S+SS+SSSGDANHGDHN +RHSATPDQIHRPGNSARVHSVSSS  QV+RDSHSQ MN DDAEPQ KG GH SE NEPVFIDEISSTAGESS KGDGILDN
Subjt:  SSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDN

Query:  CGILPSNCLPCLATTI-TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASK
        CGILPSNCLPCLA+TI ++ K KSLSSSPPSGLKKAALKLSFKWKEGN NAALFSSKALLQRP+AGSQVPFCPA KKMLDCWSHIEP SFKVRGVNYA K
Subjt:  CGILPSNCLPCLATTI-TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASK

Query:  DKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTI
        DKKKEFA +HAAY PFGVDVFLSHRKVDHIARFVE+P A YSG LPPILVVN  IPLY AAIFQGETDGEG+SIVLYFKLSDAYA+ELTSHFQENIR  I
Subjt:  DKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTI

Query:  E----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
        +                      ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
Subjt:  E----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10410.1 Protein of unknown function (DUF1336)1.2e-10748.81Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSA-LSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGA
        MG CVSTP+ CVG K + S +RK+RRRR+   K  A S+ LS+GS  +    H +FSNP+ + + ++AWF++   FE+DCD+D+ S+ +D  S+N  E  
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSA-LSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGA

Query:  SSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDN
        S SS +++      D N                                          +   Q K  G  ++ N+P  ID         S+  +G+L+N
Subjt:  SSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDN

Query:  CGILPSNCLPCLATTI--TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYAS
        C ILPSNCLPCL TT   ++ K +SLSSSPPS  KK++L+LS+KW+EG+ + ALF SK  L+RPIAGSQVPFCP +KKMLDCWS I+P+SF+VRG  Y  
Subjt:  CGILPSNCLPCLATTI--TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYAS

Query:  KDKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGT
        ++KKKEFAP+HAAY PFGVDVFLS  K+ H+A++V++P  T S  LP ILVVN  IPLY  AIFQGE+DGEGM+IVLYFKLSD Y++EL  HFQE+IR  
Subjt:  KDKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGT

Query:  IE----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL
        I+                      ILGRVANV+DL +S  E+KLMQAYNEKPVLSRPQHEFYL
Subjt:  IE----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYL

AT1G59650.1 Protein of unknown function (DUF1336)1.8e-11352.38Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRR-KGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGA
        MG CVSTP+ CVGGK + S +RK R RR+ +  + ++ S LS+GS           +NPTF+ S DEAWFD+   FE+DCD+D+ S+ +D  S+N  E  
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRR-KGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGA

Query:  SSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDN
        S SS+SS  D+N G                                 AR+S S  ++   +E  L       +  + VFIDEISS A  SS K +G+L+N
Subjt:  SSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDN

Query:  CGILPSNCLPCLATTI-TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASK
        CGILPSNCLPCL +T+ ++ K +SLSSSPPS  KKAA+KLSFKW+EG+P   LFS+   LQRP+AGSQVPFCP EKKM D WS IEP SF+VR   Y  +
Subjt:  CGILPSNCLPCLATTI-TLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASK

Query:  DKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGT-LPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENI---
        DKKKE APN+AAY PFGVDVFLS RKV+HIA++VE+P  T + T LP ILVVN  IPLY AAIF GETDGEGM+ VLYFKLSD Y +EL  HFQE+I   
Subjt:  DKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGT-LPPILVVN--IPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENI---

Query:  -------------------RGTIEILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFY
                           R  ++ILGRVANV+DL ++ AE+KLM AYNEKPVLSRPQHEFY
Subjt:  -------------------RGTIEILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFY

AT3G29180.1 Protein of unknown function (DUF1336)1.9e-4933.97Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNP-------TFQGSYDEAWFDTVGKFESDCDEDYQSLPDDN-QS
        MG CVST    +         R  R+ RR+ SK          S  SD + H +   P       +F  S D+AWFD+V   +SD DED+ SLP++N  S
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNP-------TFQGSYDEAWFDTVGKFESDCDEDYQSLPDDN-QS

Query:  INSFEGASSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVH-SVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSA
          S  GA+ ++I +          V +  ++   +   G     H +    +   A    S+ M  D +     G    +  N+   +D  +S  G    
Subjt:  INSFEGASSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVH-SVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSA

Query:  KGDGILDNCGILPSNCLPCLATTITLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVR
        K +          S  +P ++       +K+L+S      K A  +LSFK +       +   + LL RP AG  +P    EK+    WS I P +FK+R
Subjt:  KGDGILDNCGILPSNCLPCLATTITLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVR

Query:  GVNYASKDKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVNI--PLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQ
        G  Y  KDKKK  APN   Y P GVD+F+  RK+DHIA+ +E+P       LP +LVVNI  P Y AA+F G++DGEGMSIVLYFKL D + +E +  +Q
Subjt:  GVNYASKDKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVNI--PLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQ

Query:  ENI----------------------RGTIEILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFY
        E+I                      R  ++I+  + N EDL +S+ E+KL+QAYNEKPVLSRPQH F+
Subjt:  ENI----------------------RGTIEILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFY

AT3G29180.2 Protein of unknown function (DUF1336)1.9e-4933.97Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNP-------TFQGSYDEAWFDTVGKFESDCDEDYQSLPDDN-QS
        MG CVST    +         R  R+ RR+ SK          S  SD + H +   P       +F  S D+AWFD+V   +SD DED+ SLP++N  S
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNP-------TFQGSYDEAWFDTVGKFESDCDEDYQSLPDDN-QS

Query:  INSFEGASSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVH-SVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSA
          S  GA+ ++I +          V +  ++   +   G     H +    +   A    S+ M  D +     G    +  N+   +D  +S  G    
Subjt:  INSFEGASSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVH-SVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSA

Query:  KGDGILDNCGILPSNCLPCLATTITLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVR
        K +          S  +P ++       +K+L+S      K A  +LSFK +       +   + LL RP AG  +P    EK+    WS I P +FK+R
Subjt:  KGDGILDNCGILPSNCLPCLATTITLLKEKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVR

Query:  GVNYASKDKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVNI--PLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQ
        G  Y  KDKKK  APN   Y P GVD+F+  RK+DHIA+ +E+P       LP +LVVNI  P Y AA+F G++DGEGMSIVLYFKL D + +E +  +Q
Subjt:  GVNYASKDKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVNI--PLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQ

Query:  ENI----------------------RGTIEILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFY
        E+I                      R  ++I+  + N EDL +S+ E+KL+QAYNEKPVLSRPQH F+
Subjt:  ENI----------------------RGTIEILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFY

AT5G39430.1 Protein of unknown function (DUF1336)1.4e-3932.83Show/hide
Query:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPD-DNQSINSFEGA
        MG C+ST              R  R+ RR+ SK I  S +S+    SD     SF       S ++AWFD+   F SD D+D+ SL + DN  +   EG 
Subjt:  MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPD-DNQSINSFEGA

Query:  SSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILD-
            I +          V    A+   +   GN    H     ES +  D  ++       +    G    + G   +  +        SS KG   LD 
Subjt:  SSSSISSSGDANHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILD-

Query:  --NCGILPSNCLPCLATTITLLKEKSLSSSPPSGLKKAALKLSFKWK--EGNPNAALFSSKALLQRPIAGSQVPFCPAEK-KMLDCWSHIEPDSFKVRGV
              L SN    +        +K+L+S      K A  ++SFK +  +G       SSK LL RP AG  +P    EK +    W  I P + K+RG 
Subjt:  --NCGILPSNCLPCLATTITLLKEKSLSSSPPSGLKKAALKLSFKWK--EGNPNAALFSSKALLQRPIAGSQVPFCPAEK-KMLDCWSHIEPDSFKVRGV

Query:  NYASKDKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVNI--PLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQEN
         Y  KDK+K  APN   Y P GVD+F+  RK+DHIA+ +E+P       LP +L+VNI  P Y AA+F G+++GEGMSIVLYFKL + +  E++  +Q++
Subjt:  NYASKDKKKEFAPNHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVNI--PLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQEN

Query:  IRGTIE----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFY
        I+  +E                      I+  + N ++L +S+ E+KL+QAYNEKPVLSRPQH F+
Subjt:  IRGTIE----------------------ILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGCTTGCGTTTCAACGCCGCAGGGCTGCGTCGGAGGTAAATTTAAGAAATCTTCTAAGAGAAAGAACCGCCGTAGAAGGAGGAAAGGTTCCAAAACCATCGCCTT
TTCTGCTCTCTCTGAGGGATCGCACCGCTCTGACCCGATTGACCACTGCTCATTTTCCAATCCCACTTTTCAAGGAAGTTATGATGAGGCATGGTTTGATACAGTTGGCA
AATTCGAATCGGATTGCGATGAAGATTACCAAAGCCTTCCTGATGACAATCAGTCAATTAATAGCTTTGAGGGTGCATCATCGTCGAGCATTTCATCTTCTGGCGATGCT
AATCATGGAGATCATAATGTGAATCGACATAGCGCAACCCCGGATCAGATTCATAGACCAGGAAATTCGGCAAGAGTACATTCTGTGAGCAGTTCCGAGAGTCAAGTTGC
AAGGGATTCACATTCGCAGGCCATGAATCCAGATGATGCAGAACCACAACTTAAGGGATGTGGACACTCGAGCGAGGGAAATGAACCAGTATTCATCGACGAGATCTCCT
CTACTGCTGGTGAAAGTTCGGCCAAAGGCGATGGAATATTGGATAATTGTGGGATTTTACCAAGCAATTGTTTACCCTGTCTTGCAACAACAATAACTCTGTTGAAAGAG
AAATCACTAAGTTCTAGTCCTCCAAGTGGACTCAAGAAGGCCGCCTTGAAACTTTCATTCAAATGGAAAGAAGGAAATCCCAATGCTGCTTTATTTTCATCAAAAGCGCT
TCTACAAAGACCTATAGCAGGTTCTCAAGTACCCTTTTGCCCAGCTGAAAAGAAAATGCTAGACTGTTGGTCACATATTGAGCCAGATAGTTTCAAAGTTAGGGGAGTAA
ACTATGCAAGTAAGGACAAAAAGAAAGAATTTGCTCCCAATCATGCTGCCTACTATCCCTTTGGAGTTGATGTGTTCTTGTCTCACCGAAAAGTAGATCACATTGCTCGA
TTTGTTGAAATGCCTACAGCTACTTATTCTGGAACACTTCCACCTATCCTTGTTGTTAATATTCCTTTGTATTCGGCGGCGATTTTTCAAGGAGAAACCGACGGAGAAGG
AATGAGTATTGTCTTGTACTTTAAGCTTTCTGATGCATATGCAGAGGAACTTACATCTCATTTTCAAGAAAACATCAGGGGAACGATTGAAATTTTGGGGCGCGTTGCAA
ATGTAGAGGATCTTCCGATGAGCGCTGCAGAGAGAAAACTTATGCAGGCTTACAATGAAAAGCCCGTTCTTTCTCGTCCTCAACACGAATTTTACTTGGTAAGTTTAGTT
TTTACATTGCATTACAGACTGATTTCTAGATATTACATTTTCCAGAATGATTCAAATGAAAGAACTGATGAATTATAG
mRNA sequenceShow/hide mRNA sequence
ATTTTTATTCATAATTAATAATAAACAAGACAGTGTAGAGAGAGAAAGAGAAAGAGGGGACTTGGGATTTTTAACAAACAAACGGTAGCTTTAACTTTTTCCCTCTCTGT
ATTCTTCAACGAAGAAAAAAAATTCGAGTGTTGTTTATTAGGAGCTGCCCTGTCCTGCCTCTCTTTCTCTGTTTTTTTTTTTTTTTCCTCCTATGTTATTTTTCGGTACG
TGATTTCTTGTCGTTCTTCCGACATGGGTGCTTGCGTTTCAACGCCGCAGGGCTGCGTCGGAGGTAAATTTAAGAAATCTTCTAAGAGAAAGAACCGCCGTAGAAGGAGG
AAAGGTTCCAAAACCATCGCCTTTTCTGCTCTCTCTGAGGGATCGCACCGCTCTGACCCGATTGACCACTGCTCATTTTCCAATCCCACTTTTCAAGGAAGTTATGATGA
GGCATGGTTTGATACAGTTGGCAAATTCGAATCGGATTGCGATGAAGATTACCAAAGCCTTCCTGATGACAATCAGTCAATTAATAGCTTTGAGGGTGCATCATCGTCGA
GCATTTCATCTTCTGGCGATGCTAATCATGGAGATCATAATGTGAATCGACATAGCGCAACCCCGGATCAGATTCATAGACCAGGAAATTCGGCAAGAGTACATTCTGTG
AGCAGTTCCGAGAGTCAAGTTGCAAGGGATTCACATTCGCAGGCCATGAATCCAGATGATGCAGAACCACAACTTAAGGGATGTGGACACTCGAGCGAGGGAAATGAACC
AGTATTCATCGACGAGATCTCCTCTACTGCTGGTGAAAGTTCGGCCAAAGGCGATGGAATATTGGATAATTGTGGGATTTTACCAAGCAATTGTTTACCCTGTCTTGCAA
CAACAATAACTCTGTTGAAAGAGAAATCACTAAGTTCTAGTCCTCCAAGTGGACTCAAGAAGGCCGCCTTGAAACTTTCATTCAAATGGAAAGAAGGAAATCCCAATGCT
GCTTTATTTTCATCAAAAGCGCTTCTACAAAGACCTATAGCAGGTTCTCAAGTACCCTTTTGCCCAGCTGAAAAGAAAATGCTAGACTGTTGGTCACATATTGAGCCAGA
TAGTTTCAAAGTTAGGGGAGTAAACTATGCAAGTAAGGACAAAAAGAAAGAATTTGCTCCCAATCATGCTGCCTACTATCCCTTTGGAGTTGATGTGTTCTTGTCTCACC
GAAAAGTAGATCACATTGCTCGATTTGTTGAAATGCCTACAGCTACTTATTCTGGAACACTTCCACCTATCCTTGTTGTTAATATTCCTTTGTATTCGGCGGCGATTTTT
CAAGGAGAAACCGACGGAGAAGGAATGAGTATTGTCTTGTACTTTAAGCTTTCTGATGCATATGCAGAGGAACTTACATCTCATTTTCAAGAAAACATCAGGGGAACGAT
TGAAATTTTGGGGCGCGTTGCAAATGTAGAGGATCTTCCGATGAGCGCTGCAGAGAGAAAACTTATGCAGGCTTACAATGAAAAGCCCGTTCTTTCTCGTCCTCAACACG
AATTTTACTTGGTAAGTTTAGTTTTTACATTGCATTACAGACTGATTTCTAGATATTACATTTTCCAGAATGATTCAAATGAAAGAACTGATGAATTATAGGGAGAAAAC
TACTTGGAAATCGACTTGGATATGCACAGATTTAGTTACATTTCGAGGAAAGGTTTTGAAGCATTTCTTGATAGACTCAAGTGCTGCATTTTGGATGTTGGCCTAACAAT
TCAGGGGAACAGACCTGAAGAATTGCCAGAGGAGATTTTATGTTGCATTAGATTAAATGGAATTGATTACGTAAATTATCAGCAATTGGGATGGGTCCAGAGATTCTGTA
AATTGCGATGCCATTGTATTATACAAAAATTTCACTAAAAAGCAAATTTTAATGAGAATTATTTTTT
Protein sequenceShow/hide protein sequence
MGACVSTPQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEAWFDTVGKFESDCDEDYQSLPDDNQSINSFEGASSSSISSSGDA
NHGDHNVNRHSATPDQIHRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCGHSSEGNEPVFIDEISSTAGESSAKGDGILDNCGILPSNCLPCLATTITLLKE
KSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYASKDKKKEFAPNHAAYYPFGVDVFLSHRKVDHIAR
FVEMPTATYSGTLPPILVVNIPLYSAAIFQGETDGEGMSIVLYFKLSDAYAEELTSHFQENIRGTIEILGRVANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYLVSLV
FTLHYRLISRYYIFQNDSNERTDEL