; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007814 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007814
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationChr10:14092408..14101474
RNA-Seq ExpressionHG10007814
SyntenyHG10007814
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443756.1 PREDICTED: uncharacterized protein LOC103487273 [Cucumis melo]1.3e-21686.39Show/hide
Query:  KRGVSNSNSISISNSSSLDLSFALLFFLLVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKI
        K GV  S SISISN     L F L+ F  V+ QRFTLVCGLNYTY+++SSLRL+RIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKI
Subjt:  KRGVSNSNSISISNSSSLDLSFALLFFLLVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKI

Query:  QRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGS
        QRGPTEWPKTK VKENKE+  ERRAGSG   AFQTWRVNGTRCPKG+IPVRR+TV DVLRSKSLFDFGKK+RPILLDR+IDAPDVVSGNGHEHAIAYTGS
Subjt:  QRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGS

Query:  TEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPI
        +EEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ                  SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPI
Subjt:  TEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPI

Query:  SSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLS
        SS SGSQYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASYFRNLEIVD+DNSLS
Subjt:  SSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLS

Query:  AVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ
         VQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPG+NPKCQ
Subjt:  AVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ

XP_011660253.1 uncharacterized protein LOC101208882 [Cucumis sativus]2.7e-21484.98Show/hide
Query:  MGKIKRGVSNSNSISISNSSSLDLSFALLFFLLVLLQRFTLVCGLNYTY-KQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLL
        MG    GV  S SISISN     L F L+ F  V+ QRFTLVCGLNYTY K +SSLRL+RIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLL
Subjt:  MGKIKRGVSNSNSISISNSSSLDLSFALLFFLLVLLQRFTLVCGLNYTY-KQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLL

Query:  KNHKIQRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAI
        KNHKIQRGPTEWPKTK  KENKE+ SERRAGSG   +FQTWRVNGTRCPKG++PVRR+TV DVLRSKSLFDFGKKKRPILLDR+IDAPDVVSGNGHEHAI
Subjt:  KNHKIQRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAI

Query:  AYTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGA
        AYTGS+EEMYGAKATINVWDPSI+MVNEFSLSQIWILSGSFDGSDLNSIEAGWQ                  SDAYQATGCYNLLC+GFVQTNSKIAIGA
Subjt:  AYTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGA

Query:  AISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDT
        AISPISS +GSQYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASYFRNLEIVD+
Subjt:  AISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDT

Query:  DNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ
        DNSLS+VQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPG+NPKCQ
Subjt:  DNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ

XP_022961080.1 uncharacterized protein LOC111461698 [Cucurbita moschata]6.7e-21386.76Show/hide
Query:  LSFAL-LFFLLVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKKVKENKE
        L FAL L   L+L+QRF LVCGLN++ KQVSSLRL+RIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPK KKVKEN+E
Subjt:  LSFAL-LFFLLVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKKVKENKE

Query:  DGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSI
        D  + RAGSG GG FQTWRVNGTRCPKGSIPVRRSTV DVLR+KS+FD+GKKKRPILLDR+IDAPDVVSGNGHEHAIAYT S+ EMYGAKATINVWDPSI
Subjt:  DGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSI

Query:  QMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDP
        Q+VNEFSLSQ+WI+SGSFDGSDLNSIEAGWQ                  SDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSFSGSQYDITILIWKDP
Subjt:  QMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDP

Query:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIM
        KLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVD+DNSLSAVQDISIMAENTNCYNIM
Subjt:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIM

Query:  SSYNDQWGTHFYYGGPGKNPKCQ
        SSYNDQWGTHFYYGGPG+NPKCQ
Subjt:  SSYNDQWGTHFYYGGPGKNPKCQ

XP_038878455.1 uncharacterized protein LOC120070684 isoform X1 [Benincasa hispida]1.7e-22488.94Show/hide
Query:  IKRGVSNSNSISISNSSS--LDLSFALLFFLLV-LLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLT-------IQSPDGDIIDCVHKRKQPA
        IKRGVS S SISISNSSS    L FALLF LLV LLQRFTLVCGLNYTYKQVSSLRLERIQRHLD+INKPPLLT       IQSPDGDIIDCVHKRKQPA
Subjt:  IKRGVSNSNSISISNSSS--LDLSFALLFFLLV-LLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLT-------IQSPDGDIIDCVHKRKQPA

Query:  LDHPLLKNHKIQRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGN
        LDHPLLKNHKIQRGPTEWPKTKKV EN+E  S R AGSG GGA QTWRVNGTRCPKGSIPVRRSTV DVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGN
Subjt:  LDHPLLKNHKIQRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGN

Query:  GHEHAIAYTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNS
        GHEHAIAYT S+EEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ                  SDAYQATGCYNLLCAGFVQTN+
Subjt:  GHEHAIAYTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNS

Query:  KIAIGAAISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRN
        KIAIGAAISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRN
Subjt:  KIAIGAAISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRN

Query:  LEIVDTDNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ
        LEIVD+DNSLSAVQDISI+AENTNCYNIMSSYNDQWGTHFYYGGPG+NPKCQ
Subjt:  LEIVDTDNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ

XP_038878456.1 uncharacterized protein LOC120070684 isoform X2 [Benincasa hispida]1.4e-22690.34Show/hide
Query:  IKRGVSNSNSISISNSSS--LDLSFALLFFLLV-LLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLK
        IKRGVS S SISISNSSS    L FALLF LLV LLQRFTLVCGLNYTYKQVSSLRLERIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLK
Subjt:  IKRGVSNSNSISISNSSS--LDLSFALLFFLLV-LLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLK

Query:  NHKIQRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIA
        NHKIQRGPTEWPKTKKV EN+E  S R AGSG GGA QTWRVNGTRCPKGSIPVRRSTV DVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIA
Subjt:  NHKIQRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIA

Query:  YTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAA
        YT S+EEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ                  SDAYQATGCYNLLCAGFVQTN+KIAIGAA
Subjt:  YTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAA

Query:  ISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTD
        ISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVD+D
Subjt:  ISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTD

Query:  NSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ
        NSLSAVQDISI+AENTNCYNIMSSYNDQWGTHFYYGGPG+NPKCQ
Subjt:  NSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ

TrEMBL top hitse value%identityAlignment
A0A0A0LXY9 Uncharacterized protein1.3e-21484.98Show/hide
Query:  MGKIKRGVSNSNSISISNSSSLDLSFALLFFLLVLLQRFTLVCGLNYTY-KQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLL
        MG    GV  S SISISN     L F L+ F  V+ QRFTLVCGLNYTY K +SSLRL+RIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLL
Subjt:  MGKIKRGVSNSNSISISNSSSLDLSFALLFFLLVLLQRFTLVCGLNYTY-KQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLL

Query:  KNHKIQRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAI
        KNHKIQRGPTEWPKTK  KENKE+ SERRAGSG   +FQTWRVNGTRCPKG++PVRR+TV DVLRSKSLFDFGKKKRPILLDR+IDAPDVVSGNGHEHAI
Subjt:  KNHKIQRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAI

Query:  AYTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGA
        AYTGS+EEMYGAKATINVWDPSI+MVNEFSLSQIWILSGSFDGSDLNSIEAGWQ                  SDAYQATGCYNLLC+GFVQTNSKIAIGA
Subjt:  AYTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGA

Query:  AISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDT
        AISPISS +GSQYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASYFRNLEIVD+
Subjt:  AISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDT

Query:  DNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ
        DNSLS+VQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPG+NPKCQ
Subjt:  DNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ

A0A1S3B8R5 uncharacterized protein LOC1034872736.3e-21786.39Show/hide
Query:  KRGVSNSNSISISNSSSLDLSFALLFFLLVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKI
        K GV  S SISISN     L F L+ F  V+ QRFTLVCGLNYTY+++SSLRL+RIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKI
Subjt:  KRGVSNSNSISISNSSSLDLSFALLFFLLVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKI

Query:  QRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGS
        QRGPTEWPKTK VKENKE+  ERRAGSG   AFQTWRVNGTRCPKG+IPVRR+TV DVLRSKSLFDFGKK+RPILLDR+IDAPDVVSGNGHEHAIAYTGS
Subjt:  QRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGS

Query:  TEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPI
        +EEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ                  SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPI
Subjt:  TEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPI

Query:  SSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLS
        SS SGSQYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASYFRNLEIVD+DNSLS
Subjt:  SSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLS

Query:  AVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ
         VQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPG+NPKCQ
Subjt:  AVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ

A0A6J1DDR6 uncharacterized protein LOC1110195901.2e-20784.09Show/hide
Query:  LFFLLVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKKVKENKE----DG
        L  +LV+ +RF+LV GLNYTYKQVSSLRL+RIQRHLD INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK+KE  E    DG
Subjt:  LFFLLVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKKVKENKE----DG

Query:  SERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSIQM
        SE R GSG GGA+QTWRVNGTRCPKGSIPVRRSTV DVLR+KSLFDFGKK+RPILLDR++DAPDVVSGNGHEHAIAYTGS++EMYGAKATINVWDPSIQ+
Subjt:  SERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSIQM

Query:  VNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDPKL
        VNEFSLSQIWILSGSFDGSDLNSIEAGWQ                  SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF+GSQYD+TILIWKDPKL
Subjt:  VNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDPKL

Query:  GNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIMSS
        GNWWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVD+DNSLSAVQ+IS +AEN +CYNIMSS
Subjt:  GNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIMSS

Query:  YNDQWGTHFYYGGPGKNPKCQ
        YNDQWGTHFYYGGPG+NP+CQ
Subjt:  YNDQWGTHFYYGGPGKNPKCQ

A0A6J1HAZ7 uncharacterized protein LOC1114616983.2e-21386.76Show/hide
Query:  LSFAL-LFFLLVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKKVKENKE
        L FAL L   L+L+QRF LVCGLN++ KQVSSLRL+RIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPK KKVKEN+E
Subjt:  LSFAL-LFFLLVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKKVKENKE

Query:  DGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSI
        D  + RAGSG GG FQTWRVNGTRCPKGSIPVRRSTV DVLR+KS+FD+GKKKRPILLDR+IDAPDVVSGNGHEHAIAYT S+ EMYGAKATINVWDPSI
Subjt:  DGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSI

Query:  QMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDP
        Q+VNEFSLSQ+WI+SGSFDGSDLNSIEAGWQ                  SDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSFSGSQYDITILIWKDP
Subjt:  QMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDP

Query:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIM
        KLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVD+DNSLSAVQDISIMAENTNCYNIM
Subjt:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIM

Query:  SSYNDQWGTHFYYGGPGKNPKCQ
        SSYNDQWGTHFYYGGPG+NPKCQ
Subjt:  SSYNDQWGTHFYYGGPGKNPKCQ

A0A6J1JIQ1 uncharacterized protein LOC1114854601.6e-21286.73Show/hide
Query:  LSFALLFFL-LVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKKVKENKE
        L FAL F L L+L+QRF LVCGLN++ KQVSSLRL+RIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPK KKVKEN+E
Subjt:  LSFALLFFL-LVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKKVKENKE

Query:  DGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSI
        D  + RAGSG GG FQTWRVNGTRCPKGSIPVRRSTV DVLR+KS+FD+GKKKRPILLDR+IDAPDVVSGNGHEHAIAYT S+ EMYGAKATINVWDPSI
Subjt:  DGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSI

Query:  QMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDP
        Q+VNEFSLSQ+WI+SGSFDGSDLNSIEAGWQ                  SDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSFSGSQYDITILIWKDP
Subjt:  QMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDP

Query:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIM
        KLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVD+DNSLS VQDISIMAENTNCYNIM
Subjt:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIM

Query:  SSYNDQWGTHFYYGGPGKNPKC
        SSYNDQWGTHFYYGGPG+NPKC
Subjt:  SSYNDQWGTHFYYGGPGKNPKC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)2.1e-11951.9Show/hide
Query:  LQRFTLVC-------GLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPK----TKKVKENKEDG
        L R  LVC        L+Y  +   S +   +++HL+ +NKP + +IQS DGD+IDCV   KQPA DHP LK+HKIQ  P   P+      KV   K + 
Subjt:  LQRFTLVC-------GLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPK----TKKVKENKEDG

Query:  SERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSIQM
         E       G   Q W   G +C +G+IP+RR+   DVLR+ S+  +GKKKR  +   +   PD+++ +GH+HAIAY    ++ YGAKATINVW+P IQ 
Subjt:  SERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSIQM

Query:  VNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDPKL
         NEFSLSQIW+L GSF G DLNSIEAGWQ                  SDAYQATGCYNLLC+GF+Q NS IA+GA+ISP+S +  SQYDI+ILIWKDPK 
Subjt:  VNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDPKL

Query:  GNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIMSS
        G+WWM FG+  ++GYWP+ LF++L + A+M+EWGGEVVNS+++GQHTSTQMGSG FP++GF KASYFRN+++VD  N+L A + +    E +NCY++ + 
Subjt:  GNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIMSS

Query:  YNDQWGTHFYYGGPGKNPKC
         ND WG +FYYGGPGKN KC
Subjt:  YNDQWGTHFYYGGPGKNPKC

AT3G13510.1 Protein of Unknown Function (DUF239)1.6e-11953.32Show/hide
Query:  SSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSI
        SS +   +++HL+ +NKPP+ TIQSPDGDIIDC+   KQPA DHP LK+HKIQ  P+  P+     +NK     +   + +    Q W   G +C +G+I
Subjt:  SSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSI

Query:  PVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGW
        P+RR+   DVLR+ S+  +GKKK   +   +   PD+++ NGH+HAIAY    ++ YGAKAT+NVW+P IQ  NEFSLSQIW+L GSF G DLNSIEAGW
Subjt:  PVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGW

Query:  Q------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHA
        Q                  SDAYQATGCYNLLC+GF+Q NS IA+GA+ISP+S +  SQYDI+ILIWKDPK G+WWM FG+  ++GYWP+ LF++L + A
Subjt:  Q------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHA

Query:  TMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKC
        +M+EWGGEVVNS++ G HT TQMGSGHFP++GF KASYFRN+++VD  N+L A + +    E +NCY++ +  ND WG +FYYGGPGKN  C
Subjt:  TMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKC

AT5G18460.1 Protein of Unknown Function (DUF239)7.5e-17067.92Show/hide
Query:  ALLFFLLV----LLQRFTLVCGLNYT--YKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKKVKEN
        +LL FLL+    L Q+   +   N T  Y+QVSSLRL RIQ+HL+ INK P+ TIQSPDGD+IDCV KRKQPALDHPLLK+HKIQ+ P + PK K     
Subjt:  ALLFFLLV----LLQRFTLVCGLNYT--YKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKKVKEN

Query:  KEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDP
         +D   + A + + GA+Q W VNGTRCPKG++P+RR+T+ DVLR+KSLFDFGKK+R I LD+R + PD +  NGHEHAIAYT S+ E+YGAKATINVWDP
Subjt:  KEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDP

Query:  SIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWK
         I+ VNEFSLSQIWILSGSF G DLNSIEAGWQ                  SD+YQATGCYNLLC+GF+QTN+KIAIGAAISP+S+F G+Q+DITILIWK
Subjt:  SIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ------------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWK

Query:  DPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYN
        DPK+GNWWMG GD+TLVGYWPAELFTHLADHAT VEWGGEVVN+RA+G+HT+TQMGSGHFPD+GFGKASYFRNLE+VD+DNSL  V D+ I+AENT CY+
Subjt:  DPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYN

Query:  IMSSYNDQWGTHFYYGGPGKNPKC
        I SSY+++WGT+FYYGGPG NP+C
Subjt:  IMSSYNDQWGTHFYYGGPGKNPKC

AT5G56530.1 Protein of Unknown Function (DUF239)1.3e-12154.99Show/hide
Query:  IQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKT----KKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVR
        + +HL+ +NKP + +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP+  P++     KV E  ++         V    Q W  NG  C +G+IPVR
Subjt:  IQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKT----KKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVR

Query:  RSTVYDVLRSKSLFDFGKKKR-PILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ-
        R+   DVLR+ S+  +GKKK   + L R  D PD+++ +GH+HAIAY     + YGAKATINVW+P +Q  NEFSLSQ+WIL GSF G DLNSIEAGWQ 
Subjt:  RSTVYDVLRSKSLFDFGKKKR-PILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ-

Query:  -----------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATM
                         SDAYQATGCYNLLC+GF+Q NS+IA+GA+ISP+S F   QYDI+I IWKDPK G+WWM FGD  ++GYWP+ LF++LAD A++
Subjt:  -----------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATM

Query:  VEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ
        VEWGGEVVN   +G HT+TQMGSG FPD+GF KASYFRN+++VD+ N+L   + ++   E +NCY++    ND WG +FYYGGPG+NP CQ
Subjt:  VEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ

AT5G56530.2 Protein of Unknown Function (DUF239)1.3e-12154.99Show/hide
Query:  IQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKT----KKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVR
        + +HL+ +NKP + +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP+  P++     KV E  ++         V    Q W  NG  C +G+IPVR
Subjt:  IQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKT----KKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVR

Query:  RSTVYDVLRSKSLFDFGKKKR-PILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ-
        R+   DVLR+ S+  +GKKK   + L R  D PD+++ +GH+HAIAY     + YGAKATINVW+P +Q  NEFSLSQ+WIL GSF G DLNSIEAGWQ 
Subjt:  RSTVYDVLRSKSLFDFGKKKR-PILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQ-

Query:  -----------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATM
                         SDAYQATGCYNLLC+GF+Q NS+IA+GA+ISP+S F   QYDI+I IWKDPK G+WWM FGD  ++GYWP+ LF++LAD A++
Subjt:  -----------------SDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATM

Query:  VEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ
        VEWGGEVVN   +G HT+TQMGSG FPD+GF KASYFRN+++VD+ N+L   + ++   E +NCY++    ND WG +FYYGGPG+NP CQ
Subjt:  VEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAATTAAAAGAGGGGTTTCAAATTCAAATTCAATTTCAATATCAAATTCATCCTCTCTTGATCTTTCTTTTGCTTTGCTTTTCTTTCTTCTTGTTCTTCTTCA
AAGATTCACTTTGGTCTGTGGCCTCAATTATACTTATAAACAAGTCAGTAGCTTGAGATTGGAAAGGATTCAAAGGCATTTGGATACCATTAACAAGCCTCCTCTTCTCA
CCATTCAGAGCCCAGATGGTGATATTATAGATTGTGTTCATAAAAGAAAACAACCAGCTTTAGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAACGGAG
TGGCCAAAAACGAAGAAGGTGAAAGAGAATAAAGAAGATGGGAGTGAAAGGAGGGCGGGATCAGGTGTGGGAGGTGCATTTCAGACTTGGCGTGTGAACGGGACACGGTG
TCCAAAAGGGAGTATTCCAGTGCGACGCAGCACCGTCTATGATGTGCTAAGATCCAAGTCTTTGTTTGACTTTGGGAAGAAAAAACGACCGATTCTCCTTGATCGAAGAA
TAGACGCTCCTGATGTAGTCAGTGGGAATGGTCACGAGCATGCGATCGCGTACACTGGATCAACGGAAGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCG
TCAATTCAAATGGTCAACGAGTTTAGCCTCTCTCAGATTTGGATCCTCTCGGGATCATTCGACGGCTCTGATCTCAACAGCATCGAAGCTGGTTGGCAGAGTGATGCATA
TCAGGCAACGGGTTGCTATAATCTTTTATGTGCTGGATTTGTACAAACAAACAGCAAAATCGCGATCGGAGCCGCCATTTCTCCCATTTCTTCTTTCTCCGGCAGCCAAT
ATGACATTACCATTCTCATTTGGAAGGATCCGAAGTTGGGGAATTGGTGGATGGGATTTGGGGACAACACACTGGTTGGGTATTGGCCAGCAGAGCTATTTACTCACTTA
GCCGACCACGCCACAATGGTGGAGTGGGGTGGGGAAGTGGTAAACTCTAGGGCCAATGGACAGCACACTTCAACCCAAATGGGCTCCGGCCACTTCCCCGATGACGGCTT
TGGCAAAGCCAGCTACTTTCGAAACCTCGAGATCGTTGACACCGACAACAGCCTCAGCGCCGTTCAAGATATCTCAATCATGGCTGAAAACACCAACTGCTATAATATTA
TGAGCTCTTATAATGACCAATGGGGCACTCACTTCTACTATGGTGGTCCTGGTAAAAATCCTAAATGCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAATTAAAAGAGGGGTTTCAAATTCAAATTCAATTTCAATATCAAATTCATCCTCTCTTGATCTTTCTTTTGCTTTGCTTTTCTTTCTTCTTGTTCTTCTTCA
AAGATTCACTTTGGTCTGTGGCCTCAATTATACTTATAAACAAGTCAGTAGCTTGAGATTGGAAAGGATTCAAAGGCATTTGGATACCATTAACAAGCCTCCTCTTCTCA
CCATTCAGAGCCCAGATGGTGATATTATAGATTGTGTTCATAAAAGAAAACAACCAGCTTTAGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAACGGAG
TGGCCAAAAACGAAGAAGGTGAAAGAGAATAAAGAAGATGGGAGTGAAAGGAGGGCGGGATCAGGTGTGGGAGGTGCATTTCAGACTTGGCGTGTGAACGGGACACGGTG
TCCAAAAGGGAGTATTCCAGTGCGACGCAGCACCGTCTATGATGTGCTAAGATCCAAGTCTTTGTTTGACTTTGGGAAGAAAAAACGACCGATTCTCCTTGATCGAAGAA
TAGACGCTCCTGATGTAGTCAGTGGGAATGGTCACGAGCATGCGATCGCGTACACTGGATCAACGGAAGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCG
TCAATTCAAATGGTCAACGAGTTTAGCCTCTCTCAGATTTGGATCCTCTCGGGATCATTCGACGGCTCTGATCTCAACAGCATCGAAGCTGGTTGGCAGAGTGATGCATA
TCAGGCAACGGGTTGCTATAATCTTTTATGTGCTGGATTTGTACAAACAAACAGCAAAATCGCGATCGGAGCCGCCATTTCTCCCATTTCTTCTTTCTCCGGCAGCCAAT
ATGACATTACCATTCTCATTTGGAAGGATCCGAAGTTGGGGAATTGGTGGATGGGATTTGGGGACAACACACTGGTTGGGTATTGGCCAGCAGAGCTATTTACTCACTTA
GCCGACCACGCCACAATGGTGGAGTGGGGTGGGGAAGTGGTAAACTCTAGGGCCAATGGACAGCACACTTCAACCCAAATGGGCTCCGGCCACTTCCCCGATGACGGCTT
TGGCAAAGCCAGCTACTTTCGAAACCTCGAGATCGTTGACACCGACAACAGCCTCAGCGCCGTTCAAGATATCTCAATCATGGCTGAAAACACCAACTGCTATAATATTA
TGAGCTCTTATAATGACCAATGGGGCACTCACTTCTACTATGGTGGTCCTGGTAAAAATCCTAAATGCCAATGA
Protein sequenceShow/hide protein sequence
MGKIKRGVSNSNSISISNSSSLDLSFALLFFLLVLLQRFTLVCGLNYTYKQVSSLRLERIQRHLDTINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTE
WPKTKKVKENKEDGSERRAGSGVGGAFQTWRVNGTRCPKGSIPVRRSTVYDVLRSKSLFDFGKKKRPILLDRRIDAPDVVSGNGHEHAIAYTGSTEEMYGAKATINVWDP
SIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHL
ADHATMVEWGGEVVNSRANGQHTSTQMGSGHFPDDGFGKASYFRNLEIVDTDNSLSAVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGKNPKCQ