; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G21622 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G21622
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionProtein of Unknown Function (DUF239)
Genome locationctg949:862234..868755
RNA-Seq ExpressionCucsat.G21622
SyntenyCucsat.G21622
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443756.1 PREDICTED: uncharacterized protein LOC103487273 [Cucumis melo]0.097.04Show/hide
Query:  MGTKTGGVSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
        MGTKTG VSFSISISNLLPFGLIFCFV+TQRFTLVCGLNYTYQK LSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
Subjt:  MGTKTGGVSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR

Query:  GPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE
        GPTEWPKTKV KENKEEV ERRAGSGALA+FQTWRVNGTRCPKGT+PVRRTTVKDVLRSKSLFDFGKK+RPILLDRKIDAPDVVSGNGHEHAIAYTGSSE
Subjt:  GPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE

Query:  EMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS
        EMYGAKATINVWDPSI+MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISPISS
Subjt:  EMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS

Query:  IAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSV
        ++GSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLS V
Subjt:  IAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSV

Query:  QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

XP_011660253.1 uncharacterized protein LOC101208882 [Cucumis sativus]0.0100Show/hide
Query:  MGTKTGGVSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
        MGTKTGGVSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
Subjt:  MGTKTGGVSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR

Query:  GPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE
        GPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE
Subjt:  GPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE

Query:  EMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS
        EMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS
Subjt:  EMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS

Query:  IAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSV
        IAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSV
Subjt:  IAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSV

Query:  QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

XP_023516458.1 uncharacterized protein LOC111780317 [Cucurbita pepo subsp. pepo]9.16e-28387.32Show/hide
Query:  NLLPFGLIFCF--VITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKE
        +LLPF L F    ++ QRF LVCGLN++  K +SSLRLDRIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ K  KE
Subjt:  NLLPFGLIFCF--VITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKE

Query:  NKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD
        N+E+V + RAGSGA   FQTWRVNGTRCPKG++PVRR+TV DVLR+KS+FD+GKKKRPILLDR+IDAPDVVSGNGHEHAIAYT SSE MYGAKATINVWD
Subjt:  NKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD

Query:  PSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIW
        PSI++VNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAY+ATGCYNLLCSGFVQTN+KIAIGAAISP+SS +GSQYDITILIW
Subjt:  PSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIW

Query:  KDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCY
        KDPKLG+WWMGFG+NTLVGYWPAELFTHL DHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASYFRNLEIVDSDNSLSSVQDISIMAENTNCY
Subjt:  KDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPKCQ
        NIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPKCQ

XP_038878455.1 uncharacterized protein LOC120070684 isoform X1 [Benincasa hispida]1.01e-28886.37Show/hide
Query:  MGTKTG-GVSFSISISN------LLPFGLIFCFVIT--QRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQ-------SPDGDIIDCVHKRK
        MG K G  +S SISISN      +L F L+F  ++   QRFTLVCGLNYTY K +SSLRL+RIQRHLDSINKPPLLTIQ       SPDGDIIDCVHKRK
Subjt:  MGTKTG-GVSFSISISN------LLPFGLIFCFVIT--QRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQ-------SPDGDIIDCVHKRK

Query:  QPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVV
        QPALDHPLLKNHKIQRGPTEWPKTK   EN+E  S R AGSGA  + QTWRVNGTRCPKG++PVRR+TV DVLRSKSLFDFGKKKRPILLDR+IDAPDVV
Subjt:  QPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVV

Query:  SGNGHEHAIAYTGSSEEMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQ
        SGNGHEHAIAYT SSEEMYGAKATINVWDPSI+MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQ
Subjt:  SGNGHEHAIAYTGSSEEMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQ

Query:  TNSKIAIGAAISPISSIAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASY
        TN+KIAIGAAISPISS +GSQYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASY
Subjt:  TNSKIAIGAAISPISSIAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASY

Query:  FRNLEIVDSDNSLSSVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        FRNLEIVDSDNSLS+VQDISI+AENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  FRNLEIVDSDNSLSSVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

XP_038878456.1 uncharacterized protein LOC120070684 isoform X2 [Benincasa hispida]1.41e-29187.72Show/hide
Query:  MGTKTG-GVSFSISISN------LLPFGLIFCFVIT--QRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHP
        MG K G  +S SISISN      +L F L+F  ++   QRFTLVCGLNYTY K +SSLRL+RIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHP
Subjt:  MGTKTG-GVSFSISISN------LLPFGLIFCFVIT--QRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHP

Query:  LLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEH
        LLKNHKIQRGPTEWPKTK   EN+E  S R AGSGA  + QTWRVNGTRCPKG++PVRR+TV DVLRSKSLFDFGKKKRPILLDR+IDAPDVVSGNGHEH
Subjt:  LLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEH

Query:  AIAYTGSSEEMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAI
        AIAYT SSEEMYGAKATINVWDPSI+MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAI
Subjt:  AIAYTGSSEEMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAI

Query:  GAAISPISSIAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIV
        GAAISPISS +GSQYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASYFRNLEIV
Subjt:  GAAISPISSIAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIV

Query:  DSDNSLSSVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        DSDNSLS+VQDISI+AENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  DSDNSLSSVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

TrEMBL top hitse value%identityAlignment
A0A0A0LXY9 Uncharacterized protein0.0100Show/hide
Query:  MGTKTGGVSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
        MGTKTGGVSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
Subjt:  MGTKTGGVSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR

Query:  GPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE
        GPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE
Subjt:  GPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE

Query:  EMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS
        EMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS
Subjt:  EMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS

Query:  IAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSV
        IAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSV
Subjt:  IAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSV

Query:  QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

A0A1S3B8R5 uncharacterized protein LOC1034872730.097.04Show/hide
Query:  MGTKTGGVSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
        MGTKTG VSFSISISNLLPFGLIFCFV+TQRFTLVCGLNYTYQK LSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
Subjt:  MGTKTGGVSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR

Query:  GPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE
        GPTEWPKTKV KENKEEV ERRAGSGALA+FQTWRVNGTRCPKGT+PVRRTTVKDVLRSKSLFDFGKK+RPILLDRKIDAPDVVSGNGHEHAIAYTGSSE
Subjt:  GPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE

Query:  EMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS
        EMYGAKATINVWDPSI+MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISPISS
Subjt:  EMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS

Query:  IAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSV
        ++GSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLS V
Subjt:  IAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSV

Query:  QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

A0A6J1DDR6 uncharacterized protein LOC1110195903.09e-27284.27Show/hide
Query:  LPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKE-
        L   L    V+ +RF+LV GLNYTY K +SSLRLDRIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ K  KE  E 
Subjt:  LPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKE-

Query:  ---EVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD
           + SE R GSGA  ++QTWRVNGTRCPKG++PVRR+TV DVLR+KSLFDFGKK+RPILLDR++DAPDVVSGNGHEHAIAYTGSS+EMYGAKATINVWD
Subjt:  ---EVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD

Query:  PSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIW
        PSI++VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISPISS  GSQYD+TILIW
Subjt:  PSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIW

Query:  KDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCY
        KDPKLGNWWMGFG+NTLVGYWPAELFTHL DHATMVEWGGEVVNS+  G+HTSTQMGSG F ++GF KASYFRNLEIVDSDNSLS+VQ+IS +AEN +CY
Subjt:  KDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPKCQ
        NIMSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPKCQ

A0A6J1HAZ7 uncharacterized protein LOC1114616981.22e-28287.32Show/hide
Query:  NLLPFGLIFCF--VITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKE
        +LLPF L      ++ QRF LVCGLN++  K +SSLRLDRIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPK K  KE
Subjt:  NLLPFGLIFCF--VITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKE

Query:  NKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD
        N+E+V + RAGSGA   FQTWRVNGTRCPKG++PVRR+TV DVLR+KS+FD+GKKKRPILLDR+IDAPDVVSGNGHEHAIAYT SSE MYGAKATINVWD
Subjt:  NKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD

Query:  PSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIW
        PSI++VNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTN+KIAIGAAISP+SS +GSQYDITILIW
Subjt:  PSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIW

Query:  KDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCY
        KDPKLG+WWMGFG+NTLVGYWPAELFTHL DHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASYFRNLEIVDSDNSLS+VQDISIMAENTNCY
Subjt:  KDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPKCQ
        NIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPKCQ

A0A6J1JIQ1 uncharacterized protein LOC1114854601.74e-28287.53Show/hide
Query:  NLLPFGLIFCF--VITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKE
        +LLPF L F    ++ QRF LVCGLN++  K +SSLRLDRIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPK K  KE
Subjt:  NLLPFGLIFCF--VITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKE

Query:  NKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD
        N+E+V + RAGSGA   FQTWRVNGTRCPKG++PVRR+TV DVLR+KS+FD+GKKKRPILLDR+IDAPDVVSGNGHEHAIAYT SSE MYGAKATINVWD
Subjt:  NKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD

Query:  PSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIW
        PSI++VNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTN+KIAIGAAISP+SS +GSQYDITILIW
Subjt:  PSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIW

Query:  KDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCY
        KDPKLG+WWMGFG+NTLVGYWPAELFTHL DHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASYFRNLEIVDSDNSLS VQDISIMAENTNCY
Subjt:  KDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPKC
        NIMSSYNDQWGTHFYYGGPGRNPKC
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPKC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.3e-13254.55Show/hide
Query:  GLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSE
        G + C  +   F+    L+Y  +  +S  + + +++HL+ +NKP + +IQS DGD+IDCV   KQPA DHP LK+HKIQ  P   P+  +  +NK    +
Subjt:  GLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSE

Query:  RRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIEMVN
             G +   Q W   G +C +GT+P+RRT   DVLR+ S+  +GKKKR  +   K   PD+++ +GH+HAIAY    ++ YGAKATINVW+P I+  N
Subjt:  RRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIEMVN

Query:  EFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIWKDPKLGN
        EFSLSQIW+L GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLCSGF+Q NS IA+GA+ISP+S    SQYDI+ILIWKDPK G+
Subjt:  EFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIWKDPKLGN

Query:  WWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCYNIMSSYN
        WWM FG   ++GYWP+ LF++L + A+M+EWGGEVVNS+ +GQHTSTQMGSG FP++GF+KASYFRN+++VD  N+L + + +    E +NCY++ +  N
Subjt:  WWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCYNIMSSYN

Query:  DQWGTHFYYGGPGRNPKC
        D WG +FYYGGPG+N KC
Subjt:  DQWGTHFYYGGPGRNPKC

AT3G13510.1 Protein of Unknown Function (DUF239)1.5e-13357.4Show/hide
Query:  SSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTV
        SS +   +++HL+ +NKPP+ TIQSPDGDIIDC+   KQPA DHP LK+HKIQ  P+  P+   G  +  +VS    G       Q W   G +C +GT+
Subjt:  SSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTV

Query:  PVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGW
        P+RRT   DVLR+ S+  +GKKK   +   K   PD+++ NGH+HAIAY    ++ YGAKAT+NVW+P I+  NEFSLSQIW+L GSF G DLNSIEAGW
Subjt:  PVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGW

Query:  QVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHA
        QVSP+LYGD+  RLFTYWTSDAYQATGCYNLLCSGF+Q NS IA+GA+ISP+S    SQYDI+ILIWKDPK G+WWM FG   ++GYWP+ LF++L + A
Subjt:  QVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHA

Query:  TMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKC
        +M+EWGGEVVNS+  G HT TQMGSGHFP++GF+KASYFRN+++VD  N+L + + +    E +NCY++ +  ND WG +FYYGGPG+N  C
Subjt:  TMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKC

AT5G18460.1 Protein of Unknown Function (DUF239)1.5e-18169.91Show/hide
Query:  VSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTY-QKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP
        V+   S  +LL F LI    ++Q+   +   N T   + +SSLRL RIQ+HL+ INK P+ TIQSPDGD+IDCV KRKQPALDHPLLK+HKIQ+ P + P
Subjt:  VSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTY-QKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP

Query:  KTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAK
        K K   ++ +E      G     ++Q W VNGTRCPKGTVP+RR T+ DVLR+KSLFDFGKK+R I LD++ + PD +  NGHEHAIAYT SS E+YGAK
Subjt:  KTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAK

Query:  ATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQY
        ATINVWDP IE VNEFSLSQIWILSGSF G DLNSIEAGWQVSPELYGD+RPRLFTYWTSD+YQATGCYNLLCSGF+QTN+KIAIGAAISP+S+  G+Q+
Subjt:  ATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQY

Query:  DITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIM
        DITILIWKDPK+GNWWMG G++TLVGYWPAELFTHLADHAT VEWGGEVVN+R +G+HT+TQMGSGHFPD+GF KASYFRNLE+VDSDNSL  V D+ I+
Subjt:  DITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIM

Query:  AENTNCYNIMSSYNDQWGTHFYYGGPGRNPKC
        AENT CY+I SSY+++WGT+FYYGGPG NP+C
Subjt:  AENTNCYNIMSSYNDQWGTHFYYGGPGRNPKC

AT5G56530.1 Protein of Unknown Function (DUF239)2.5e-13659.69Show/hide
Query:  IQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTV
        + +HL+ +NKP + +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP+  P++  G   + +VSE+   S      Q W  NG  C +GT+PVRRT  
Subjt:  IQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTV

Query:  KDVLRSKSLFDFGKKKR-PILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPEL
        +DVLR+ S+  +GKKK   + L R  D PD+++ +GH+HAIAY     + YGAKATINVW+P ++  NEFSLSQ+WIL GSF G DLNSIEAGWQVSP+L
Subjt:  KDVLRSKSLFDFGKKKR-PILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPEL

Query:  YGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWG
        YGD+  RLFTYWTSDAYQATGCYNLLCSGF+Q NS+IA+GA+ISP+S     QYDI+I IWKDPK G+WWM FG+  ++GYWP+ LF++LAD A++VEWG
Subjt:  YGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWG

Query:  GEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        GEVVN   +G HT+TQMGSG FPD+GF KASYFRN+++VDS N+L   + ++   E +NCY++    ND WG +FYYGGPGRNP CQ
Subjt:  GEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

AT5G56530.2 Protein of Unknown Function (DUF239)2.5e-13659.69Show/hide
Query:  IQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTV
        + +HL+ +NKP + +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP+  P++  G   + +VSE+   S      Q W  NG  C +GT+PVRRT  
Subjt:  IQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTV

Query:  KDVLRSKSLFDFGKKKR-PILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPEL
        +DVLR+ S+  +GKKK   + L R  D PD+++ +GH+HAIAY     + YGAKATINVW+P ++  NEFSLSQ+WIL GSF G DLNSIEAGWQVSP+L
Subjt:  KDVLRSKSLFDFGKKKR-PILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIEMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPEL

Query:  YGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWG
        YGD+  RLFTYWTSDAYQATGCYNLLCSGF+Q NS+IA+GA+ISP+S     QYDI+I IWKDPK G+WWM FG+  ++GYWP+ LF++LAD A++VEWG
Subjt:  YGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWG

Query:  GEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        GEVVN   +G HT+TQMGSG FPD+GF KASYFRN+++VDS N+L   + ++   E +NCY++    ND WG +FYYGGPGRNP CQ
Subjt:  GEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACAAAAACAGGAGGGGTTTCATTTTCGATTTCAATATCAAATCTTCTTCCTTTTGGTTTGATTTTCTGTTTTGTTATCACTCAAAGATTCACTTTGGTTTGTGG
ACTCAATTATACTTATCAAAAACATCTCAGTAGCTTGAGACTGGACAGGATTCAAAGGCATTTGGATTCCATTAACAAGCCTCCCCTCCTCACCATTCAGAGCCCAGATG
GTGATATCATAGATTGTGTTCATAAAAGAAAACAGCCAGCTCTAGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAACGGAGTGGCCGAAGACGAAGGTG
GGGAAAGAGAATAAAGAAGAGGTGAGTGAAAGGAGGGCAGGATCAGGTGCGTTAGCTTCATTTCAAACTTGGCGTGTGAACGGGACACGGTGTCCAAAAGGGACTGTTCC
AGTGCGACGCACCACAGTCAAGGACGTTCTTAGATCCAAGTCTTTGTTTGACTTTGGCAAGAAGAAACGACCCATTCTCCTTGATCGAAAAATAGACGCTCCTGATGTCG
TCAGTGGGAATGGTCACGAGCATGCGATCGCGTACACTGGATCATCAGAAGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCGTCAATTGAAATGGTCAAC
GAATTCAGTCTCTCTCAGATTTGGATCCTCTCGGGATCATTCGACGGCTCTGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGTGACAGCAG
ACCTAGACTATTCACATATTGGACGAGTGATGCATATCAGGCAACGGGTTGCTATAATCTTTTATGTTCGGGATTTGTACAAACAAACAGCAAAATCGCTATAGGAGCTG
CCATTTCTCCCATTTCTTCCATCGCCGGCAGTCAATATGACATTACCATTCTCATTTGGAAGGATCCAAAGTTGGGGAATTGGTGGATGGGATTTGGGGAGAACACATTG
GTTGGGTATTGGCCAGCAGAGTTATTCACTCACTTAGCCGATCACGCCACAATGGTTGAGTGGGGTGGGGAAGTGGTTAACTCGAGGATCAATGGGCAGCACACTTCCAC
CCAAATGGGCTCTGGCCACTTCCCTGACGATGGCTTTGCCAAAGCCAGCTACTTCCGGAACCTCGAGATCGTTGACAGCGACAACAGCCTCAGCAGTGTTCAAGACATCT
CAATCATGGCTGAAAATACCAACTGTTACAATATTATGAGCTCCTATAACGATCAATGGGGCACTCACTTCTACTATGGTGGTCCTGGTAGAAATCCTAAATGCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAACAAAAACAGGAGGGGTTTCATTTTCGATTTCAATATCAAATCTTCTTCCTTTTGGTTTGATTTTCTGTTTTGTTATCACTCAAAGATTCACTTTGGTTTGTGG
ACTCAATTATACTTATCAAAAACATCTCAGTAGCTTGAGACTGGACAGGATTCAAAGGCATTTGGATTCCATTAACAAGCCTCCCCTCCTCACCATTCAGAGCCCAGATG
GTGATATCATAGATTGTGTTCATAAAAGAAAACAGCCAGCTCTAGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAACGGAGTGGCCGAAGACGAAGGTG
GGGAAAGAGAATAAAGAAGAGGTGAGTGAAAGGAGGGCAGGATCAGGTGCGTTAGCTTCATTTCAAACTTGGCGTGTGAACGGGACACGGTGTCCAAAAGGGACTGTTCC
AGTGCGACGCACCACAGTCAAGGACGTTCTTAGATCCAAGTCTTTGTTTGACTTTGGCAAGAAGAAACGACCCATTCTCCTTGATCGAAAAATAGACGCTCCTGATGTCG
TCAGTGGGAATGGTCACGAGCATGCGATCGCGTACACTGGATCATCAGAAGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCGTCAATTGAAATGGTCAAC
GAATTCAGTCTCTCTCAGATTTGGATCCTCTCGGGATCATTCGACGGCTCTGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGTGACAGCAG
ACCTAGACTATTCACATATTGGACGAGTGATGCATATCAGGCAACGGGTTGCTATAATCTTTTATGTTCGGGATTTGTACAAACAAACAGCAAAATCGCTATAGGAGCTG
CCATTTCTCCCATTTCTTCCATCGCCGGCAGTCAATATGACATTACCATTCTCATTTGGAAGGATCCAAAGTTGGGGAATTGGTGGATGGGATTTGGGGAGAACACATTG
GTTGGGTATTGGCCAGCAGAGTTATTCACTCACTTAGCCGATCACGCCACAATGGTTGAGTGGGGTGGGGAAGTGGTTAACTCGAGGATCAATGGGCAGCACACTTCCAC
CCAAATGGGCTCTGGCCACTTCCCTGACGATGGCTTTGCCAAAGCCAGCTACTTCCGGAACCTCGAGATCGTTGACAGCGACAACAGCCTCAGCAGTGTTCAAGACATCT
CAATCATGGCTGAAAATACCAACTGTTACAATATTATGAGCTCCTATAACGATCAATGGGGCACTCACTTCTACTATGGTGGTCCTGGTAGAAATCCTAAATGCCAATGA
Protein sequenceShow/hide protein sequence
MGTKTGGVSFSISISNLLPFGLIFCFVITQRFTLVCGLNYTYQKHLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKV
GKENKEEVSERRAGSGALASFQTWRVNGTRCPKGTVPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIEMVN
EFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSIAGSQYDITILIWKDPKLGNWWMGFGENTL
VGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSSVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ