; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G039980 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G039980
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchrH02:18073219..18080229
RNA-Seq ExpressionChy2G039980
SyntenyChy2G039980
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6590299.1 hypothetical protein SDJN03_15722, partial [Cucurbita argyrosperma subsp. sororia]9.22e-28588.03Show/hide
Query:  NLLPFGLIFCF--VVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKE
        +LLPF L      ++ QRF LVCGLN++  KQ+SSLRLDRIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPK K  KE
Subjt:  NLLPFGLIFCF--VVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKE

Query:  NKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD
        N+E+V + RAGSGA   FQTWRVNGTRCPKG+IPVRR+TV DVLR+KS+FD+GKKKRPILLDR+IDAPD+VSGNGHEHAIAYT SSE MYGAKATINVWD
Subjt:  NKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD

Query:  PSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIW
        PSIQ+VNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTN+KIAIGAAISP+SS SGSQYDITILIW
Subjt:  PSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIW

Query:  KDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCY
        KDPKLG+WWMGFG+NTLVGYWPAELFTHL DHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASYFRNLEIVDSDNSLS VQDISIMAENTNCY
Subjt:  KDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPKCQ
        NIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPKCQ

XP_008443756.1 PREDICTED: uncharacterized protein LOC103487273 [Cucumis melo]0.098.63Show/hide
Query:  MGTKTGVSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRG
        MGTKTGVSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTYQK LSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRG
Subjt:  MGTKTGVSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRG

Query:  PTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEE
        PTEWPKTKV KENKEEV ERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKK+RPILLDRKIDAPDVVSGNGHEHAIAYTGSSEE
Subjt:  PTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEE

Query:  MYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSL
        MYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISPISS+
Subjt:  MYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSL

Query:  SGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQ
        SGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQ
Subjt:  SGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQ

Query:  DISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        DISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  DISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

XP_011660253.1 uncharacterized protein LOC101208882 [Cucumis sativus]0.097.95Show/hide
Query:  MGTKTG-VSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
        MGTKTG VSFSISISNLLPFGLIFCFV+TQRFTLVCGLNYTYQK LSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
Subjt:  MGTKTG-VSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR

Query:  GPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE
        GPTEWPKTKVGKENKEEVSERRAGSGALA+FQTWRVNGTRCPKGT+PVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE
Subjt:  GPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE

Query:  EMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS
        EMYGAKATINVWDPSI+MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS
Subjt:  EMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS

Query:  LSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGV
        ++GSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLS V
Subjt:  LSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGV

Query:  QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

XP_038878455.1 uncharacterized protein LOC120070684 isoform X1 [Benincasa hispida]2.23e-29287.69Show/hide
Query:  MGTKTGVSFSISIS--------NLLPFGLIFCFVVT--QRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQ-------SPDGDIIDCVHKRK
        MG K GVS SISIS        ++L F L+F  +V   QRFTLVCGLNYTY KQ+SSLRL+RIQRHLDSINKPPLLTIQ       SPDGDIIDCVHKRK
Subjt:  MGTKTGVSFSISIS--------NLLPFGLIFCFVVT--QRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQ-------SPDGDIIDCVHKRK

Query:  QPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVV
        QPALDHPLLKNHKIQRGPTEWPKTK   EN+E  S R AGSGA  A QTWRVNGTRCPKG+IPVRR+TV DVLRSKSLFDFGKKKRPILLDR+IDAPDVV
Subjt:  QPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVV

Query:  SGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQ
        SGNGHEHAIAYT SSEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQ
Subjt:  SGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQ

Query:  TNSKIAIGAAISPISSLSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASY
        TN+KIAIGAAISPISS SGSQYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASY
Subjt:  TNSKIAIGAAISPISSLSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASY

Query:  FRNLEIVDSDNSLSGVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        FRNLEIVDSDNSLS VQDISI+AENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  FRNLEIVDSDNSLSGVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

XP_038878456.1 uncharacterized protein LOC120070684 isoform X2 [Benincasa hispida]3.10e-29589.06Show/hide
Query:  MGTKTGVSFSISIS--------NLLPFGLIFCFVVT--QRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHP
        MG K GVS SISIS        ++L F L+F  +V   QRFTLVCGLNYTY KQ+SSLRL+RIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHP
Subjt:  MGTKTGVSFSISIS--------NLLPFGLIFCFVVT--QRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHP

Query:  LLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEH
        LLKNHKIQRGPTEWPKTK   EN+E  S R AGSGA  A QTWRVNGTRCPKG+IPVRR+TV DVLRSKSLFDFGKKKRPILLDR+IDAPDVVSGNGHEH
Subjt:  LLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEH

Query:  AIAYTGSSEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAI
        AIAYT SSEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAI
Subjt:  AIAYTGSSEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAI

Query:  GAAISPISSLSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIV
        GAAISPISS SGSQYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASYFRNLEIV
Subjt:  GAAISPISSLSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIV

Query:  DSDNSLSGVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        DSDNSLS VQDISI+AENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  DSDNSLSGVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

TrEMBL top hitse value%identityAlignment
A0A0A0LXY9 Uncharacterized protein1.1e-25697.95Show/hide
Query:  MGTKT-GVSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
        MGTKT GVSFSISISNLLPFGLIFCFV+TQRFTLVCGLNYTYQK LSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
Subjt:  MGTKT-GVSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR

Query:  GPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE
        GPTEWPKTKVGKENKEEVSERRAGSGALA+FQTWRVNGTRCPKGT+PVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE
Subjt:  GPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSE

Query:  EMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS
        EMYGAKATINVWDPSI+MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS
Subjt:  EMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISS

Query:  LSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGV
        ++GSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLS V
Subjt:  LSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGV

Query:  QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  QDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

A0A1S3B8R5 uncharacterized protein LOC1034872732.4e-25698.63Show/hide
Query:  MGTKTGVSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRG
        MGTKTGVSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTYQK LSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRG
Subjt:  MGTKTGVSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRG

Query:  PTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEE
        PTEWPKTKV KENKEEV ERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKK+RPILLDRKIDAPDVVSGNGHEHAIAYTGSSEE
Subjt:  PTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEE

Query:  MYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSL
        MYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISPISS+
Subjt:  MYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSL

Query:  SGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQ
        SGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQ
Subjt:  SGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQ

Query:  DISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        DISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  DISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

A0A6J1DDR6 uncharacterized protein LOC1110195905.5e-21685.45Show/hide
Query:  LPFGLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKE-
        L   L    VV +RF+LV GLNYTY KQ+SSLRLDRIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ K  KE  E 
Subjt:  LPFGLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKE-

Query:  ---EVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD
           + SE R GSGA  A+QTWRVNGTRCPKG+IPVRR+TV DVLR+KSLFDFGKK+RPILLDR++DAPDVVSGNGHEHAIAYTGSS+EMYGAKATINVWD
Subjt:  ---EVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD

Query:  PSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIW
        PSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISPISS +GSQYD+TILIW
Subjt:  PSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIW

Query:  KDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCY
        KDPKLGNWWMGFG+NTLVGYWPAELFTHL DHATMVEWGGEVVNS+  G+HTSTQMGSG F ++GF KASYFRNLEIVDSDNSLS VQ+IS +AEN +CY
Subjt:  KDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPKCQ
        NIMSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPKCQ

A0A6J1HAZ7 uncharacterized protein LOC1114616982.1e-22387.85Show/hide
Query:  ISNLLPFGLIFC--FVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVG
        + +LLPF L      ++ QRF LVCGLN++  KQ+SSLRLDRIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPK K  
Subjt:  ISNLLPFGLIFC--FVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVG

Query:  KENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINV
        KEN+E+V + RAGSGA   FQTWRVNGTRCPKG+IPVRR+TV DVLR+KS+FD+GKKKRPILLDR+IDAPDVVSGNGHEHAIAYT SS EMYGAKATINV
Subjt:  KENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINV

Query:  WDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITIL
        WDPSIQ+VNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTN+KIAIGAAISP+SS SGSQYDITIL
Subjt:  WDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITIL

Query:  IWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTN
        IWKDPKLG+WWMGFG+NTLVGYWPAELFTHL DHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASYFRNLEIVDSDNSLS VQDISIMAENTN
Subjt:  IWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTN

Query:  CYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        CYNIMSSYNDQWGTHFYYGGPGRNPKCQ
Subjt:  CYNIMSSYNDQWGTHFYYGGPGRNPKCQ

A0A6J1JIQ1 uncharacterized protein LOC1114854602.7e-22388.06Show/hide
Query:  ISNLLPFGLIFC--FVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVG
        + +LLPF L F    ++ QRF LVCGLN++  KQ+SSLRLDRIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPK K  
Subjt:  ISNLLPFGLIFC--FVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVG

Query:  KENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINV
        KEN+E+V + RAGSGA   FQTWRVNGTRCPKG+IPVRR+TV DVLR+KS+FD+GKKKRPILLDR+IDAPDVVSGNGHEHAIAYT SS EMYGAKATINV
Subjt:  KENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINV

Query:  WDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITIL
        WDPSIQ+VNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTN+KIAIGAAISP+SS SGSQYDITIL
Subjt:  WDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITIL

Query:  IWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTN
        IWKDPKLG+WWMGFG+NTLVGYWPAELFTHL DHATMVEWGGEVVNSR NGQHTSTQMGSGHFPDDGF KASYFRNLEIVDSDNSLS VQDISIMAENTN
Subjt:  IWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTN

Query:  CYNIMSSYNDQWGTHFYYGGPGRNPKC
        CYNIMSSYNDQWGTHFYYGGPGRNPKC
Subjt:  CYNIMSSYNDQWGTHFYYGGPGRNPKC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.5e-13355.02Show/hide
Query:  GLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSE
        G + C  +   F+    L+Y  +  +S  + + +++HL+ +NKP + +IQS DGD+IDCV   KQPA DHP LK+HKIQ  P   P+   G  +  +VS 
Subjt:  GLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSE

Query:  RRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQMVN
         ++        Q W   G +C +GTIP+RRT   DVLR+ S+  +GKKKR  +   K   PD+++ +GH+HAIAY    ++ YGAKATINVW+P IQ  N
Subjt:  RRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQMVN

Query:  EFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIWKDPKLGN
        EFSLSQIW+L GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLCSGF+Q NS IA+GA+ISP+S    SQYDI+ILIWKDPK G+
Subjt:  EFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIWKDPKLGN

Query:  WWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCYNIMSSYN
        WWM FG   ++GYWP+ LF++L + A+M+EWGGEVVNS+ +GQHTSTQMGSG FP++GF+KASYFRN+++VD  N+L   + +    E +NCY++ +  N
Subjt:  WWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCYNIMSSYN

Query:  DQWGTHFYYGGPGRNPKC
        D WG +FYYGGPG+N KC
Subjt:  DQWGTHFYYGGPGRNPKC

AT3G13510.1 Protein of Unknown Function (DUF239)1.8e-13457.91Show/hide
Query:  SSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTI
        SS +   +++HL+ +NKPP+ TIQSPDGDIIDC+   KQPA DHP LK+HKIQ  P+  P+   G  +  +VS    G       Q W   G +C +GTI
Subjt:  SSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTI

Query:  PVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGW
        P+RRT   DVLR+ S+  +GKKK   +   K   PD+++ NGH+HAIAY    ++ YGAKAT+NVW+P IQ  NEFSLSQIW+L GSF G DLNSIEAGW
Subjt:  PVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGW

Query:  QVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHA
        QVSP+LYGD+  RLFTYWTSDAYQATGCYNLLCSGF+Q NS IA+GA+ISP+S    SQYDI+ILIWKDPK G+WWM FG   ++GYWP+ LF++L + A
Subjt:  QVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHA

Query:  TMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKC
        +M+EWGGEVVNS+  G HT TQMGSGHFP++GF+KASYFRN+++VD  N+L   + +    E +NCY++ +  ND WG +FYYGGPG+N  C
Subjt:  TMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKC

AT5G18460.1 Protein of Unknown Function (DUF239)2.3e-18269.91Show/hide
Query:  VSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTY-QKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP
        V+   S  +LL F LI    ++Q+   +   N T   +Q+SSLRL RIQ+HL+ INK P+ TIQSPDGD+IDCV KRKQPALDHPLLK+HKIQ+ P + P
Subjt:  VSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTY-QKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP

Query:  KTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAK
        K K   ++ +E      G     A+Q W VNGTRCPKGT+P+RR T+ DVLR+KSLFDFGKK+R I LD++ + PD +  NGHEHAIAYT SS E+YGAK
Subjt:  KTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAK

Query:  ATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQY
        ATINVWDP I+ VNEFSLSQIWILSGSF G DLNSIEAGWQVSPELYGD+RPRLFTYWTSD+YQATGCYNLLCSGF+QTN+KIAIGAAISP+S+  G+Q+
Subjt:  ATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQY

Query:  DITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIM
        DITILIWKDPK+GNWWMG G++TLVGYWPAELFTHLADHAT VEWGGEVVN+R +G+HT+TQMGSGHFPD+GF KASYFRNLE+VDSDNSL  V D+ I+
Subjt:  DITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIM

Query:  AENTNCYNIMSSYNDQWGTHFYYGGPGRNPKC
        AENT CY+I SSY+++WGT+FYYGGPG NP+C
Subjt:  AENTNCYNIMSSYNDQWGTHFYYGGPGRNPKC

AT5G56530.1 Protein of Unknown Function (DUF239)3.0e-13760.21Show/hide
Query:  IQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTV
        + +HL+ +NKP + +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP+  P++  G   + +VSE+   S      Q W  NG  C +GTIPVRRT  
Subjt:  IQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTV

Query:  KDVLRSKSLFDFGKKKR-PILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPEL
        +DVLR+ S+  +GKKK   + L R  D PD+++ +GH+HAIAY     + YGAKATINVW+P +Q  NEFSLSQ+WIL GSF G DLNSIEAGWQVSP+L
Subjt:  KDVLRSKSLFDFGKKKR-PILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPEL

Query:  YGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWG
        YGD+  RLFTYWTSDAYQATGCYNLLCSGF+Q NS+IA+GA+ISP+S     QYDI+I IWKDPK G+WWM FG+  ++GYWP+ LF++LAD A++VEWG
Subjt:  YGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWG

Query:  GEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        GEVVN   +G HT+TQMGSG FPD+GF KASYFRN+++VDS N+L   + ++   E +NCY++    ND WG +FYYGGPGRNP CQ
Subjt:  GEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ

AT5G56530.2 Protein of Unknown Function (DUF239)3.0e-13760.21Show/hide
Query:  IQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTV
        + +HL+ +NKP + +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP+  P++  G   + +VSE+   S      Q W  NG  C +GTIPVRRT  
Subjt:  IQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVGKENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTV

Query:  KDVLRSKSLFDFGKKKR-PILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPEL
        +DVLR+ S+  +GKKK   + L R  D PD+++ +GH+HAIAY     + YGAKATINVW+P +Q  NEFSLSQ+WIL GSF G DLNSIEAGWQVSP+L
Subjt:  KDVLRSKSLFDFGKKKR-PILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPEL

Query:  YGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWG
        YGD+  RLFTYWTSDAYQATGCYNLLCSGF+Q NS+IA+GA+ISP+S     QYDI+I IWKDPK G+WWM FG+  ++GYWP+ LF++LAD A++VEWG
Subjt:  YGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIWKDPKLGNWWMGFGENTLVGYWPAELFTHLADHATMVEWG

Query:  GEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ
        GEVVN   +G HT+TQMGSG FPD+GF KASYFRN+++VDS N+L   + ++   E +NCY++    ND WG +FYYGGPGRNP CQ
Subjt:  GEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACAAAAACAGGGGTTTCATTTTCAATTTCAATATCAAATCTTCTTCCTTTTGGTTTGATTTTCTGTTTTGTTGTTACTCAAAGATTCACTTTGGTTTGTGGACT
CAATTATACTTATCAAAAACAACTCAGTAGCTTGAGACTGGACAGGATTCAAAGGCATTTGGATTCCATTAACAAGCCTCCCCTCCTCACCATTCAGAGCCCAGATGGTG
ATATTATAGATTGTGTTCATAAAAGAAAACAGCCAGCTCTAGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAACGGAGTGGCCGAAAACGAAGGTGGGG
AAAGAGAATAAAGAGGAGGTGAGTGAAAGGAGGGCAGGATCAGGTGCGTTAGCTGCATTTCAAACTTGGCGTGTGAACGGGACACGGTGTCCAAAAGGTACTATTCCAGT
GCGACGCACCACAGTAAAGGACGTTCTTAGATCCAAGTCTTTGTTTGACTTTGGCAAGAAAAAACGACCCATTCTCCTTGATCGTAAAATAGACGCTCCTGATGTCGTCA
GTGGGAATGGTCACGAGCATGCGATCGCGTACACTGGATCATCAGAAGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCGTCAATACAAATGGTCAACGAA
TTCAGTCTCTCTCAGATTTGGATCCTCTCGGGATCATTCGACGGCTCTGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGTGACAGCAGACC
TAGACTATTCACATATTGGACGAGTGATGCATATCAGGCAACGGGTTGCTATAATCTTTTATGTTCGGGATTTGTACAAACAAACAGCAAAATCGCGATCGGAGCCGCCA
TTTCTCCCATTTCTTCTCTCTCCGGCAGTCAATATGACATTACCATTCTCATTTGGAAGGATCCAAAGTTGGGGAATTGGTGGATGGGATTTGGGGAGAACACATTGGTT
GGGTATTGGCCAGCAGAGTTATTCACTCACTTAGCCGACCACGCCACAATGGTTGAGTGGGGTGGGGAAGTGGTTAACTCGAGGATCAATGGGCAGCACACTTCCACCCA
AATGGGCTCTGGCCACTTCCCTGACGATGGCTTTGCCAAAGCCAGCTACTTCCGAAACCTCGAGATCGTTGACAGCGACAACAGCCTCAGCGGTGTTCAAGACATCTCAA
TCATGGCTGAAAATACCAACTGTTACAATATTATGAGCTCCTATAACGATCAATGGGGCACTCACTTCTACTATGGTGGTCCTGGTAGAAATCCTAAATGCCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAACAAAAACAGGGGTTTCATTTTCAATTTCAATATCAAATCTTCTTCCTTTTGGTTTGATTTTCTGTTTTGTTGTTACTCAAAGATTCACTTTGGTTTGTGGACT
CAATTATACTTATCAAAAACAACTCAGTAGCTTGAGACTGGACAGGATTCAAAGGCATTTGGATTCCATTAACAAGCCTCCCCTCCTCACCATTCAGAGCCCAGATGGTG
ATATTATAGATTGTGTTCATAAAAGAAAACAGCCAGCTCTAGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAACGGAGTGGCCGAAAACGAAGGTGGGG
AAAGAGAATAAAGAGGAGGTGAGTGAAAGGAGGGCAGGATCAGGTGCGTTAGCTGCATTTCAAACTTGGCGTGTGAACGGGACACGGTGTCCAAAAGGTACTATTCCAGT
GCGACGCACCACAGTAAAGGACGTTCTTAGATCCAAGTCTTTGTTTGACTTTGGCAAGAAAAAACGACCCATTCTCCTTGATCGTAAAATAGACGCTCCTGATGTCGTCA
GTGGGAATGGTCACGAGCATGCGATCGCGTACACTGGATCATCAGAAGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCGTCAATACAAATGGTCAACGAA
TTCAGTCTCTCTCAGATTTGGATCCTCTCGGGATCATTCGACGGCTCTGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGTGACAGCAGACC
TAGACTATTCACATATTGGACGAGTGATGCATATCAGGCAACGGGTTGCTATAATCTTTTATGTTCGGGATTTGTACAAACAAACAGCAAAATCGCGATCGGAGCCGCCA
TTTCTCCCATTTCTTCTCTCTCCGGCAGTCAATATGACATTACCATTCTCATTTGGAAGGATCCAAAGTTGGGGAATTGGTGGATGGGATTTGGGGAGAACACATTGGTT
GGGTATTGGCCAGCAGAGTTATTCACTCACTTAGCCGACCACGCCACAATGGTTGAGTGGGGTGGGGAAGTGGTTAACTCGAGGATCAATGGGCAGCACACTTCCACCCA
AATGGGCTCTGGCCACTTCCCTGACGATGGCTTTGCCAAAGCCAGCTACTTCCGAAACCTCGAGATCGTTGACAGCGACAACAGCCTCAGCGGTGTTCAAGACATCTCAA
TCATGGCTGAAAATACCAACTGTTACAATATTATGAGCTCCTATAACGATCAATGGGGCACTCACTTCTACTATGGTGGTCCTGGTAGAAATCCTAAATGCCAATAA
Protein sequenceShow/hide protein sequence
MGTKTGVSFSISISNLLPFGLIFCFVVTQRFTLVCGLNYTYQKQLSSLRLDRIQRHLDSINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKTKVG
KENKEEVSERRAGSGALAAFQTWRVNGTRCPKGTIPVRRTTVKDVLRSKSLFDFGKKKRPILLDRKIDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQMVNE
FSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCSGFVQTNSKIAIGAAISPISSLSGSQYDITILIWKDPKLGNWWMGFGENTLV
GYWPAELFTHLADHATMVEWGGEVVNSRINGQHTSTQMGSGHFPDDGFAKASYFRNLEIVDSDNSLSGVQDISIMAENTNCYNIMSSYNDQWGTHFYYGGPGRNPKCQ