; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G012040 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G012040
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPHD domain-containing protein
Genome locationchr06:22410814..22422190
RNA-Seq ExpressionLsi06G012040
SyntenyLsi06G012040
Gene Ontology termsGO:0046274 - lignin catabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0048046 - apoplast (cellular component)
GO:0005507 - copper ion binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0052716 - hydroquinone:oxygen oxidoreductase activity (molecular function)
InterPro domainsIPR000949 - ELM2 domain
IPR001965 - Zinc finger, PHD-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR011124 - Zinc finger, CW-type
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587732.1 tRNA-specific adenosine deaminase TAD2, partial [Cucurbita argyrosperma subsp. sororia]1.5e-24677.37Show/hide
Query:  WLGNLRRA-LWGLIHFLPSGR------------VQCFGVGKIGA--------EWIDFGELQGLPIRRKIDT---------------MCPHCDEFSHDGCR
        WLGNLR   L  +IHFLP GR            + C+   K           +W  F +       +                   MCPHCDEF HDGCR
Subjt:  WLGNLRRA-LWGLIHFLPSGR------------VQCFGVGKIGA--------EWIDFGELQGLPIRRKIDT---------------MCPHCDEFSHDGCR

Query:  KAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRNHKSEIVGNAVLSFP
        KAG IIEEKKN+GG RCLNFPRAF QIST+SMMP GSKSNVVY+RKKLRGNSDSRLLANGTDC SLISCD  L EDKEQA  S+  HKSEIVGN +   P
Subjt:  KAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRNHKSEIVGNAVLSFP

Query:  VYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGL
        V  GK QVSE ES+NGCTIGEGHGSDET NNNLQKSLEVDSINDSCSSSKSNME +STS+KVEVDDTGECSSSSI+VMED VEDISGRDLCISILRSNGL
Subjt:  VYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGL

Query:  LYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLANTLSRNGSSKGESNS
        L SM HASE+ESD RS +NCFR CKTCGSS+S LKMLICDHCEDAFHV C NHRMKKVSNDEWYCNSCLKKKHKIL E I+KKLAN  SRNGSSK ESNS
Subjt:  LYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLANTLSRNGSSKGESNS

Query:  IALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGVICGKWRRAPLFEVQ
        IALML DTEPYTTGVRIGKGFQAEVPDWSGPI DDTDA GEPLEMDPS SFLMHEQSTNKPCRLS IGNWLQCQQVIDG+GGVNGVICGKWRRAPLFEVQ
Subjt:  IALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGVICGKWRRAPLFEVQ

Query:  TDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHK
        TDDWECFCSILWDP HADCAVPQELETGQVLKQLKYIEMLRPRL+SKRRKLDE KSRSDVQNL E+TEHK
Subjt:  TDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHK

XP_008453559.1 PREDICTED: uncharacterized protein LOC103494237 isoform X1 [Cucumis melo]3.8e-25387.7Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN
        MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPR FP   T  MM EGSKSNVVYRRKKLRG+SDSR LANGTDCISLISCD  L EDKEQAAASQRN
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN

Query:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS
        H+ EIVGNAV  FPV DGKTQVSE ES NGC  GEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMEL+STS+KVEVDDTGECSSSSIQVMEDTVEDIS
Subjt:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS

Query:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN
        GRDLCISILRSNGLL SM H  EEESD RS++NCFR CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHK+LKE ISKKL N
Subjt:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN

Query:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV
        TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPI DDTDAIGEPLEMD SESFLMHEQSTNK CRLSTIGNWLQCQQV+DG+GG NG 
Subjt:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQ-------------------ELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDT
        ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQ                   ELETGQVLKQLKYIEMLRPRL+SKRRKLDEAKSRSDVQNLTEDT
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQ-------------------ELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDT

Query:  EHKP
        E+KP
Subjt:  EHKP

XP_008453560.1 PREDICTED: uncharacterized protein LOC103494237 isoform X2 [Cucumis melo]1.2e-25691.13Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN
        MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPR FP   T  MM EGSKSNVVYRRKKLRG+SDSR LANGTDCISLISCD  L EDKEQAAASQRN
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN

Query:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS
        H+ EIVGNAV  FPV DGKTQVSE ES NGC  GEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMEL+STS+KVEVDDTGECSSSSIQVMEDTVEDIS
Subjt:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS

Query:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN
        GRDLCISILRSNGLL SM H  EEESD RS++NCFR CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHK+LKE ISKKL N
Subjt:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN

Query:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV
        TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPI DDTDAIGEPLEMD SESFLMHEQSTNK CRLSTIGNWLQCQQV+DG+GG NG 
Subjt:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHKP
        ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRL+SKRRKLDEAKSRSDVQNLTEDTE+KP
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHKP

XP_011656989.1 uncharacterized protein LOC101212408 isoform X1 [Cucumis sativus]3.6e-24888.87Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN
        MCPHCDEFSHDGCRKAG  IEEKKN+GGLRCLNFPR FP   TV MMPEGSKSNVVYRRKKLRG+SDSR LANGTDCISLISCD  L EDKEQAAASQ N
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN

Query:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS
        H+ EIVGNAV  FPV DGKTQVSE ES NGC  GEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMEL+S S+KVEVDDTGECSSSSIQVM D +EDIS
Subjt:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS

Query:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN
        GRDLCISILRSNGLL S  HA EEESDFRS++NCFR CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMK+VSNDEW CNSCLKK HKILKE ISKKL N
Subjt:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN

Query:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV
        T SRNGSSKGESNSIALMLKDT+PYTT +RIGKGFQAEVPDWSGPI DDTDAIGEPLEMD SESF MHEQSTNKPCRLSTIGNWLQCQQVIDG+GG NG 
Subjt:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHKP
        ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQV KQLKYIEMLRPRL+SKRRKLDE KSRSDVQNLTEDTEHKP
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHKP

XP_038878482.1 uncharacterized protein LOC120070708 isoform X1 [Benincasa hispida]2.0e-26292.78Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN
        MCPHCDEFS DGCRKAGPIIEEKKNNGG RCLNFPRAFPQISTVSMMPE SKSNVVYRRKKLRGNSDSRLLANGTDCISL SCD  LGEDKEQAAASQ N
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN

Query:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS
        HK+EI+GN V  FPVY+GKTQVSE ESVNGC  GEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMEL+STSVKVEVDDTGECSSSSIQVMED VEDIS
Subjt:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS

Query:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN
        GRDLCI ILRSNGLL SM HA EEESDFRS++NCFR CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN
Subjt:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN

Query:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV
          SRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPI DDTDA GEPLE+DPSESFLMHE+STNKPCRLSTIGNWLQCQQVIDG+GGVNG 
Subjt:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHKP
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQELETGQVLKQLKYIEMLRPRL+SKRRKLDEAKSRSDVQNLTEDTEHKP
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHKP

TrEMBL top hitse value%identityAlignment
A0A1S3BWJ2 uncharacterized protein LOC103494237 isoform X11.8e-25387.7Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN
        MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPR FP   T  MM EGSKSNVVYRRKKLRG+SDSR LANGTDCISLISCD  L EDKEQAAASQRN
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN

Query:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS
        H+ EIVGNAV  FPV DGKTQVSE ES NGC  GEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMEL+STS+KVEVDDTGECSSSSIQVMEDTVEDIS
Subjt:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS

Query:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN
        GRDLCISILRSNGLL SM H  EEESD RS++NCFR CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHK+LKE ISKKL N
Subjt:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN

Query:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV
        TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPI DDTDAIGEPLEMD SESFLMHEQSTNK CRLSTIGNWLQCQQV+DG+GG NG 
Subjt:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQ-------------------ELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDT
        ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQ                   ELETGQVLKQLKYIEMLRPRL+SKRRKLDEAKSRSDVQNLTEDT
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQ-------------------ELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDT

Query:  EHKP
        E+KP
Subjt:  EHKP

A0A1S3BXC2 uncharacterized protein LOC103494237 isoform X26.0e-25791.13Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN
        MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPR FP   T  MM EGSKSNVVYRRKKLRG+SDSR LANGTDCISLISCD  L EDKEQAAASQRN
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN

Query:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS
        H+ EIVGNAV  FPV DGKTQVSE ES NGC  GEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMEL+STS+KVEVDDTGECSSSSIQVMEDTVEDIS
Subjt:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS

Query:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN
        GRDLCISILRSNGLL SM H  EEESD RS++NCFR CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHK+LKE ISKKL N
Subjt:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN

Query:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV
        TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPI DDTDAIGEPLEMD SESFLMHEQSTNK CRLSTIGNWLQCQQV+DG+GG NG 
Subjt:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHKP
        ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRL+SKRRKLDEAKSRSDVQNLTEDTE+KP
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHKP

A0A6J1C1R7 uncharacterized protein LOC1110071733.3e-23186.02Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN
        MCPHCDEFSH GCRKAGPII+EKKNN G  CLN PRA  QISTVS MPEGS S VVYRRKKLRGNSDSRL ANGTDCIS ISCD  LGE+ EQAAASQ  
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN

Query:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS
         +S+IVGN V   PVYDGKT VSE ESVNGCTIGEGHGSDET NNNLQK+LEVDSINDSCSSSKSNMEL+STS+KVEVDDTGECSSSSIQVMED +EDIS
Subjt:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS

Query:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN
        GRDLCISILRSNGLL  M HA +EES+F+S+ NCFRSCK CGSSESVLKMLICDHCEDAFH+SCCNHRMKKVSNDEWYCNSCLKKKHK+LKETI+ KLAN
Subjt:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN

Query:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV
          SR+GSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPI DDTDAIGEPLE+DPSESF MHEQSTNKPCRLS IGNWLQCQQVI      NG+
Subjt:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKL
        ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELET QVLKQLKYIEMLRPRL+SKRRK+
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKL

A0A6J1F145 uncharacterized protein LOC111441172 isoform X11.8e-24587.6Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN
        MCPHCDEF HDGCRKAG IIEEKKN+GG RCLNFPRAF QIST+SMMP GSKSNVVY+RKKLRGNSDSRLLANGTDC SLISCD  L EDKEQA  SQ  
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN

Query:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS
        HKSEIVGN +   PV  GK QVSE ES+NGCTIGEGHGSDET NNNLQKSLEVDSINDSCSSSKSNME +STS+KVEVDDTGECSSSSI+VMED VEDIS
Subjt:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS

Query:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN
        GRDLCISILRSNGLL SM HASE+ESD RS +NCFR CKTCGSS+S LKMLICDHCEDAFHV C NHRMKKVSNDEWYCNSCLKKKHKIL E I+KKLAN
Subjt:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN

Query:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV
          SRNGSSK ESNSIALML DTEPYTTGVRIGKGFQAEVPDWSGPI DDTDA GEPLEMDPS SFLMHEQSTNKPCRLS IGNWLQCQQVIDG+GGVNGV
Subjt:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHK
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQELETGQVLKQLKYIEMLRPRL+SKRRKLDE KSRSDVQNL E+TEHK
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHK

A0A6J1IGX7 uncharacterized protein LOC111472812 isoform X17.7e-24486.98Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN
        MCPHCDEF HDGCRKAG IIEEKKN+GGLRCLNFPRAF QIST+SMMP GSKSNVVY+RKKLRGNSDSRLLANGTDC SLISCD  L EDKEQA  S+  
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRN

Query:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS
        HKSEIVGN +   PV DGK QVS  ES+NGCTIGEGHGSDET NNNLQKSLEVDSINDSCSSSKSNME +STS+KVEVDDTGECSSSSI+VMED VEDIS
Subjt:  HKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDIS

Query:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN
        GRDLCISILRSNGLL SM HASE+ESD RS +NCFR CKTCGSS+S LKMLICDHCEDAFHV C NHRMKKVSNDEWYCNSCLKKKHKIL E I+KKLAN
Subjt:  GRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN

Query:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV
          SRNGSSK ESNSIALML DTEPYTTGVRIGKGFQAEVPDWSG I DDTDA  EPLEMDPS SFLMHEQSTNKPCRLS IGNWLQCQQVIDG+GGVNGV
Subjt:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHK
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQELETGQVLKQLKYIEMLRPRL+SKRRKLDE +SRSDVQNL E+TEHK
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHK

SwissProt top hitse value%identityAlignment
Q63625 PHD and RING finger domain-containing protein 12.7e-0432.2Show/hide
Query:  EESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC
        E+ +    D  F  C+ CG S+   ++L+CD C+  +H+ C +  +++V  DEW+C  C
Subjt:  EESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC

Q9FNE9 Histone-lysine N-methyltransferase ATXR66.0e-0434.85Show/hide
Query:  SDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILK
        SD  S+ +    C+ C S +   K+L+CD C+  FH+ C    +  V    W+C SC   KH+I K
Subjt:  SDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILK

Q9HDV4 Lid2 complex component lid21.6e-0435.29Show/hide
Query:  CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC------------LKKKHKILKETISKKLANTL-SRNGSSK
        C+ CG  ++   +L+CD CE A+H SC +  +  +  ++WYC++C             K K   LKE  S ++ NTL  RN SSK
Subjt:  CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC------------LKKKHKILKETISKKLANTL-SRNGSSK

Q9P1Y6 PHD and RING finger domain-containing protein 11.6e-0435.48Show/hide
Query:  ASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC
        ASEEE D          C+ CG S+   ++L+CD C+  +H+ C +  +++V  DEW+C  C
Subjt:  ASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC

Q9SGH2 Methyl-CpG-binding domain-containing protein 91.2e-0439.13Show/hide
Query:  SCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC
        SC  CG  ES+  +++CD CE  FH+SC N  ++   + +W C+ C
Subjt:  SCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC

Arabidopsis top hitse value%identityAlignment
AT1G77250.1 RING/FYVE/PHD-type zinc finger family protein7.3e-0525.2Show/hide
Query:  SKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRNHKSEIVGNAVLSFPVYDGKTQVSEPESVN-GCTIGEGH--GSDE--TPNN
        SK    Y+R+KL G S S    +  D  S+        E +E  +  + + ++ + G            ++ +  E+V  GC     H   S E  + N 
Subjt:  SKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRNHKSEIVGNAVLSFPVYDGKTQVSEPESVN-GCTIGEGH--GSDE--TPNN

Query:  NLQKSLEVDSINDSCSSSKSNMELLSTSVKVEV-DDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLYSMVHASEEESDFRS-------------N
         L ++L+   I+D  S +     L+ T +K  V + +    S+ +Q +   ++D+ G D+  ++L ++ L  S     E+   F +             N
Subjt:  NLQKSLEVDSINDSCSSSKSNMELLSTSVKVEV-DDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLYSMVHASEEESDFRS-------------N

Query:  DNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKK
        D+    CK CG        L CDHCED +HVSC     K +    WYC  C  K
Subjt:  DNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKK

AT2G19260.1 RING/FYVE/PHD zinc finger superfamily protein1.9e-6144.14Show/hide
Query:  DSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLIC
        D  NDSCSS KS+ E+ STS K   DD   C SS   V                                 E+D   + + FR CK C    +V KMLIC
Subjt:  DSINDSCSSSKSNMELLSTSVKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLIC

Query:  DHCEDAFHVSCCNHRMKKVSN-DEWYCNSCLKKKHKILKETISKKLANTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDA
        D CE+A+H  CC  +MK V+  DEW C SCLK +                S    +KG   S     + T P+  G+RIGK FQA+VPDWSGP   DT  
Subjt:  DHCEDAFHVSCCNHRMKKVSN-DEWYCNSCLKKKHKILKETISKKLANTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDA

Query:  IGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIE
        +GEPLE+  SE     +++ N   + S + NWLQC++        NGVICGKWRRAP  EVQT DWECFC   WDP+ ADCAVPQELET ++LKQLKYI+
Subjt:  IGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGMGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIE

Query:  MLRPRLSSKRRKL-DEAKSRSDVQ
        MLRPR  +K+RKL  + +SRS ++
Subjt:  MLRPRLSSKRRKL-DEAKSRSDVQ

AT3G01460.1 methyl-CPG-binding domain 98.7e-0639.13Show/hide
Query:  SCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC
        SC  CG  ES+  +++CD CE  FH+SC N  ++   + +W C+ C
Subjt:  SCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC

AT5G24330.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 64.3e-0534.85Show/hide
Query:  SDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILK
        SD  S+ +    C+ C S +   K+L+CD C+  FH+ C    +  V    W+C SC   KH+I K
Subjt:  SDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGAATGGCTTGGGAACTTGCGTCGTGCTCTTTGGGGACTGATTCATTTTCTTCCATCTGGAAGGGTTCAATGTTTTGGAGTTGGGAAGATAGGGGCCGAATGGAT
AGATTTTGGTGAGTTACAGGGGCTTCCAATCAGGAGGAAGATTGACACAATGTGCCCCCATTGTGATGAATTCTCTCATGATGGCTGCAGAAAAGCTGGACCAATCATAG
AGGAAAAGAAGAACAATGGTGGATTGCGTTGCTTAAATTTTCCAAGGGCCTTTCCCCAGATATCAACTGTTAGTATGATGCCTGAAGGTTCAAAATCTAATGTAGTATAT
AGGAGAAAGAAATTGCGAGGCAATTCTGATTCCAGGTTGTTGGCTAATGGGACAGATTGTATATCTTTGATTAGTTGCGATAGTCAGTTGGGAGAAGACAAAGAGCAAGC
TGCAGCTTCTCAACGTAACCACAAAAGTGAAATAGTTGGAAATGCTGTCCTATCTTTTCCTGTTTACGATGGAAAAACTCAAGTTTCAGAACCAGAATCAGTCAATGGTT
GTACCATTGGGGAAGGGCATGGTTCAGACGAAACACCTAATAACAACCTGCAAAAAAGTTTGGAGGTTGACAGCATTAATGATAGCTGCTCCTCATCTAAGTCAAACATG
GAACTTCTTTCAACTTCTGTGAAAGTTGAAGTGGATGACACAGGTGAGTGCTCCTCTTCTAGTATTCAAGTTATGGAGGATACGGTTGAGGATATTTCAGGAAGAGATCT
ATGCATCTCTATCCTTAGAAGCAATGGGCTTCTGTATTCTATGGTTCATGCTTCTGAGGAAGAAAGTGATTTTAGAAGCAACGATAATTGTTTTCGATCATGCAAAACGT
GTGGCTCTTCAGAATCAGTCTTGAAGATGTTAATCTGTGATCATTGTGAAGATGCATTTCATGTCTCATGTTGCAATCATCGCATGAAGAAAGTGTCAAATGATGAGTGG
TATTGCAATTCATGTCTGAAGAAGAAGCATAAAATTTTGAAGGAAACAATCTCAAAGAAATTGGCAAACACCTTGAGTAGAAATGGATCTTCTAAGGGTGAATCAAATTC
CATAGCATTAATGTTAAAGGACACAGAACCTTATACAACTGGTGTTCGGATTGGCAAAGGTTTTCAAGCAGAAGTTCCAGATTGGTCTGGCCCGATTCCAGATGATACCG
ATGCCATTGGAGAGCCACTGGAAATGGATCCTTCAGAATCTTTTCTTATGCATGAGCAGAGTACCAACAAACCTTGTAGATTGAGCACTATTGGAAATTGGCTTCAATGT
CAACAAGTTATAGATGGAATGGGTGGTGTTAATGGAGTCATATGTGGCAAGTGGCGCAGGGCTCCTCTTTTTGAAGTCCAAACTGATGACTGGGAATGCTTCTGCTCTAT
CCTCTGGGATCCGACACATGCAGATTGTGCTGTACCTCAGGAATTGGAAACGGGGCAAGTTTTAAAGCAGTTGAAGTACATTGAGATGCTGAGGCCTCGGTTATCTTCCA
AACGACGGAAATTGGATGAGGCCAAGAGCAGAAGTGATGTGCAGAACCTTACAGAGGATACAGAACACAAACCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGAATGGCTTGGGAACTTGCGTCGTGCTCTTTGGGGACTGATTCATTTTCTTCCATCTGGAAGGGTTCAATGTTTTGGAGTTGGGAAGATAGGGGCCGAATGGAT
AGATTTTGGTGAGTTACAGGGGCTTCCAATCAGGAGGAAGATTGACACAATGTGCCCCCATTGTGATGAATTCTCTCATGATGGCTGCAGAAAAGCTGGACCAATCATAG
AGGAAAAGAAGAACAATGGTGGATTGCGTTGCTTAAATTTTCCAAGGGCCTTTCCCCAGATATCAACTGTTAGTATGATGCCTGAAGGTTCAAAATCTAATGTAGTATAT
AGGAGAAAGAAATTGCGAGGCAATTCTGATTCCAGGTTGTTGGCTAATGGGACAGATTGTATATCTTTGATTAGTTGCGATAGTCAGTTGGGAGAAGACAAAGAGCAAGC
TGCAGCTTCTCAACGTAACCACAAAAGTGAAATAGTTGGAAATGCTGTCCTATCTTTTCCTGTTTACGATGGAAAAACTCAAGTTTCAGAACCAGAATCAGTCAATGGTT
GTACCATTGGGGAAGGGCATGGTTCAGACGAAACACCTAATAACAACCTGCAAAAAAGTTTGGAGGTTGACAGCATTAATGATAGCTGCTCCTCATCTAAGTCAAACATG
GAACTTCTTTCAACTTCTGTGAAAGTTGAAGTGGATGACACAGGTGAGTGCTCCTCTTCTAGTATTCAAGTTATGGAGGATACGGTTGAGGATATTTCAGGAAGAGATCT
ATGCATCTCTATCCTTAGAAGCAATGGGCTTCTGTATTCTATGGTTCATGCTTCTGAGGAAGAAAGTGATTTTAGAAGCAACGATAATTGTTTTCGATCATGCAAAACGT
GTGGCTCTTCAGAATCAGTCTTGAAGATGTTAATCTGTGATCATTGTGAAGATGCATTTCATGTCTCATGTTGCAATCATCGCATGAAGAAAGTGTCAAATGATGAGTGG
TATTGCAATTCATGTCTGAAGAAGAAGCATAAAATTTTGAAGGAAACAATCTCAAAGAAATTGGCAAACACCTTGAGTAGAAATGGATCTTCTAAGGGTGAATCAAATTC
CATAGCATTAATGTTAAAGGACACAGAACCTTATACAACTGGTGTTCGGATTGGCAAAGGTTTTCAAGCAGAAGTTCCAGATTGGTCTGGCCCGATTCCAGATGATACCG
ATGCCATTGGAGAGCCACTGGAAATGGATCCTTCAGAATCTTTTCTTATGCATGAGCAGAGTACCAACAAACCTTGTAGATTGAGCACTATTGGAAATTGGCTTCAATGT
CAACAAGTTATAGATGGAATGGGTGGTGTTAATGGAGTCATATGTGGCAAGTGGCGCAGGGCTCCTCTTTTTGAAGTCCAAACTGATGACTGGGAATGCTTCTGCTCTAT
CCTCTGGGATCCGACACATGCAGATTGTGCTGTACCTCAGGAATTGGAAACGGGGCAAGTTTTAAAGCAGTTGAAGTACATTGAGATGCTGAGGCCTCGGTTATCTTCCA
AACGACGGAAATTGGATGAGGCCAAGAGCAGAAGTGATGTGCAGAACCTTACAGAGGATACAGAACACAAACCTTGA
Protein sequenceShow/hide protein sequence
MLEWLGNLRRALWGLIHFLPSGRVQCFGVGKIGAEWIDFGELQGLPIRRKIDTMCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAFPQISTVSMMPEGSKSNVVY
RRKKLRGNSDSRLLANGTDCISLISCDSQLGEDKEQAAASQRNHKSEIVGNAVLSFPVYDGKTQVSEPESVNGCTIGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNM
ELLSTSVKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLYSMVHASEEESDFRSNDNCFRSCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEW
YCNSCLKKKHKILKETISKKLANTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIPDDTDAIGEPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQC
QQVIDGMGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLRPRLSSKRRKLDEAKSRSDVQNLTEDTEHKP