; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0001376 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0001376
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPHD domain-containing protein
Genome locationchr02:19554205..19564692
RNA-Seq ExpressionIVF0001376
SyntenyIVF0001376
Gene Ontology termsGO:0046274 - lignin catabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0048046 - apoplast (cellular component)
GO:0005507 - copper ion binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0052716 - hydroquinone:oxygen oxidoreductase activity (molecular function)
InterPro domainsIPR000949 - ELM2 domain
IPR001965 - Zinc finger, PHD-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR011124 - Zinc finger, CW-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453559.1 PREDICTED: uncharacterized protein LOC103494237 isoform X1 [Cucumis melo]0.0100Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH
        MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH

Query:  EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD
        EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD
Subjt:  EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD

Query:  LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS
        LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS
Subjt:  LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS

Query:  RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG
        RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG
Subjt:  RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG

Query:  KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK
        KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK
Subjt:  KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK

Query:  P
        P
Subjt:  P

XP_008453560.1 PREDICTED: uncharacterized protein LOC103494237 isoform X2 [Cucumis melo]0.096.21Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH
        MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH

Query:  EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD
        EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD
Subjt:  EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD

Query:  LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS
        LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS
Subjt:  LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS

Query:  RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG
        RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG
Subjt:  RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG

Query:  KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK
        KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQ                   ELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK
Subjt:  KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK

Query:  P
        P
Subjt:  P

XP_008453561.1 PREDICTED: uncharacterized protein LOC103494237 isoform X3 [Cucumis melo]0.0100Show/hide
Query:  MMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNN
        MMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNN
Subjt:  MMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNN

Query:  NLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSE
        NLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSE
Subjt:  NLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSE

Query:  SVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGP
        SVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGP
Subjt:  SVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGP

Query:  ISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIV
        ISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIV
Subjt:  ISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIV

Query:  AKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYKP
        AKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYKP
Subjt:  AKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYKP

XP_011656989.1 uncharacterized protein LOC101212408 isoform X1 [Cucumis sativus]0.090.22Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH
        MCPHCDEFSHDGCRKAG I EEKKN+GGLRCLNFPRTFPT IMM EGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDG+LAEDKEQAAASQ NHE 
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH

Query:  EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD
        EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVS SLKVEVDDTGECSSSSIQVM D +EDISGRD
Subjt:  EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD

Query:  LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS
        LCISILRSNGLLSS  H PEEESD RSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMK+VSNDEW CNSCLKK HK+LKEAISKKLTNT S
Subjt:  LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS

Query:  RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG
        RNGSSKGESNSIALMLKDT+PYTT +RIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESF MHEQSTNK CRLSTIGNWLQCQQV+DGVGGGNGGICG
Subjt:  RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG

Query:  KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK
        KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQ                   ELETGQV KQLKYIEMLRPRLASKRRKLDE KSRSDVQNLTEDTE+K
Subjt:  KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK

Query:  P
        P
Subjt:  P

XP_038878482.1 uncharacterized protein LOC120070708 isoform X1 [Benincasa hispida]0.087.7Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFP---TAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRN
        MCPHCDEFS DGCRKAGPIIEEKKNNGG RCLNFPR FP   T  MM E SKSNVVYRRKKLRG+SDSR LANGTDCISL SCDGHL EDKEQAAASQ N
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFP---TAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRN

Query:  HEHEIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDIS
        H++EI+GN VPPFPV +GKTQVSELES NGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTS+KVEVDDTGECSSSSIQVMED VEDIS
Subjt:  HEHEIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDIS

Query:  GRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTN
        GRDLCI ILRSNGLLSSMAH PEEESD RSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHK+LKE ISKKL N
Subjt:  GRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTN

Query:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGG
          SRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA GEPLE+D SESFLMHE+STNK CRLSTIGNWLQCQQV+DG+GG NG 
Subjt:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGG

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDT
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQ                   ELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDT
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDT

Query:  EYKP
        E+KP
Subjt:  EYKP

TrEMBL top hitse value%identityAlignment
A0A1S3BWJ2 uncharacterized protein LOC103494237 isoform X13.0e-295100Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH
        MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH

Query:  EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD
        EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD
Subjt:  EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD

Query:  LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS
        LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS
Subjt:  LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS

Query:  RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG
        RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG
Subjt:  RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG

Query:  KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK
        KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK
Subjt:  KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK

Query:  P
        P
Subjt:  P

A0A1S3BXC2 uncharacterized protein LOC103494237 isoform X22.8e-28096.21Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH
        MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEH

Query:  EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD
        EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD
Subjt:  EIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRD

Query:  LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS
        LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS
Subjt:  LCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLS

Query:  RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG
        RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG
Subjt:  RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICG

Query:  KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK
        KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQ                   ELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK
Subjt:  KWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYK

Query:  P
        P
Subjt:  P

A0A1S3BXQ5 uncharacterized protein LOC103494237 isoform X32.3e-266100Show/hide
Query:  MMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNN
        MMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNN
Subjt:  MMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNN

Query:  NLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSE
        NLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSE
Subjt:  NLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSE

Query:  SVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGP
        SVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGP
Subjt:  SVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGP

Query:  ISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIV
        ISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIV
Subjt:  ISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIV

Query:  AKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYKP
        AKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYKP
Subjt:  AKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYKP

A0A6J1F145 uncharacterized protein LOC111441172 isoform X11.9e-23682.9Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTF---PTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRN
        MCPHCDEF HDGCRKAG IIEEKKN+GG RCLNFPR F    T  MM  GSKSNVVY+RKKLRG+SDSR LANGTDC SLISCDGHL EDKEQA  SQ  
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTF---PTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRN

Query:  HEHEIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDIS
        H+ EIVGN +PP PV  GK QVSELES NGC  GEGHGSDET NNNLQKSLEVDSINDSCSSSKSNME VSTSLKVEVDDTGECSSSSI+VMED VEDIS
Subjt:  HEHEIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDIS

Query:  GRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTN
        GRDLCISILRSNGLLSSMAH  E+ESD RS+NNCFRLCKTCGSS+S LKMLICDHCEDAFHV C NHRMKKVSNDEWYCNSCLKKKHK+L EAI+KKL N
Subjt:  GRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTN

Query:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGG
          SRNGSSK ESNSIALML DTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA GEPLEMD S SFLMHEQSTNK CRLS IGNWLQCQQV+DGVGG NG 
Subjt:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGG

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDT
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQ                   ELETGQVLKQLKYIEMLRPRLASKRRKLDE KSRSDVQNL E+T
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDT

Query:  EYK
        E+K
Subjt:  EYK

A0A6J1IGX7 uncharacterized protein LOC111472812 isoform X14.2e-23682.5Show/hide
Query:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTF---PTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRN
        MCPHCDEF HDGCRKAG IIEEKKN+GGLRCLNFPR F    T  MM  GSKSNVVY+RKKLRG+SDSR LANGTDC SLISCDGHL EDKEQA  S+  
Subjt:  MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTF---PTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRN

Query:  HEHEIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDIS
        H+ EIVGN +PP PVCDGK QVS LES NGC  GEGHGSDET NNNLQKSLEVDSINDSCSSSKSNME VSTSLKVEVDDTGECSSSSI+VMED VEDIS
Subjt:  HEHEIVGNAVPPFPVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDIS

Query:  GRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTN
        GRDLCISILRSNGLLSSMAH  E+ESD RS+NNCFRLCKTCGSS+S LKMLICDHCEDAFHV C NHRMKKVSNDEWYCNSCLKKKHK+L EAI+KKL N
Subjt:  GRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTN

Query:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGG
          SRNGSSK ESNSIALML DTEPYTTGVRIGKGFQAEVPDWSG ISDDTDA  EPLEMD S SFLMHEQSTNK CRLS IGNWLQCQQV+DGVGG NG 
Subjt:  TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGG

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDT
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQ                   ELETGQVLKQLKYIEMLRPRLASKRRKLDE +SRSDVQNL E+T
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDT

Query:  EYK
        E+K
Subjt:  EYK

SwissProt top hitse value%identityAlignment
A6H619 PHD and RING finger domain-containing protein 13.3e-0435.56Show/hide
Query:  CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC
        C+ CG S+   ++L+CD C+  +H+ C +  +++V  DEW+C  C
Subjt:  CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC

Q9FNE9 Histone-lysine N-methyltransferase ATXR67.9e-0634.29Show/hide
Query:  PEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLK
        P+  SD  SD++   +C+ C S +   K+L+CD C+  FH+ C    +  V    W+C SC   KH++ K
Subjt:  PEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLK

Q9HDV4 Lid2 complex component lid21.5e-0435.29Show/hide
Query:  CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC------------LKKKHKVLKEAISKKLTNTL-SRNGSSK
        C+ CG  ++   +L+CD CE A+H SC +  +  +  ++WYC++C             K K   LKE  S ++ NTL  RN SSK
Subjt:  CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC------------LKKKHKVLKEAISKKLTNTL-SRNGSSK

Q9P1Y6 PHD and RING finger domain-containing protein 16.7e-0529.03Show/hide
Query:  VPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC
        +P E + +  +      C+ CG S+   ++L+CD C+  +H+ C +  +++V  DEW+C  C
Subjt:  VPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC

Q9SGH2 Methyl-CpG-binding domain-containing protein 93.3e-0428.42Show/hide
Query:  AHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLK--KKHKVLKEAISKKLTNTLSRNGSSKGE
        A VPE + D+         C  CG  ES+  +++CD CE  FH+SC N  ++   + +W C+ C    ++ K+    +  KL   ++ +  S  E
Subjt:  AHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLK--KKHKVLKEAISKKLTNTLSRNGSSKGE

Arabidopsis top hitse value%identityAlignment
AT1G77250.1 RING/FYVE/PHD-type zinc finger family protein1.8e-0524.51Show/hide
Query:  SEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGN-AVPPFPVCDGKTQVSELESANGCIFGEGH--GSDE--T
        S  SK    Y+R+KL G S S    +  D  S+       +E +E  +  + +    + G    PP P        +      GC     H   S E  +
Subjt:  SEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGN-AVPPFPVCDGKTQVSELESANGCIFGEGH--GSDE--T

Query:  PNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEV-DDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEE-------------ESDS
         N  L ++L+   I+D  S +     L+ T +K  V + +    S+ +Q +   ++D+ G D+  ++L ++ L  S     E+              +++
Subjt:  PNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEV-DDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEE-------------ESDS

Query:  RSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKK
          +++   +CK CG        L CDHCED +HVSC     K +    WYC  C  K
Subjt:  RSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKK

AT2G19260.1 RING/FYVE/PHD zinc finger superfamily protein1.3e-5641.4Show/hide
Query:  DSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLIC
        D  NDSCSS KS+ E+ STS K   DD   C SS   V                                 E+D+   ++ FR CK C    +V KMLIC
Subjt:  DSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLIC

Query:  DHCEDAFHVSCCNHRMKKVSN-DEWYCNSCLKKKHKVLKEAISKKLTNTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA
        D CE+A+H  CC  +MK V+  DEW C SCLK +                S    +KG   S     + T P+  G+RIGK FQA+VPDWSGP   DT  
Subjt:  DHCEDAFHVSCCNHRMKKVSN-DEWYCNSCLKKKHKVLKEAISKKLTNTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA

Query:  IGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLK
        +GEPLE+  SE     +++ N   + S + NWLQC++        NG ICGKWRRAP  EVQT DWECFC   WDP+ ADCAVPQ               
Subjt:  IGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFCIVAKMRNLK

Query:  VHIKELETGQVLKQLKYIEMLRPRLASKRRKL-DEAKSRSDVQ
            ELET ++LKQLKYI+MLRPR  +K+RKL  + +SRS ++
Subjt:  VHIKELETGQVLKQLKYIEMLRPRLASKRRKL-DEAKSRSDVQ

AT3G01460.1 methyl-CPG-binding domain 92.3e-0528.42Show/hide
Query:  AHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLK--KKHKVLKEAISKKLTNTLSRNGSSKGE
        A VPE + D+         C  CG  ES+  +++CD CE  FH+SC N  ++   + +W C+ C    ++ K+    +  KL   ++ +  S  E
Subjt:  AHVPEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLK--KKHKVLKEAISKKLTNTLSRNGSSKGE

AT5G24330.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 65.6e-0734.29Show/hide
Query:  PEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLK
        P+  SD  SD++   +C+ C S +   K+L+CD C+  FH+ C    +  V    W+C SC   KH++ K
Subjt:  PEEESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCCCCCATTGTGATGAATTCTCTCATGATGGCTGCAGAAAAGCTGGACCGATCATTGAGGAAAAGAAGAACAATGGTGGATTGCGTTGCTTAAATTTTCCTAGGAC
CTTTCCAACTGCTATTATGATGTCTGAAGGTTCAAAATCTAATGTAGTATATAGGAGAAAGAAACTGCGGGGCAGTTCTGATTCCAGGTTTTTGGCTAATGGGACAGATT
GTATATCTTTAATTAGTTGTGATGGTCATTTGGCAGAAGACAAAGAGCAAGCTGCAGCTTCTCAACGTAACCACGAGCATGAAATTGTTGGAAATGCTGTCCCTCCTTTT
CCTGTTTGCGATGGAAAAACTCAAGTTTCAGAACTAGAATCAGCCAACGGTTGTATATTTGGGGAAGGGCATGGTTCAGACGAAACACCTAATAATAACCTGCAAAAAAG
TTTGGAGGTTGACAGCATTAATGATAGCTGCTCCTCATCTAAGTCAAACATGGAACTTGTTTCAACTTCCCTGAAAGTTGAAGTGGATGACACAGGTGAGTGCTCCTCTT
CTAGCATTCAAGTTATGGAGGATACGGTCGAGGATATTTCTGGAAGAGATCTATGCATCTCTATCCTTAGAAGCAATGGGCTGTTATCTTCTATGGCTCATGTTCCTGAG
GAAGAAAGTGATTCTAGAAGCGACAATAATTGTTTTCGATTGTGCAAAACTTGTGGCTCTTCAGAATCAGTCTTGAAGATGTTAATCTGTGATCATTGCGAAGATGCATT
TCATGTCTCATGTTGCAATCATCGCATGAAGAAAGTGTCAAATGATGAGTGGTATTGCAATTCATGTCTGAAAAAAAAGCACAAAGTTTTGAAGGAAGCTATCTCAAAGA
AATTGACAAACACCTTGAGTAGAAATGGATCTTCCAAGGGTGAATCAAATTCTATAGCATTAATGTTAAAGGACACAGAACCTTATACAACTGGTGTTCGGATTGGCAAA
GGTTTTCAAGCAGAAGTTCCAGATTGGTCTGGCCCGATTTCAGATGATACTGATGCCATCGGAGAGCCACTGGAAATGGATTCTTCAGAATCTTTTCTTATGCATGAGCA
GAGCACCAATAAAGCTTGTAGATTGAGCACTATTGGAAATTGGCTTCAATGTCAACAAGTTGTAGATGGAGTGGGTGGTGGTAATGGAGGCATATGTGGCAAGTGGCGCA
GGGCTCCTCTTTTTGAAGTCCAAACTGATGACTGGGAATGCTTCTGCTCTATCCTCTGGGATCCGACACATGCAGATTGTGCTGTACCTCAGAAATTCTACAAATTTTGC
ATTGTTGCAAAGATGAGAAATCTCAAAGTCCACATAAAGGAATTGGAGACGGGTCAAGTTTTAAAGCAGTTGAAGTACATTGAGATGCTGAGGCCTCGGTTAGCTTCCAA
AAGACGGAAACTGGATGAGGCGAAGAGCAGAAGTGATGTGCAGAACCTTACAGAGGATACAGAATACAAACCTTGA
mRNA sequenceShow/hide mRNA sequence
ATCATCAATTATTTTTGGTCATTTTGCCTTTTCTCTCTTTGTAACACTTTTGTGTATTACGAAGAAAAAAGATAAATCAAAAACAAAAACTTGGTGAAAATTTGTAATAT
GCAAAATGAGTGTCTACAAAGGCAAAGTGAAAAGTTTGCGTCTATTGTAGTGAGGGAACCCCCCAATGAATCAATTTGAGTTCATAAACACGGACGTTGTCTTTTCTTCA
ATATATAAAAGAGAAAAAAAGAGAAATAAAAACGCATAATTTTGAATTAAGATGAATCTGGGATTCTGAGTGGTAAAGGAGCAGAAAGGAAAGCATTTTTTTTTTCTTCC
ACAACTGGGTTGATCATTCGGAAGAACTGGCTGATTGTACGCTTCTTTTGATGTGCCCCCATTGTGATGAATTCTCTCATGATGGCTGCAGAAAAGCTGGACCGATCATT
GAGGAAAAGAAGAACAATGGTGGATTGCGTTGCTTAAATTTTCCTAGGACCTTTCCAACTGCTATTATGATGTCTGAAGGTTCAAAATCTAATGTAGTATATAGGAGAAA
GAAACTGCGGGGCAGTTCTGATTCCAGGTTTTTGGCTAATGGGACAGATTGTATATCTTTAATTAGTTGTGATGGTCATTTGGCAGAAGACAAAGAGCAAGCTGCAGCTT
CTCAACGTAACCACGAGCATGAAATTGTTGGAAATGCTGTCCCTCCTTTTCCTGTTTGCGATGGAAAAACTCAAGTTTCAGAACTAGAATCAGCCAACGGTTGTATATTT
GGGGAAGGGCATGGTTCAGACGAAACACCTAATAATAACCTGCAAAAAAGTTTGGAGGTTGACAGCATTAATGATAGCTGCTCCTCATCTAAGTCAAACATGGAACTTGT
TTCAACTTCCCTGAAAGTTGAAGTGGATGACACAGGTGAGTGCTCCTCTTCTAGCATTCAAGTTATGGAGGATACGGTCGAGGATATTTCTGGAAGAGATCTATGCATCT
CTATCCTTAGAAGCAATGGGCTGTTATCTTCTATGGCTCATGTTCCTGAGGAAGAAAGTGATTCTAGAAGCGACAATAATTGTTTTCGATTGTGCAAAACTTGTGGCTCT
TCAGAATCAGTCTTGAAGATGTTAATCTGTGATCATTGCGAAGATGCATTTCATGTCTCATGTTGCAATCATCGCATGAAGAAAGTGTCAAATGATGAGTGGTATTGCAA
TTCATGTCTGAAAAAAAAGCACAAAGTTTTGAAGGAAGCTATCTCAAAGAAATTGACAAACACCTTGAGTAGAAATGGATCTTCCAAGGGTGAATCAAATTCTATAGCAT
TAATGTTAAAGGACACAGAACCTTATACAACTGGTGTTCGGATTGGCAAAGGTTTTCAAGCAGAAGTTCCAGATTGGTCTGGCCCGATTTCAGATGATACTGATGCCATC
GGAGAGCCACTGGAAATGGATTCTTCAGAATCTTTTCTTATGCATGAGCAGAGCACCAATAAAGCTTGTAGATTGAGCACTATTGGAAATTGGCTTCAATGTCAACAAGT
TGTAGATGGAGTGGGTGGTGGTAATGGAGGCATATGTGGCAAGTGGCGCAGGGCTCCTCTTTTTGAAGTCCAAACTGATGACTGGGAATGCTTCTGCTCTATCCTCTGGG
ATCCGACACATGCAGATTGTGCTGTACCTCAGAAATTCTACAAATTTTGCATTGTTGCAAAGATGAGAAATCTCAAAGTCCACATAAAGGAATTGGAGACGGGTCAAGTT
TTAAAGCAGTTGAAGTACATTGAGATGCTGAGGCCTCGGTTAGCTTCCAAAAGACGGAAACTGGATGAGGCGAAGAGCAGAAGTGATGTGCAGAACCTTACAGAGGATAC
AGAATACAAACCTTGATATGGTGAGAACTGCTTGATACTATTCAAGTGCCGTAACTCCATATTTCTTACAAAATCAAGTCATTCAGATTAAAATTGGGTCTTATATTCTC
TACCTTAAGAAGCATTGTAATAGTAATATCTTGTATGTTGCAAATGAAGGACTAAATATGATGTGAGCCATTTACCATCTTCTTCTGTTGGCTTTTTGTCTCTTTTTCTC
TTGGAGAACGTCTGTCAAGAGAAAAGTGGGTAAATGGAAATTGCTTTTTGTTAGTGAGTAGAGAAATTAAATTGGAATAGTTCAAGATCTTTTGCAGTTTTGTATTTGTT
TTTGTCCCTGAGATTTGATTGAACACTGATAGCTAGTCAAGCACTTTATCGCTATCGTTTAATTTCAAAATTTC
Protein sequenceShow/hide protein sequence
MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFPTAIMMSEGSKSNVVYRRKKLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGNAVPPF
PVCDGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPE
EESDSRSDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTNTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGK
GFQAEVPDWSGPISDDTDAIGEPLEMDSSESFLMHEQSTNKACRLSTIGNWLQCQQVVDGVGGGNGGICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQKFYKFC
IVAKMRNLKVHIKELETGQVLKQLKYIEMLRPRLASKRRKLDEAKSRSDVQNLTEDTEYKP