; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012398 (gene) of Snake gourd v1 genome

Gene IDTan0012398
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPHD domain-containing protein
Genome locationLG09:68829321..68839712
RNA-Seq ExpressionTan0012398
SyntenyTan0012398
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001965 - Zinc finger, PHD-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR011124 - Zinc finger, CW-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587732.1 tRNA-specific adenosine deaminase TAD2, partial [Cucurbita argyrosperma subsp. sororia]2.8e-25589.69Show/hide
Query:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN
        MCPHCDEF  DGCRKAG IIEEKKNDGG RCLNFPRAFSQIST+  MP GSKSNVVY+RKKLRGNSDSRLLANGTDC SLISCDGHL+EDKEQA  SRH 
Subjt:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN

Query:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS
        HKSEIVGN++PP P   GK QVSELES+NGCTIGEGHGSDETL NNLQKSLEVDSINDSCSSSKSNME VSTSLKVEVDDTGECSSSSIR+MEDMVEDIS
Subjt:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS

Query:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN
        GRDLCISILRSNGLLSSMAHA E+ SD RS+NNCFRLCKTCG S+S LKMLICDHCEDAFHV CGNHRMKKVSNDEWYCNSCLKKKHK+L E ITKKLAN
Subjt:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN

Query:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV
         SSRNGSSK E +SIALML DTEPYTTGVRIGKGFQAEVPDWSGPI DDTDA G PLEMDPS SFLMHEQSTNKPCRLS IGNWLQCQQVIDGVGGVNGV
Subjt:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT
        ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETK RSDVQNL E+TEHKT
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT

XP_022933897.1 uncharacterized protein LOC111441172 isoform X1 [Cucurbita moschata]1.0e-25489.48Show/hide
Query:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN
        MCPHCDEF  DGCRKAG IIEEKKNDGG RCLNFPRAFSQIST+  MP GSKSNVVY+RKKLRGNSDSRLLANGTDC SLISCDGHL+EDKEQA  S+H 
Subjt:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN

Query:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS
        HKSEIVGN++PP P   GK QVSELES+NGCTIGEGHGSDETL NNLQKSLEVDSINDSCSSSKSNME VSTSLKVEVDDTGECSSSSIR+MEDMVEDIS
Subjt:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS

Query:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN
        GRDLCISILRSNGLLSSMAHA E+ SD RS+NNCFRLCKTCG S+S LKMLICDHCEDAFHV CGNHRMKKVSNDEWYCNSCLKKKHK+L E ITKKLAN
Subjt:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN

Query:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV
         SSRNGSSK E +SIALML DTEPYTTGVRIGKGFQAEVPDWSGPI DDTDA G PLEMDPS SFLMHEQSTNKPCRLS IGNWLQCQQVIDGVGGVNGV
Subjt:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT
        ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETK RSDVQNL E+TEHKT
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT

XP_022974204.1 uncharacterized protein LOC111472812 isoform X1 [Cucurbita maxima]5.2e-25489.28Show/hide
Query:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN
        MCPHCDEF  DGCRKAG IIEEKKNDGGLRCLNFPRAFSQIST+  MP GSKSNVVY+RKKLRGNSDSRLLANGTDC SLISCDGHL+EDKEQA  SRH 
Subjt:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN

Query:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS
        HKSEIVGN++PP P  DGK QVS LES+NGCTIGEGHGSDETL NNLQKSLEVDSINDSCSSSKSNME VSTSLKVEVDDTGECSSSSIR+MEDMVEDIS
Subjt:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS

Query:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN
        GRDLCISILRSNGLLSSMAHA E+ SD RS+NNCFRLCKTCG S+S LKMLICDHCEDAFHV CGNHRMKKVSNDEWYCNSCLKKKHK+L E ITKKLAN
Subjt:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN

Query:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV
         SSRNGSSK E +SIALML DTEPYTTGVRIGKGFQAEVPDWSG I DDTDA   PLEMDPS SFLMHEQSTNKPCRLS IGNWLQCQQVIDGVGGVNGV
Subjt:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT
        ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDET+ RSDVQNL E+TEHKT
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT

XP_023531970.1 uncharacterized protein LOC111794074 isoform X1 [Cucurbita pepo subsp. pepo]4.3e-25689.69Show/hide
Query:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN
        MCPHCDEF  DGCRKAG IIEEKKNDGGLRCLNFPRAFSQIST+  MP GSKSNVVY+RKKLRGNSDSRLLANGTDC SLISCDGHL+EDKEQA  SRH 
Subjt:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN

Query:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS
        HKSEIVGN++PP P  DGK QVSELES+NGCTIGEGHGSDETL NNLQKSLEVDS+NDSCSSSKSNME VSTSLKVEVDDTGECSSSSIR+MEDMVEDIS
Subjt:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS

Query:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN
        GRDLCISILRSNGLLSS+AHA E+ SD RS+NNCFRLCKTCG S+S LKMLICDHCEDAFHV CGNHRMKKVSNDEWYCNSCLKKKHK+L E ITKKLAN
Subjt:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN

Query:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV
         SSRNGSSK E +SIALML DTEPYTTGVRIGKGFQAEVPDWSGPI DDTDA G PLEMDPS SFLMHEQSTNKPCRLS IGNWLQCQQVIDGVGGVNGV
Subjt:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT
        ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETK RSDVQNL E+TEHKT
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT

XP_038878482.1 uncharacterized protein LOC120070708 isoform X1 [Benincasa hispida]1.4e-25991.53Show/hide
Query:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN
        MCPHCDEFSRDGCRKAGPIIEEKKN+GG RCLNFPRAF QISTV  MPE SKSNVVYRRKKLRGNSDSRLLANGTDCISL SCDGHL EDKEQAA S+HN
Subjt:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN

Query:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDET-LNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS
        HK+EI+GN VPP P Y+GKTQVSELESVNGC  GEGHGSDET  NNLQKSLEVDSINDSCSSSKSNMELVSTS+KVEVDDTGECSSSSI++MEDMVEDIS
Subjt:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDET-LNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS

Query:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN
        GRDLCI ILRSNGLLSSMAHAPEE SDFRSDNNCFRLCKTCG SESVLKMLICDHCEDAFHVSC NHRMKKVSNDEWYCNSCLKKKHK+LKETI+KKLAN
Subjt:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN

Query:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV
         SSRNGSSKGE +SIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPI DDTDA G PLE+DPSESFLMHE+STNKPCRLSTIGNWLQCQQVIDG+GGVNG 
Subjt:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHK
        ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDE K RSDVQNLTEDTEHK
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHK

TrEMBL top hitse value%identityAlignment
A0A1S3BWJ2 uncharacterized protein LOC103494237 isoform X14.3e-24685.69Show/hide
Query:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN
        MCPHCDEFS DGCRKAGPIIEEKKN+GGLRCLNFPR F    T   M EGSKSNVVYRRKKLRG+SDSR LANGTDCISLISCDGHL EDKEQAA S+ N
Subjt:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN

Query:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDET-LNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS
        H+ EIVGN VPP P  DGKTQVSELES NGC  GEGHGSDET  NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSI++MED VEDIS
Subjt:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDET-LNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS

Query:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN
        GRDLCISILRSNGLLSSMAH PEE SD RSDNNCFRLCKTCG SESVLKMLICDHCEDAFHVSC NHRMKKVSNDEWYCNSCLKKKHKVLKE I+KKL N
Subjt:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN

Query:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV
        T SRNGSSKGE +SIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPI DDTDAIG PLEMD SESFLMHEQSTNK CRLSTIGNWLQCQQV+DGVGG NG 
Subjt:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQ-------------------ELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDT
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQ                   ELETGQVLKQLKYIEMLRPRLASKRRKLDE K RSDVQNLTEDT
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQ-------------------ELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDT

Query:  EHK
        E+K
Subjt:  EHK

A0A1S3BXC2 uncharacterized protein LOC103494237 isoform X21.4e-24989.05Show/hide
Query:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN
        MCPHCDEFS DGCRKAGPIIEEKKN+GGLRCLNFPR F    T   M EGSKSNVVYRRKKLRG+SDSR LANGTDCISLISCDGHL EDKEQAA S+ N
Subjt:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN

Query:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDET-LNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS
        H+ EIVGN VPP P  DGKTQVSELES NGC  GEGHGSDET  NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSI++MED VEDIS
Subjt:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDET-LNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS

Query:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN
        GRDLCISILRSNGLLSSMAH PEE SD RSDNNCFRLCKTCG SESVLKMLICDHCEDAFHVSC NHRMKKVSNDEWYCNSCLKKKHKVLKE I+KKL N
Subjt:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN

Query:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV
        T SRNGSSKGE +SIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPI DDTDAIG PLEMD SESFLMHEQSTNK CRLSTIGNWLQCQQV+DGVGG NG 
Subjt:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHK
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDE K RSDVQNLTEDTE+K
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHK

A0A6J1C1R7 uncharacterized protein LOC1110071737.9e-23286.02Show/hide
Query:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN
        MCPHCDEFS  GCRKAGPII+EKKN+ G  CLN PRA SQISTV TMPEGS S VVYRRKKLRGNSDSRL ANGTDCIS ISCDG L E+ EQAA S+H 
Subjt:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN

Query:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS
         +S+IVGNIVP  P YDGKT VSELESVNGCTIGEGHGSDETL NNLQK+LEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSI++MEDM+EDIS
Subjt:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS

Query:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN
        GRDLCISILRSNGLLS MAHAP+E S+F+SD+NCFR CK CG SESVLKMLICDHCEDAFH+SC NHRMKKVSNDEWYCNSCLKKKHK+LKETIT KLAN
Subjt:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN

Query:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV
         SSR+GSSKGE +SIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPI DDTDAIG PLE+DPSESF MHEQSTNKPCRLS IGNWLQCQQVI      NG+
Subjt:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKL
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQELET QVLKQLKYIEMLRPRLASKRRK+
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKL

A0A6J1F145 uncharacterized protein LOC111441172 isoform X15.1e-25589.48Show/hide
Query:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN
        MCPHCDEF  DGCRKAG IIEEKKNDGG RCLNFPRAFSQIST+  MP GSKSNVVY+RKKLRGNSDSRLLANGTDC SLISCDGHL+EDKEQA  S+H 
Subjt:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN

Query:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS
        HKSEIVGN++PP P   GK QVSELES+NGCTIGEGHGSDETL NNLQKSLEVDSINDSCSSSKSNME VSTSLKVEVDDTGECSSSSIR+MEDMVEDIS
Subjt:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS

Query:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN
        GRDLCISILRSNGLLSSMAHA E+ SD RS+NNCFRLCKTCG S+S LKMLICDHCEDAFHV CGNHRMKKVSNDEWYCNSCLKKKHK+L E ITKKLAN
Subjt:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN

Query:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV
         SSRNGSSK E +SIALML DTEPYTTGVRIGKGFQAEVPDWSGPI DDTDA G PLEMDPS SFLMHEQSTNKPCRLS IGNWLQCQQVIDGVGGVNGV
Subjt:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT
        ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETK RSDVQNL E+TEHKT
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT

A0A6J1IGX7 uncharacterized protein LOC111472812 isoform X12.5e-25489.28Show/hide
Query:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN
        MCPHCDEF  DGCRKAG IIEEKKNDGGLRCLNFPRAFSQIST+  MP GSKSNVVY+RKKLRGNSDSRLLANGTDC SLISCDGHL+EDKEQA  SRH 
Subjt:  MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHN

Query:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS
        HKSEIVGN++PP P  DGK QVS LES+NGCTIGEGHGSDETL NNLQKSLEVDSINDSCSSSKSNME VSTSLKVEVDDTGECSSSSIR+MEDMVEDIS
Subjt:  HKSEIVGNIVPPLPAYDGKTQVSELESVNGCTIGEGHGSDETL-NNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDIS

Query:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN
        GRDLCISILRSNGLLSSMAHA E+ SD RS+NNCFRLCKTCG S+S LKMLICDHCEDAFHV CGNHRMKKVSNDEWYCNSCLKKKHK+L E ITKKLAN
Subjt:  GRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLAN

Query:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV
         SSRNGSSK E +SIALML DTEPYTTGVRIGKGFQAEVPDWSG I DDTDA   PLEMDPS SFLMHEQSTNKPCRLS IGNWLQCQQVIDGVGGVNGV
Subjt:  TSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT
        ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDET+ RSDVQNL E+TEHKT
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT

SwissProt top hitse value%identityAlignment
A6H619 PHD and RING finger domain-containing protein 13.2e-0426.58Show/hide
Query:  LCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSC
        +CI    +  +L  +     +A +   ++  F  C+ CG S+   ++L+CD C+  +H+ C +  +++V  DEW+C  C
Subjt:  LCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSC

Q9FNE9 Histone-lysine N-methyltransferase ATXR61.3e-0532.86Show/hide
Query:  PEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLK
        P+  SD  SD++   +C+ C   +   K+L+CD C+  FH+ C    +  V    W+C SC   KH++ K
Subjt:  PEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLK

Q9HDV4 Lid2 complex component lid21.9e-0430.95Show/hide
Query:  CKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSC------------LKKKHKVLKETITKKLANTSSRNGSSK
        C+ CG  ++   +L+CD CE A+H SC +  +  +  ++WYC++C             K K   LKE   +       RN SSK
Subjt:  CKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSC------------LKKKHKVLKETITKKLANTSSRNGSSK

Q9P1Y6 PHD and RING finger domain-containing protein 13.2e-0427.85Show/hide
Query:  LCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSC
        +CI       +L  +     +AS+   D      C+ CG S+   ++L+CD C+  +H+ C +  +++V  DEW+C  C
Subjt:  LCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSC

Q9SGH2 Methyl-CpG-binding domain-containing protein 95.4e-0437.78Show/hide
Query:  CKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSC
        C  CG  ES+  +++CD CE  FH+SC N  ++   + +W C+ C
Subjt:  CKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSC

Arabidopsis top hitse value%identityAlignment
AT1G77250.1 RING/FYVE/PHD-type zinc finger family protein7.0e-0725.2Show/hide
Query:  SKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHNHKSEIVGNI-VPPLPAYDGKTQVSELESVNGCTIGEGH--GSDE--TLNN
        SK    Y+R+KL G S S    +  D  S+        E +E  +  R + ++ + G +  PP P        +      GC     H   S E  +LN 
Subjt:  SKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHNHKSEIVGNI-VPPLPAYDGKTQVSELESVNGCTIGEGH--GSDE--TLNN

Query:  -LQKSLEVDSINDSCSSSKSNMELVSTSLKVEV-DDTGECSSSSIRLMEDMVEDISGRDLCISILRSNGLLSSMAHAPEEASDFRS-------------D
         L ++L+   I+D  S +     L+ T +K  V + +    S+ ++ +   ++D+ G D+  ++L ++ L  S     E+   F +             +
Subjt:  -LQKSLEVDSINDSCSSSKSNMELVSTSLKVEV-DDTGECSSSSIRLMEDMVEDISGRDLCISILRSNGLLSSMAHAPEEASDFRS-------------D

Query:  NNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKK
        ++   +CK CG        L CDHCED +HVSC     K +    WYC  C  K
Subjt:  NNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKK

AT2G19260.1 RING/FYVE/PHD zinc finger superfamily protein4.2e-6044.41Show/hide
Query:  DSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDISGRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLIC
        D  NDSCSS KS+ E+ STS K   DD   C SS   + E                                +D    ++ FR CK C    +V KMLIC
Subjt:  DSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDISGRDLCISILRSNGLLSSMAHAPEEASDFRSDNNCFRLCKTCGCSESVLKMLIC

Query:  DHCEDAFHVSCGNHRMKKVSN-DEWYCNSCLKKKHKVLKETITKKLANTSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDA
        D CE+A+H  C   +MK V+  DEW C SCLK               N SS+   +KG +S      + T P+  G+RIGK FQA+VPDWSGP   DT  
Subjt:  DHCEDAFHVSCGNHRMKKVSN-DEWYCNSCLKKKHKVLKETITKKLANTSSRNGSSKGELSSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPIFDDTDA

Query:  IGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIE
        +G PLE+  SE     +++ N   + S + NWLQC++        NGVICGKWRRAP  EVQT DWECFC   WDP+ ADCAVPQELET ++LKQLKYI+
Subjt:  IGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELETGQVLKQLKYIE

Query:  MLRPRLASKRRKL
        MLRPR  +K+RKL
Subjt:  MLRPRLASKRRKL

AT3G01460.1 methyl-CPG-binding domain 93.9e-0537.78Show/hide
Query:  CKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSC
        C  CG  ES+  +++CD CE  FH+SC N  ++   + +W C+ C
Subjt:  CKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSC

AT5G24330.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 69.2e-0732.86Show/hide
Query:  PEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLK
        P+  SD  SD++   +C+ C   +   K+L+CD C+  FH+ C    +  V    W+C SC   KH++ K
Subjt:  PEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCCCCCATTGTGATGAATTCTCTCGTGATGGCTGCAGAAAAGCTGGACCAATCATAGAGGAAAAGAAGAATGATGGTGGCTTGCGTTGCTTAAATTTTCCAAGGGC
CTTTTCCCAGATATCAACTGTTGGTACGATGCCTGAAGGTTCAAAATCTAATGTAGTATATAGGAGAAAGAAACTGCGAGGCAATTCTGATTCCAGGTTGTTGGCTAATG
GGACAGATTGTATTTCTTTGATTAGTTGTGATGGTCATTTGGTAGAAGACAAAGAGCAAGCTGCAGGTTCTCGGCATAACCACAAGAGTGAAATTGTTGGAAATATTGTC
CCTCCGCTTCCTGCTTACGATGGAAAAACTCAAGTTTCAGAACTAGAATCAGTCAATGGTTGTACTATAGGGGAGGGGCATGGTTCTGACGAAACACTTAATAACCTGCA
AAAAAGTTTGGAGGTTGACAGCATAAATGATAGCTGCTCCTCATCCAAGTCAAACATGGAACTTGTTTCAACTTCCTTGAAGGTTGAAGTGGATGACACAGGTGAGTGCT
CCTCTTCTAGTATTCGACTTATGGAGGATATGGTCGAGGATATATCAGGAAGAGATCTATGCATCTCTATCCTTAGAAGCAATGGGCTTCTGTCTTCTATGGCTCATGCT
CCTGAGGAAGCAAGTGATTTTAGAAGCGACAATAATTGTTTTCGATTGTGCAAAACTTGCGGCTGTTCAGAATCAGTCTTGAAGATGTTAATTTGTGATCATTGTGAAGA
TGCATTTCATGTCTCATGTGGCAATCATCGCATGAAGAAAGTGTCAAATGATGAGTGGTATTGCAATTCATGTTTGAAGAAGAAACATAAAGTTTTGAAGGAAACAATTA
CAAAAAAATTGGCAAACACCTCGAGTAGAAATGGATCTTCTAAGGGTGAATTAAGTTCCATAGCATTAATGTTAAAGGACACAGAACCTTATACAACTGGCGTTCGGATT
GGCAAAGGTTTTCAAGCAGAAGTTCCGGATTGGTCTGGCCCAATTTTTGATGATACTGATGCCATCGGTGGGCCACTGGAAATGGATCCTTCAGAATCTTTTCTTATGCA
TGAGCAGAGTACCAATAAACCTTGTAGATTGAGCACTATTGGAAATTGGCTTCAATGTCAACAAGTTATAGATGGAGTGGGTGGTGTTAACGGAGTCATATGTGGCAAGT
GGCGCAGGGCTCCTCTTTTTGAAGTCCAAACTGATGACTGGGAATGCTTCTGCTCCATCCTCTGGGATCCAGCACATGCTGATTGTGCTGTACCTCAGGAATTGGAAACA
GGTCAAGTTTTAAAGCAGTTGAAGTACATTGAGATGTTGAGGCCTCGGTTAGCTTCCAAGAGACGGAAATTGGACGAGACTAAGTGCAGAAGTGATGTGCAGAACCTTAC
AGAGGATACAGAACACAAAACTTGA
mRNA sequenceShow/hide mRNA sequence
CAATTTTCTTAGGCTTTGTCCACCCGTACAAGAGCCCAAGATATTATAGCTGTCAATACACCTATTCGGAATCATCAATTATTTTTGGTCTGCTTTTTCTTTTTTTTCCT
TGTAACACAACTCTTGTATTTTACAAACGAAACTAGTCTAAAATGCAAATTGGTTGATCTTCAAAGGAAAAATGAAAAGTTGTTTCTATTCTAGTGATCGTAAGGGAGCC
CCCTATGAATCAATTTGAGTGCAAAAACACGGACGTTGTCTTTTCTGTAATATAAAAGAAAAAGGGTAATAAAAACGCATAATTTTGAAATAGATGAATGAATCTCGGAT
TCTGAGTGATAAGAAGCAGAAGAGAAGCATTTTCACAACTGGGTTGATCAATCAGAAGAACTGGCAGATTGTGCGCTTCTTTTGATGTGCCCCCATTGTGATGAATTCTC
TCGTGATGGCTGCAGAAAAGCTGGACCAATCATAGAGGAAAAGAAGAATGATGGTGGCTTGCGTTGCTTAAATTTTCCAAGGGCCTTTTCCCAGATATCAACTGTTGGTA
CGATGCCTGAAGGTTCAAAATCTAATGTAGTATATAGGAGAAAGAAACTGCGAGGCAATTCTGATTCCAGGTTGTTGGCTAATGGGACAGATTGTATTTCTTTGATTAGT
TGTGATGGTCATTTGGTAGAAGACAAAGAGCAAGCTGCAGGTTCTCGGCATAACCACAAGAGTGAAATTGTTGGAAATATTGTCCCTCCGCTTCCTGCTTACGATGGAAA
AACTCAAGTTTCAGAACTAGAATCAGTCAATGGTTGTACTATAGGGGAGGGGCATGGTTCTGACGAAACACTTAATAACCTGCAAAAAAGTTTGGAGGTTGACAGCATAA
ATGATAGCTGCTCCTCATCCAAGTCAAACATGGAACTTGTTTCAACTTCCTTGAAGGTTGAAGTGGATGACACAGGTGAGTGCTCCTCTTCTAGTATTCGACTTATGGAG
GATATGGTCGAGGATATATCAGGAAGAGATCTATGCATCTCTATCCTTAGAAGCAATGGGCTTCTGTCTTCTATGGCTCATGCTCCTGAGGAAGCAAGTGATTTTAGAAG
CGACAATAATTGTTTTCGATTGTGCAAAACTTGCGGCTGTTCAGAATCAGTCTTGAAGATGTTAATTTGTGATCATTGTGAAGATGCATTTCATGTCTCATGTGGCAATC
ATCGCATGAAGAAAGTGTCAAATGATGAGTGGTATTGCAATTCATGTTTGAAGAAGAAACATAAAGTTTTGAAGGAAACAATTACAAAAAAATTGGCAAACACCTCGAGT
AGAAATGGATCTTCTAAGGGTGAATTAAGTTCCATAGCATTAATGTTAAAGGACACAGAACCTTATACAACTGGCGTTCGGATTGGCAAAGGTTTTCAAGCAGAAGTTCC
GGATTGGTCTGGCCCAATTTTTGATGATACTGATGCCATCGGTGGGCCACTGGAAATGGATCCTTCAGAATCTTTTCTTATGCATGAGCAGAGTACCAATAAACCTTGTA
GATTGAGCACTATTGGAAATTGGCTTCAATGTCAACAAGTTATAGATGGAGTGGGTGGTGTTAACGGAGTCATATGTGGCAAGTGGCGCAGGGCTCCTCTTTTTGAAGTC
CAAACTGATGACTGGGAATGCTTCTGCTCCATCCTCTGGGATCCAGCACATGCTGATTGTGCTGTACCTCAGGAATTGGAAACAGGTCAAGTTTTAAAGCAGTTGAAGTA
CATTGAGATGTTGAGGCCTCGGTTAGCTTCCAAGAGACGGAAATTGGACGAGACTAAGTGCAGAAGTGATGTGCAGAACCTTACAGAGGATACAGAACACAAAACTTGAT
ATGGTGAGAACTGCTTGATAGTATTCAAGTGTTCGTAACTCTCATCTCTTACAAAATCGAGTCATTCAGATTAAAATTGAGTCTTATTCTCTACTTTAAGAAGCATTGTA
ATAGTAATATCTTGTATGATGCAAATGAAGGACTAAATATGATGTGTGCGCATTTACCGGCTTCTGCTGGCTTTTTTATTCTTTTTTTGGTCTCTTTTTCTCTTGGAGAA
CGTCTGTCAAGAGAAAAGTGGGTAGATGGTAATTGCCTTTTGTTGGTGAATAGAGAAATGAAATTGGAATAGTTCAAGATCTTTTATCGTTCTGTATTTGTTTTCTTTGC
TGAAATTTGATTGAACACTGAT
Protein sequenceShow/hide protein sequence
MCPHCDEFSRDGCRKAGPIIEEKKNDGGLRCLNFPRAFSQISTVGTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEDKEQAAGSRHNHKSEIVGNIV
PPLPAYDGKTQVSELESVNGCTIGEGHGSDETLNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRLMEDMVEDISGRDLCISILRSNGLLSSMAHA
PEEASDFRSDNNCFRLCKTCGCSESVLKMLICDHCEDAFHVSCGNHRMKKVSNDEWYCNSCLKKKHKVLKETITKKLANTSSRNGSSKGELSSIALMLKDTEPYTTGVRI
GKGFQAEVPDWSGPIFDDTDAIGGPLEMDPSESFLMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPAHADCAVPQELET
GQVLKQLKYIEMLRPRLASKRRKLDETKCRSDVQNLTEDTEHKT