; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033876 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033876
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPHD domain-containing protein
Genome locationchr3:2581747..2600541
RNA-Seq ExpressionLag0033876
SyntenyLag0033876
Gene Ontology termsGO:0046274 - lignin catabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0048046 - apoplast (cellular component)
GO:0052716 - hydroquinone:oxygen oxidoreductase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0005507 - copper ion binding (molecular function)
InterPro domainsIPR019787 - Zinc finger, PHD-finger
IPR019786 - Zinc finger, PHD-type, conserved site
IPR016193 - Cytidine deaminase-like
IPR016192 - APOBEC/CMP deaminase, zinc-binding
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR011124 - Zinc finger, CW-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR002125 - Cytidine and deoxycytidylate deaminase domain
IPR001965 - Zinc finger, PHD-type
IPR000949 - ELM2 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587732.1 tRNA-specific adenosine deaminase TAD2, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0081.18Show/hide
Query:  EMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASEA
        EMEAIDILIEAWQRDGLSTSEVA K SKCKLYVTCEPCIMCASALSILGI EVYYGCANDKFGGCGSILSLHLGSREAHTSGN QG+GFKCTAGIMASEA
Subjt:  EMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASEA

Query:  VALFRSFYEQGNPNAPKPHRPLVHHQAVSFMQNYG-----VPNLKRKTNRMKPLERKVDGSVTFWPRGRAIGQLQSLCIVPFYISACPSISCHIEWKAKQ
        VALFRSFYEQGNPNAP PHRPLVH++A +  Q        V  L     R  PL   +     F P GR +  L S         ACP ISC+IEWKA+Q
Subjt:  VALFRSFYEQGNPNAPKPHRPLVHHQAVSFMQNYG-----VPNLKRKTNRMKPLERKVDGSVTFWPRGRAIGQLQSLCIVPFYISACPSISCHIEWKAKQ

Query:  ------------------GAEGKHFHNWVDHSEELADCTLLLMCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYR
                           AE KHFHNWV HSEELADCTLLLMCPHCDEF HDGCRKAG IIEEKKNDGG RCLNF  +FSQIST+S MP GSKSNVVY+
Subjt:  ------------------GAEGKHFHNWVDHSEELADCTLLLMCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYR

Query:  RKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMND
        RKKLRGNSDSRLLANGTDC SLISCDGHL+E +EQA  SRH HKSEIVG+V+PP PV  GK  VSEL+S++GCTIGEG GSDETLNNNLQKSLEVDS+ND
Subjt:  RKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMND

Query:  SCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDD
        SCSSSKSNME VSTSLKVEVDDTGECSSSSIRVMEDM EDISGRDLCISILRS+GLLSSMAHA  +ESD RS+NNCFR CKTCGSS+S LKMLICDHC+D
Subjt:  SCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDD

Query:  AFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLANISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLE
        AFHV CGNHRMKKV NDEWYCNSCLKKKHKIL E ITKKLANISSRNGSSK ESNSIALML DTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA GEPLE
Subjt:  AFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLANISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLE

Query:  MDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML
        MDPS SFL+HEQSTNK  RLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQELETGQVLKQLKYIEML
Subjt:  MDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML

KAG6589690.1 tRNA-specific adenosine deaminase TAD2, partial [Cucurbita argyrosperma subsp. sororia]7.7e-29374.93Show/hide
Query:  LVAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAH
        +V A  R +   +R +T  HAEMEAIDILIEAWQRDGLSTSEVA+KFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGS+LSLHLGSREA 
Subjt:  LVAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAH

Query:  TSGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPKPHRPLVHHQAVSFMQNYGVPNLKRKTNRMKPLERKVDGSVTFWPRGRAIGQLQSLCIVPFYI
        TS N QG+GFKCTAGIMASEAVALFRSFYEQGNPNAPKPHR L     ++          K  ++ +K    +++ S     R  +  Q+ +  +   +I
Subjt:  TSGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPKPHRPLVHHQAVSFMQNYGVPNLKRKTNRMKPLERKVDGSVTFWPRGRAIGQLQSLCIVPFYI

Query:  SACPSISCHIEWKAKQGAEGKHFHNWVDHSEELADCTLLLMCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK
            S             E KHFHNW DHS+ELAD  LLLMC HCD FSH+GCRKAG I+EE KNDG   CLN S +F QISTVS MPE SK NVVY R+
Subjt:  SACPSISCHIEWKAKQGAEGKHFHNWVDHSEELADCTLLLMCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK

Query:  KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSC
        KLRGNSDSRL A  TDCISLISCDG      +QAAASRHNHK +IVG+VVP  PVY GKT VSE +SVDGCTIG+G GS+  LNN LQKSLEVDS+NDSC
Subjt:  KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSC

Query:  SSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAF
        SSSKSNMELVSTS KVEVDDTGECSSSSIRVMEDM EDISGRDLCISILRS+GLLSSMAHA  EESDFRS+NNCFR CKTCGSS+SVLKMLICDHC+DAF
Subjt:  SSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAF

Query:  HVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLANISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD
        HVSC NHRMKKV+NDEWYCNSCLKK HKILK+ I KKLAN SSRNG SKGESNS+ALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDD DAIGE LEM 
Subjt:  HVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLANISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD

Query:  PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML
        PSE  L+HE STNK  R S IGNWLQCQQV++GVGGVN  ICGKWRRAPLFEVQTD+WECFCS+LWDPTHADCAVPQELET QVLKQLKYIEML
Subjt:  PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML

KAG7023369.1 tRNA-specific adenosine deaminase 2 [Cucurbita argyrosperma subsp. argyrosperma]6.1e-25066.71Show/hide
Query:  LVAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAH
        +V A  R +   +R +T  HAEMEAIDILIEAWQRDGLSTSEVA+KFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGS+LSLHLGSREA 
Subjt:  LVAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAH

Query:  TSGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPKPHRPLVHHQAVSFMQNYGVPNLKRKTNRMKPLERKVDGSVTFWPRGRAIGQLQSLCIVPFYI
        T                                                      G+P L+  T  +  + +                            
Subjt:  TSGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPKPHRPLVHHQAVSFMQNYGVPNLKRKTNRMKPLERKVDGSVTFWPRGRAIGQLQSLCIVPFYI

Query:  SACPSISCHIEWKAKQGAEGKHFHNWVDHSEELADCTLLLMCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK
                                  V+   E        MC HCD FSH+GCRKAG I+EE KNDG   CLN S +F QISTVS MPE SK NVVY R+
Subjt:  SACPSISCHIEWKAKQGAEGKHFHNWVDHSEELADCTLLLMCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRK

Query:  KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSC
        KLRGNSDSRL A  TDCISLISCDG      +QAAASRHNHK +IVG+VVP  PVY GKT VSE +SVDGCTIG+G GS+  LNN LQKSLEVDS+NDSC
Subjt:  KLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSC

Query:  SSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAF
        SSSKSNMELVSTS KVEVDDTGECSSSSIRVMEDM EDISGRDLCISILRS+GLLSS AHA  EESDFRS+NNCFR CKTCGSS+SVLKMLICDHC+DAF
Subjt:  SSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAF

Query:  HVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLANISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD
        HVSC NHRMKKV+NDEWYCNSCLKK HKILK+ I KKLAN SSRNG SKGESNS+ALML+DTEPYTTGVRIGKGFQAEVPDWSGPISDD DAIGE LEM 
Subjt:  HVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLANISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD

Query:  PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML
        PSE  L+HE STNK  R S IGNWLQCQQV++GVGGVN  ICGKWRRAPLFEVQTDDWECFCS+LWDPTHADCAVPQELET QVLKQLKYIEML
Subjt:  PSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML

XP_023531970.1 uncharacterized protein LOC111794074 isoform X1 [Cucurbita pepo subsp. pepo]3.9e-23688.55Show/hide
Query:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN
        MCPHCDEF HDGCRKAG IIEEKKNDGGLRCLNF  +FSQIST+S MP GSKSNVVY+RKKLRGNSDSRLLANGTDC SLISCDGHL+E +EQA  SRH 
Subjt:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN

Query:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS
        HKSEIVG+V+PP PV +GK  VSEL+S++GCTIGEG GSDETLNNNLQKSLEVDSMNDSCSSSKSNME VSTSLKVEVDDTGECSSSSIRVMEDM EDIS
Subjt:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS

Query:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN
        GRDLCISILRS+GLLSS+AHA  +ESD RS+NNCFR CKTCGSS+S LKMLICDHC+DAFHV CGNHRMKKV NDEWYCNSCLKKKHKIL E ITKKLAN
Subjt:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN

Query:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV
        ISSRNGSSK ESNSIALML DTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA GEPLEMDPS SFL+HEQSTNK  RLSAIGNWLQCQQVIDGVGGVNGV
Subjt:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQELETGQVLKQLKYIEML
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML

XP_038878482.1 uncharacterized protein LOC120070708 isoform X1 [Benincasa hispida]2.3e-23689.21Show/hide
Query:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN
        MCPHCDEFS DGCRKAG IIEEKKN+GG RCLNF  +F QISTVS MPE SKSNVVYRRKKLRGNSDSRLLANGTDCISL SCDGHL E +EQAAAS+HN
Subjt:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN

Query:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS
        HK+EI+G+ VPP PVYNGKT VSEL+SV+GC  GEG GSDET NNNLQKSLEVDS+NDSCSSSKSNMELVSTS+KVEVDDTGECSSSSI+VMEDM EDIS
Subjt:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS

Query:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN
        GRDLCI ILRS+GLLSSMAHAP EESDFRSDNNCFR CKTCGSSESVLKMLICDHC+DAFHVSC NHRMKKV NDEWYCNSCLKKKHKILKETI+KKLAN
Subjt:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN

Query:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV
        ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA GEPLE+DPSESFL+HE+STNK  RLS IGNWLQCQQVIDG+GGVNG 
Subjt:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQELETGQVLKQLKYIEML
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML

TrEMBL top hitse value%identityAlignment
A0A1S3BWJ2 uncharacterized protein LOC103494237 isoform X17.4e-22583.09Show/hide
Query:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN
        MCPHCDEFSHDGCRKAG IIEEKKN+GGLRCLNF  +F    T   M EGSKSNVVYRRKKLRG+SDSR LANGTDCISLISCDGHL E +EQAAAS+ N
Subjt:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN

Query:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS
        H+ EIVG+ VPP PV +GKT VSEL+S +GC  GEG GSDET NNNLQKSLEVDS+NDSCSSSKSNMELVSTSLKVEVDDTGECSSSSI+VMED  EDIS
Subjt:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS

Query:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN
        GRDLCISILRS+GLLSSMAH P EESD RSDNNCFR CKTCGSSESVLKMLICDHC+DAFHVSC NHRMKKV NDEWYCNSCLKKKHK+LKE I+KKL N
Subjt:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN

Query:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV
          SRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD SESFL+HEQSTNK+ RLS IGNWLQCQQV+DGVGG NG 
Subjt:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQ-------------------ELETGQVLKQLKYIEML
        ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQ                   ELETGQVLKQLKYIEML
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQ-------------------ELETGQVLKQLKYIEML

A0A1S3BXC2 uncharacterized protein LOC103494237 isoform X22.5e-22886.56Show/hide
Query:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN
        MCPHCDEFSHDGCRKAG IIEEKKN+GGLRCLNF  +F    T   M EGSKSNVVYRRKKLRG+SDSR LANGTDCISLISCDGHL E +EQAAAS+ N
Subjt:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN

Query:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS
        H+ EIVG+ VPP PV +GKT VSEL+S +GC  GEG GSDET NNNLQKSLEVDS+NDSCSSSKSNMELVSTSLKVEVDDTGECSSSSI+VMED  EDIS
Subjt:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS

Query:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN
        GRDLCISILRS+GLLSSMAH P EESD RSDNNCFR CKTCGSSESVLKMLICDHC+DAFHVSC NHRMKKV NDEWYCNSCLKKKHK+LKE I+KKL N
Subjt:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN

Query:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV
          SRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMD SESFL+HEQSTNK+ RLS IGNWLQCQQV+DGVGG NG 
Subjt:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML
        ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML

A0A6J1C1R7 uncharacterized protein LOC1110071731.4e-22385.46Show/hide
Query:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN
        MCPHCDEFSH GCRKAG II+EKKN+ G  CLN   + SQISTVSTMPEGS S VVYRRKKLRGNSDSRL ANGTDCIS ISCDG L E  EQAAAS+H 
Subjt:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN

Query:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS
         +S+IVG++VP  PVY+GKTHVSEL+SV+GCTIGEG GSDETLNNNLQK+LEVDS+NDSCSSSKSNMELVSTSLKVEVDDTGECSSSSI+VMEDM EDIS
Subjt:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS

Query:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN
        GRDLCISILRS+GLLS MAHAP EES+F+SD+NCFR CK CGSSESVLKMLICDHC+DAFH+SC NHRMKKV NDEWYCNSCLKKKHK+LKETIT KLAN
Subjt:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN

Query:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV
        ISSR+GSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPI+DDTDAIGEPLE+DPSESF +HEQSTNK  RLSAIGNWLQCQQVI      NG+
Subjt:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML
        ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELET QVLKQLKYIEML
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML

A0A6J1F145 uncharacterized protein LOC111441172 isoform X16.0e-23588.11Show/hide
Query:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN
        MCPHCDEF HDGCRKAG IIEEKKNDGG RCLNF  +FSQIST+S MP GSKSNVVY+RKKLRGNSDSRLLANGTDC SLISCDGHL+E +EQA  S+H 
Subjt:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN

Query:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS
        HKSEIVG+V+PP PV  GK  VSEL+S++GCTIGEG GSDETLNNNLQKSLEVDS+NDSCSSSKSNME VSTSLKVEVDDTGECSSSSIRVMEDM EDIS
Subjt:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS

Query:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN
        GRDLCISILRS+GLLSSMAHA  +ESD RS+NNCFR CKTCGSS+S LKMLICDHC+DAFHV CGNHRMKKV NDEWYCNSCLKKKHKIL E ITKKLAN
Subjt:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN

Query:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV
        ISSRNGSSK ESNSIALML DTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA GEPLEMDPS SFL+HEQSTNK  RLSAIGNWLQCQQVIDGVGGVNGV
Subjt:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQELETGQVLKQLKYIEML
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML

A0A6J1IGX7 uncharacterized protein LOC111472812 isoform X16.7e-23487.89Show/hide
Query:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN
        MCPHCDEF HDGCRKAG IIEEKKNDGGLRCLNF  +FSQIST+S MP GSKSNVVY+RKKLRGNSDSRLLANGTDC SLISCDGHL+E +EQA  SRH 
Subjt:  MCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHN

Query:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS
        HKSEIVG+V+PP PV +GK  VS L+S++GCTIGEG GSDETLNNNLQKSLEVDS+NDSCSSSKSNME VSTSLKVEVDDTGECSSSSIRVMEDM EDIS
Subjt:  HKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDIS

Query:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN
        GRDLCISILRS+GLLSSMAHA  +ESD RS+NNCFR CKTCGSS+S LKMLICDHC+DAFHV CGNHRMKKV NDEWYCNSCLKKKHKIL E ITKKLAN
Subjt:  GRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLAN

Query:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV
        ISSRNGSSK ESNSIALML DTEPYTTGVRIGKGFQAEVPDWSG ISDDTDA  EPLEMDPS SFL+HEQSTNK  RLSAIGNWLQCQQVIDGVGGVNGV
Subjt:  ISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGV

Query:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML
        ICGKWRRAPLFEVQTDDWECFCSILWDP HADCAVPQELETGQVLKQLKYIEML
Subjt:  ICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEML

SwissProt top hitse value%identityAlignment
Q5E9J7 tRNA-specific adenosine deaminase 28.3e-2441.48Show/hide
Query:  YPVVAQNTYGKSKSIETLSAQCLDT--VPSG--MLAGLLVAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMC
        Y V A+ T    +    ++   LD   VP G  M+    V    R +   ++ +T  HAEM AID  ++  +R G S SEV   F    LYVT EPCIMC
Subjt:  YPVVAQNTYGKSKSIETLSAQCLDT--VPSG--MLAGLLVAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMC

Query:  ASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPK
        A+AL ++ I  V YGC N++FGGCGS+L +      A       GK F+CT G  A EAV + ++FY+Q NPNAPK
Subjt:  ASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPK

Q5RIV4 tRNA-specific adenosine deaminase 21.3e-2148.33Show/hide
Query:  HAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMAS
        HAEM A+D +++ W R  L   +      +  LYVT EPCIMCA+AL +L I  V YGC N++FGGCGS+L +       HT     G  FKC AG  A 
Subjt:  HAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNVQGKGFKCTAGIMAS

Query:  EAVALFRSFYEQGNPNAPKP
        EAV + ++FY+Q NPNAPKP
Subjt:  EAVALFRSFYEQGNPNAPKP

Q6IDB6 tRNA-specific adenosine deaminase TAD25.0e-5371.33Show/hide
Query:  ASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGN
        AS R +    R+   HAEMEAID L+  WQ+DGLS S+VA KFSKC LYVTCEPCIMCASALS LGIKEVYYGC NDKFGGCGSILSLHLGS EA     
Subjt:  ASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGN

Query:  VQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPKPHRPLVHHQ
         +GKG+KC  GIMA EAV+LF+ FYEQGNPNAPKPHRP+V  +
Subjt:  VQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPKPHRPLVHHQ

Q6P6J0 tRNA-specific adenosine deaminase 22.3e-2143.8Show/hide
Query:  VAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHT
        V    R +   ++ +T  HAEM AID +++   + G S S V   F    LYVT EPCIMCA+AL ++ I  V YGC N++FGGCGS+L++      A  
Subjt:  VAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHT

Query:  SGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPK
             G+ F+C  G  A EAV L ++FY+Q NPNAPK
Subjt:  SGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPK

Q7Z6V5 tRNA-specific adenosine deaminase 24.6e-2243.8Show/hide
Query:  VAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHT
        V    R +   ++ +T  HAEM AID +++  ++ G S SEV   F    LYVT EPCIMCA+AL ++ I  V YGC N++FGGCGS+L++      A  
Subjt:  VAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHT

Query:  SGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPK
             G+ F+C  G  A EAV + ++FY+Q NPNAPK
Subjt:  SGNVQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPK

Arabidopsis top hitse value%identityAlignment
AT1G48175.1 Cytidine/deoxycytidylate deaminase family protein3.6e-5471.33Show/hide
Query:  ASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGN
        AS R +    R+   HAEMEAID L+  WQ+DGLS S+VA KFSKC LYVTCEPCIMCASALS LGIKEVYYGC NDKFGGCGSILSLHLGS EA     
Subjt:  ASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGN

Query:  VQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPKPHRPLVHHQ
         +GKG+KC  GIMA EAV+LF+ FYEQGNPNAPKPHRP+V  +
Subjt:  VQGKGFKCTAGIMASEAVALFRSFYEQGNPNAPKPHRPLVHHQ

AT1G77250.1 RING/FYVE/PHD-type zinc finger family protein6.8e-0523.72Show/hide
Query:  SKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSV-VPPLPVYNGKTHVSELDSVDGC--TIGEGQGSDE--TLNN
        SK    Y+R+KL G S S    +  D  S+        E  E  +  R + ++ + G +  PP P        +      GC   +     S E  +LN 
Subjt:  SKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLVEGEEQAAASRHNHKSEIVGSV-VPPLPVYNGKTHVSELDSVDGC--TIGEGQGSDE--TLNN

Query:  NLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEV-DDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAH------------APGEESDFRSDN
         L ++L+   ++D  S +     L+ T +K  V + +    S+ ++ +    +D+ G D+ + +  S   LS  ++             P   ++   ++
Subjt:  NLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEV-DDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAH------------APGEESDFRSDN

Query:  NCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKK
        +    CK CG        L CDHC+D +HVSC     K +    WYC  C  K
Subjt:  NCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKK

AT2G19260.1 RING/FYVE/PHD zinc finger superfamily protein7.7e-5743.71Show/hide
Query:  DSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLIC
        D  NDSCSS KS+ E+ STS K   DD   C SS   V                                 E+D    ++ FR CK C    +V KMLIC
Subjt:  DSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDMAEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLIC

Query:  DHCDDAFHVSCGNHRMKKVLN-DEWYCNSCLKKKHKILKETITKKLANISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA
        D C++A+H  C   +MK V   DEW C SCLK      + + TK    IS                 + T P+  G+RIGK FQA+VPDWSGP   DT  
Subjt:  DHCDDAFHVSCGNHRMKKVLN-DEWYCNSCLKKKHKILKETITKKLANISSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDA

Query:  IGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIE
        +GEPLE+  SE     +++ N   + SA+ NWLQC++        NGVICGKWRRAP  EVQT DWECFC   WDP+ ADCAVPQELET ++LKQLKYI+
Subjt:  IGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQTDDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIE

Query:  ML
        ML
Subjt:  ML

AT3G01460.1 methyl-CPG-binding domain 98.9e-0535.56Show/hide
Query:  CKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSC
        C  CG  ES+  +++CD C+  FH+SC N  ++   + +W C+ C
Subjt:  CKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSC

AT5G24330.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 67.3e-0737.14Show/hide
Query:  PGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILK
        P   SD  SD++    C+ C S +   K+L+CD CD  FH+ C    +  V    W+C SC   KH+I K
Subjt:  PGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCACTGCTCTCTATCTCCATCTTGGTGTGTTATGTGTTCGAATGACAACGAGAACCCAGGCCACTTATTTGGGTGTTGCTCATTTGCTCGTTGTTACTGGTCTTA
TATCTTGGAAGCTTTTGAGTGGAATTTGGTCGTACCTAACAATGTGTTTGATCTCATATCGTTGATATTTATGGGTCATCCTTTCCATGGTTCAAAGAAGGTTTTGTGGA
AATATTGGACAACGTTTCGACGCTACCCTGTTGTTGCTCAAAATACATATGGAAAATCCAAGAGCATTGAGACGCTCTCAGCCCAGTGTCTCGACACTGTGCCTTCGGGA
ATGCTAGCTGGGTTGTTGGTTGCAGCAGCATCGAGACGCCAAAAGGGAGCGTCGAGACGCTCTACTCTCGGACACGCAGAAATGGAAGCTATTGACATCCTGATTGAGGC
ATGGCAGAGGGATGGACTTTCAACCTCAGAAGTTGCTAATAAATTCTCGAAGTGCAAACTTTATGTTACCTGTGAACCATGTATTATGTGTGCTTCTGCCCTATCAATAC
TTGGTATAAAGGAAGTATATTATGGTTGTGCAAACGATAAATTTGGTGGATGTGGATCTATATTGTCACTTCACTTGGGTAGCCGGGAGGCACATACAAGTGGTAATGTG
CAAGGAAAGGGGTTCAAATGCACTGCAGGAATAATGGCATCAGAAGCAGTTGCTCTTTTTCGAAGTTTTTACGAACAGGGGAATCCCAATGCTCCAAAACCTCACAGGCC
CCTTGTTCACCATCAGGCAGTTTCATTTATGCAAAACTATGGTGTTCCAAATCTCAAAAGAAAGACGAATAGAATGAAGCCATTGGAGAGGAAAGTTGATGGAAGTGTTA
CTTTTTGGCCAAGAGGCAGAGCAATTGGCCAACTGCAAAGCTTGTGCATTGTCCCATTTTACATTTCTGCGTGTCCTTCAATTAGTTGTCATATTGAATGGAAAGCAAAA
CAAGGAGCAGAAGGGAAGCATTTTCACAACTGGGTTGATCATTCAGAAGAACTGGCAGATTGTACGCTTCTTTTGATGTGCCCCCATTGTGATGAATTCTCTCATGATGG
CTGCAGAAAAGCTGGATCAATCATAGAGGAAAAGAAGAACGATGGTGGCTTGCGTTGCTTAAATTTTTCAAGCTCCTTTTCCCAGATATCAACTGTTAGTACGATGCCTG
AAGGTTCAAAATCTAATGTAGTATATAGGAGAAAGAAGCTTCGAGGCAATTCTGATTCCAGGTTGTTGGCTAATGGGACAGATTGTATTTCTTTGATTAGTTGCGATGGT
CATTTGGTAGAAGGCGAAGAGCAAGCTGCAGCTTCTCGCCATAACCACAAGAGTGAAATTGTTGGAAGTGTTGTCCCTCCTCTTCCTGTTTACAATGGAAAAACTCATGT
CTCTGAACTAGATTCAGTCGATGGTTGTACCATTGGGGAAGGACAAGGTTCTGACGAAACACTCAATAATAACCTGCAAAAAAGTTTGGAGGTTGACAGCATGAATGATA
GCTGCTCCTCATCAAAGTCAAACATGGAACTTGTTTCAACTTCCTTGAAGGTTGAAGTGGATGACACAGGTGAGTGCTCCTCTTCTAGTATTCGAGTTATGGAGGATATG
GCAGAGGATATATCAGGAAGAGATCTATGCATCTCTATCCTTAGAAGCCATGGGCTTCTGTCTTCTATGGCTCATGCTCCTGGGGAAGAAAGTGATTTTAGGAGTGACAA
TAATTGTTTTCGATTCTGCAAAACTTGTGGCTCTTCGGAATCAGTCTTGAAGATGTTAATTTGTGATCACTGTGACGATGCATTTCATGTCTCATGTGGCAATCATCGCA
TGAAGAAAGTGTTAAATGATGAGTGGTATTGCAATTCATGTTTGAAGAAGAAGCATAAAATTTTGAAGGAAACAATTACAAAGAAATTGGCAAACATCTCGAGTAGAAAT
GGATCTTCTAAGGGTGAATCAAATTCCATAGCATTAATGTTAAAGGACACAGAACCATATACAACTGGTGTTCGGATTGGCAAAGGTTTTCAAGCAGAAGTTCCAGATTG
GTCTGGCCCGATTTCTGATGATACTGATGCCATCGGTGAGCCACTGGAAATGGATCCTTCAGAATCTTTTCTTGTGCATGAGCAGAGTACCAATAAATCTAGTAGACTGA
GCGCTATTGGAAATTGGCTTCAATGTCAACAAGTTATAGATGGAGTGGGTGGTGTTAATGGAGTCATATGTGGCAAGTGGCGCAGGGCTCCTCTTTTTGAAGTCCAAACT
GATGACTGGGAATGCTTCTGCTCCATCCTCTGGGATCCAACTCATGCTGATTGTGCTGTACCTCAGGAATTGGAGACGGGTCAAGTTTTGAAGCAGTTGAAGTACATTGA
GATGTTATTTGCAATAATGTGGAAATATGCAGTGTTGGAGTCGCCTACATTCATCCAATCCACCTTGGTTATTTGCATCCGTCTCATATCCTCCCTGAGAAAATTATTGA
TTGAAACTGGAACTGGGGATTTGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCACTGCTCTCTATCTCCATCTTGGTGTGTTATGTGTTCGAATGACAACGAGAACCCAGGCCACTTATTTGGGTGTTGCTCATTTGCTCGTTGTTACTGGTCTTA
TATCTTGGAAGCTTTTGAGTGGAATTTGGTCGTACCTAACAATGTGTTTGATCTCATATCGTTGATATTTATGGGTCATCCTTTCCATGGTTCAAAGAAGGTTTTGTGGA
AATATTGGACAACGTTTCGACGCTACCCTGTTGTTGCTCAAAATACATATGGAAAATCCAAGAGCATTGAGACGCTCTCAGCCCAGTGTCTCGACACTGTGCCTTCGGGA
ATGCTAGCTGGGTTGTTGGTTGCAGCAGCATCGAGACGCCAAAAGGGAGCGTCGAGACGCTCTACTCTCGGACACGCAGAAATGGAAGCTATTGACATCCTGATTGAGGC
ATGGCAGAGGGATGGACTTTCAACCTCAGAAGTTGCTAATAAATTCTCGAAGTGCAAACTTTATGTTACCTGTGAACCATGTATTATGTGTGCTTCTGCCCTATCAATAC
TTGGTATAAAGGAAGTATATTATGGTTGTGCAAACGATAAATTTGGTGGATGTGGATCTATATTGTCACTTCACTTGGGTAGCCGGGAGGCACATACAAGTGGTAATGTG
CAAGGAAAGGGGTTCAAATGCACTGCAGGAATAATGGCATCAGAAGCAGTTGCTCTTTTTCGAAGTTTTTACGAACAGGGGAATCCCAATGCTCCAAAACCTCACAGGCC
CCTTGTTCACCATCAGGCAGTTTCATTTATGCAAAACTATGGTGTTCCAAATCTCAAAAGAAAGACGAATAGAATGAAGCCATTGGAGAGGAAAGTTGATGGAAGTGTTA
CTTTTTGGCCAAGAGGCAGAGCAATTGGCCAACTGCAAAGCTTGTGCATTGTCCCATTTTACATTTCTGCGTGTCCTTCAATTAGTTGTCATATTGAATGGAAAGCAAAA
CAAGGAGCAGAAGGGAAGCATTTTCACAACTGGGTTGATCATTCAGAAGAACTGGCAGATTGTACGCTTCTTTTGATGTGCCCCCATTGTGATGAATTCTCTCATGATGG
CTGCAGAAAAGCTGGATCAATCATAGAGGAAAAGAAGAACGATGGTGGCTTGCGTTGCTTAAATTTTTCAAGCTCCTTTTCCCAGATATCAACTGTTAGTACGATGCCTG
AAGGTTCAAAATCTAATGTAGTATATAGGAGAAAGAAGCTTCGAGGCAATTCTGATTCCAGGTTGTTGGCTAATGGGACAGATTGTATTTCTTTGATTAGTTGCGATGGT
CATTTGGTAGAAGGCGAAGAGCAAGCTGCAGCTTCTCGCCATAACCACAAGAGTGAAATTGTTGGAAGTGTTGTCCCTCCTCTTCCTGTTTACAATGGAAAAACTCATGT
CTCTGAACTAGATTCAGTCGATGGTTGTACCATTGGGGAAGGACAAGGTTCTGACGAAACACTCAATAATAACCTGCAAAAAAGTTTGGAGGTTGACAGCATGAATGATA
GCTGCTCCTCATCAAAGTCAAACATGGAACTTGTTTCAACTTCCTTGAAGGTTGAAGTGGATGACACAGGTGAGTGCTCCTCTTCTAGTATTCGAGTTATGGAGGATATG
GCAGAGGATATATCAGGAAGAGATCTATGCATCTCTATCCTTAGAAGCCATGGGCTTCTGTCTTCTATGGCTCATGCTCCTGGGGAAGAAAGTGATTTTAGGAGTGACAA
TAATTGTTTTCGATTCTGCAAAACTTGTGGCTCTTCGGAATCAGTCTTGAAGATGTTAATTTGTGATCACTGTGACGATGCATTTCATGTCTCATGTGGCAATCATCGCA
TGAAGAAAGTGTTAAATGATGAGTGGTATTGCAATTCATGTTTGAAGAAGAAGCATAAAATTTTGAAGGAAACAATTACAAAGAAATTGGCAAACATCTCGAGTAGAAAT
GGATCTTCTAAGGGTGAATCAAATTCCATAGCATTAATGTTAAAGGACACAGAACCATATACAACTGGTGTTCGGATTGGCAAAGGTTTTCAAGCAGAAGTTCCAGATTG
GTCTGGCCCGATTTCTGATGATACTGATGCCATCGGTGAGCCACTGGAAATGGATCCTTCAGAATCTTTTCTTGTGCATGAGCAGAGTACCAATAAATCTAGTAGACTGA
GCGCTATTGGAAATTGGCTTCAATGTCAACAAGTTATAGATGGAGTGGGTGGTGTTAATGGAGTCATATGTGGCAAGTGGCGCAGGGCTCCTCTTTTTGAAGTCCAAACT
GATGACTGGGAATGCTTCTGCTCCATCCTCTGGGATCCAACTCATGCTGATTGTGCTGTACCTCAGGAATTGGAGACGGGTCAAGTTTTGAAGCAGTTGAAGTACATTGA
GATGTTATTTGCAATAATGTGGAAATATGCAGTGTTGGAGTCGCCTACATTCATCCAATCCACCTTGGTTATTTGCATCCGTCTCATATCCTCCCTGAGAAAATTATTGA
TTGAAACTGGAACTGGGGATTTGAATTGA
Protein sequenceShow/hide protein sequence
MSHCSLSPSWCVMCSNDNENPGHLFGCCSFARCYWSYILEAFEWNLVVPNNVFDLISLIFMGHPFHGSKKVLWKYWTTFRRYPVVAQNTYGKSKSIETLSAQCLDTVPSG
MLAGLLVAAASRRQKGASRRSTLGHAEMEAIDILIEAWQRDGLSTSEVANKFSKCKLYVTCEPCIMCASALSILGIKEVYYGCANDKFGGCGSILSLHLGSREAHTSGNV
QGKGFKCTAGIMASEAVALFRSFYEQGNPNAPKPHRPLVHHQAVSFMQNYGVPNLKRKTNRMKPLERKVDGSVTFWPRGRAIGQLQSLCIVPFYISACPSISCHIEWKAK
QGAEGKHFHNWVDHSEELADCTLLLMCPHCDEFSHDGCRKAGSIIEEKKNDGGLRCLNFSSSFSQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDG
HLVEGEEQAAASRHNHKSEIVGSVVPPLPVYNGKTHVSELDSVDGCTIGEGQGSDETLNNNLQKSLEVDSMNDSCSSSKSNMELVSTSLKVEVDDTGECSSSSIRVMEDM
AEDISGRDLCISILRSHGLLSSMAHAPGEESDFRSDNNCFRFCKTCGSSESVLKMLICDHCDDAFHVSCGNHRMKKVLNDEWYCNSCLKKKHKILKETITKKLANISSRN
GSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISDDTDAIGEPLEMDPSESFLVHEQSTNKSSRLSAIGNWLQCQQVIDGVGGVNGVICGKWRRAPLFEVQT
DDWECFCSILWDPTHADCAVPQELETGQVLKQLKYIEMLFAIMWKYAVLESPTFIQSTLVICIRLISSLRKLLIETGTGDLN