; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023746 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023746
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionZinc finger protein
Genome locationtig00000892:6248648..6252861
RNA-Seq ExpressionSgr023746
SyntenySgr023746
Gene Ontology termsGO:0006351 - transcription, DNA-templated (biological process)
GO:0070176 - DRM complex (cellular component)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570979.1 hypothetical protein SDJN03_29894, partial [Cucurbita argyrosperma subsp. sororia]2.7e-11673.19Show/hide
Query:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL
        MVQKSIDSKFSEYGHGNSGKD P  EKQLQISAKKTALRDLQNDN VTASNCTGSSPLLKERGPSSDFIKVSGN       P +P HLHSSTSNA+NGHL
Subjt:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL

Query:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS
        VYV                                 ET HLKSQVKEL+  CFPAFAPFP+VSPMNASGKPSVPHH+GKYGIN ATAESN H APSTVPS
Subjt:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS

Query:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSETFSEF
             GWKNLQWEDRY QLQLLLNKLDQ+DQQDYLQVLRSLSSVELSRHAVELE+RSIQLSLEEAKELQRVGVLNVLGNPVK IK PLTH DGSET    
Subjt:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSETFSEF

Query:  FAMDDTDSDPLFPFNVF
          +      P+ PF++F
Subjt:  FAMDDTDSDPLFPFNVF

XP_022140529.1 uncharacterized protein LOC111011167 [Momordica charantia]6.0e-12481.42Show/hide
Query:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL
        MVQK IDSKFSEYGHGNSGKD  PHEKQLQISAKKTALRDLQN+N VTASNCTGS PLLKE GP SDFIKVS NKRPS VCPTSPPHLHSSTSNAANGHL
Subjt:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL

Query:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS
        VYV                                 ETVHLKSQVKELKNHCFPAFAPFPVV PMNASG PSVPHHIGKYGINLATAESN HSA STVPS
Subjt:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS

Query:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET
        VGIP GWKNLQWEDRY QLQLLLNKLDQ+DQQDYLQVLRSLSSVELSRHAV LEKRSIQLSLEEAKELQRVGVLNVLGNP K+IK PL HQDGSET
Subjt:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET

XP_022943750.1 uncharacterized protein LOC111448407 [Cucurbita moschata]3.9e-11577.03Show/hide
Query:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL
        MVQKSIDSKFSEYGHGNSGKD P  EKQLQISAKKTALRDLQNDN VTASNCTGSSPLLKERGPSSDFIKVSGN       P +P HLHSSTSNA+NGHL
Subjt:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL

Query:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS
        VYV                                 ET HLKSQVKEL+  CFPAFAPFP+VSPMNASGKPSVPHH+GKYGIN ATAESN H APSTVPS
Subjt:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS

Query:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET
             GWKNLQWEDRY QLQLLLNKLDQ+DQQDYLQVLRSLSSVELSRHAVELE+RSIQLSLEEAKELQRVGVLNVLGNPVK IK PLTH DGSET
Subjt:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET

XP_022986425.1 uncharacterized protein LOC111484175 [Cucurbita maxima]2.7e-11677.36Show/hide
Query:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL
        MVQKSIDSKFSEYGHGNSGKD P  EKQLQISAKKTALRDLQNDN VTASNCTGSSPLLKERGPSSDFIKVSGN       P +P HLHSSTSNA+NGHL
Subjt:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL

Query:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS
        VYV                                 ET HLKSQVKEL+NHCFPAFAPFP+VSPMNASGKPSVPHH+GKYGIN  TAESN H APSTVPS
Subjt:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS

Query:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET
             GWKNLQWEDRY QLQLLLNKLDQ+DQQDYLQVLRSLSSVELSRHAVELE+RSIQLSLEEAKELQRVGVLNVLGNPVK IK PLTHQ+GSET
Subjt:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET

XP_038878250.1 uncharacterized protein LOC120070536 [Benincasa hispida]3.3e-12279.73Show/hide
Query:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL
        MVQKSIDSKFSEYGHGNSGKD    EKQLQISAKKTALRDLQNDN +TASNC GSSPLLKERG SSD IKVSGNKR SPVCP SP HLHSS SNAANGHL
Subjt:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL

Query:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS
        VYV                                 ET HLKSQVKEL+NHCF AFAPFP+VSPMNA GKPSVPHH+GK G NLATAESN  SAPST PS
Subjt:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS

Query:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET
        VGIPTGWKNLQWEDRY QLQLLLNKLDQ+DQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVK+IKAPLTHQDGSET
Subjt:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET

TrEMBL top hitse value%identityAlignment
A0A6J1CFB7 zinc finger protein ZAT31.6e-10973.79Show/hide
Query:  DTDSDPLFPFNVF-PATAFVAAASASASASIAAGQNSSRRKRTKLIKMSDHLLAPSSSGGIAKPKYGKKPDPSAPKITRPCSECGKKFWSWKALFGHMRC
        DTDSD LF FNVF PA+AFVAAA+AS SA   A QN  RRKRTKLIKMSDH+LAPSSSGGIAKPKYGKKPDPSAPKITRPCSECGKKFWSWKALFGHMRC
Subjt:  DTDSDPLFPFNVF-PATAFVAAASASASASIAAGQNSSRRKRTKLIKMSDHLLAPSSSGGIAKPKYGKKPDPSAPKITRPCSECGKKFWSWKALFGHMRC

Query:  HPERQWRGINPPPNFRRSTCPMESETTEEDNEIAACLIMLANGPNAIDQ-TEAVTETDCREIQASEAA--ALGRFTECTGPPSWCRFECSSCRKVFASHQ
        HPERQWRGINPPPN RRS+    +  TEED+EIAACLIMLANGPNAID+ TEAVTETD    Q SE A  AL RF E TGPP + RFECS C+K F SHQ
Subjt:  HPERQWRGINPPPNFRRSTCPMESETTEEDNEIAACLIMLANGPNAIDQ-TEAVTETDCREIQASEAA--ALGRFTECTGPPSWCRFECSSCRKVFASHQ

Query:  ALGGHRASHKNVKGCFAIARTDGCGFADAGDENTGTEQECSICMKVSSRGQEQKRRHWEKEEDDQPYSMGKEEGSGYLDLNLPPPMEDIEESSSSFCSSG
        ALGGHRASHKNVKGCFA  RTDG  F+DA +EN  T  EC +CMKV       KRRHWEKE+++Q  S     G  YLDLNLPPPMED+EE+SS F  SG
Subjt:  ALGGHRASHKNVKGCFAIARTDGCGFADAGDENTGTEQECSICMKVSSRGQEQKRRHWEKEEDDQPYSMGKEEGSGYLDLNLPPPMEDIEESSSSFCSSG

Query:  LVLDLRLGL
        +VLDLRLGL
Subjt:  LVLDLRLGL

A0A6J1CFY6 uncharacterized protein LOC1110111672.9e-12481.42Show/hide
Query:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL
        MVQK IDSKFSEYGHGNSGKD  PHEKQLQISAKKTALRDLQN+N VTASNCTGS PLLKE GP SDFIKVS NKRPS VCPTSPPHLHSSTSNAANGHL
Subjt:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL

Query:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS
        VYV                                 ETVHLKSQVKELKNHCFPAFAPFPVV PMNASG PSVPHHIGKYGINLATAESN HSA STVPS
Subjt:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS

Query:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET
        VGIP GWKNLQWEDRY QLQLLLNKLDQ+DQQDYLQVLRSLSSVELSRHAV LEKRSIQLSLEEAKELQRVGVLNVLGNP K+IK PL HQDGSET
Subjt:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET

A0A6J1FY79 uncharacterized protein LOC1114484071.9e-11577.03Show/hide
Query:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL
        MVQKSIDSKFSEYGHGNSGKD P  EKQLQISAKKTALRDLQNDN VTASNCTGSSPLLKERGPSSDFIKVSGN       P +P HLHSSTSNA+NGHL
Subjt:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL

Query:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS
        VYV                                 ET HLKSQVKEL+  CFPAFAPFP+VSPMNASGKPSVPHH+GKYGIN ATAESN H APSTVPS
Subjt:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS

Query:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET
             GWKNLQWEDRY QLQLLLNKLDQ+DQQDYLQVLRSLSSVELSRHAVELE+RSIQLSLEEAKELQRVGVLNVLGNPVK IK PLTH DGSET
Subjt:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET

A0A6J1G8C0 uncharacterized protein LOC1114517579.5e-9970.65Show/hide
Query:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL
        MVQKSIDSK S     NSGK++P HEKQLQISAKKTALRDLQNDN V ASNCTGSSPLLKERGPSSDFIKVSGN +PSPV  TSPP L SSTSN   GHL
Subjt:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL

Query:  VYVH------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPSVG
        VY+                               ETVHLKSQVKEL++HCFPAFAPF +VSPMNASGKPSVPH   KYGINLATAES+  SA        
Subjt:  VYVH------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPSVG

Query:  IPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSE
            WKNLQWE RY QL+LLLNKL+Q+DQQDYLQVLRSLSSVELSRHAVELEKRSI LS EEAKELQRVGVLNVLGNPV +IK PL HQDGS+
Subjt:  IPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSE

A0A6J1JE12 uncharacterized protein LOC1114841751.3e-11677.36Show/hide
Query:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL
        MVQKSIDSKFSEYGHGNSGKD P  EKQLQISAKKTALRDLQNDN VTASNCTGSSPLLKERGPSSDFIKVSGN       P +P HLHSSTSNA+NGHL
Subjt:  MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHL

Query:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS
        VYV                                 ET HLKSQVKEL+NHCFPAFAPFP+VSPMNASGKPSVPHH+GKYGIN  TAESN H APSTVPS
Subjt:  VYVH--------------------------------ETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPS

Query:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET
             GWKNLQWEDRY QLQLLLNKLDQ+DQQDYLQVLRSLSSVELSRHAVELE+RSIQLSLEEAKELQRVGVLNVLGNPVK IK PLTHQ+GSET
Subjt:  VGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSET

SwissProt top hitse value%identityAlignment
O65499 Zinc finger protein ZAT39.2e-5946.26Show/hide
Query:  NSSRRKRTKLIKMSDHLLAPSSSGGIAKPKYGKKPDPSAPKITRPCSECGKKFWSWKALFGHMRCHPERQWRGINPPPNFRRSTCPMESE----------
        +S ++KRTK +  S    + SS    +KPKY KKPDP+APKITRPC+ECG+KFWSWKALFGHMRCHPERQWRGINPPPN+R  T     +          
Subjt:  NSSRRKRTKLIKMSDHLLAPSSSGGIAKPKYGKKPDPSAPKITRPCSECGKKFWSWKALFGHMRCHPERQWRGINPPPNFRRSTCPMESE----------

Query:  -TTEEDNEIAACLIMLANGPNAIDQTEAVTETDCREIQASEAAALGRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIAR-TDG--C
          +EED+E+A+CL+ML+NG  +    E                               RFEC  C+KVF SHQALGGHRASHKNVKGCFAI   TD    
Subjt:  -TTEEDNEIAACLIMLANGPNAIDQTEAVTETDCREIQASEAAALGRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIAR-TDG--C

Query:  GFADAGDENTGT------EQECSICMKVSSRGQE---QKRRHWEKEEDDQPYSMGKEEGSGYLDLNLPPPMEDIEESSSSFCSSGLVLDLRLGL
            +G ++ G         +C+IC +V S GQ      R HWEKEE+           SG LDLN+PP ++D+  S +S C     LDLRLGL
Subjt:  GFADAGDENTGT------EQECSICMKVSSRGQE---QKRRHWEKEEDDQPYSMGKEEGSGYLDLNLPPPMEDIEESSSSFCSSGLVLDLRLGL

Q39092 Zinc finger protein ZAT11.0e-0940.52Show/hide
Query:  FECSSCRKVFASHQALGGHRASHKNVKGCFAIARTDGCG---FADAGDENTGTEQECSICMKVSSRGQE---QKRRHWEKEEDDQPYSMGKEEGSGYLDL
        FEC +C KVF S+QALGGHRASHK       IA TD  G         ++T +  EC IC KV + GQ     KR H     +   ++         +DL
Subjt:  FECSSCRKVFASHQALGGHRASHKNVKGCFAIARTDGCG---FADAGDENTGTEQECSICMKVSSRGQE---QKRRHWEKEEDDQPYSMGKEEGSGYLDL

Query:  NLPPPMEDIEESSSSF
        NLP P E+ E +SS F
Subjt:  NLPPPMEDIEESSSSF

Q9M202 Zinc finger protein ZAT95.3e-0632.76Show/hide
Query:  TTEEDNEIAACLIMLANGPNAIDQT--EAVTETDCREIQASEAAALGRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIARTDGCGF
        TTEED  +A CL+ML+      +++  E V E +  E    E+    +    T   +  R++C +C KVF S+QALGGHRASHK  K   +  +T+    
Subjt:  TTEEDNEIAACLIMLANGPNAIDQT--EAVTETDCREIQASEAAALGRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIARTDGCGF

Query:  ADAGDENTGTEQ--ECSICMKVSSRGQE---QKRRHWEKE-EDDQPYSMGKEEG--SGYLDLNLPPPMEDIEES
         +  +     ++  EC IC++V + GQ     KR H       +Q   + + E      +DLNLP P E+ E S
Subjt:  ADAGDENTGTEQ--ECSICMKVSSRGQE---QKRRHWEKE-EDDQPYSMGKEEG--SGYLDLNLPPPMEDIEES

Q9SHD0 Zinc finger protein ZAT42.5e-0832.79Show/hide
Query:  TTEEDNEIAACLIMLANG--------PNAIDQTEAVTETDCREIQASEAAALGRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIAR
        TTEED  +A CLIML+             +++ E  T+ D  + ++S++                RF+C +C KVF S+QALGGHRASHK  K C  + +
Subjt:  TTEEDNEIAACLIMLANG--------PNAIDQTEAVTETDCREIQASEAAALGRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIAR

Query:  TDGCGFADAGDENTGTEQECSICMKVSSRGQE---QKRRHWEKEEDDQPYSMG-----KEEGS---GYLDLNLPPPMEDIEES
        T+                EC IC +V + GQ     KR H       +  S+      +EE S     +DLNLP P E+ E S
Subjt:  TDGCGFADAGDENTGTEQECSICMKVSSRGQE---QKRRHWEKEEDDQPYSMG-----KEEGS---GYLDLNLPPPMEDIEES

Q9SIJ0 Zinc finger protein ZAT22.7e-4239.27Show/hide
Query:  ASASASIAAGQNSSRRKRTKLIKMSDHLLAPSSSGGIAKPKYGKKPDPSAPKITRPCSECGKKFWSWKALFGHMRCHPERQWRGINPPPNFRR------S
        AS++ ++ +   + RRKRTKL           SS    +PK   +PDP A +I RPC+ECGK+F S KALFGHMRCHPERQWRGINPP NF+R      +
Subjt:  ASASASIAAGQNSSRRKRTKLIKMSDHLLAPSSSGGIAKPKYGKKPDPSAPKITRPCSECGKKFWSWKALFGHMRCHPERQWRGINPPPNFRR------S

Query:  TCPMESETTEEDNEIAACLIMLANGPNAIDQTEAVTETDCREIQASEAAALGRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIAR-
        +     + +EE++ IA+CL+M+ANG           +   R  +  E                 RFEC  C+KVF SHQALGGHRA+HK+VKGCFA    
Subjt:  TCPMESETTEEDNEIAACLIMLANGPNAIDQTEAVTETDCREIQASEAAALGRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIAR-

Query:  --------TDGCGFADAGDE---NTGTEQECSICMKVSSRGQE---QKRRHWEKEEDDQPYSMGKEEGSGYLDLNLPPPMEDIEESSSSFCSSGLVLDLR
                       D G      +G    C+IC +V S GQ      R HWEK+++        E     +DLN+P        ++SS  + G  LDLR
Subjt:  --------TDGCGFADAGDE---NTGTEQECSICMKVSSRGQE---QKRRHWEKEEDDQPYSMGKEEGSGYLDLNLPPPMEDIEESSSSFCSSGLVLDLR

Query:  LGL
        LGL
Subjt:  LGL

Arabidopsis top hitse value%identityAlignment
AT2G17180.1 C2H2-like zinc finger protein1.9e-4339.27Show/hide
Query:  ASASASIAAGQNSSRRKRTKLIKMSDHLLAPSSSGGIAKPKYGKKPDPSAPKITRPCSECGKKFWSWKALFGHMRCHPERQWRGINPPPNFRR------S
        AS++ ++ +   + RRKRTKL           SS    +PK   +PDP A +I RPC+ECGK+F S KALFGHMRCHPERQWRGINPP NF+R      +
Subjt:  ASASASIAAGQNSSRRKRTKLIKMSDHLLAPSSSGGIAKPKYGKKPDPSAPKITRPCSECGKKFWSWKALFGHMRCHPERQWRGINPPPNFRR------S

Query:  TCPMESETTEEDNEIAACLIMLANGPNAIDQTEAVTETDCREIQASEAAALGRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIAR-
        +     + +EE++ IA+CL+M+ANG           +   R  +  E                 RFEC  C+KVF SHQALGGHRA+HK+VKGCFA    
Subjt:  TCPMESETTEEDNEIAACLIMLANGPNAIDQTEAVTETDCREIQASEAAALGRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIAR-

Query:  --------TDGCGFADAGDE---NTGTEQECSICMKVSSRGQE---QKRRHWEKEEDDQPYSMGKEEGSGYLDLNLPPPMEDIEESSSSFCSSGLVLDLR
                       D G      +G    C+IC +V S GQ      R HWEK+++        E     +DLN+P        ++SS  + G  LDLR
Subjt:  --------TDGCGFADAGDE---NTGTEQECSICMKVSSRGQE---QKRRHWEKEEDDQPYSMGKEEGSGYLDLNLPPPMEDIEESSSSFCSSGLVLDLR

Query:  LGL
        LGL
Subjt:  LGL

AT2G45250.1 Integral membrane protein hemolysin-III homolog1.7e-2340Show/hide
Query:  SSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHLVYVHETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLAT
        SS +    G   D  K       S +    PP    +T+NAA+G LVYV   V + +           A A     +P      P +P            
Subjt:  SSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHLVYVHETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLAT

Query:  AESNVHSAPSTVPSVGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKA
          S+   A +  P+   PT  K L WE+RY  LQ+LLNKL+Q+D+ D++Q+L SLSS ELS+HAV+LEKRSIQ SLEEA+E+QRV  LNVLG  V  IK+
Subjt:  AESNVHSAPSTVPSVGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKA

AT2G45250.2 Integral membrane protein hemolysin-III homolog4.5e-1637.64Show/hide
Query:  SSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHLVYVHETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLAT
        SS +    G   D  K       S +    PP    +T+NAA+G LVYV   V + +           A A     +P      P +P            
Subjt:  SSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHLVYVHETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLAT

Query:  AESNVHSAPSTVPSVGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEE
          S+   A +  P+   PT  K L WE+RY  LQ+LLNKL+Q+D+ D++Q+L SLSS ELS+HAV+LEKRSIQ SLEE
Subjt:  AESNVHSAPSTVPSVGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEE

AT4G35280.1 C2H2-like zinc finger protein6.6e-6046.26Show/hide
Query:  NSSRRKRTKLIKMSDHLLAPSSSGGIAKPKYGKKPDPSAPKITRPCSECGKKFWSWKALFGHMRCHPERQWRGINPPPNFRRSTCPMESE----------
        +S ++KRTK +  S    + SS    +KPKY KKPDP+APKITRPC+ECG+KFWSWKALFGHMRCHPERQWRGINPPPN+R  T     +          
Subjt:  NSSRRKRTKLIKMSDHLLAPSSSGGIAKPKYGKKPDPSAPKITRPCSECGKKFWSWKALFGHMRCHPERQWRGINPPPNFRRSTCPMESE----------

Query:  -TTEEDNEIAACLIMLANGPNAIDQTEAVTETDCREIQASEAAALGRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIAR-TDG--C
          +EED+E+A+CL+ML+NG  +    E                               RFEC  C+KVF SHQALGGHRASHKNVKGCFAI   TD    
Subjt:  -TTEEDNEIAACLIMLANGPNAIDQTEAVTETDCREIQASEAAALGRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIAR-TDG--C

Query:  GFADAGDENTGT------EQECSICMKVSSRGQE---QKRRHWEKEEDDQPYSMGKEEGSGYLDLNLPPPMEDIEESSSSFCSSGLVLDLRLGL
            +G ++ G         +C+IC +V S GQ      R HWEKEE+           SG LDLN+PP ++D+  S +S C     LDLRLGL
Subjt:  GFADAGDENTGT------EQECSICMKVSSRGQE---QKRRHWEKEEDDQPYSMGKEEGSGYLDLNLPPPMEDIEESSSSFCSSGLVLDLRLGL

AT4G38280.1 BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog (TAIR:AT2G45250.1)9.9e-2439.58Show/hide
Query:  GPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHLVYVHETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSA
        G S D  K +     S +    PP    +T+NAA+G LVYV   V + +                   S  N +  P+              A   + S+
Subjt:  GPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHLVYVHETVHLKSQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSA

Query:  PSTVPSVGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKA
        P+  P+   PT  K L WE+RY  LQ+LLNKL+Q+D+ D++Q+L SLSS ELS+HAV+LEKRSIQ SLEEA+E+QRV  LN+LG  V  +K+
Subjt:  PSTVPSVGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVELEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAAAAATCCATAGACTCCAAATTCAGTGAATATGGGCATGGAAATTCTGGGAAGGACGCACCTCCTCATGAAAAGCAACTGCAGATTTCTGCAAAGAAGACAGC
ATTAAGGGATTTGCAAAATGATAATATGGTCACAGCTTCCAATTGTACGGGAAGCTCCCCTCTTTTGAAGGAAAGAGGTCCCAGTAGTGACTTCATTAAAGTTTCTGGTA
ACAAGAGACCCTCACCTGTCTGCCCAACGAGTCCGCCTCATCTCCATTCTTCAACCTCTAATGCTGCAAATGGGCATCTTGTTTACGTCCATGAAACCGTGCATCTCAAA
TCCCAGGTTAAGGAGCTAAAGAATCATTGCTTTCCGGCATTTGCTCCATTTCCAGTGGTTTCTCCTATGAATGCATCTGGAAAACCTTCAGTTCCTCACCACATTGGAAA
GTATGGCATTAATTTAGCCACAGCAGAATCAAACGTTCATTCTGCACCTTCTACTGTCCCTTCAGTAGGCATTCCAACAGGATGGAAAAATTTGCAGTGGGAAGACAGAT
ATCGTCAGTTGCAGTTGTTATTGAATAAATTGGACCAAGCAGATCAACAAGATTATCTTCAGGTGCTTCGATCGCTGTCATCAGTTGAACTTAGCAGACATGCAGTTGAA
TTGGAAAAGAGATCCATTCAGCTCTCGCTCGAGGAAGCAAAAGAGTTGCAGCGAGTTGGGGTTTTGAATGTGCTAGGGAATCCTGTGAAGCATATCAAAGCGCCATTGAC
TCATCAAGACGGATCAGAGACGTTTTCTGAATTCTTCGCCATGGACGACACTGATTCCGATCCTCTCTTTCCCTTTAACGTTTTTCCTGCTACTGCTTTTGTCGCCGCTG
CTTCTGCTTCTGCTTCTGCTTCCATTGCGGCCGGTCAAAATTCTTCTCGCCGGAAGCGTACCAAACTGATCAAAATGAGTGATCATCTTCTTGCGCCGTCGAGTTCCGGC
GGCATTGCGAAGCCGAAATATGGAAAGAAGCCGGACCCGAGCGCTCCGAAGATTACTCGGCCGTGTAGTGAGTGCGGGAAGAAGTTCTGGTCGTGGAAGGCTCTGTTCGG
CCACATGCGGTGCCATCCAGAGCGTCAATGGCGGGGAATCAATCCGCCGCCGAACTTCCGACGGTCCACTTGCCCTATGGAATCGGAAACGACGGAGGAAGATAACGAAA
TAGCGGCGTGTTTGATCATGTTGGCCAACGGCCCCAATGCGATCGATCAAACCGAAGCAGTTACCGAAACTGACTGTCGAGAAATCCAAGCATCCGAGGCGGCGGCGTTG
GGGAGGTTCACGGAGTGTACCGGTCCGCCGTCGTGGTGCCGGTTCGAGTGCTCGAGTTGCAGGAAAGTGTTTGCGTCGCATCAGGCTCTCGGGGGACACAGAGCAAGCCA
CAAGAACGTGAAGGGCTGCTTCGCCATTGCGAGAACGGACGGCTGTGGGTTCGCCGACGCCGGCGACGAGAACACGGGAACGGAGCAGGAGTGCAGCATTTGCATGAAGG
TGTCGTCGAGGGGACAGGAGCAGAAGAGGCGGCATTGGGAGAAGGAAGAAGATGATCAGCCATATTCAATGGGGAAGGAAGAAGGGTCTGGTTATTTGGACTTGAATCTG
CCTCCTCCAATGGAAGACATTGAAGAATCCTCCTCCTCATTCTGTTCTTCAGGGCTTGTTTTGGATCTAAGATTGGGTCTTAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAAAAATCCATAGACTCCAAATTCAGTGAATATGGGCATGGAAATTCTGGGAAGGACGCACCTCCTCATGAAAAGCAACTGCAGATTTCTGCAAAGAAGACAGC
ATTAAGGGATTTGCAAAATGATAATATGGTCACAGCTTCCAATTGTACGGGAAGCTCCCCTCTTTTGAAGGAAAGAGGTCCCAGTAGTGACTTCATTAAAGTTTCTGGTA
ACAAGAGACCCTCACCTGTCTGCCCAACGAGTCCGCCTCATCTCCATTCTTCAACCTCTAATGCTGCAAATGGGCATCTTGTTTACGTCCATGAAACCGTGCATCTCAAA
TCCCAGGTTAAGGAGCTAAAGAATCATTGCTTTCCGGCATTTGCTCCATTTCCAGTGGTTTCTCCTATGAATGCATCTGGAAAACCTTCAGTTCCTCACCACATTGGAAA
GTATGGCATTAATTTAGCCACAGCAGAATCAAACGTTCATTCTGCACCTTCTACTGTCCCTTCAGTAGGCATTCCAACAGGATGGAAAAATTTGCAGTGGGAAGACAGAT
ATCGTCAGTTGCAGTTGTTATTGAATAAATTGGACCAAGCAGATCAACAAGATTATCTTCAGGTGCTTCGATCGCTGTCATCAGTTGAACTTAGCAGACATGCAGTTGAA
TTGGAAAAGAGATCCATTCAGCTCTCGCTCGAGGAAGCAAAAGAGTTGCAGCGAGTTGGGGTTTTGAATGTGCTAGGGAATCCTGTGAAGCATATCAAAGCGCCATTGAC
TCATCAAGACGGATCAGAGACGTTTTCTGAATTCTTCGCCATGGACGACACTGATTCCGATCCTCTCTTTCCCTTTAACGTTTTTCCTGCTACTGCTTTTGTCGCCGCTG
CTTCTGCTTCTGCTTCTGCTTCCATTGCGGCCGGTCAAAATTCTTCTCGCCGGAAGCGTACCAAACTGATCAAAATGAGTGATCATCTTCTTGCGCCGTCGAGTTCCGGC
GGCATTGCGAAGCCGAAATATGGAAAGAAGCCGGACCCGAGCGCTCCGAAGATTACTCGGCCGTGTAGTGAGTGCGGGAAGAAGTTCTGGTCGTGGAAGGCTCTGTTCGG
CCACATGCGGTGCCATCCAGAGCGTCAATGGCGGGGAATCAATCCGCCGCCGAACTTCCGACGGTCCACTTGCCCTATGGAATCGGAAACGACGGAGGAAGATAACGAAA
TAGCGGCGTGTTTGATCATGTTGGCCAACGGCCCCAATGCGATCGATCAAACCGAAGCAGTTACCGAAACTGACTGTCGAGAAATCCAAGCATCCGAGGCGGCGGCGTTG
GGGAGGTTCACGGAGTGTACCGGTCCGCCGTCGTGGTGCCGGTTCGAGTGCTCGAGTTGCAGGAAAGTGTTTGCGTCGCATCAGGCTCTCGGGGGACACAGAGCAAGCCA
CAAGAACGTGAAGGGCTGCTTCGCCATTGCGAGAACGGACGGCTGTGGGTTCGCCGACGCCGGCGACGAGAACACGGGAACGGAGCAGGAGTGCAGCATTTGCATGAAGG
TGTCGTCGAGGGGACAGGAGCAGAAGAGGCGGCATTGGGAGAAGGAAGAAGATGATCAGCCATATTCAATGGGGAAGGAAGAAGGGTCTGGTTATTTGGACTTGAATCTG
CCTCCTCCAATGGAAGACATTGAAGAATCCTCCTCCTCATTCTGTTCTTCAGGGCTTGTTTTGGATCTAAGATTGGGTCTTAATTGA
Protein sequenceShow/hide protein sequence
MVQKSIDSKFSEYGHGNSGKDAPPHEKQLQISAKKTALRDLQNDNMVTASNCTGSSPLLKERGPSSDFIKVSGNKRPSPVCPTSPPHLHSSTSNAANGHLVYVHETVHLK
SQVKELKNHCFPAFAPFPVVSPMNASGKPSVPHHIGKYGINLATAESNVHSAPSTVPSVGIPTGWKNLQWEDRYRQLQLLLNKLDQADQQDYLQVLRSLSSVELSRHAVE
LEKRSIQLSLEEAKELQRVGVLNVLGNPVKHIKAPLTHQDGSETFSEFFAMDDTDSDPLFPFNVFPATAFVAAASASASASIAAGQNSSRRKRTKLIKMSDHLLAPSSSG
GIAKPKYGKKPDPSAPKITRPCSECGKKFWSWKALFGHMRCHPERQWRGINPPPNFRRSTCPMESETTEEDNEIAACLIMLANGPNAIDQTEAVTETDCREIQASEAAAL
GRFTECTGPPSWCRFECSSCRKVFASHQALGGHRASHKNVKGCFAIARTDGCGFADAGDENTGTEQECSICMKVSSRGQEQKRRHWEKEEDDQPYSMGKEEGSGYLDLNL
PPPMEDIEESSSSFCSSGLVLDLRLGLN