; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016498 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016498
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationtig00152936:559663..565831
RNA-Seq ExpressionSgr016498
SyntenySgr016498
Gene Ontology termsGO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR000727 - Target SNARE coiled-coil homology domain
IPR005175 - PPC domain
IPR014476 - AT-hook motif nuclear-localized protein 15-29
IPR039899 - BET1, SNARE domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAG1861232.1 unnamed protein product, partial [Musa acuminata subsp. malaccensis]4.4e-8752.11Show/hide
Query:  REHRASRSALFDDLEEGGLRTS--SSVEIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRTC
        R+HR++R+ALFD +EEGG+R S  SS EI EHDND A+  L+DRV+ILKRLTGDIHEEVESH             SR     ++    M+    S  +T 
Subjt:  REHRASRSALFDDLEEGGLRTS--SSVEIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRTC

Query:  RLAVYFVLFFLLLFYLIRTLQHRPHFPAIQDLPKYMVFNEG-GTGTVFWKDFQIVYLQLSASLANPWWVNRVGFPA-----------------IHPTATT
              V   L+    +  L ++    AI   P  ++ +EG G+  +  +  +    +     AN WW  ++G P                    P A  
Subjt:  RLAVYFVLFFLLLFYLIRTLQHRPHFPAIQDLPKYMVFNEG-GTGTVFWKDFQIVYLQLSASLANPWWVNRVGFPA-----------------IHPTATT

Query:  INGGENHEDEED--EPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAA
          GG N ED ++  EP+EG +VV  RRPRGRPPGSKNKPKPP+ V RDSP+ALR+HVME+A G+D+A+SI+QFARRRQRGVCVLS +G+V+NV LRQ +A
Subjt:  INGGENHEDEED--EPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAA

Query:  SAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDD
          AV+ LHGRFEILSLTG FLPG +PPG+TGLT+YLAGGQGQVVGGSVVGSL+AAGPVM+IA+TFANATYERLPL++ D+
Subjt:  SAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDD

KAF8401845.1 hypothetical protein HHK36_012792 [Tetracentron sinense]7.7e-10055.84Show/hide
Query:  EHRASRSALFDDLEEGGLRTSSSV--EIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRTCR
        E+R +R+ALFD +EEGG+R SSS   EI EHDND A+  L+DRV++LKRL+GDIHEEVE+HN +LD+MGN MD+SRGI+S TMDRFKMV +  S      
Subjt:  EHRASRSALFDDLEEGGLRTSSSV--EIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRTCR

Query:  LAVYFVLFFLLLFY-----LIRTLQHRPHFPAIQD------LPKYMVFNEG---GTGTVFWKDFQIVYLQLSASLANPWWVNRVGFPAIHPTA-------
         A+ F+LF   + +     ++ T++      ++ +          + F  G       V    F  ++ Q  A LAN WW   +G P I P A       
Subjt:  LAVYFVLFFLLLFY-----LIRTLQHRPHFPAIQD------LPKYMVFNEG---GTGTVFWKDFQIVYLQLSASLANPWWVNRVGFPAIHPTA-------

Query:  ----TTIN-------GGENHEDEE---DEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSG
            T+IN       GG + E+E    DE K+G V V NRRPRGRP GSKNKPKPPI V RDSP+ALR+HVME+A GAD+ +S++QFARRRQRGVCVLSG
Subjt:  ----TTIN-------GGENHEDEE---DEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSG

Query:  SGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDD
        SG+VANVTLRQ A+S AV+ LHGRFEILSLTGAFLPG APPGSTGLT+Y+AGG GQV+GG VVG+L+AAGPVM+IAA FANAT+ERLPL++ DD
Subjt:  SGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDD

KAF9842193.1 hypothetical protein H0E87_011232, partial [Populus deltoides]2.5e-8256.5Show/hide
Query:  LTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRTCRLAVYFVLFFLLLFYLIRTLQHRPHFPAIQDLPKYMVFNEGGTGTVFWKD
        LTGDIHEEVES N LLDRMGN MD SRG                                                        Y V N G         
Subjt:  LTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRTCRLAVYFVLFFLLLFYLIRTLQHRPHFPAIQDLPKYMVFNEGGTGTVFWKD

Query:  FQIVYLQLSASLANPWWVNRVGFPAI-----HPTATTIN------------GGENHEDEE----DEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDS
                 A LANPWW  +VG P +      P+   IN            GG + +D++    DE KEG V VGNRRPRGRPPGSKNKPKPPI V RDS
Subjt:  FQIVYLQLSASLANPWWVNRVGFPAI-----HPTATTIN------------GGENHEDEE----DEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDS

Query:  PHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVV
        P+ALR+HVMEIA GADVA+S++QFARRRQRGVCVLSGSGSVANVTLRQ AA  AV+ LHGRFEILSLTGAFLPG APPGSTGLT+YLAGGQGQVVGGSVV
Subjt:  PHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVV

Query:  GSLLAAGPVMLIAATFANATYERLPLQDNDD
        GSL+AAGPVM+IAATFANATYERLPL+D+++
Subjt:  GSLLAAGPVMLIAATFANATYERLPLQDNDD

KAG6759256.1 hypothetical protein POTOM_035728 [Populus tomentosa]3.2e-12266.49Show/hide
Query:  REHRASRSALFDD-LEEGGLRTSSSV--EIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRT
        REHR SR+ALFDD LEEGGLR SSS   E  +HDND+A+HTL+DRV  LKRLTGDIHEEVES NHLLDRMGN MD SRGIMS TMDRFKMVFE+KS  RT
Subjt:  REHRASRSALFDD-LEEGGLRTSSSV--EIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRT

Query:  CRLAVYFVLFFLLLFYLIRTLQHRPHFPAIQDLPKYMVFNEGGTGTVFWKDFQIVYLQLSASLANPWWVNRVGFPAI-----HPTATTIN----------
        C LA +F+L FL+L+YLIR L                                 VYL+  A LANPWW  +VG P +      P+   IN          
Subjt:  CRLAVYFVLFFLLLFYLIRTLQHRPHFPAIQDLPKYMVFNEGGTGTVFWKDFQIVYLQLSASLANPWWVNRVGFPAI-----HPTATTIN----------

Query:  --GGENHEDEE----DEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQS
          GG + +D++    DE KEG V VGNRRPRGRPPGSKNKPKPPI V RDSP+ALR+HVMEIA GADVA+S++QFARRRQRGVCVLSGSGSVANVTLRQ 
Subjt:  --GGENHEDEE----DEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQS

Query:  AASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDD
        AA  AV+ LHGRFEILSLTGAFLPG APPGSTGLT+YLAGGQGQVVGGSVVGSL+AAGPVM+IAATFANATYERLPL+D++D
Subjt:  AASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDD

KAG8072950.1 hypothetical protein GUJ93_ZPchr0006g44397 [Zizania palustris]1.2e-8949.41Show/hide
Query:  HRASRSALFDDLEEGGLRTSSSVEIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRTCRLAV
        +R++R++LFD +EE                          VSILK+L+GDIHEEVE+HNH+LDRMGN MD+SRG +S T+D+FKMVFE KS  R   L  
Subjt:  HRASRSALFDDLEEGGLRTSSSVEIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRTCRLAV

Query:  YFVLFFLLLFYLI-----RTLQHRPHFPAIQDLPKYMVFNEG----------GTGTVFWK---------------------------DFQIVYLQ-LSAS
         F++ FLL++YL      + ++ +     I  L  Y +   G          G   + W+                           D Q++ +  +   
Subjt:  YFVLFFLLLFYLI-----RTLQHRPHFPAIQDLPKYMVFNEG----------GTGTVFWK---------------------------DFQIVYLQ-LSAS

Query:  LANPWWVNRVGFP---------------AIHPTATTING------GENHEDEEDEPKEGG---VVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHV
        LAN WW   VG P               A  P A+   G       EN+E    EP EG    V   NRRPRGRPPGSKNKPKPPI V RDSP+ALR+HV
Subjt:  LANPWWVNRVGFP---------------AIHPTATTING------GENHEDEEDEPKEGG---VVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHV

Query:  MEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGP
        ME+A GADVAD+I+QF+RRRQRGVCVLSG+G+VANV LRQ +A  AV+ LHGRFEILSLTG FLPG APPGSTGLT+YLAGGQGQVVGGSVVGSL+AAGP
Subjt:  MEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGP

Query:  VMLIAATFANATYERLPLQDNDDHS
        VM+IA+TFANATYERLPL++ ++ S
Subjt:  VMLIAATFANATYERLPLQDNDDHS

TrEMBL top hitse value%identityAlignment
A0A4S4DGU4 Uncharacterized protein8.6e-8144.83Show/hide
Query:  REHRASRSALFDDLEEGGLRTSSSV--EIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRTC
        R++R S++ALFD +EEG +R SSS   EI E  ND A+++L+DRV  LKR                   GN MDASRGIMS T DRFKM       W+  
Subjt:  REHRASRSALFDDLEEGGLRTSSSV--EIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRTC

Query:  RLAVYFVLFFL--------------LLFYLIRTLQ---HRP------HFPAIQDL---------------PKYMVF------------------------
         +     L  +               LFY+  +     H P      H+ A  +L                KY+ +                        
Subjt:  RLAVYFVLFFL--------------LLFYLIRTLQ---HRP------HFPAIQDL---------------PKYMVF------------------------

Query:  ----------------------NEGGTGTVFWKDFQ-----------------IVYLQLSASLANPWWVN-RVGFPAIHPTATT----------------
                               E     V  K+F+                  V+++  ASLAN WW   ++G P I P ATT                
Subjt:  ----------------------NEGGTGTVFWKDFQ-----------------IVYLQLSASLANPWWVN-RVGFPAIHPTATT----------------

Query:  --INGG----ENHEDEEDEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLR
          I+ G    +N  D EDEPKEG V VGNRRPRGRPPGSKNKPKPPI V +DSP+ALR+HVME+A G+DVA+SI+Q+ARRRQRGV VLSGSG+VANVTLR
Subjt:  --INGG----ENHEDEEDEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLR

Query:  QSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDD
        Q AA  AV+ L GRFEILSLTG FLPG APPGSTGLT+YLAGGQGQVVGGSVVGSL+AAGPVM+IAATF+NATYERLPL+D+D+
Subjt:  QSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDD

A0A4U5Q2H6 Putative DNA-binding protein ESCAROLA1.5e-12266.49Show/hide
Query:  REHRASRSALFDD-LEEGGLRTSSSV--EIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRT
        REHR SR+ALFDD LEEGGLR SSS   E  +HDND+A+HTL+DRV  LKRLTGDIHEEVES NHLLDRMGN MD SRGIMS TMDRFKMVFE+KS  RT
Subjt:  REHRASRSALFDD-LEEGGLRTSSSV--EIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKWRT

Query:  CRLAVYFVLFFLLLFYLIRTLQHRPHFPAIQDLPKYMVFNEGGTGTVFWKDFQIVYLQLSASLANPWWVNRVGFPAI-----HPTATTIN----------
        C LA +F+L FL+L+YLIR L                                 VYL+  A LANPWW  +VG P +      P+   IN          
Subjt:  CRLAVYFVLFFLLLFYLIRTLQHRPHFPAIQDLPKYMVFNEGGTGTVFWKDFQIVYLQLSASLANPWWVNRVGFPAI-----HPTATTIN----------

Query:  --GGENHEDEE----DEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQS
          GG + +D++    DE KEG V VGNRRPRGRPPGSKNKPKPPI V RDSP+ALR+HVMEIA GADVA+S++QFARRRQRGVCVLSGSGSVANVTLRQ 
Subjt:  --GGENHEDEE----DEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQS

Query:  AASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDD
        AA  AV+ LHGRFEILSLTGAFLPG APPGSTGLT+YLAGGQGQVVGGSVVGSL+AAGPVM+IAATFANATYERLPL+D++D
Subjt:  AASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDD

A0A6A6KAK2 PPC domain-containing protein2.6e-7770.97Show/hide
Query:  SLANPWWVNRVGFPAIHPTAT-----------TIN------GGENHEDEEDEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARG
        +LANPWW  +VG P + P++            +IN      G E   D  DEPKEG V +G RRPRGRPPGSKNKPKPPI V RDSP+ALR+HV+E+  G
Subjt:  SLANPWWVNRVGFPAIHPTAT-----------TIN------GGENHEDEEDEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARG

Query:  ADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAA
        ADVA+S++QFARRRQRGVCVLSGSGSVANVTLRQ AA  AV+ LHGRFEILSLTGAFLPG APPGSTGLT+YLAGGQGQVVGGSVVGSL+AAGPVM+IAA
Subjt:  ADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAA

Query:  TFANATYERLPLQDNDD
        TFANATYERLPL+D+++
Subjt:  TFANATYERLPLQDNDD

A0A6J1H4F5 AT-hook motif nuclear-localized protein 20-like2.0e-7769.83Show/hide
Query:  LQLSASLA--NPWWVNRVGFPAIHPTATTING-GENHEDEEDEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQ
        L+ S +LA  NPWW +RVGFPA  P +TT+ G G  H+++EDE KEG V+VGNRR RGRPPG+KNKPKPPIIV RDSPHALRTHV+EIA GADVADSI+Q
Subjt:  LQLSASLA--NPWWVNRVGFPAIHPTATTING-GENHEDEEDEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQ

Query:  FARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYER
        F+ RRQRGVCVLSG+G+VA+VTLRQ   SA VIQLHG F+ILSL+G+FLPG A P ST LT+YLAGGQGQVVGG+VVG LLAAGPV+LIAATFANA YER
Subjt:  FARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYER

Query:  LPLQDNDDHSQKELAL-----GKKMEKVLRPP
        LPLQD+ D+ Q E++      G+  E  L PP
Subjt:  LPLQDNDDHSQKELAL-----GKKMEKVLRPP

A0A6N2MBW2 PPC domain-containing protein4.0e-7870.18Show/hide
Query:  LSASLANPWWVNRVGFPAIHPTAT-----------TIN--------GGENHEDEE------DEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHA
        ++A LANPWW  +VG P + P++            +IN         GE+ EDE+      DEPKEG V  GNRRPRGRPPGSKNKPKPPI V RDSP+A
Subjt:  LSASLANPWWVNRVGFPAIHPTAT-----------TIN--------GGENHEDEE------DEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHA

Query:  LRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSL
        LR+HVMEIA GADVA+S++QFAR+RQRGVCVLSGSGSVANVTLRQ AA  AV+ LHGRFEILSLTGAFLPG APPGSTGLT+YLAGGQGQVVGGSVVGSL
Subjt:  LRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSL

Query:  LAAGPVMLIAATFANATYERLPLQDNDD
        +AAGPVM+IAATFANATYERLPL+D ++
Subjt:  LAAGPVMLIAATFANATYERLPLQDNDD

SwissProt top hitse value%identityAlignment
O22130 AT-hook motif nuclear-localized protein 221.9e-5663.95Show/hide
Query:  RRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAA----SAAVIQLHGRFEILSLTGAFL
        RRPRGRP GSKNKPKPPII+ RDS +AL++HVME+A G DV +S++ FARRRQRG+CVLSG+G+V NVT+RQ A+     ++V+ LHGRFEILSL+G+FL
Subjt:  RRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAA----SAAVIQLHGRFEILSLTGAFL

Query:  PGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDDHSQKELALGKKME
        P  APP ++GLTIYLAGGQGQVVGGSVVG L+A+GPV+++AA+F NA YERLPL+++D   Q   A+   ++
Subjt:  PGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDDHSQKELALGKKME

O23620 AT-hook motif nuclear-localized protein 235.4e-5665.45Show/hide
Query:  GGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTG
        GG VVG RRPRGRPPGSKNKPKPP+I+ R+S + LR H++E+  G DV D ++ +ARRRQRG+CVLSGSG+V NV++RQ +A+ AV+ L G FEILSL+G
Subjt:  GGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTG

Query:  AFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDDHSQ
        +FLP  APPG+T LTI+LAGGQGQVVGGSVVG L AAGPV++IAA+F N  YERLPL++++   Q
Subjt:  AFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDDHSQ

O49662 AT-hook motif nuclear-localized protein 241.3e-5466.46Show/hide
Query:  GGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSA---ASAAVIQLHGRFEILS
        GG     RRPRGRP GSKNKPKPPII+ RDS +ALRTHVMEI  G D+ +S++ FARRRQRGVCV+SG+G+V NVT+RQ     +  +V+ LHGRFEILS
Subjt:  GGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSA---ASAAVIQLHGRFEILS

Query:  LTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDND
        L+G+FLP  APP +TGL++YLAGGQGQVVGGSVVG LL AGPV+++AA+F+NA YERLPL++++
Subjt:  LTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDND

Q8GWQ2 AT-hook motif nuclear-localized protein 203.0e-7062.9Show/hide
Query:  LANPWWVNRVGFPAIHPTATTINGGENH----------------------EDEEDEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVME
        +ANPWW N+ G   +   + +    +NH                      +DEED+P+EG V V NRRPRGRPPGSKNKPK PI V RDSP+ALR+HV+E
Subjt:  LANPWWVNRVGFPAIHPTATTINGGENH----------------------EDEEDEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVME

Query:  IARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVM
        I+ G+DVAD+I+ F+RRRQRGVCVLSG+GSVANVTLRQ+AA   V+ L GRFEILSLTGAFLPG +PPGSTGLT+YLAG QGQVVGGSVVG LLA G VM
Subjt:  IARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVM

Query:  LIAATFANATYERLPLQDNDD
        +IAATF+NATYERLP+++ +D
Subjt:  LIAATFANATYERLPLQDNDD

Q9SR17 AT-hook motif nuclear-localized protein 194.1e-6456.85Show/hide
Query:  LANPWWVNRVGFPAIHPT-----------------ATTINGGENH-----------EDEED-------EPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVA
        +ANPWW  +V    +  T                     +G  NH           +D+ D       EP+EG V    RRPRGRP GSKNKPKPPI V 
Subjt:  LANPWWVNRVGFPAIHPT-----------------ATTINGGENH-----------EDEED-------EPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVA

Query:  RDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAAS--------AAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAG
        RDSP+AL++HVMEIA G DV ++++ FARRRQRG+C+LSG+G+VANVTLRQ + +        AAV+ L GRFEILSLTG+FLPG APPGSTGLTIYLAG
Subjt:  RDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAAS--------AAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAG

Query:  GQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDND
        GQGQVVGGSVVG L+AAGPVMLIAATF+NATYERLPL++ +
Subjt:  GQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDND

Arabidopsis top hitse value%identityAlignment
AT2G45430.1 AT-hook motif nuclear-localized protein 221.3e-5763.95Show/hide
Query:  RRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAA----SAAVIQLHGRFEILSLTGAFL
        RRPRGRP GSKNKPKPPII+ RDS +AL++HVME+A G DV +S++ FARRRQRG+CVLSG+G+V NVT+RQ A+     ++V+ LHGRFEILSL+G+FL
Subjt:  RRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAA----SAAVIQLHGRFEILSLTGAFL

Query:  PGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDDHSQKELALGKKME
        P  APP ++GLTIYLAGGQGQVVGGSVVG L+A+GPV+++AA+F NA YERLPL+++D   Q   A+   ++
Subjt:  PGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDDHSQKELALGKKME

AT3G04570.1 AT-hook motif nuclear-localized protein 192.9e-6556.85Show/hide
Query:  LANPWWVNRVGFPAIHPT-----------------ATTINGGENH-----------EDEED-------EPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVA
        +ANPWW  +V    +  T                     +G  NH           +D+ D       EP+EG V    RRPRGRP GSKNKPKPPI V 
Subjt:  LANPWWVNRVGFPAIHPT-----------------ATTINGGENH-----------EDEED-------EPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVA

Query:  RDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAAS--------AAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAG
        RDSP+AL++HVMEIA G DV ++++ FARRRQRG+C+LSG+G+VANVTLRQ + +        AAV+ L GRFEILSLTG+FLPG APPGSTGLTIYLAG
Subjt:  RDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAAS--------AAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAG

Query:  GQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDND
        GQGQVVGGSVVG L+AAGPVMLIAATF+NATYERLPL++ +
Subjt:  GQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDND

AT4G14465.1 AT-hook motif nuclear-localized protein 202.1e-7162.9Show/hide
Query:  LANPWWVNRVGFPAIHPTATTINGGENH----------------------EDEEDEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVME
        +ANPWW N+ G   +   + +    +NH                      +DEED+P+EG V V NRRPRGRPPGSKNKPK PI V RDSP+ALR+HV+E
Subjt:  LANPWWVNRVGFPAIHPTATTINGGENH----------------------EDEEDEPKEGGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVME

Query:  IARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVM
        I+ G+DVAD+I+ F+RRRQRGVCVLSG+GSVANVTLRQ+AA   V+ L GRFEILSLTGAFLPG +PPGSTGLT+YLAG QGQVVGGSVVG LLA G VM
Subjt:  IARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVM

Query:  LIAATFANATYERLPLQDNDD
        +IAATF+NATYERLP+++ +D
Subjt:  LIAATFANATYERLPLQDNDD

AT4G17800.1 Predicted AT-hook DNA-binding family protein3.8e-5765.45Show/hide
Query:  GGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTG
        GG VVG RRPRGRPPGSKNKPKPP+I+ R+S + LR H++E+  G DV D ++ +ARRRQRG+CVLSGSG+V NV++RQ +A+ AV+ L G FEILSL+G
Subjt:  GGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTG

Query:  AFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDDHSQ
        +FLP  APPG+T LTI+LAGGQGQVVGGSVVG L AAGPV++IAA+F N  YERLPL++++   Q
Subjt:  AFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDDHSQ

AT4G22810.1 Predicted AT-hook DNA-binding family protein9.5e-5666.46Show/hide
Query:  GGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSA---ASAAVIQLHGRFEILS
        GG     RRPRGRP GSKNKPKPPII+ RDS +ALRTHVMEI  G D+ +S++ FARRRQRGVCV+SG+G+V NVT+RQ     +  +V+ LHGRFEILS
Subjt:  GGVVVGNRRPRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSA---ASAAVIQLHGRFEILS

Query:  LTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDND
        L+G+FLP  APP +TGL++YLAGGQGQVVGGSVVG LL AGPV+++AA+F+NA YERLPL++++
Subjt:  LTGAFLPGQAPPGSTGLTIYLAGGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATATTGTCTCATTTGAAACTACTGGTATTGTTTTTGTTCATAGGGAGCACCGTGCTTCCAGATCTGCTCTCTTTGATGACCTTGAGGAAGGTGGTCTCAGGACAAG
CTCTTCCGTTGAAATTAAAGAGCATGACAATGACAGAGCCCTTCACACGTTGGAAGATAGAGTTTCTATTCTGAAGAGGTTGACAGGTGACATTCATGAGGAAGTGGAGA
GTCATAATCATTTACTTGACCGAATGGGAAATGGCATGGATGCTTCAAGGGGTATAATGTCAAGAACCATGGATCGATTCAAGATGGTTTTTGAGCAAAAATCAAAGTGG
AGAACTTGTAGACTTGCAGTTTATTTTGTGTTGTTCTTCTTACTTCTTTTCTATCTCATCAGAACATTGCAACACAGACCACACTTTCCTGCAATACAAGATTTGCCAAA
ATACATGGTCTTCAATGAAGGTGGAACTGGAACAGTGTTTTGGAAGGATTTTCAGATTGTTTACCTGCAACTATCTGCTAGTCTGGCTAATCCGTGGTGGGTCAATCGGG
TGGGCTTTCCGGCCATCCACCCAACAGCTACCACCATCAACGGCGGAGAAAATCATGAGGATGAAGAAGACGAGCCCAAGGAAGGTGGTGTCGTGGTCGGAAACCGCCGA
CCCAGAGGACGACCTCCCGGATCTAAGAACAAACCGAAACCTCCGATAATTGTAGCTCGCGACAGCCCCCACGCGCTTCGCACCCACGTGATGGAGATCGCCAGAGGAGC
CGACGTCGCCGACAGCATAAGCCAGTTTGCTCGACGGCGGCAGCGCGGGGTTTGTGTACTCAGCGGGAGTGGCTCCGTGGCCAACGTCACTCTCAGACAGTCCGCCGCTT
CTGCCGCTGTAATTCAACTTCATGGGAGGTTCGAGATTCTATCTCTGACCGGAGCTTTTCTTCCGGGCCAGGCCCCTCCTGGCTCAACCGGCTTGACCATATACCTCGCC
GGCGGTCAGGGACAGGTGGTCGGAGGAAGTGTGGTGGGTTCACTGTTAGCAGCTGGACCAGTAATGTTAATAGCTGCAACTTTTGCTAATGCAACGTATGAGAGATTGCC
CTTGCAGGACAACGATGACCATAGTCAGAAAGAGTTAGCGTTGGGGAAGAAGATGGAGAAGGTTCTCCGGCCACCGCAGGCGGAGGACATGTGGAGGCTGCGCTGGGGAT
CCTCCGACATCTTCAATTTACGACATGACACCAAATCATCATATGGTTCAAAACGGGGGACACCTCGAAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATATTGTCTCATTTGAAACTACTGGTATTGTTTTTGTTCATAGGGAGCACCGTGCTTCCAGATCTGCTCTCTTTGATGACCTTGAGGAAGGTGGTCTCAGGACAAG
CTCTTCCGTTGAAATTAAAGAGCATGACAATGACAGAGCCCTTCACACGTTGGAAGATAGAGTTTCTATTCTGAAGAGGTTGACAGGTGACATTCATGAGGAAGTGGAGA
GTCATAATCATTTACTTGACCGAATGGGAAATGGCATGGATGCTTCAAGGGGTATAATGTCAAGAACCATGGATCGATTCAAGATGGTTTTTGAGCAAAAATCAAAGTGG
AGAACTTGTAGACTTGCAGTTTATTTTGTGTTGTTCTTCTTACTTCTTTTCTATCTCATCAGAACATTGCAACACAGACCACACTTTCCTGCAATACAAGATTTGCCAAA
ATACATGGTCTTCAATGAAGGTGGAACTGGAACAGTGTTTTGGAAGGATTTTCAGATTGTTTACCTGCAACTATCTGCTAGTCTGGCTAATCCGTGGTGGGTCAATCGGG
TGGGCTTTCCGGCCATCCACCCAACAGCTACCACCATCAACGGCGGAGAAAATCATGAGGATGAAGAAGACGAGCCCAAGGAAGGTGGTGTCGTGGTCGGAAACCGCCGA
CCCAGAGGACGACCTCCCGGATCTAAGAACAAACCGAAACCTCCGATAATTGTAGCTCGCGACAGCCCCCACGCGCTTCGCACCCACGTGATGGAGATCGCCAGAGGAGC
CGACGTCGCCGACAGCATAAGCCAGTTTGCTCGACGGCGGCAGCGCGGGGTTTGTGTACTCAGCGGGAGTGGCTCCGTGGCCAACGTCACTCTCAGACAGTCCGCCGCTT
CTGCCGCTGTAATTCAACTTCATGGGAGGTTCGAGATTCTATCTCTGACCGGAGCTTTTCTTCCGGGCCAGGCCCCTCCTGGCTCAACCGGCTTGACCATATACCTCGCC
GGCGGTCAGGGACAGGTGGTCGGAGGAAGTGTGGTGGGTTCACTGTTAGCAGCTGGACCAGTAATGTTAATAGCTGCAACTTTTGCTAATGCAACGTATGAGAGATTGCC
CTTGCAGGACAACGATGACCATAGTCAGAAAGAGTTAGCGTTGGGGAAGAAGATGGAGAAGGTTCTCCGGCCACCGCAGGCGGAGGACATGTGGAGGCTGCGCTGGGGAT
CCTCCGACATCTTCAATTTACGACATGACACCAAATCATCATATGGTTCAAAACGGGGGACACCTCGAAATTGA
Protein sequenceShow/hide protein sequence
MHIVSFETTGIVFVHREHRASRSALFDDLEEGGLRTSSSVEIKEHDNDRALHTLEDRVSILKRLTGDIHEEVESHNHLLDRMGNGMDASRGIMSRTMDRFKMVFEQKSKW
RTCRLAVYFVLFFLLLFYLIRTLQHRPHFPAIQDLPKYMVFNEGGTGTVFWKDFQIVYLQLSASLANPWWVNRVGFPAIHPTATTINGGENHEDEEDEPKEGGVVVGNRR
PRGRPPGSKNKPKPPIIVARDSPHALRTHVMEIARGADVADSISQFARRRQRGVCVLSGSGSVANVTLRQSAASAAVIQLHGRFEILSLTGAFLPGQAPPGSTGLTIYLA
GGQGQVVGGSVVGSLLAAGPVMLIAATFANATYERLPLQDNDDHSQKELALGKKMEKVLRPPQAEDMWRLRWGSSDIFNLRHDTKSSYGSKRGTPRN