; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10006865 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10006865
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiontrihelix transcription factor ASR3-like
Genome locationChr07:22718243..22724880
RNA-Seq ExpressionHG10006865
SyntenyHG10006865
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140413.2 trihelix transcription factor PTL isoform X3 [Cucumis sativus]1.6e-18488.5Show/hide
Query:  MSDPPTTSSEPPQHHHHHQQLLHLPVIHGGA---TRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWK
        MSDPPTTSSEPP HHHH Q L  LPVIH GA   TRMNTAAATSSS+VIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPS  DPAARKGGELRWK
Subjt:  MSDPPTTSSEPPQHHHHHQQLLHLPVIHGGA---TRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWK

Query:  WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPP
        WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQ   QIPSYWKMEKHERKDKNLPSNMAFEVYQAL DVVQRKFSQ+PSNS+ TG+LLLP 
Subjt:  WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPP

Query:  PAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHR
        PA  PPPSA+ PPP  TATNSP LSESSSSGTESSEKKEK+EAKRRKM DNIGR IERS+SAL QTLH+CEEQREI+HQQLMELRKRRLQIEETRNHIHR
Subjt:  PAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHR

Query:  QGIADLVAAVANLSAGI-NNRTSRSEGY-GCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSMDPSGSGGDA
        QGIADLVAAVANLSAGI N+R  RSEGY  CLY+GEE+RILKEQNEAMQAELMNVK+ELSQLRDQMPSLMQTMMHNM+HNIPPPPP TSSMDPSGSGGDA
Subjt:  QGIADLVAAVANLSAGI-NNRTSRSEGY-GCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSMDPSGSGGDA

XP_008456886.1 PREDICTED: trihelix transcription factor PTL-like [Cucumis melo]1.1e-18288.03Show/hide
Query:  MSDPPTTSSEPPQHHHHHQQLLHLPVIHG---GATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWK
        MSDPPTTSSEPP H    Q L  LPVIHG   GATRMNTAAATSSS+VIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPS  DPAARKGGELRWK
Subjt:  MSDPPTTSSEPPQHHHHHQQLLHLPVIHG---GATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWK

Query:  WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPP
        WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQ   QIPSYWKMEKHERKDKNLPSNMAFEVYQAL DVVQRKFSQ+PSNS+ TG+LLLP 
Subjt:  WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPP

Query:  PAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHR
        PA  PPPS + PPP  TATNSP LSESSSSGTESSEKKEKMEAKRRKM DNIGR IERS+SAL QTLH+CEEQREI+HQQLMELRKRRLQIEETRNHIHR
Subjt:  PAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHR

Query:  QGIADLVAAVANLSAGI-NNRTSRSEGY-GCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPP-TSSMDPSGSGGD
        QGIADLVAAVANLSAGI NNR  RSEGY  CLY+GEE+RILKEQNEAMQAELMNVK+ELSQLRDQMPSLMQTMMH+MIHNIPPPPPP TSSMDPSGSG D
Subjt:  QGIADLVAAVANLSAGI-NNRTSRSEGY-GCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPP-TSSMDPSGSGGD

Query:  A
        A
Subjt:  A

XP_031742150.1 trihelix transcription factor PTL isoform X1 [Cucumis sativus]2.0e-17988.24Show/hide
Query:  MSDPPTTSSEPPQHHHHHQQLLHLPVIHGGA---TRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWK
        MSDPPTTSSEPP HHHH Q L  LPVIH GA   TRMNTAAATSSS+VIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPS  DPAARKGGELRWK
Subjt:  MSDPPTTSSEPPQHHHHHQQLLHLPVIHGGA---TRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWK

Query:  WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPP
        WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQ   QIPSYWKMEKHERKDKNLPSNMAFEVYQAL DVVQRKFSQ+PSNS+ TG+LLLP 
Subjt:  WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPP

Query:  PAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHR
        PA  PPPSA+ PPP  TATNSP LSESSSSGTESSEKKEK+EAKRRKM DNIGR IERS+SAL QTLH+CEEQREI+HQQLMELRKRRLQIEETRNHIHR
Subjt:  PAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHR

Query:  QGIADLVAAVANLSAGI-NNRTSRSEGY-GCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSM
        QGIADLVAAVANLSAGI N+R  RSEGY  CLY+GEE+RILKEQNEAMQAELMNVK+ELSQLRDQMPSLMQTMMHNM+HNIPPPPP TSSM
Subjt:  QGIADLVAAVANLSAGI-NNRTSRSEGY-GCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSM

XP_031742151.1 trihelix transcription factor PTL isoform X2 [Cucumis sativus]1.4e-17586.96Show/hide
Query:  MSDPPTTSSEPPQHHHHHQQLLHLPVIHGGA---TRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWK
        MSDPPTTSSEPP HHHH Q L  LPVIH GA   TRMNTAAATSSS+VIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPS  DPAARKGGELRWK
Subjt:  MSDPPTTSSEPPQHHHHHQQLLHLPVIHGGA---TRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWK

Query:  WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPP
        WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQ   QIPSYWKMEKHERKDKNLPSNMAFEVYQAL DVVQRKFSQ+PSNS+ TG+LLLP 
Subjt:  WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPP

Query:  PAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHR
        PA  PPPSA+ PPP  TATNSP L     SGTESSEKKEK+EAKRRKM DNIGR IERS+SAL QTLH+CEEQREI+HQQLMELRKRRLQIEETRNHIHR
Subjt:  PAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHR

Query:  QGIADLVAAVANLSAGI-NNRTSRSEGY-GCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSM
        QGIADLVAAVANLSAGI N+R  RSEGY  CLY+GEE+RILKEQNEAMQAELMNVK+ELSQLRDQMPSLMQTMMHNM+HNIPPPPP TSSM
Subjt:  QGIADLVAAVANLSAGI-NNRTSRSEGY-GCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSM

XP_038893036.1 uncharacterized protein LOC120081925 [Benincasa hispida]1.6e-19289.85Show/hide
Query:  MSDPPTTSSEPPQH-HHHHQQLLHLPVIHG------GATRMNT--AAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKG
        MSDPP+TSSEPPQH HHHHQQ+LHLPVIHG      GATRMNT  AAA SSS+VIVREYRKGNWTLQETMILITAKKLDDERRNKANLGP+PADPAARKG
Subjt:  MSDPPTTSSEPPQH-HHHHQQLLHLPVIHG------GATRMNT--AAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKG

Query:  GELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTG
        GELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQ  QIPSYWKMEKHERKDKNLPSN+AFEVYQAL DVVQRKFSQ+PSNS   G
Subjt:  GELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTG

Query:  LLLLPPPAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEET
        LLLLPPPA PPPPSA+PPPPPT+AT SP LS+SSSSGTESSEKKEK+EAKRRKMGDNIGRSIERSISAL QTLH+CEEQREIQHQQLMELRKRRLQIEET
Subjt:  LLLLPPPAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEET

Query:  RNHIHRQGIADLVAAVANLSAGINNRTSRSEGYGCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSMDPSGS
        RNHIHRQGIADLVAAVANLSAG+NNRT+R E YGCLY+GEE+RILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNI PPPP  +SMDPSGS
Subjt:  RNHIHRQGIADLVAAVANLSAGINNRTSRSEGYGCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSMDPSGS

Query:  GGDA
        GGDA
Subjt:  GGDA

TrEMBL top hitse value%identityAlignment
A0A1S3C482 trihelix transcription factor PTL-like5.5e-18388.03Show/hide
Query:  MSDPPTTSSEPPQHHHHHQQLLHLPVIHG---GATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWK
        MSDPPTTSSEPP H    Q L  LPVIHG   GATRMNTAAATSSS+VIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPS  DPAARKGGELRWK
Subjt:  MSDPPTTSSEPPQHHHHHQQLLHLPVIHG---GATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWK

Query:  WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPP
        WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQ   QIPSYWKMEKHERKDKNLPSNMAFEVYQAL DVVQRKFSQ+PSNS+ TG+LLLP 
Subjt:  WVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPP

Query:  PAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHR
        PA  PPPS + PPP  TATNSP LSESSSSGTESSEKKEKMEAKRRKM DNIGR IERS+SAL QTLH+CEEQREI+HQQLMELRKRRLQIEETRNHIHR
Subjt:  PAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHR

Query:  QGIADLVAAVANLSAGI-NNRTSRSEGY-GCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPP-TSSMDPSGSGGD
        QGIADLVAAVANLSAGI NNR  RSEGY  CLY+GEE+RILKEQNEAMQAELMNVK+ELSQLRDQMPSLMQTMMH+MIHNIPPPPPP TSSMDPSGSG D
Subjt:  QGIADLVAAVANLSAGI-NNRTSRSEGY-GCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPP-TSSMDPSGSGGD

Query:  A
        A
Subjt:  A

A0A6J1DSG1 trihelix transcription factor ASR3-like4.1e-15476.6Show/hide
Query:  MSDPPTTSSEPP---QH-HHHHQQLLHLPVIHGGATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRW
        MSDPPTTSSEPP   QH H HHQQLLHLP+IHGGA    T + T S +   REYRKGNWTLQETMILI AKKLDDERR+KANL  +P DPAARKGGELRW
Subjt:  MSDPPTTSSEPP---QH-HHHHQQLLHLPVIHGGATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRW

Query:  KWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRAC---DQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLL
        KWVENYCWS GC RSQNQCNDKWDNLLRDYKKVREY+SRAC    +QPS  PSYWKMEKHERKD NLPSNM FEVYQAL DVVQRK+S    +  T   +
Subjt:  KWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRAC---DQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLL

Query:  LLPPPAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRN
        L       P PS+ PPPP    T SP  SE SSSGTESSEK+E ME KRRKMGD IG SIERS SALAQ L +CEEQREI+HQQLMEL+KRRL IEETRN
Subjt:  LLPPPAPPPPPSAVPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRN

Query:  HIHRQGIADLVAAVANLSAGINNRTSR-SEGYG---CLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSMDPS
        H+HRQGIADLVAAVANLS G NNR+SR SEGYG   CLY+GEE+R+LKEQNEAMQAELM VKSELSQLRDQMPSLMQTMMHNMIHNIPPPP P SSMDP+
Subjt:  HIHRQGIADLVAAVANLSAGINNRTSR-SEGYG---CLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSMDPS

Query:  GSGGDA
        GSGGDA
Subjt:  GSGGDA

A0A6J1GZY0 trihelix transcription factor ASR3-like isoform X29.2e-16278.97Show/hide
Query:  MSDPPTTSSEPP-----QHHHHHQQLLHLPVIHGGATRMNT---AAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGG
        +S   TT   PP      HHH  QQLLHLP+IHGGA R+NT   AAATSSS+VIVREYRKGNWTLQETMILITAKKLDDERRNK  L P P DP ARKGG
Subjt:  MSDPPTTSSEPP-----QHHHHHQQLLHLPVIHGGATRMNT---AAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGG

Query:  ELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRP--SNSTTT
        ELRWKWVENYCWSHGC RSQNQCNDKWDNLLRDYKKVREYESRACDQQ SQIPSYWKMEKHERKD NLPSNMAFEVYQAL DVVQRKFSQRP  SN+T T
Subjt:  ELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRP--SNSTTT

Query:  GLLLLPPPAPPPPPS--AVPPPPPTTATN-SPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQ
         L+ L PPAPPP  +  A+PPPPPTTATN SP +SESSSSGTESSEKKEK EAKRRKM DN    IERS + LAQTL  CEEQREI+HQ++ME++KR LQ
Subjt:  GLLLLPPPAPPPPPS--AVPPPPPTTATN-SPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQ

Query:  IEETRNHIHRQGIADLVAAVANLSAGINNR-TSRSEGYGCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSM
        IEE RNHIHRQGI+D+VAA+ANLSA I++R   RSEGY C YNGEE+R+LK+QNEAMQAE+MNVK+ELSQLRDQMPSLMQTMMHNM+HNIPPPPPP  SM
Subjt:  IEETRNHIHRQGIADLVAAVANLSAGINNR-TSRSEGYGCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSM

Query:  DPSGSGGDA
        DPSGSGGDA
Subjt:  DPSGSGGDA

A0A6J1H1B6 trihelix transcription factor ASR3-like isoform X13.0e-16076.79Show/hide
Query:  MSDPPTTSSEPP-----QHHHHHQQLLHLPVIHGGATRMNT---AAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGG
        +S   TT   PP      HHH  QQLLHLP+IHGGA R+NT   AAATSSS+VIVREYRKGNWTLQETMILITAKKLDDERRNK  L P P DP ARKGG
Subjt:  MSDPPTTSSEPP-----QHHHHHQQLLHLPVIHGGATRMNT---AAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGG

Query:  ELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRP--------
        ELRWKWVENYCWSHGC RSQNQCNDKWDNLLRDYKKVREYESRACDQQ SQIPSYWKMEKHERKD NLPSNMAFEVYQAL DVVQRKFSQRP        
Subjt:  ELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRP--------

Query:  ---SNSTTTGLLLLPPPAPPPPPS--AVPPPPPTTATN-SPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQL
           + +TTT  L+  PPAPPP  +  A+PPPPPTTATN SP +SESSSSGTESSEKKEK EAKRRKM DN    IERS + LAQTL  CEEQREI+HQ++
Subjt:  ---SNSTTTGLLLLPPPAPPPPPS--AVPPPPPTTATN-SPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQL

Query:  MELRKRRLQIEETRNHIHRQGIADLVAAVANLSAGINNR-TSRSEGYGCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIP
        ME++KR LQIEE RNHIHRQGI+D+VAA+ANLSA I++R   RSEGY C YNGEE+R+LK+QNEAMQAE+MNVK+ELSQLRDQMPSLMQTMMHNM+HNIP
Subjt:  MELRKRRLQIEETRNHIHRQGIADLVAAVANLSAGINNR-TSRSEGYGCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIP

Query:  PPPPPTSSMDPSGSGGDA
        PPPPP  SMDPSGSGGDA
Subjt:  PPPPPTSSMDPSGSGGDA

A0A6J1KBG1 trihelix transcription factor ASR3-like1.9e-16779.24Show/hide
Query:  MSDPPTTSSEPPQ---------HHHHH---------QQLLHLPVIHGGATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPS
        MSDPPTTSSEPP          HHHHH         QQLLHLP+IHGGA R+NTAAATSSS+VIVREYRKGNWTLQETMILITAKKLDDERRNK  L P 
Subjt:  MSDPPTTSSEPPQ---------HHHHH---------QQLLHLPVIHGGATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPS

Query:  PADPAARKGGELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQ
        PADP ARKGGELRWKWVENYCWSHGC RSQNQCNDKWDNLLRDYKKVREYESRACDQQ SQIPSYWKMEKHERKD NLPSNMAFEVYQAL DVVQRKFSQ
Subjt:  PADPAARKGGELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQ

Query:  RP--SNSTTTGLLLLPPPAPPPPPS--AVPPPPPTTATN-SPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQ
        RP  S++T T L+ L PPAPPP  +  A+PPPPPTTATN SP +SESSSSGTESSEKKEK EAKRRKM DN    IERS + LAQTL +CEEQREI+HQ+
Subjt:  RP--SNSTTTGLLLLPPPAPPPPPS--AVPPPPPTTATN-SPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQ

Query:  LMELRKRRLQIEETRNHIHRQGIADLVAAVANLSAGINNR-TSRSEGYGCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNI
        +ME++KR LQIEE RNHIHRQGI+D+VAA+ANLSA I++R   RSEGY C YNGEE+R+LK+QNEAMQAE+MNVK+ELSQLRDQMPSLMQTMMHNM+HNI
Subjt:  LMELRKRRLQIEETRNHIHRQGIADLVAAVANLSAGINNR-TSRSEGYGCLYNGEEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNI

Query:  PPPPPPTSSMDPSGSGGDA
        PPPPPP  SMDPSGSGGDA
Subjt:  PPPPPPTSSMDPSGSGGDA

SwissProt top hitse value%identityAlignment
Q8VZ20 Trihelix transcription factor ASR31.5e-1232.67Show/hide
Query:  LHLPVIHGGATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWKW--VENYCWSHGCQRSQNQCNDKW
        L +  + GG    ++A +       V+  R   WT QE ++LI  K++ + R  +       A   A   G++  KW  V +YC  HG  R   QC  +W
Subjt:  LHLPVIHGGATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWKW--VENYCWSHGCQRSQNQCNDKW

Query:  DNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVY
         NL  DYKK++E+ES    Q   +  SYW M    R++K LP     EVY
Subjt:  DNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVY

Arabidopsis top hitse value%identityAlignment
AT1G31310.1 hydroxyproline-rich glycoprotein family protein6.9e-5338.07Show/hide
Query:  AATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSP----ADPAARKGGELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYE
        A  S   V++REYRKGNWTL ETM+LI AK++DDERR + ++G  P     D  + K  ELRWKW+E+YCW  GC RSQNQCNDKWDNL+RDYKKVREYE
Subjt:  AATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSP----ADPAARKGGELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYE

Query:  SR-------ACDQQPSQIP-----SYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTG------------------------------
         R       A +   S  P     SYWKMEK ERK+++LPSNM  + YQAL +VV+ K    PS++  T                               
Subjt:  SR-------ACDQQPSQIP-----SYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTG------------------------------

Query:  -----------------LLLLPPPAPPPP------PSAVPPPPPTTATNSPLL-SESSSSGTESSEKKEKMEAKRRKM---------------GDNIGRS
                         LL L PP PPPP      P  +PPPPP +    P+L ++ SS+ +++SE  +   AKRR+                 + +GRS
Subjt:  -----------------LLLLPPPAPPPP------PSAVPPPPPTTATNSPLL-SESSSSGTESSEKKEKMEAKRRKM---------------GDNIGRS

Query:  -----------IERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHRQGIADLVAAVANLSAGI
                   + RS+S +A  +   EE+++ +H+++M +++RRL+IEE+   ++R+G+  LV A+  L++ I
Subjt:  -----------IERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHRQGIADLVAAVANLSAGI

AT2G33550.1 Homeodomain-like superfamily protein1.1e-1332.67Show/hide
Query:  LHLPVIHGGATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWKW--VENYCWSHGCQRSQNQCNDKW
        L +  + GG    ++A +       V+  R   WT QE ++LI  K++ + R  +       A   A   G++  KW  V +YC  HG  R   QC  +W
Subjt:  LHLPVIHGGATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWKW--VENYCWSHGCQRSQNQCNDKW

Query:  DNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVY
         NL  DYKK++E+ES    Q   +  SYW M    R++K LP     EVY
Subjt:  DNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVY

AT2G35640.1 Homeodomain-like superfamily protein2.1e-4938.25Show/hide
Query:  MNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAAR-KGGELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREY
        M  A  +S   +++RE RKGNWT+ ET++LI AKK+DD+RR +     S   P  R K  ELRWKW+E YCW  GC R+QNQCNDKWDNL+RDYKK+REY
Subjt:  MNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAAR-KGGELRWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREY

Query:  ESRACDQQPSQI--PSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNST----------------------------------TTGLLLLP
        E    +   + +   SYWKM+K ERK+KNLPSNM  ++Y  L ++V RK     S++                                   TT +L LP
Subjt:  ESRACDQQPSQI--PSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNST----------------------------------TTGLLLLP

Query:  PPAP------------PPPPSA--VPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELR
        PP P            PPP S+    P PPT  T+S     ++   T +  ++E  E       D +G ++ R  S + Q +   EE +E +H++++ L+
Subjt:  PPAP------------PPPPSA--VPPPPPTTATNSPLLSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELR

Query:  KRRLQIEETRNHIHRQGIADLVAAVANLSAGI
        +RRL+IEE++  I+RQG+  LV A+  L++ I
Subjt:  KRRLQIEETRNHIHRQGIADLVAAVANLSAGI

AT4G31270.1 sequence-specific DNA binding transcription factors6.8e-0832.93Show/hide
Query:  RWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVV
        +W  +   C +    R+ NQC  KWD+L+ DY +++++ES    Q      SYW +   +RK  NLP ++  E+++A+  VV
Subjt:  RWKWVENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVV

AT5G51800.1 Protein kinase superfamily protein8.3e-0632.14Show/hide
Query:  VENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPPP
        V  Y   HG  R       KWDN+L +++KV E+E   C  Q     SY+++  +ERK   LP++   EVYQ L   +  +      N    G   +   
Subjt:  VENYCWSHGCQRSQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPPP

Query:  APPPPPSAVPPP
        + PP   A+PPP
Subjt:  APPPPPSAVPPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGATCCTCCGACAACATCATCGGAGCCACCGCAGCACCACCACCACCACCAACAACTACTACATTTACCCGTAATCCACGGCGGCGCTACCCGAATGAATACTGC
AGCAGCAACCTCATCTTCGAGCGTTATAGTCCGAGAGTATCGGAAAGGAAACTGGACACTCCAAGAGACGATGATTTTAATAACGGCGAAGAAGTTGGACGACGAGCGGC
GGAACAAGGCGAACTTAGGCCCTAGTCCGGCGGATCCGGCGGCGAGGAAGGGCGGCGAGCTGCGGTGGAAGTGGGTGGAAAATTACTGCTGGAGCCATGGTTGTCAACGG
AGCCAAAATCAGTGCAATGACAAGTGGGATAACCTTCTCCGCGACTACAAAAAAGTCCGCGAGTATGAATCCCGCGCGTGTGATCAACAACCTTCTCAAATTCCTTCTTA
CTGGAAAATGGAGAAACATGAGCGAAAAGACAAGAATCTCCCTTCTAATATGGCCTTTGAGGTTTATCAAGCCTTAAAGGACGTCGTTCAGAGGAAGTTCTCTCAAAGAC
CTTCTAATTCTACTACTACCGGCTTGCTTCTACTTCCTCCTCCTGCTCCGCCTCCTCCTCCTTCCGCCGTCCCCCCGCCGCCGCCCACTACCGCCACCAATTCTCCGCTG
CTTTCCGAGTCATCGTCTTCGGGAACAGAGTCAAGCGAGAAGAAAGAAAAAATGGAGGCAAAGAGAAGGAAAATGGGAGATAATATTGGAAGAAGCATTGAGAGAAGCAT
TTCAGCGTTGGCTCAAACGCTGCACAATTGCGAGGAGCAAAGAGAAATTCAACACCAACAACTTATGGAACTTCGAAAACGCCGCCTTCAAATTGAAGAAACCCGCAACC
ACATTCACCGCCAAGGCATCGCCGACCTCGTTGCCGCCGTCGCCAACCTCTCCGCCGGGATAAATAATAGAACAAGCAGATCAGAAGGATATGGATGTTTATACAATGGA
GAAGAGATGAGAATATTGAAAGAGCAAAATGAAGCAATGCAAGCTGAGCTTATGAATGTGAAGAGTGAGCTTTCCCAACTTAGAGACCAAATGCCTTCTCTCATGCAAAC
TATGATGCACAATATGATCCATAATATCCCACCACCTCCTCCTCCTACTTCTTCCATGGACCCATCTGGATCAGGTGGAGATGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGATCCTCCGACAACATCATCGGAGCCACCGCAGCACCACCACCACCACCAACAACTACTACATTTACCCGTAATCCACGGCGGCGCTACCCGAATGAATACTGC
AGCAGCAACCTCATCTTCGAGCGTTATAGTCCGAGAGTATCGGAAAGGAAACTGGACACTCCAAGAGACGATGATTTTAATAACGGCGAAGAAGTTGGACGACGAGCGGC
GGAACAAGGCGAACTTAGGCCCTAGTCCGGCGGATCCGGCGGCGAGGAAGGGCGGCGAGCTGCGGTGGAAGTGGGTGGAAAATTACTGCTGGAGCCATGGTTGTCAACGG
AGCCAAAATCAGTGCAATGACAAGTGGGATAACCTTCTCCGCGACTACAAAAAAGTCCGCGAGTATGAATCCCGCGCGTGTGATCAACAACCTTCTCAAATTCCTTCTTA
CTGGAAAATGGAGAAACATGAGCGAAAAGACAAGAATCTCCCTTCTAATATGGCCTTTGAGGTTTATCAAGCCTTAAAGGACGTCGTTCAGAGGAAGTTCTCTCAAAGAC
CTTCTAATTCTACTACTACCGGCTTGCTTCTACTTCCTCCTCCTGCTCCGCCTCCTCCTCCTTCCGCCGTCCCCCCGCCGCCGCCCACTACCGCCACCAATTCTCCGCTG
CTTTCCGAGTCATCGTCTTCGGGAACAGAGTCAAGCGAGAAGAAAGAAAAAATGGAGGCAAAGAGAAGGAAAATGGGAGATAATATTGGAAGAAGCATTGAGAGAAGCAT
TTCAGCGTTGGCTCAAACGCTGCACAATTGCGAGGAGCAAAGAGAAATTCAACACCAACAACTTATGGAACTTCGAAAACGCCGCCTTCAAATTGAAGAAACCCGCAACC
ACATTCACCGCCAAGGCATCGCCGACCTCGTTGCCGCCGTCGCCAACCTCTCCGCCGGGATAAATAATAGAACAAGCAGATCAGAAGGATATGGATGTTTATACAATGGA
GAAGAGATGAGAATATTGAAAGAGCAAAATGAAGCAATGCAAGCTGAGCTTATGAATGTGAAGAGTGAGCTTTCCCAACTTAGAGACCAAATGCCTTCTCTCATGCAAAC
TATGATGCACAATATGATCCATAATATCCCACCACCTCCTCCTCCTACTTCTTCCATGGACCCATCTGGATCAGGTGGAGATGCTTAA
Protein sequenceShow/hide protein sequence
MSDPPTTSSEPPQHHHHHQQLLHLPVIHGGATRMNTAAATSSSSVIVREYRKGNWTLQETMILITAKKLDDERRNKANLGPSPADPAARKGGELRWKWVENYCWSHGCQR
SQNQCNDKWDNLLRDYKKVREYESRACDQQPSQIPSYWKMEKHERKDKNLPSNMAFEVYQALKDVVQRKFSQRPSNSTTTGLLLLPPPAPPPPPSAVPPPPPTTATNSPL
LSESSSSGTESSEKKEKMEAKRRKMGDNIGRSIERSISALAQTLHNCEEQREIQHQQLMELRKRRLQIEETRNHIHRQGIADLVAAVANLSAGINNRTSRSEGYGCLYNG
EEMRILKEQNEAMQAELMNVKSELSQLRDQMPSLMQTMMHNMIHNIPPPPPPTSSMDPSGSGGDA