; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001039 (gene) of Snake gourd v1 genome

Gene IDTan0001039
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontrihelix transcription factor ASR3-like
Genome locationLG04:40910570..40917476
RNA-Seq ExpressionTan0001039
SyntenyTan0001039
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601648.1 Trihelix transcription factor ASR3, partial [Cucurbita argyrosperma subsp. sororia]2.3e-15474.65Show/hide
Query:  TTSSEPPQHHHHHSHHHH---QLLHLPLIHGGASTAAAATTTARINT--GAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDPAA
        ++S+ P   HHHH HHHH   QLLHLPLIHGG          ARINT   AA SSSTVIVREYRKGNWTLQETMILITAKKLDDERRNK  LAPP DP A
Subjt:  TTSSEPPQHHHHHSHHHH---QLLHLPLIHGGASTAAAATTTARINT--GAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDPAA

Query:  RKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKL---------
        RKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQ SQI SYWKMEKHERKD NLPSNMAFEVYQALNDVVQRK          
Subjt:  RKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKL---------

Query:  -----RVAVSVAVSTPTSVVA--AAPPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRS
                 +   +TPT +VA   APPPSSAV  LPPP        TTAT+ SPAVSESSSSGTESSEK+EKTEAKRRKM D   +IERSA+ LA+TLRS
Subjt:  -----RVAVSVAVSTPTSVVA--AAPPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRS

Query:  CEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSG-INNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSL
        CEEQREIRHQ++ME++KR LQIEEARNHIHRQGI+D+VAA+ANLS  I++R     SEGY C Y+GEEV MLK+QNEAMQAE+M+VKTELSQLRDQMPSL
Subjt:  CEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSG-INNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSL

Query:  MQTMMHNMIHNI--PPPPPSMDPSGSGGDA
        MQTMMHNM+HNI  PPPPPSMDPSGSGGDA
Subjt:  MQTMMHNMIHNI--PPPPPSMDPSGSGGDA

KAG7032408.1 Trihelix transcription factor ASR3, partial [Cucurbita argyrosperma subsp. argyrosperma]4.7e-15571.71Show/hide
Query:  MSDPPTTSSEPP-----QH----HHHHSHHHH----QLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERR
        MSDPPTTSSEPP     QH    H HH H HH    QLLHLPLIHGG          ARINT AA SSSTVIVREYRKGNWTLQETMILITAKKLDDERR
Subjt:  MSDPPTTSSEPP-----QH----HHHHSHHHH----QLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERR

Query:  NKANLAPPVDPAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVV
        NK  LAPP DP ARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQ SQI SYWKMEKHERKD NLPSNMAFEVYQALNDVV
Subjt:  NKANLAPPVDPAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVV

Query:  QRKL--------RVAVSVAVSTPTSVVA--AAPPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASA
        QRK             +   +TPT +VA   APPPSSAV ALPPP        TTAT+ SPAVSESSSSGTESSEK+EKTEAKRRKM D   +IERSA+ 
Subjt:  QRKL--------RVAVSVAVSTPTSVVA--AAPPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASA

Query:  LAETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSG---------------------------INNRTTTTTSEGYGCLYSGE
        LA+TLRSCEEQREIRHQ++ME++KR LQIEEARNHIHRQGI+D+VAA+ANLS                            I++R     SEGY C Y+GE
Subjt:  LAETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSG---------------------------INNRTTTTTSEGYGCLYSGE

Query:  EVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQTMMHNMIHNI--PPPPPSMDPSGSGGDA
        EV MLK+QNEAMQAE+M+VKTELSQLRDQMPSLMQTMMHNM+HNI  PPPPPSMDPSGSGGDA
Subjt:  EVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQTMMHNMIHNI--PPPPPSMDPSGSGGDA

XP_022997995.1 trihelix transcription factor ASR3-like [Cucurbita maxima]4.0e-16277.1Show/hide
Query:  MSDPPTTSSEPPQ---------HHHHHSHHH-----HQLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDER
        MSDPPTTSSEPP          HHHHH HHH      QLLHLPLIHGGA         ARINT AATSSSTVIVREYRKGNWTLQETMILITAKKLDDER
Subjt:  MSDPPTTSSEPPQ---------HHHHHSHHH-----HQLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDER

Query:  RNKANLAPPVDPAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDV
        RNK  LAPP DP ARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQ SQI SYWKMEKHERKD NLPSNMAFEVYQALNDV
Subjt:  RNKANLAPPVDPAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDV

Query:  VQRKLRVAVSVAVSTPTSVVA--AAPPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRS
        VQRK      ++ +TPT +VA   APPPSSA+ ALPPP        TTAT+ SPAVSESSSSGTESSEK+EK EAKRRKM D   +IERSA+ LA+TLRS
Subjt:  VQRKLRVAVSVAVSTPTSVVA--AAPPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRS

Query:  CEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSGINNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLM
        CEEQREIRHQ++ME++KR LQIEEARNHIHRQGI+D+VAA+ANLS   +      SEGY C Y+GEEV MLK+QNEAMQAE+M+VKTELSQLRDQMPSLM
Subjt:  CEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSGINNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLM

Query:  QTMMHNMIHNI-PPPPPSMDPSGSGGDA
        QTMMHNM+HNI PPPPPSMDPSGSGGDA
Subjt:  QTMMHNMIHNI-PPPPPSMDPSGSGGDA

XP_023513279.1 trihelix transcription factor ASR3-like [Cucurbita pepo subsp. pepo]6.6e-15774.42Show/hide
Query:  MSDPPTTSSEPP-----QHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPP
        MSDPPTTSSEPP     QH HH+ H   QLLHLPLIHGG          ARINT AA SSSTVIVREYRKGNWTLQETMILITAKKLDDERRNK  LAPP
Subjt:  MSDPPTTSSEPP-----QHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPP

Query:  VDPAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAV
         DP ARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVR+YESRACDQ SQI SYWKMEKHERKD NLPSNMAFEVYQALNDVV RK     
Subjt:  VDPAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAV

Query:  SVAVSTPTSVVAA----------------APPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALA
         ++ +TPT+                    APPPSS V ALPPP        TTAT+ SPAVSESSSSGTESSEK+EKT+AKRRKM D   +I RSA+ LA
Subjt:  SVAVSTPTSVVAA----------------APPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALA

Query:  ETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSG-INNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRD
        +TLRSCEEQREIRHQ++ME++KR LQIEEARNHIHRQGI+D+VAA+ANLS  I++R     SEGY C Y+GEEV MLKEQNEAMQAE+M+VKTELSQLRD
Subjt:  ETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSG-INNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRD

Query:  QMPSLMQTMMHNMIHNI-PPPPPSMDPSGSGGDA
        QMPSLMQTMMHNM+HNI PPPPPSMDPSGSGGDA
Subjt:  QMPSLMQTMMHNMIHNI-PPPPPSMDPSGSGGDA

XP_038893036.1 uncharacterized protein LOC120081925 [Benincasa hispida]2.7e-15879.14Show/hide
Query:  MSDPPTTSSEPPQHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINT--GAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAP-PVD
        MSDPP+TSSEPPQH H   HHH Q+LHLP+IHGGAS  A AT   R+NT   AA SSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANL P P D
Subjt:  MSDPPTTSSEPPQHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINT--GAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAP-PVD

Query:  PAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACD-QISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVS
        PAARKGGELRWKWVENYCWSHGC RSQNQCNDKWDNLLRDYKKVREYESRACD Q+ QI SYWKMEKHERKDKNLPSN+AFEVYQALNDVVQRK     S
Subjt:  PAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACD-QISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVS

Query:  VAVSTPTSVVAAAPPPSSAVAALPPPGEAAAAAATTATDSPAVSESSSSGTESSEKEEKTEAKRRKMGD-IGRSIERSASALAETLRSCEEQREIRHQQL
               S +   PPP     A PPP        T+AT SP +S+SSSSGTESSEK+EK EAKRRKMGD IGRSIERS SAL +TL SCEEQREI+HQQL
Subjt:  VAVSTPTSVVAAAPPPSSAVAALPPPGEAAAAAATTATDSPAVSESSSSGTESSEKEEKTEAKRRKMGD-IGRSIERSASALAETLRSCEEQREIRHQQL

Query:  MELRKRRLQIEEARNHIHRQGIADLVAAVANLS-GINNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQTMMHNMIHNI
        MELRKRRLQIEE RNHIHRQGIADLVAAVANLS G+NNR  T   E YGCLYSGEEV +LKEQNEAMQAELM+VK+ELSQLRDQMPSLMQTMMHNMIHNI
Subjt:  MELRKRRLQIEEARNHIHRQGIADLVAAVANLS-GINNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQTMMHNMIHNI

Query:  -PPPPPSMDPSGSGGDA
         PPPP SMDPSGSGGDA
Subjt:  -PPPPPSMDPSGSGGDA

TrEMBL top hitse value%identityAlignment
A0A1S3C482 trihelix transcription factor PTL-like7.6e-15176.79Show/hide
Query:  MSDPPTTSSEPPQHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAP-PVDPA
        MSDPPTTSSEPP    HH      L  LP+IHGGAS A       R+NT AATSSS VIVREYRKGNWTLQETMILITAKKLDDERRNKANL P  VDPA
Subjt:  MSDPPTTSSEPPQHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAP-PVDPA

Query:  ARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVSVAV
        ARKGGELRWKWVENYCWSHGC RSQNQCNDKWDNLLRDYKKVREYESRACDQ  QI SYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRK     S + 
Subjt:  ARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVSVAV

Query:  STPTSVV-AAAPPPSSAVAALPPPGEAAAAAATTATDSPAVSESSSSGTESSEKEEKTEAKRRKMGD-IGRSIERSASALAETLRSCEEQREIRHQQLME
        +T   ++   APPPS+    LPPP         TAT+SP +SESSSSGTESSEK+EK EAKRRKM D IGR IERS SAL +TL SCEEQREIRHQQLME
Subjt:  STPTSVV-AAAPPPSSAVAALPPPGEAAAAAATTATDSPAVSESSSSGTESSEKEEKTEAKRRKMGD-IGRSIERSASALAETLRSCEEQREIRHQQLME

Query:  LRKRRLQIEEARNHIHRQGIADLVAAVANLSGINNRTTTTTSEGY-GCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQTMMHNMIHNIPP
        LRKRRLQIEE RNHIHRQGIADLVAAVANLS   +      SEGY  CLYSGEEV +LKEQNEAMQAELM+VK ELSQLRDQMPSLMQTMMH+MIHNIPP
Subjt:  LRKRRLQIEEARNHIHRQGIADLVAAVANLSGINNRTTTTTSEGY-GCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQTMMHNMIHNIPP

Query:  PPP----SMDPSGSGGDA
        PPP    SMDPSGSG DA
Subjt:  PPP----SMDPSGSGGDA

A0A6J1DSG1 trihelix transcription factor ASR3-like1.7e-14774.52Show/hide
Query:  MSDPPTTSSEPPQHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDPAA
        MSDPPTTSSEPP  H H   HH QLLHLPLIHGGA+T    +TT  IN  A         REYRKGNWTLQETMILI AKKLDDERR+KANLAPP DPAA
Subjt:  MSDPPTTSSEPPQHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDPAA

Query:  RKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRAC----DQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVS
        RKGGELRWKWVENYCWS GCHRSQNQCNDKWDNLLRDYKKVREY+SRAC    +Q S   SYWKMEKHERKD NLPSNM FEVYQALNDVVQRK     S
Subjt:  RKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRAC----DQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVS

Query:  VAVSTPTSVVAAAPPPSSAVAALPPPGEAAAAAATTATDSPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRSCEEQREIRHQQLM
         +  +  +  A  P PSSA    PPP           T SPA SE SSSGTESSEK E  E KRRKMGDIG SIERSASALA+ LRSCEEQREIRHQQLM
Subjt:  VAVSTPTSVVAAAPPPSSAVAALPPPGEAAAAAATTATDSPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRSCEEQREIRHQQLM

Query:  ELRKRRLQIEEARNHIHRQGIADLVAAVANLSGINNRTTTTTSEGYG---CLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQTMMHNMIHN
        EL+KRRL IEE RNH+HRQGIADLVAAVANLSG NNR ++ +SEGYG   CLYSGEEV +LKEQNEAMQAELM VK+ELSQLRDQMPSLMQTMMHNMIHN
Subjt:  ELRKRRLQIEEARNHIHRQGIADLVAAVANLSGINNRTTTTTSEGYG---CLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQTMMHNMIHN

Query:  IPPPP---PSMDPSGSGGDA
        IPPPP    SMDP+GSGGDA
Subjt:  IPPPP---PSMDPSGSGGDA

A0A6J1GZY0 trihelix transcription factor ASR3-like isoform X21.2e-15376.2Show/hide
Query:  SDPPTTSSEPPQHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINT---GAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDP
        S   T    PP    HH H   QLLHLPLIHGGA         ARINT    AATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNK  LAPP DP
Subjt:  SDPPTTSSEPPQHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINT---GAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDP

Query:  AARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVSVA
         ARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQ SQI SYWKMEKHERKD NLPSNMAFEVYQALNDVVQRK      ++
Subjt:  AARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVSVA

Query:  VSTPTSVVA--AAPPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRSCEEQREIRHQQL
         +TPT +VA   APPPSSAV ALPPP        TTAT+ SPAVSESSSSGTESSEK+EKTEAKRRKM D   +IERSA+ LA+TL+ CEEQREIRHQ++
Subjt:  VSTPTSVVA--AAPPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRSCEEQREIRHQQL

Query:  MELRKRRLQIEEARNHIHRQGIADLVAAVANLSGINNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQTMMHNMIHNI-
        ME++KR LQIEEARNHIHRQGI+D+VAA+ANLS   +      SEGY C Y+GEEV MLK+QNEAMQAE+M+VKTELSQLRDQMPSLMQTMMHNM+HNI 
Subjt:  MELRKRRLQIEEARNHIHRQGIADLVAAVANLSGINNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQTMMHNMIHNI-

Query:  PPPPPSMDPSGSGGDA
        PPPPPSMDPSGSGGDA
Subjt:  PPPPPSMDPSGSGGDA

A0A6J1H1B6 trihelix transcription factor ASR3-like isoform X18.9e-15273.94Show/hide
Query:  SDPPTTSSEPPQHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINT---GAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDP
        S   T    PP    HH H   QLLHLPLIHGGA         ARINT    AATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNK  LAPP DP
Subjt:  SDPPTTSSEPPQHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINT---GAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDP

Query:  AARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVSVA
         ARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQ SQI SYWKMEKHERKD NLPSNMAFEVYQALNDVVQRK      ++
Subjt:  AARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVSVA

Query:  VSTPTSVVAA------------APPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRSCE
         +TPT+                APPPSSAV ALPPP        TTAT+ SPAVSESSSSGTESSEK+EKTEAKRRKM D   +IERSA+ LA+TL+ CE
Subjt:  VSTPTSVVAA------------APPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRSCE

Query:  EQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSGINNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQT
        EQREIRHQ++ME++KR LQIEEARNHIHRQGI+D+VAA+ANLS   +      SEGY C Y+GEEV MLK+QNEAMQAE+M+VKTELSQLRDQMPSLMQT
Subjt:  EQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSGINNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQT

Query:  MMHNMIHNI-PPPPPSMDPSGSGGDA
        MMHNM+HNI PPPPPSMDPSGSGGDA
Subjt:  MMHNMIHNI-PPPPPSMDPSGSGGDA

A0A6J1KBG1 trihelix transcription factor ASR3-like1.9e-16277.1Show/hide
Query:  MSDPPTTSSEPPQ---------HHHHHSHHH-----HQLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDER
        MSDPPTTSSEPP          HHHHH HHH      QLLHLPLIHGGA         ARINT AATSSSTVIVREYRKGNWTLQETMILITAKKLDDER
Subjt:  MSDPPTTSSEPPQ---------HHHHHSHHH-----HQLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDER

Query:  RNKANLAPPVDPAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDV
        RNK  LAPP DP ARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQ SQI SYWKMEKHERKD NLPSNMAFEVYQALNDV
Subjt:  RNKANLAPPVDPAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDV

Query:  VQRKLRVAVSVAVSTPTSVVA--AAPPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRS
        VQRK      ++ +TPT +VA   APPPSSA+ ALPPP        TTAT+ SPAVSESSSSGTESSEK+EK EAKRRKM D   +IERSA+ LA+TLRS
Subjt:  VQRKLRVAVSVAVSTPTSVVA--AAPPPSSAVAALPPPGEAAAAAATTATD-SPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRS

Query:  CEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSGINNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLM
        CEEQREIRHQ++ME++KR LQIEEARNHIHRQGI+D+VAA+ANLS   +      SEGY C Y+GEEV MLK+QNEAMQAE+M+VKTELSQLRDQMPSLM
Subjt:  CEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSGINNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLM

Query:  QTMMHNMIHNI-PPPPPSMDPSGSGGDA
        QTMMHNM+HNI PPPPPSMDPSGSGGDA
Subjt:  QTMMHNMIHNI-PPPPPSMDPSGSGGDA

SwissProt top hitse value%identityAlignment
Q8VZ20 Trihelix transcription factor ASR34.6e-0425.93Show/hide
Query:  VREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDPAARKGGELRWKW--VENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIAS
        V+  R   WT QE ++LI  K++ + R  +   A      A   G++  KW  V +YC  HG +R   QC  +W NL  DYKK++E+ES+  ++     S
Subjt:  VREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDPAARKGGELRWKW--VENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIAS

Query:  YWKMEKHERKDKNLPSNMAFEVYQ-------------------------ALNDVVQRKLRVAV-SVAVSTPTSVVAAAPPPSSAVAALPPPGEAAAAAAT
        YW M    R++K LP     EVY                           L+D+ +R+    + S  V+   + V       + VA      E    AA 
Subjt:  YWKMEKHERKDKNLPSNMAFEVYQ-------------------------ALNDVVQRKLRVAV-SVAVSTPTSVVAAAPPPSSAVAALPPPGEAAAAAAT

Query:  TATDSPAVSESSSSGTESSEKEEKTE--AKRRKMGDIGRSIERSASALAETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVAN
            S +  E     T   EKEE+ E    ++    +   +ER+   LA        Q E+++  L   R++R    ++   +  + +AD VA +A+
Subjt:  TATDSPAVSESSSSGTESSEKEEKTE--AKRRKMGDIGRSIERSASALAETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVAN

Arabidopsis top hitse value%identityAlignment
AT1G31310.1 hydroxyproline-rich glycoprotein family protein3.8e-4637.17Show/hide
Query:  AATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKAN--LAPP---VDPAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYE
        A  S   V++REYRKGNWTL ETM+LI AK++DDERR + +  L PP    D  + K  ELRWKW+E+YCW  GC RSQNQCNDKWDNL+RDYKKVREYE
Subjt:  AATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKAN--LAPP---VDPAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYE

Query:  SRACDQ-------------ISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVSVAVSTPTSVVAAA-----------------------
         R  +                + ASYWKMEK ERK+++LPSNM  + YQAL +VV+ K  +  S AV+  T+ VAAA                       
Subjt:  SRACDQ-------------ISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVSVAVSTPTSVVAAA-----------------------

Query:  -------------------------PPPSSAV---AALPPPGEAAAAAATTATDSPAVSESSSSGTESSEKEEKTEAKRRK----------------MGD
                                 PPPS  +     LPPP   +  A          ++ SS+ +++SE  + + AKRR+                + +
Subjt:  -------------------------PPPSSAV---AALPPPGEAAAAAATTATDSPAVSESSSSGTESSEKEEKTEAKRRK----------------MGD

Query:  IGRS-----------IERSASALAETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLS
        +GRS           + RS S +A  +R  EE+++ RH+++M +++RRL+IEE+   ++R+G+  LV A+  L+
Subjt:  IGRS-----------IERSASALAETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLS

AT2G33550.1 Homeodomain-like superfamily protein3.3e-0525.93Show/hide
Query:  VREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDPAARKGGELRWKW--VENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIAS
        V+  R   WT QE ++LI  K++ + R  +   A      A   G++  KW  V +YC  HG +R   QC  +W NL  DYKK++E+ES+  ++     S
Subjt:  VREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDPAARKGGELRWKW--VENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIAS

Query:  YWKMEKHERKDKNLPSNMAFEVYQ-------------------------ALNDVVQRKLRVAV-SVAVSTPTSVVAAAPPPSSAVAALPPPGEAAAAAAT
        YW M    R++K LP     EVY                           L+D+ +R+    + S  V+   + V       + VA      E    AA 
Subjt:  YWKMEKHERKDKNLPSNMAFEVYQ-------------------------ALNDVVQRKLRVAV-SVAVSTPTSVVAAAPPPSSAVAALPPPGEAAAAAAT

Query:  TATDSPAVSESSSSGTESSEKEEKTE--AKRRKMGDIGRSIERSASALAETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVAN
            S +  E     T   EKEE+ E    ++    +   +ER+   LA        Q E+++  L   R++R    ++   +  + +AD VA +A+
Subjt:  TATDSPAVSESSSSGTESSEKEEKTE--AKRRKMGDIGRSIERSASALAETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVAN

AT2G35640.1 Homeodomain-like superfamily protein2.0e-4738.54Show/hide
Query:  TSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDPAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYE-SRACDQ
        +S   +++RE RKGNWT+ ET++LI AKK+DD+RR + +   P      K  ELRWKW+E YCW  GC+R+QNQCNDKWDNL+RDYKK+REYE SR    
Subjt:  TSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDPAARKGGELRWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYE-SRACDQ

Query:  ISQI--ASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVSVAVST-------------------------------PTSVVAA--APPPSSAV
         + +  +SYWKM+K ERK+KNLPSNM  ++Y  L+++V RK   + S A +                                PT++V +   PPP S  
Subjt:  ISQI--ASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVSVAVST-------------------------------PTSVVAA--APPPSSAV

Query:  AALPPPGEAAAAAATTATDSPAVSESSSSGTESSEKEEKTEAKRRKMGD--IGRSIERSASALAETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHR
         +LP P +   +++  A   P    +SS+    +   E T    R++ +  +G ++ R  S + + +R  EE +E RH++++ L++RRL+IEE++  I+R
Subjt:  AALPPPGEAAAAAATTATDSPAVSESSSSGTESSEKEEKTEAKRRKMGD--IGRSIERSASALAETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHR

Query:  QGIADLVAAVANLS
        QG+  LV A+  L+
Subjt:  QGIADLVAAVANLS

AT4G31270.1 sequence-specific DNA binding transcription factors4.1e-0833.33Show/hide
Query:  RWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVV
        +W  +   C +    R+ NQC  KWD+L+ DY +++++ES+         SYW +   +RK  NLP ++  E+++A+N VV
Subjt:  RWKWVENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGATCCTCCGACGACATCATCGGAGCCACCGCAACACCACCACCACCACTCCCATCACCACCACCAACTCCTACATTTACCCCTAATTCACGGCGGCGCATCCAC
CGCCGCCGCGGCCACCACCACCGCACGAATCAACACTGGAGCAGCAACCTCATCCTCGACAGTAATAGTCCGAGAGTACCGCAAAGGAAACTGGACTCTCCAAGAGACGA
TGATTCTAATAACCGCCAAAAAGCTGGACGACGAGCGGCGGAATAAGGCGAACTTAGCCCCTCCGGTGGATCCCGCCGCCAGAAAGGGCGGCGAACTACGGTGGAAATGG
GTCGAAAATTACTGCTGGAGCCACGGCTGTCACCGGAGCCAAAATCAGTGCAACGACAAGTGGGATAACCTCCTCCGCGACTACAAAAAAGTCCGCGAGTACGAATCCCG
CGCGTGCGATCAAATATCCCAAATTGCCTCTTACTGGAAAATGGAAAAGCACGAGCGTAAGGACAAGAATCTTCCTTCTAATATGGCCTTTGAGGTCTATCAGGCCTTAA
ACGACGTCGTTCAGAGGAAGCTCAGAGTTGCTGTTTCTGTTGCTGTTTCCACTCCTACTAGTGTTGTTGCTGCTGCTCCTCCTCCTTCTTCCGCCGTCGCCGCCCTTCCG
CCTCCCGGTGAAGCTGCAGCGGCGGCGGCTACGACGGCGACCGATTCCCCGGCGGTTTCAGAGTCGTCATCATCGGGGACGGAGTCGAGCGAGAAGGAAGAGAAGACGGA
GGCAAAGAGGAGAAAAATGGGAGATATTGGGAGAAGCATAGAGAGAAGCGCATCGGCTTTAGCTGAAACGTTGCGGAGCTGCGAGGAGCAAAGGGAGATTCGACACCAAC
AACTTATGGAGCTTCGAAAACGCCGCCTTCAAATCGAAGAAGCCCGCAACCACATTCACCGTCAAGGCATCGCCGACCTCGTCGCCGCCGTCGCCAACCTCTCCGGTATA
AATAATAGAACAACAACAACAACATCAGAAGGGTATGGATGTTTATACAGTGGAGAAGAGGTGACAATGTTGAAAGAGCAAAATGAGGCAATGCAAGCTGAGCTTATGAG
TGTGAAGACTGAGCTTTCTCAACTTAGAGACCAAATGCCCTCTCTTATGCAAACCATGATGCACAATATGATCCACAACATCCCTCCTCCTCCTCCTTCCATGGACCCAT
CTGGATCAGGTGGAGATGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGGATCCTCCGACGACATCATCGGAGCCACCGCAACACCACCACCACCACTCCCATCACCACCACCAACTCCTACATTTACCCCTAATTCACGGCGGCGCATCCAC
CGCCGCCGCGGCCACCACCACCGCACGAATCAACACTGGAGCAGCAACCTCATCCTCGACAGTAATAGTCCGAGAGTACCGCAAAGGAAACTGGACTCTCCAAGAGACGA
TGATTCTAATAACCGCCAAAAAGCTGGACGACGAGCGGCGGAATAAGGCGAACTTAGCCCCTCCGGTGGATCCCGCCGCCAGAAAGGGCGGCGAACTACGGTGGAAATGG
GTCGAAAATTACTGCTGGAGCCACGGCTGTCACCGGAGCCAAAATCAGTGCAACGACAAGTGGGATAACCTCCTCCGCGACTACAAAAAAGTCCGCGAGTACGAATCCCG
CGCGTGCGATCAAATATCCCAAATTGCCTCTTACTGGAAAATGGAAAAGCACGAGCGTAAGGACAAGAATCTTCCTTCTAATATGGCCTTTGAGGTCTATCAGGCCTTAA
ACGACGTCGTTCAGAGGAAGCTCAGAGTTGCTGTTTCTGTTGCTGTTTCCACTCCTACTAGTGTTGTTGCTGCTGCTCCTCCTCCTTCTTCCGCCGTCGCCGCCCTTCCG
CCTCCCGGTGAAGCTGCAGCGGCGGCGGCTACGACGGCGACCGATTCCCCGGCGGTTTCAGAGTCGTCATCATCGGGGACGGAGTCGAGCGAGAAGGAAGAGAAGACGGA
GGCAAAGAGGAGAAAAATGGGAGATATTGGGAGAAGCATAGAGAGAAGCGCATCGGCTTTAGCTGAAACGTTGCGGAGCTGCGAGGAGCAAAGGGAGATTCGACACCAAC
AACTTATGGAGCTTCGAAAACGCCGCCTTCAAATCGAAGAAGCCCGCAACCACATTCACCGTCAAGGCATCGCCGACCTCGTCGCCGCCGTCGCCAACCTCTCCGGTATA
AATAATAGAACAACAACAACAACATCAGAAGGGTATGGATGTTTATACAGTGGAGAAGAGGTGACAATGTTGAAAGAGCAAAATGAGGCAATGCAAGCTGAGCTTATGAG
TGTGAAGACTGAGCTTTCTCAACTTAGAGACCAAATGCCCTCTCTTATGCAAACCATGATGCACAATATGATCCACAACATCCCTCCTCCTCCTCCTTCCATGGACCCAT
CTGGATCAGGTGGAGATGCTTAG
Protein sequenceShow/hide protein sequence
MSDPPTTSSEPPQHHHHHSHHHHQLLHLPLIHGGASTAAAATTTARINTGAATSSSTVIVREYRKGNWTLQETMILITAKKLDDERRNKANLAPPVDPAARKGGELRWKW
VENYCWSHGCHRSQNQCNDKWDNLLRDYKKVREYESRACDQISQIASYWKMEKHERKDKNLPSNMAFEVYQALNDVVQRKLRVAVSVAVSTPTSVVAAAPPPSSAVAALP
PPGEAAAAAATTATDSPAVSESSSSGTESSEKEEKTEAKRRKMGDIGRSIERSASALAETLRSCEEQREIRHQQLMELRKRRLQIEEARNHIHRQGIADLVAAVANLSGI
NNRTTTTTSEGYGCLYSGEEVTMLKEQNEAMQAELMSVKTELSQLRDQMPSLMQTMMHNMIHNIPPPPPSMDPSGSGGDA