; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0899 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0899
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptiontrihelix transcription factor ASR3-like
Genome locationMC09:12228550..12234077
RNA-Seq ExpressionMC09g0899
SyntenyMC09g0899
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456886.1 PREDICTED: trihelix transcription factor PTL-like [Cucumis melo]8.62e-17374.12Show/hide
Query:  MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAA-------REYRKGNWTLQETMILIAAKKLDDERRSKANLAPP--DPAARKGGE
        MSDPPTTSSEPPH HQ Q QH   L  LP+IHGGA+ +T  +  AA        REYRKGNWTLQETMILI AKKLDDERR+KANL P   DPAARKGGE
Subjt:  MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAA-------REYRKGNWTLQETMILIAAKKLDDERRSKANLAPP--DPAARKGGE

Query:  LRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNS-HRSGAT
        LRWKWVENYCWS GC RSQNQCNDKWDNLLRDYKKVREY+SRAC          PSYWKMEKHERKD NLPSNM FEVYQALNDVVQRK+S     S  T
Subjt:  LRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNS-HRSGAT

Query:  AAAVLPSPSSAPPP----PLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGD-IGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETR
           +LP P  APPP    P P  T SP  SESSS GTESSEK+E ME KRRKM D IG  IERS SAL Q L SCEEQREIRHQQLMEL+KRRL IEETR
Subjt:  AAAVLPSPSSAPPP----PLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGD-IGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETR

Query:  NHLHRQGIADLVAAVANLSG--KNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSS
        NH+HRQGIADLVAAVANLS    NNR  RS EGY   SCLYSGEEVR+LKEQNEAMQAELM VK ELSQLRDQMPSLMQTMMH+MIHNIPPPP P +S
Subjt:  NHLHRQGIADLVAAVANLSG--KNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSS

XP_022157078.1 trihelix transcription factor ASR3-like [Momordica charantia]1.33e-26699.74Show/hide
Query:  MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDPAARKGGELRWKWVENY
        MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDPAARKGGELRWKWVENY
Subjt:  MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDPAARKGGELRWKWVENY

Query:  CWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRSGATAAAVLPSPSS
        CWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRSGATAAAVLPSPSS
Subjt:  CWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRSGATAAAVLPSPSS

Query:  APPPPLPPTTTSPAASESSSGTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVAAVA
        APPPPLPPTTTSPAASESSSGTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVAAVA
Subjt:  APPPPLPPTTTSPAASESSSGTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVAAVA

Query:  NLSGKNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSSM
        NLSGKNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVK+ELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSSM
Subjt:  NLSGKNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSSM

XP_022997995.1 trihelix transcription factor ASR3-like [Cucurbita maxima]1.61e-17470.66Show/hide
Query:  MSDPPTTSSEPPHQ--------HQHQHQHH------QQLLHLPLIHGGA----TTSTTRSINAAAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPP
        MSDPPTTSSEPPHQ        H H H HH      QQLLHLPLIHGGA    T + T S     REYRKGNWTLQETMILI AKKLDDERR+K  LAPP
Subjt:  MSDPPTTSSEPPHQ--------HQHQHQHH------QQLLHLPLIHGGA----TTSTTRSINAAAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPP

Query:  -DPAARKGGELRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKY
         DP ARKGGELRWKWVENYCWS GCHRSQNQCNDKWDNLLRDYKKVREY+SRAC    +Q S  PSYWKMEKHERKDNNLPSNM FEVYQALNDVVQRK+
Subjt:  -DPAARKGGELRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKY

Query:  SN----SHRSGATAAAVLPSP--SSA-----PPPPLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQ
        S     SH +     A+ P+P  SSA     PPPP   T +SPA SESSS GTESSEK+E  E KRRKM D   +IERSA+ LAQ LRSCEEQREIRHQ+
Subjt:  SN----SHRSGATAAAVLPSP--SSA-----PPPPLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQ

Query:  LMELQKRRLHIEETRNHLHRQGIADLVAAVANLSGK-NNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMI
        +ME+QKR L IEE RNH+HRQGI+D+VAA+ANLS + ++R  R SEGY    C Y+GEEVR+LK+QNEAMQAE+M VK ELSQLRDQMPSLMQTMMHNM+
Subjt:  LMELQKRRLHIEETRNHLHRQGIADLVAAVANLSGK-NNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMI

Query:  HNIPPPPNP
        HNIPPPP P
Subjt:  HNIPPPPNP

XP_031742150.1 trihelix transcription factor PTL isoform X1 [Cucumis sativus]4.34e-17272.41Show/hide
Query:  MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAA-------REYRKGNWTLQETMILIAAKKLDDERRSKANLAPP--DPAARKGGE
        MSDPPTTSSEPPH     H H Q L  LP+IH GAT  T  +  AA        REYRKGNWTLQETMILI AKKLDDERR+KANL P   DPAARKGGE
Subjt:  MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAA-------REYRKGNWTLQETMILIAAKKLDDERRSKANLAPP--DPAARKGGE

Query:  LRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNS-HRSGAT
        LRWKWVENYCWS GC RSQNQCNDKWDNLLRDYKKVREY+SRAC          PSYWKMEKHERKD NLPSNM FEVYQALNDVVQRK+S     S  T
Subjt:  LRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNS-HRSGAT

Query:  AAAVLPSPSSAPPP----PLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGD-IGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETR
           +LP P  APPP    P P  T SP  SESSS GTESSEK+E +E KRRKM D IG  IERS SAL Q L SCEEQREIRHQQLMEL+KRRL IEETR
Subjt:  AAAVLPSPSSAPPP----PLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGD-IGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETR

Query:  NHLHRQGIADLVAAVANLSG--KNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSSMV
        NH+HRQGIADLVAAVANLS    N+R  RS EGY   SCLYSGEEVR+LKEQNEAMQAELM VK ELSQLRDQMPSLMQTMMHNM+HNIPPPP   SSM 
Subjt:  NHLHRQGIADLVAAVANLSG--KNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSSMV

Query:  LSLSSL
        + L  L
Subjt:  LSLSSL

XP_038893036.1 uncharacterized protein LOC120081925 [Benincasa hispida]4.52e-17673.05Show/hide
Query:  MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTST--TRSINAAA----------REYRKGNWTLQETMILIAAKKLDDERRSKANL--APPDPAA
        MSDPP+TSSEPP   QH+H HHQQ+LHLP+IHGGA+     TR    AA          REYRKGNWTLQETMILI AKKLDDERR+KANL  AP DPAA
Subjt:  MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTST--TRSINAAA----------REYRKGNWTLQETMILIAAKKLDDERRSKANL--APPDPAA

Query:  RKGGELRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHR
        RKGGELRWKWVENYCWS GC RSQNQCNDKWDNLLRDYKKVREY+SRAC    +Q    PSYWKMEKHERKD NLPSN+ FEVYQALNDVVQRK+S   +
Subjt:  RKGGELRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHR

Query:  SGATAAAVLPSPSSAPP---PPLPPTTTS---PAASESSSGTESSEKRESMETKRRKMGD-IGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLH
           +   +LP P+  PP   PP PPT+ +   P +  SSSGTESSEK+E +E KRRKMGD IG SIERS SAL Q L SCEEQREI+HQQLMEL+KRRL 
Subjt:  SGATAAAVLPSPSSAPP---PPLPPTTTS---PAASESSSGTESSEKRESMETKRRKMGD-IGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLH

Query:  IEETRNHLHRQGIADLVAAVANLS-GKNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPP
        IEETRNH+HRQGIADLVAAVANLS G NNR++R  E YG   CLYSGEEVR+LKEQNEAMQAELM VK+ELSQLRDQMPSLMQTMMHNMIHNI PPP
Subjt:  IEETRNHLHRQGIADLVAAVANLS-GKNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPP

TrEMBL top hitse value%identityAlignment
A0A1S3C482 trihelix transcription factor PTL-like4.18e-17374.12Show/hide
Query:  MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAA-------REYRKGNWTLQETMILIAAKKLDDERRSKANLAPP--DPAARKGGE
        MSDPPTTSSEPPH HQ Q QH   L  LP+IHGGA+ +T  +  AA        REYRKGNWTLQETMILI AKKLDDERR+KANL P   DPAARKGGE
Subjt:  MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAA-------REYRKGNWTLQETMILIAAKKLDDERRSKANLAPP--DPAARKGGE

Query:  LRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNS-HRSGAT
        LRWKWVENYCWS GC RSQNQCNDKWDNLLRDYKKVREY+SRAC          PSYWKMEKHERKD NLPSNM FEVYQALNDVVQRK+S     S  T
Subjt:  LRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNS-HRSGAT

Query:  AAAVLPSPSSAPPP----PLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGD-IGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETR
           +LP P  APPP    P P  T SP  SESSS GTESSEK+E ME KRRKM D IG  IERS SAL Q L SCEEQREIRHQQLMEL+KRRL IEETR
Subjt:  AAAVLPSPSSAPPP----PLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGD-IGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETR

Query:  NHLHRQGIADLVAAVANLSG--KNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSS
        NH+HRQGIADLVAAVANLS    NNR  RS EGY   SCLYSGEEVR+LKEQNEAMQAELM VK ELSQLRDQMPSLMQTMMH+MIHNIPPPP P +S
Subjt:  NHLHRQGIADLVAAVANLSG--KNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSS

A0A6J1DSG1 trihelix transcription factor ASR3-like6.44e-26799.74Show/hide
Query:  MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDPAARKGGELRWKWVENY
        MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDPAARKGGELRWKWVENY
Subjt:  MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDPAARKGGELRWKWVENY

Query:  CWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRSGATAAAVLPSPSS
        CWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRSGATAAAVLPSPSS
Subjt:  CWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRSGATAAAVLPSPSS

Query:  APPPPLPPTTTSPAASESSSGTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVAAVA
        APPPPLPPTTTSPAASESSSGTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVAAVA
Subjt:  APPPPLPPTTTSPAASESSSGTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVAAVA

Query:  NLSGKNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSSM
        NLSGKNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVK+ELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSSM
Subjt:  NLSGKNNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSSM

A0A6J1GZY0 trihelix transcription factor ASR3-like isoform X28.23e-16469.21Show/hide
Query:  TTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAA-------REYRKGNWTLQETMILIAAKKLDDERRSKANLAPP-DPAARKGGELRWKWV
        T    PP    H H   QQLLHLPLIHGGA    T +  AA        REYRKGNWTLQETMILI AKKLDDERR+K  LAPP DP ARKGGELRWKWV
Subjt:  TTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAA-------REYRKGNWTLQETMILIAAKKLDDERRSKANLAPP-DPAARKGGELRWKWV

Query:  ENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYS------NSHRSGATA
        ENYCWS GCHRSQNQCNDKWDNLLRDYKKVREY+SRAC    +Q S  PSYWKMEKHERKDNNLPSNM FEVYQALNDVVQRK+S      N+  +   A
Subjt:  ENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYS------NSHRSGATA

Query:  AAVLPSPSSA-----PPPPLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRN
            P PSSA     PPPP   T +SPA SESSS GTESSEK+E  E KRRKM D   +IERSA+ LAQ L+ CEEQREIRHQ++ME+QKR L IEE RN
Subjt:  AAVLPSPSSA-----PPPPLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRN

Query:  HLHRQGIADLVAAVANLSGK-NNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNP
        H+HRQGI+D+VAA+ANLS + ++R  R SEGY    C Y+GEEVR+LK+QNEAMQAE+M VK ELSQLRDQMPSLMQTMMHNM+HNIPPPP P
Subjt:  HLHRQGIADLVAAVANLSGK-NNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNP

A0A6J1H1B6 trihelix transcription factor ASR3-like isoform X15.44e-16267.74Show/hide
Query:  TTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAA-------REYRKGNWTLQETMILIAAKKLDDERRSKANLAPP-DPAARKGGELRWKWV
        T    PP    H H   QQLLHLPLIHGGA    T +  AA        REYRKGNWTLQETMILI AKKLDDERR+K  LAPP DP ARKGGELRWKWV
Subjt:  TTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAA-------REYRKGNWTLQETMILIAAKKLDDERRSKANLAPP-DPAARKGGELRWKWV

Query:  ENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYS------NSHRSGATA
        ENYCWS GCHRSQNQCNDKWDNLLRDYKKVREY+SRAC    +Q S  PSYWKMEKHERKDNNLPSNM FEVYQALNDVVQRK+S      N+  +  T 
Subjt:  ENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYS------NSHRSGATA

Query:  AAVL----------PSPSSA-----PPPPLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQLMELQK
         A            P PSSA     PPPP   T +SPA SESSS GTESSEK+E  E KRRKM D   +IERSA+ LAQ L+ CEEQREIRHQ++ME+QK
Subjt:  AAVL----------PSPSSA-----PPPPLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQLMELQK

Query:  RRLHIEETRNHLHRQGIADLVAAVANLSGK-NNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPP
        R L IEE RNH+HRQGI+D+VAA+ANLS + ++R  R SEGY    C Y+GEEVR+LK+QNEAMQAE+M VK ELSQLRDQMPSLMQTMMHNM+HNIPPP
Subjt:  RRLHIEETRNHLHRQGIADLVAAVANLSGK-NNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPP

Query:  PNP
        P P
Subjt:  PNP

A0A6J1KBG1 trihelix transcription factor ASR3-like7.80e-17570.66Show/hide
Query:  MSDPPTTSSEPPHQ--------HQHQHQHH------QQLLHLPLIHGGA----TTSTTRSINAAAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPP
        MSDPPTTSSEPPHQ        H H H HH      QQLLHLPLIHGGA    T + T S     REYRKGNWTLQETMILI AKKLDDERR+K  LAPP
Subjt:  MSDPPTTSSEPPHQ--------HQHQHQHH------QQLLHLPLIHGGA----TTSTTRSINAAAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPP

Query:  -DPAARKGGELRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKY
         DP ARKGGELRWKWVENYCWS GCHRSQNQCNDKWDNLLRDYKKVREY+SRAC    +Q S  PSYWKMEKHERKDNNLPSNM FEVYQALNDVVQRK+
Subjt:  -DPAARKGGELRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKY

Query:  SN----SHRSGATAAAVLPSP--SSA-----PPPPLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQ
        S     SH +     A+ P+P  SSA     PPPP   T +SPA SESSS GTESSEK+E  E KRRKM D   +IERSA+ LAQ LRSCEEQREIRHQ+
Subjt:  SN----SHRSGATAAAVLPSP--SSA-----PPPPLPPTTTSPAASESSS-GTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQ

Query:  LMELQKRRLHIEETRNHLHRQGIADLVAAVANLSGK-NNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMI
        +ME+QKR L IEE RNH+HRQGI+D+VAA+ANLS + ++R  R SEGY    C Y+GEEVR+LK+QNEAMQAE+M VK ELSQLRDQMPSLMQTMMHNM+
Subjt:  LMELQKRRLHIEETRNHLHRQGIADLVAAVANLSGK-NNRSSRSSEGYGSSSCLYSGEEVRVLKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMI

Query:  HNIPPPPNP
        HNIPPPP P
Subjt:  HNIPPPPNP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31310.1 hydroxyproline-rich glycoprotein family protein5.4e-4837.15Show/hide
Query:  REYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDP------AARKGGELRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSR--------
        REYRKGNWTL ETM+LI AK++DDERR + ++  P P       + K  ELRWKW+E+YCW +GC RSQNQCNDKWDNL+RDYKKVREY+ R        
Subjt:  REYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDP------AARKGGELRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSR--------

Query:  -ACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRS--------GATAAAV------------------------------
           +S+S     + SYWKMEK ERK+ +LPSNM  + YQAL +VV+ K   S  +         A AAA+                              
Subjt:  -ACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRS--------GATAAAV------------------------------

Query:  --------------------LPSPSSAPPPPLP--------PTTTSPAASESS--SGTESSEKRESMET-------------------KRRKMGDIGSSI
                            LP P   PPPP P        PT  S   S++S  S T  +++R +M T                   KR +   + +++
Subjt:  --------------------LPSPSSAPPPPLP--------PTTTSPAASESS--SGTESSEKRESMET-------------------KRRKMGDIGSSI

Query:  ERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVAAVANLS
         RS S +A A+R  EE+++ RH+++M +Q+RRL IEE+   ++R+G+  LV A+  L+
Subjt:  ERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVAAVANLS

AT2G33550.1 Homeodomain-like superfamily protein1.3e-0424.01Show/hide
Query:  LHLPLIHGGATTSTTRSINA--AAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDPAARKGGELRWKW--VENYCWSQGCHRSQNQCNDKWDNLL
        L +  + GG  +S   +       +  R   WT QE ++LI  K++ + R  +   A     A   G++  KW  V +YC   G +R   QC  +W NL 
Subjt:  LHLPLIHGGATTSTTRSINA--AAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDPAARKGGELRWKW--VENYCWSQGCHRSQNQCNDKWDNLL

Query:  RDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRSGATAAAVLPSPSSAPPPPLPPTTTSPAASESSSG
         DYKK++E++S+           + SYW M    R++  LP     EVY  ++  V          G   A+     S       P    S   ++S + 
Subjt:  RDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRSGATAAAVLPSPSSAPPPPLPPTTTSPAASESSSG

Query:  TESSEKRESMETKRRKMGD---IGSSIERSASALAQALRS----CEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVAA---VANLSGKNNRSSR
            EK+E+    + ++ +     +++E  +++  +  R      E++ E    +  ++Q + + I      L R G   L+AA   V NL+ K +R  R
Subjt:  TESSEKRESMETKRRKMGD---IGSSIERSASALAQALRS----CEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVAA---VANLSGKNNRSSR

Query:  SSEG
           G
Subjt:  SSEG

AT2G35640.1 Homeodomain-like superfamily protein9.2e-4838.89Show/hide
Query:  REYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDPAARKGGELRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPS
        RE RKGNWT+ ET++LI AKK+DD+RR + +   P+    K  ELRWKW+E YCW +GC+R+QNQCNDKWDNL+RDYKK+REY+ R+   +S     S S
Subjt:  REYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDPAARKGGELRWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPS

Query:  YWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRSGATAAA-----------------------------------VLPSPS------SAPPPPL
        YWKM+K ERK+ NLPSNM  ++Y  L+++V RK   S  S A A                                      LP P       S P PP 
Subjt:  YWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRSGATAAA-----------------------------------VLPSPS------SAPPPPL

Query:  PPTTTSPAAS--ESSSGTESSEKRESMETKRRKMGD-------IGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVA
        PP ++S  A     + GT S+++R +   +    G+       +G ++ R  S + Q +R  EE +E RH++++ LQ+RRL IEE++  ++RQG+  LV 
Subjt:  PPTTTSPAAS--ESSSGTESSEKRESMETKRRKMGD-------IGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVA

Query:  AVANLSGKNNRSSRSSEGYGSSSC
        A+       N+ + S     SSSC
Subjt:  AVANLSGKNNRSSRSSEGYGSSSC

AT4G31270.1 sequence-specific DNA binding transcription factors9.3e-0830.59Show/hide
Query:  RWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVV
        +W  +   C +    R+ NQC  KWD+L+ DY ++++++       S+      SYW +   +RK  NLP ++  E+++A+N VV
Subjt:  RWKWVENYCWSQGCHRSQNQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGATCCTCCGACGACATCATCGGAGCCGCCGCACCAGCACCAGCACCAGCATCAGCACCACCAGCAACTCCTACATTTGCCCCTAATCCACGGCGGCGCCACCAC
CTCCACCACCAGATCAATCAATGCCGCCGCCCGTGAGTACCGCAAGGGCAACTGGACTCTCCAGGAGACCATGATTTTAATAGCCGCGAAGAAGCTTGACGACGAGCGGC
GGAGCAAGGCGAACCTAGCGCCACCGGACCCGGCGGCGCGTAAGGGCGGGGAGCTCCGGTGGAAGTGGGTGGAGAACTACTGCTGGAGCCAAGGTTGCCACCGGAGCCAG
AACCAGTGCAACGACAAGTGGGACAATCTCCTCCGCGACTACAAAAAAGTCCGCGAGTACGACTCCCGCGCGTGCGCCTCCGCCTCCGAACAACCCTCCCCCTCCCCCTC
CTACTGGAAAATGGAGAAGCACGAGCGAAAAGACAATAATCTCCCTTCCAACATGCCCTTCGAAGTCTACCAAGCGCTCAACGACGTCGTTCAGAGGAAGTACTCCAACT
CCCACAGATCAGGCGCCACCGCCGCCGCCGTTCTCCCGTCTCCGTCCTCCGCCCCGCCGCCGCCTCTCCCGCCGACAACCACTTCTCCGGCTGCCTCAGAATCATCATCT
GGGACGGAGTCGAGCGAGAAGAGAGAGAGCATGGAAACGAAGAGGAGAAAAATGGGAGACATAGGGTCAAGCATAGAGAGGAGCGCCTCGGCGCTGGCTCAAGCGCTGCG
GAGCTGCGAGGAGCAGAGGGAGATTCGACACCAACAGCTAATGGAGCTTCAAAAACGGAGGCTTCACATTGAAGAAACCCGCAACCACCTCCACCGCCAAGGAATCGCGG
ATCTCGTCGCCGCCGTCGCCAACCTCTCCGGGAAAAACAATAGAAGTAGCAGATCATCAGAAGGGTATGGATCATCATCATGTTTATACAGTGGAGAAGAGGTGAGGGTT
TTGAAAGAGCAAAATGAGGCAATGCAAGCTGAGCTGATGGGTGTCAAGGCTGAGCTCTCCCAACTTAGGGACCAAATGCCCTCTCTCATGCAGACCATGATGCACAATAT
GATCCACAACATCCCTCCTCCTCCCAATCCTCATTCTTCCATGGTACTCTCTCTCTCTTCTCTCTCTCTCATGCATCATATCGAGCATATATGCACGTACCGACTGTTTT
GGGGTTGGAGATTCAAACCCACGACCTTTTAG
mRNA sequenceShow/hide mRNA sequence
AAATAATTAAGAAAAAAAATAATAAGACGGAGAGAGAGGGAGAGAGAAAGAAAGAGACTTGTTCACACACTCTCTCTTTGTGTTCTTCTTTATCCATCTGTGAGTCTCTG
AGATGCCAAAGCTATGTCGGATCCTCCGACGACATCATCGGAGCCGCCGCACCAGCACCAGCACCAGCATCAGCACCACCAGCAACTCCTACATTTGCCCCTAATCCACG
GCGGCGCCACCACCTCCACCACCAGATCAATCAATGCCGCCGCCCGTGAGTACCGCAAGGGCAACTGGACTCTCCAGGAGACCATGATTTTAATAGCCGCGAAGAAGCTT
GACGACGAGCGGCGGAGCAAGGCGAACCTAGCGCCACCGGACCCGGCGGCGCGTAAGGGCGGGGAGCTCCGGTGGAAGTGGGTGGAGAACTACTGCTGGAGCCAAGGTTG
CCACCGGAGCCAGAACCAGTGCAACGACAAGTGGGACAATCTCCTCCGCGACTACAAAAAAGTCCGCGAGTACGACTCCCGCGCGTGCGCCTCCGCCTCCGAACAACCCT
CCCCCTCCCCCTCCTACTGGAAAATGGAGAAGCACGAGCGAAAAGACAATAATCTCCCTTCCAACATGCCCTTCGAAGTCTACCAAGCGCTCAACGACGTCGTTCAGAGG
AAGTACTCCAACTCCCACAGATCAGGCGCCACCGCCGCCGCCGTTCTCCCGTCTCCGTCCTCCGCCCCGCCGCCGCCTCTCCCGCCGACAACCACTTCTCCGGCTGCCTC
AGAATCATCATCTGGGACGGAGTCGAGCGAGAAGAGAGAGAGCATGGAAACGAAGAGGAGAAAAATGGGAGACATAGGGTCAAGCATAGAGAGGAGCGCCTCGGCGCTGG
CTCAAGCGCTGCGGAGCTGCGAGGAGCAGAGGGAGATTCGACACCAACAGCTAATGGAGCTTCAAAAACGGAGGCTTCACATTGAAGAAACCCGCAACCACCTCCACCGC
CAAGGAATCGCGGATCTCGTCGCCGCCGTCGCCAACCTCTCCGGGAAAAACAATAGAAGTAGCAGATCATCAGAAGGGTATGGATCATCATCATGTTTATACAGTGGAGA
AGAGGTGAGGGTTTTGAAAGAGCAAAATGAGGCAATGCAAGCTGAGCTGATGGGTGTCAAGGCTGAGCTCTCCCAACTTAGGGACCAAATGCCCTCTCTCATGCAGACCA
TGATGCACAATATGATCCACAACATCCCTCCTCCTCCCAATCCTCATTCTTCCATGGTACTCTCTCTCTCTTCTCTCTCTCTCATGCATCATATCGAGCATATATGCACG
TACCGACTGTTTTGGGGTTGGAGATTCAAACCCACGACCTTTTAGTCGAGGATATGTACCTTAACTAGTTGAGTTATGTTGTGTTCATGTTGGTCAATCTGGTTCATTTT
AGTCTTTACTAAAGACTACAATTGACGACGTTGATTGTGGTTTGTGTAGTATATATGAGTAGAAAATCGTGGACA
Protein sequenceShow/hide protein sequence
MSDPPTTSSEPPHQHQHQHQHHQQLLHLPLIHGGATTSTTRSINAAAREYRKGNWTLQETMILIAAKKLDDERRSKANLAPPDPAARKGGELRWKWVENYCWSQGCHRSQ
NQCNDKWDNLLRDYKKVREYDSRACASASEQPSPSPSYWKMEKHERKDNNLPSNMPFEVYQALNDVVQRKYSNSHRSGATAAAVLPSPSSAPPPPLPPTTTSPAASESSS
GTESSEKRESMETKRRKMGDIGSSIERSASALAQALRSCEEQREIRHQQLMELQKRRLHIEETRNHLHRQGIADLVAAVANLSGKNNRSSRSSEGYGSSSCLYSGEEVRV
LKEQNEAMQAELMGVKAELSQLRDQMPSLMQTMMHNMIHNIPPPPNPHSSMVLSLSSLSLMHHIEHICTYRLFWGWRFKPTTF