; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029255 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029255
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGRAS domain-containing protein
Genome locationchr8:36950841..36952424
RNA-Seq ExpressionLag0029255
SyntenyLag0029255
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR005202 - Transcription factor GRAS
IPR030028 - Scarecrow-like protein 26/nodulation signalling pathway 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578977.1 Protein NODULATION SIGNALING PATHWAY 2, partial [Cucurbita argyrosperma subsp. sororia]2.6e-21476.62Show/hide
Query:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA
        MA+A+DN F  +A+SDYSTSTNNSDD                         DFHDLFDS+M++DA PY P+  E+D G+NCNS S+PAE +EE       
Subjt:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA

Query:  EDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDH
           EEEE++ LRLYHLLMA ADA+FGDHKSR+LA VIL+RLNELVS SHGTNLERLTAYYAQAFQ LLDCAAVS   GGG NKP        HHLHRDDH
Subjt:  EDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDH

Query:  SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRL
        SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDY+IMEG QWASLMQA VSRKD PPAPHLRITAISR   G + RR I TVQETGRRL
Subjt:  SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRL

Query:  VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYS
        VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNC+LHLPHF   +PESIASFL+GAKTLNPRLVTLVEEEI HGPT+DGDYKVQFLDSLERYS
Subjt:  VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYS

Query:  AIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQ-------ANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWK
        AIYDSL AG P KNRAR LVE+VFLGPRISATL RIGQPQ        NC WGERLEK+G KAAAISFANHCQARLL+DLFNDGYRVEELG+NKLVLGWK
Subjt:  AIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQ-------ANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWK

Query:  SKRLLSTSIWVSSSCSSSLCSSSDSE
        SKRLLS SIW SSS SSS  SSSDSE
Subjt:  SKRLLSTSIWVSSSCSSSLCSSSDSE

KAG7016501.1 Nodulation-signaling pathway 2 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-21476.43Show/hide
Query:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA
        MA+A+DN F  +A+SDYSTSTNNSDD                         DFHDLFDS+M++DA PY P+  E+D G+NCNS S+PAE +EE       
Subjt:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA

Query:  EDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDH
           EEEE++ LRLYHLLMA ADA+FGDHKSR+LA VIL+RLNELVS SHGTNLER+TAYYAQAFQ LLDCAAVS   GGG NKP        HHLHRDDH
Subjt:  EDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDH

Query:  SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRL
        SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDY+IMEG QWASLMQA VSRKD PPAPHLRITAISR   G + RR I TVQETGRRL
Subjt:  SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRL

Query:  VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYS
        VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNC+LHLPHF   +PESIASFL+GAKTLNPRLVTLVEEEI HGPT+DGDYKVQFLDSLERYS
Subjt:  VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYS

Query:  AIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQ-------ANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWK
        AIYDSL AG P KNRAR LVE+VFLGPRISATL RIGQPQ        NC WGERLEK+G KAAAISFANHCQARLL+DLFNDGYRVEELG+NKLVLGWK
Subjt:  AIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQ-------ANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWK

Query:  SKRLLSTSIWVSSSCSSSLCSSSDSE
        SKRLLS SIW SSS SSS  SSSDSE
Subjt:  SKRLLSTSIWVSSSCSSSLCSSSDSE

XP_008445682.1 PREDICTED: nodulation-signaling pathway 2 protein [Cucumis melo]1.5e-21476.09Show/hide
Query:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA
        MALAI+NPF D+ALS+YSTSTNNSDD  HLAGNWNY SP+VDWE F GTH DF D+FDS + ++ PP+                                
Subjt:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA

Query:  EDR-EEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDD
        EDR EE+E+KGLRLYHLL+AAADA+FGDH+S DLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQ LLD A V       SNK HHH+HH H   HRDD
Subjt:  EDR-EEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDD

Query:  HSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRR
        H+PTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVA+DRRVHIVDYDIMEG QWASLMQAFVS    P APHLRITAISRG  G   RRSI TVQETGRR
Subjt:  HSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRR

Query:  LVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERY
        LVAFAASIGQPFSFHQCKLDSDESFRPSGLKLV+GEALVVNCMLHLPHF YR+PESIASFLSGAK+L+PR+VTLVEEEIGHGPT+DGDYKVQFLDSLERY
Subjt:  LVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERY

Query:  SAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQ-------PQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGW
        SAIYDSL A  P KNRARALVERVFLGPRISATL RIGQ        + NC WGE+LEK+G K   ISFANHCQARLLL LFNDGYRVEELGNNKLVLGW
Subjt:  SAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQ-------PQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGW

Query:  KSKRLLSTSIWVSSSCSSSLCSSSDSE
        KSKRLLS SIW SS+ SSS  S SDSE
Subjt:  KSKRLLSTSIWVSSSCSSSLCSSSDSE

XP_023551167.1 nodulation-signaling pathway 2 protein isoform X1 [Cucurbita pepo subsp. pepo]5.2e-21577.19Show/hide
Query:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA
        MA+A+DN F  +A+SDYSTSTNNSDD                         DFHDLFDS+M++DA PY P+  EVD G+NCNS S+PAED+EE       
Subjt:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA

Query:  EDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDH
           EEEE+K LRLYHLLMA ADA+FGDHKSRDLA VIL+RLNELVS SHGTNLERLTAYYAQ+FQ LLD AAVS   GGGSNKP        HHLHRDDH
Subjt:  EDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDH

Query:  SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRL
        SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDY+IMEG QWASLMQA VSRKD PPAPHLRITAISR   G + RR I TVQETGRRL
Subjt:  SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRL

Query:  VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYS
        VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNC+LHLPHF   +PESI SFL+GAKTLNPRLVTLVEEEI HGPT+DGDYKVQFLDSLERYS
Subjt:  VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYS

Query:  AIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQ-------ANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWK
        AIYDSL AG P KNRAR LVE+VFLGPRISATL RIGQPQ        NCSWGERLEK+G KAAAISFANHCQARLL+DLFNDGYRVEELG+NKLVLGWK
Subjt:  AIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQ-------ANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWK

Query:  SKRLLSTSIWVSSSCSSSLCSSSDSE
        SKRLLS SIW SSS SSS  SSSDSE
Subjt:  SKRLLSTSIWVSSSCSSSLCSSSDSE

XP_038884439.1 protein NODULATION SIGNALING PATHWAY 2 [Benincasa hispida]8.6e-21876.17Show/hide
Query:  IQTVNSPMALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEE
        I ++NSPMALA+DN F ++ LS+YSTSTNNSDD  HLAGNWNY SP+VDWE FP TH DFHDL DSM+ +D PP                  SP ED+E 
Subjt:  IQTVNSPMALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEE

Query:  EEEEAKAEDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHH
                  EEE++KGLRLYHLL+AAADA+FGDHKSRDLAHVILVRLNELVSPSHGTNL+RLTAYYAQAFQ LLD   V       SNK HHH    +H
Subjt:  EEEEAKAEDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHH

Query:  HLHRDDHSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATV
        HLHRDDH+PTDVLAAFQLLQEMSPYVKF HFTANQAILEAVA+DRRVHIVDYDIMEG QWASLMQAFVS    P APHLRITAISRG  G   RRSI TV
Subjt:  HLHRDDHSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATV

Query:  QETGRRLVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFL
        QETGRRLVAFAASIGQPFSFHQCKLDSDESFRPSGLKLV+GEALVVNC+LHLPHF YR+PESI SFLSG K+LNPR+VTLVEEEIGHGPT+D DYKVQFL
Subjt:  QETGRRLVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFL

Query:  DSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQ-------PQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNN
        DSLERYSAIYDSL A  P KNRARALVERVFLGPRISATL RIGQ        + NC WGE+LEK+G K A ISFANHCQARLLL LFNDGYRVEELGNN
Subjt:  DSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQ-------PQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNN

Query:  KLVLGWKSKRLLSTSIWVSSSCSSSLCSSSDSE
        KLVLGWKSKRLLS SIWVSSS SSS  S SDSE
Subjt:  KLVLGWKSKRLLSTSIWVSSSCSSSLCSSSDSE

TrEMBL top hitse value%identityAlignment
A0A0A0KDX4 GRAS domain-containing protein8.4e-21175.71Show/hide
Query:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA
        MALAI+NP  D+ALS+YSTSTNNSDD  HLAGNWNY SP+VDWE F GTH DF D+FDS + ++ PP+                                
Subjt:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA

Query:  EDR-EEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDD
        EDR EE+E+KGLRLYHLL AAADA+ GDHKS DLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQ LLD A V       +NK HHH+HH H    RDD
Subjt:  EDR-EEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDD

Query:  HSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRR
        H+PTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVA+DRRVHIVDYDIMEG QWASLMQAFVS    P APHLRITAISRG  G   RRSI TVQETGRR
Subjt:  HSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRR

Query:  LVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERY
        LVAFAASIGQPFSFHQCKLDSDESFRPSGLKLV+GEALVVNCMLHLPHF YR+PESIASFLSGAK+L+PR+VTLVEEEIGHGPT+DGDYKVQFLDSLERY
Subjt:  LVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERY

Query:  SAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQ-------PQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGW
        SAIYDSL A  P KNRARALVERVFLGPRISATL RIGQ        + NC WGE+LEK+G K   ISFANHCQARLLL LFNDGYRVEELGNNKLVLGW
Subjt:  SAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQ-------PQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGW

Query:  KSKRLLSTSIWVSSSCSSSLCSSSDSE
        KSKRLLS SIW SSS SSSL   SDSE
Subjt:  KSKRLLSTSIWVSSSCSSSLCSSSDSE

A0A1S3BE54 nodulation-signaling pathway 2 protein7.3e-21576.09Show/hide
Query:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA
        MALAI+NPF D+ALS+YSTSTNNSDD  HLAGNWNY SP+VDWE F GTH DF D+FDS + ++ PP+                                
Subjt:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA

Query:  EDR-EEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDD
        EDR EE+E+KGLRLYHLL+AAADA+FGDH+S DLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQ LLD A V       SNK HHH+HH H   HRDD
Subjt:  EDR-EEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDD

Query:  HSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRR
        H+PTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVA+DRRVHIVDYDIMEG QWASLMQAFVS    P APHLRITAISRG  G   RRSI TVQETGRR
Subjt:  HSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRR

Query:  LVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERY
        LVAFAASIGQPFSFHQCKLDSDESFRPSGLKLV+GEALVVNCMLHLPHF YR+PESIASFLSGAK+L+PR+VTLVEEEIGHGPT+DGDYKVQFLDSLERY
Subjt:  LVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERY

Query:  SAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQ-------PQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGW
        SAIYDSL A  P KNRARALVERVFLGPRISATL RIGQ        + NC WGE+LEK+G K   ISFANHCQARLLL LFNDGYRVEELGNNKLVLGW
Subjt:  SAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQ-------PQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGW

Query:  KSKRLLSTSIWVSSSCSSSLCSSSDSE
        KSKRLLS SIW SS+ SSS  S SDSE
Subjt:  KSKRLLSTSIWVSSSCSSSLCSSSDSE

A0A5A7V767 Nodulation-signaling pathway 2 protein7.3e-21576.09Show/hide
Query:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA
        MALAI+NPF D+ALS+YSTSTNNSDD  HLAGNWNY SP+VDWE F GTH DF D+FDS + ++ PP+                                
Subjt:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA

Query:  EDR-EEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDD
        EDR EE+E+KGLRLYHLL+AAADA+FGDH+S DLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQ LLD A V       SNK HHH+HH H   HRDD
Subjt:  EDR-EEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDD

Query:  HSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRR
        H+PTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVA+DRRVHIVDYDIMEG QWASLMQAFVS    P APHLRITAISRG  G   RRSI TVQETGRR
Subjt:  HSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRR

Query:  LVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERY
        LVAFAASIGQPFSFHQCKLDSDESFRPSGLKLV+GEALVVNCMLHLPHF YR+PESIASFLSGAK+L+PR+VTLVEEEIGHGPT+DGDYKVQFLDSLERY
Subjt:  LVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERY

Query:  SAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQ-------PQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGW
        SAIYDSL A  P KNRARALVERVFLGPRISATL RIGQ        + NC WGE+LEK+G K   ISFANHCQARLLL LFNDGYRVEELGNNKLVLGW
Subjt:  SAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQ-------PQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGW

Query:  KSKRLLSTSIWVSSSCSSSLCSSSDSE
        KSKRLLS SIW SS+ SSS  S SDSE
Subjt:  KSKRLLSTSIWVSSSCSSSLCSSSDSE

A0A6J1FK48 nodulation-signaling pathway 2 protein6.8e-21376.05Show/hide
Query:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA
        MA+A+DN F  +A+SDYSTSTNNS+D                         DFHDLFDS+M++DA PY P+  E+D G+NCNS S+PAE +EE       
Subjt:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA

Query:  EDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDH
           EEEE++ LRLYHLLMA ADA+FGDHKSR+LA VIL+RLNELVS SHGTNLER+TAYYAQAFQ LLDCAAVS   GGG NKP        HHLHRDDH
Subjt:  EDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDH

Query:  SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRL
        SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDY+IMEG QWASLMQA VSRKD PPAPHLRITAISR   G + RR I TVQETGRRL
Subjt:  SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRL

Query:  VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYS
        VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNC+LHLPHF   +PESIASFL+GAKTLNPRLVTLVEEEI HGPT+DGDYKVQFLDSLERYS
Subjt:  VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYS

Query:  AIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQ-------ANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWK
        AIYDSL AG P KNRAR LVE+VFLGPRISATL RI QPQ        NC WGERLEK+G KAAAISFANHCQARLL+DLFNDGYRVEELG+NKLVLGWK
Subjt:  AIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQ-------ANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWK

Query:  SKRLLSTSIWVSSSCSSSLCSSSDSE
        SKRLLS SIW SSS SSS  SSSDSE
Subjt:  SKRLLSTSIWVSSSCSSSLCSSSDSE

A0A6J1JUC4 nodulation-signaling pathway 2 protein8.1e-21476.43Show/hide
Query:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA
        MA+A+DN F  +A+S+YSTSTNNSDD                         DFHDLFDS+M++DA PY P+  EVD G+NCNS S+PAED+EE       
Subjt:  MALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKA

Query:  EDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDH
           EEEE+K LRLYHLLMA ADA+FGD+KSR+LA VIL+RLNELVS SHGTNLERLTAYYAQAFQ LLDCAAVSGG     NKP        HHLHRDDH
Subjt:  EDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDH

Query:  SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRL
        SPTDVLAAFQLLQ+MSPYVKFGHFTANQAILEAVADDRRVHIVDY+IMEG QWASLMQAFVSRKD PPAPHLRIT ISR   G + RR I TVQETGRRL
Subjt:  SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRL

Query:  VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYS
        VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNC+LHLPHF   +PESIASFL+GAKTLNPRLVTLVEEEI HGPT+DGDYKVQFLDSLERYS
Subjt:  VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYS

Query:  AIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQ-------ANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWK
        AIYDSL AG P KNRAR LVE+VFLGPRISATL RIGQPQ        NC WGERLEK+G KAAAISFANHCQARLL+DLFNDGYRVEELG+NKLVLGWK
Subjt:  AIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQ-------ANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWK

Query:  SKRLLSTSIWVSSSCSSSLCSSSDSE
        SKRLLS SIW SSS SSS  SSSDSE
Subjt:  SKRLLSTSIWVSSSCSSSLCSSSDSE

SwissProt top hitse value%identityAlignment
A2ZHL0 Protein SCARECROW 22.5e-5033.56Show/hide
Query:  PYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKAEDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQG
        P PP A E  + +   + ++ A  +E +EE+     R++ + +GL L  LL+  A+++  D  + D AH  L+ + EL +P  GT+ +R+ AY+A+A   
Subjt:  PYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKAEDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQG

Query:  LLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDHSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDG
         L  + +   A   S  P     H              V AAFQ+   +SP+VKF HFTANQAI EA   + RVHI+D DIM+G QW  L     SR  G
Subjt:  LLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDHSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDG

Query:  PPAPHLRITAISRGGGGATCRRSIATVQETGRRLVAFAASIGQPFSFHQCKL-DSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKT
        P  P +R+T +           S+  ++ TG+RL  FA ++G PF F  C + D   +  P  L + R EA+ V+    L H +Y    S ++ L   + 
Subjt:  PPAPHLRITAISRGGGGATCRRSIATVQETGRRLVAFAASIGQPFSFHQCKL-DSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKT

Query:  LNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQAN-----CSWGERLEKLGFKAAAIS
        L P++VT+VE+++ H     G +  +F++++  YSA++DSL A + + +  R +VE+  L   I   L  +G P         SW E+L + GF+ ++++
Subjt:  LNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQAN-----CSWGERLEKLGFKAAAIS

Query:  FANHCQARLLLDLF-NDGYRVEELGNNKLVLGWKSKRLLSTSIW
         +   QA LLL +F +DGY + E  N  L LGWK   LL+ S W
Subjt:  FANHCQARLLLDLF-NDGYRVEELGNNKLVLGWKSKRLLSTSIW

Q2PEG7 Protein NODULATION SIGNALING PATHWAY 23.8e-14456.7Show/hide
Query:  LALSDYSTSTNN-SDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEA------------
        L  S +ST TN  S D  +   +WN+ SPVV+W+ F G   DFH L DSM+            + ++G   +  ++    +EEEEEEA            
Subjt:  LALSDYSTSTNN-SDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEA------------

Query:  -KAEDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHR
            +  ++++KGLRL HLLMA A+AL G +K+R+LA VILVRL ELVS + GTN+ERL AY+ +A QGLL+      GAGG  N    HH     H   
Subjt:  -KAEDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHR

Query:  DDHSP-TDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQET
          H P  D LAAFQLLQ+MSPYVKFGHFTANQAI+EAVA +RRVHIVDYDIMEG QWASLMQA  S  +G   PHLRITA+SR G G   RRS+ATVQET
Subjt:  DDHSP-TDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQET

Query:  GRRLVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSL
        GRRL AFA S+GQPFSFH  +L+SDE+FRP+GLKLVRGEALV NCML+LPH  YR+P S+ASFL+ AK L PRLVT+VEEE+G   +  G +  +F+DSL
Subjt:  GRRLVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSL

Query:  ERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRI----GQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGW
          +SA++DSL AGFP + RARALVERVFLGPRI  +L RI    G  +   SW E L   GF   A+S ANHCQ+ LLL LFNDGYRVEELG+NKLVL W
Subjt:  ERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRI----GQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGW

Query:  KSKRLLSTSIWVSSS
        K++RLLS S+W  SS
Subjt:  KSKRLLSTSIWVSSS

Q5NE24 Protein NODULATION SIGNALING PATHWAY 21.4e-14655.53Show/hide
Query:  DLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEE--------AKAED
        DL  S +S+ TN           WN+ SP+V+W+ F G   DFH L D+++          +  + +     + ++   D+EEEE E        A    
Subjt:  DLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEE--------AKAED

Query:  REEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSP-SHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDH-
           ++ KGL+L HLLMA A+AL G  K+RDLA VIL+RL ELVS  ++G+N+ERL A++ +A  GLL+      GAGG  N  HHH+++ H+      H 
Subjt:  REEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSP-SHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDH-

Query:  SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRL
        +  D LAAFQLLQ+MSPYVKFGHFTANQAI+EAVA +RRVH++DYDIMEG QWASL+Q+  S  +G   PHLRITA+SR G G   RRSIATVQETGRRL
Subjt:  SPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRL

Query:  VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYS
         +FAAS+GQPFSFH C+LDSDE+FRPS LKLVRGEALV NCML+LPH  YR PES+ASFL+GAKTLNP+LVTLVEEE+G   ++ G +  +F+DSL  YS
Subjt:  VAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYS

Query:  AIYDSLGAGFPKKNRARALVERVFLGPRISATLTRI---GQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEE--LGNNKLVLGWKSK
        A++DSL AGFP +NRAR LVERVF GPRI+ +L RI   G  +   SWGE L ++GF+   +SFANHCQA+LLL LFNDGYRVEE  +G+NKLVL WKS+
Subjt:  AIYDSLGAGFPKKNRARALVERVFLGPRISATLTRI---GQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEE--LGNNKLVLGWKSK

Query:  RLLSTSIWVSSSCSSSLCSSSDSE
        RLLS S+W         CSSSDS+
Subjt:  RLLSTSIWVSSSCSSSLCSSSDSE

Q84Q92 Protein NODULATION SIGNALING PATHWAY 22.5e-11948.34Show/hide
Query:  LAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPF-----PGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNC---------------N
        +A D  F+    +  ++S ++ DD   +   W  LSPV DW  F      G H D H L +SM+  D          VD                     
Subjt:  LAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPF-----PGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNC---------------N

Query:  SGSSPAEDQEEEEEEAKAEDREEE-EYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVS-----PSHGTNLERLTAYYAQAFQGLLDCAAVSGG
        +GS+P+            +D  +    KGLRL HLLMAAA+AL G HKSR+LA VILVRL E+VS      +  +N+ERL A++  A QGLLD +   GG
Subjt:  SGSSPAEDQEEEEEEAKAEDREEE-EYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVS-----PSHGTNLERLTAYYAQAFQGLLDCAAVSGG

Query:  AGGGSNKPHHHHHHSHHHLHRDDHSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITA
        +G  +     HHH              DVL AFQ+LQ+MSPY+KFGHFTANQAILEAV+ DRRVHIVDYDI EG QWASLMQA  SR DG PAPHLRITA
Subjt:  AGGGSNKPHHHHHHSHHHLHRDDHSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITA

Query:  ISRGGGGATCRRSIATVQETGRRLVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLH---LPHFIYRTPESIASFLSGAKTLNPRLVTL
        +SR GGG         VQE GRRL AFAASIGQPFSF QC+LDSDE FRP+ +++V+GEALV NC+LH       I R   S+ASFLSG   L  +LVT+
Subjt:  ISRGGGGATCRRSIATVQETGRRLVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLH---LPHFIYRTPESIASFLSGAKTLNPRLVTL

Query:  VEEEIGHGPTMDGD---------YKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQ---PQANCSWGERLEKLGFKAAAISF
        VEEE       DGD         +  QF++ L RYSA++DSL AGFP ++R R LVERV L P I+  ++R  +    +  C WG+ +   GF A  +S 
Subjt:  VEEEIGHGPTMDGD---------YKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQ---PQANCSWGERLEKLGFKAAAISF

Query:  ANHCQARLLLDLFNDGYRVEELGNNKLVLGWKSKRLLSTSIW
         NH QARLLL LFNDGY VEE G NK+VLGWK++RL+S S+W
Subjt:  ANHCQARLLLDLFNDGYRVEELGNNKLVLGWKSKRLLSTSIW

Q9SUF5 Scarecrow-like protein 268.8e-11749.41Show/hide
Query:  PFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPV-VDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEV----DSGSNCNSGSSPAEDQEEEEEEAKAED
        P+ D     +ST T+    A   + N   L+ + +DW+       DF D+ +S+M  +     P +  V    D    CNS S+         +     +
Subjt:  PFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPV-VDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEV----DSGSNCNSGSSPAEDQEEEEEEAKAED

Query:  REEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDD-HS
         + +E KGLRL HLL+AAADA  G +KSR+L  VIL RL +LVSP   TN+ERL A++      LL+  +V                      HRDD + 
Subjt:  REEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDD-HS

Query:  PTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRLV
          DV++AF+LLQ MSPYV FG+ TA QAILEAV  +RR+HIVDYDI EG QWASLMQA VSR  GP A HLRITA+SR   G   ++S+A VQETGRRL 
Subjt:  PTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRLV

Query:  AFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYSA
        AFA SIGQPFS+  CKLD++ +F  S LKLVRGEA+V+NCMLHLP F ++TP S+ SFLS AKTLNP+LVTLV EE+G        Y+  F+D L ++SA
Subjt:  AFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYSA

Query:  IYDSLGAGFPKKNRARALVERVFLGPRISATLTRI----GQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWKSKRL
        I+DSL AG    N AR  VERVF+GP ++  LTRI     + ++  SW + LE  GFK   +SF N CQA+LLL LFNDG+RVEELG N LVLGWKS+RL
Subjt:  IYDSLGAGFPKKNRARALVERVFLGPRISATLTRI----GQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWKSKRL

Query:  LSTSIWVS
        +S S W S
Subjt:  LSTSIWVS

Arabidopsis top hitse value%identityAlignment
AT1G55580.1 GRAS family transcription factor3.2e-4535.98Show/hide
Query:  LHRDDHSPTDVLAAFQL-LQEMSPYVKFGHFTANQAILEAVA--DDRRVHIVDYDIMEGSQWASLMQAFVSRKDGP--PAPHLRITAISRGGGGATCRRS
        L R  ++ +D  + + L L +++P+++FGH TANQAIL+A    D+  +HI+D DI +G QW  LMQA   R   P  P P LRIT          C R 
Subjt:  LHRDDHSPTDVLAAFQL-LQEMSPYVKFGHFTANQAILEAVA--DDRRVHIVDYDIMEGSQWASLMQAFVSRKDGP--PAPHLRITAISRGGGGATCRRS

Query:  IATVQETGRRLVAFAASIGQPFSFHQCKLDSDE------SFRPSGLKLVRGEALVVNCMLHLPHFIYRTP-ESIASFLSGAKTLNPRLVTLVEEEIGHGP
        +  +  TG RL  FA S+G  F FH   +  ++        R   L  V+GE + VNC +H  H I+    + I  FLS  K+LN R+VT+ E E  HG 
Subjt:  IATVQETGRRLVAFAASIGQPFSFHQCKLDSDE------SFRPSGLKLVRGEALVVNCMLHLPHFIYRTP-ESIASFLSGAKTLNPRLVTLVEEEIGHGP

Query:  TMDGDYKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRI----SATLTRIGQPQANCS-WGERLEKLGFKAAAISFANHCQARLLLDLF--N
          D  +  +F ++++ Y AI+DSL A  P  +R R  +E+ + G  I    +A  T   Q       W E +++ GF    I      QA+LLL L   +
Subjt:  TMDGDYKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRI----SATLTRIGQPQANCS-WGERLEKLGFKAAAISFANHCQARLLLDLF--N

Query:  DGYRVEELGNNKLVLGWKSKRLLSTSIW
        +GY ++ L NN L LGW+++ L S S W
Subjt:  DGYRVEELGNNKLVLGWKSKRLLSTSIW

AT3G03450.1 RGA-like 26.6e-4330.54Show/hide
Query:  EEEEEEAKAEDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHS
        E  +E  ++    + +  G+RL H L+A A+A+    ++ +LA  ++ R+  L     G  + ++  Y+AQA                            
Subjt:  EEEEEEAKAEDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHS

Query:  HHHLHRDDHSPTDVLAA----FQLLQEM-----SPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGG
           ++RD  + TDV AA    F+ + EM      PY+KF HFTANQAILEAV   RRVH++D  + +G QW +LMQA   R  GPP+   R+T I     
Subjt:  HHHLHRDDHSPTDVLAA----FQLLQEM-----SPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGG

Query:  GATCRRSIATVQETGRRLVAFAASIGQPFSFHQCKLDSDESFRPSGLKL-VRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHG
        G     +  ++Q+ G +L  FA ++G  F F     +S     P   +     E LVVN +  L   + R+  SI   L+  K + P +VT+VE+E  H 
Subjt:  GATCRRSIATVQETGRRLVAFAASIGQPFSFHQCKLDSDESFRPSGLKL-VRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHG

Query:  PTMDGDYKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLT-----RIGQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLF--
          +  D   +F ++L  YS+++DSL   +   ++ R + E V+LG +I   +      R+ + +    W  R++  GF    +  +   QA +LL L+  
Subjt:  PTMDGDYKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLT-----RIGQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLF--

Query:  NDGYRVEELGNNKLVLGWKSKRLLSTSIW
         DGYRVEE  +  L++GW+++ L++TS W
Subjt:  NDGYRVEELGNNKLVLGWKSKRLLSTSIW

AT3G54220.1 GRAS family transcription factor2.1e-4932.67Show/hide
Query:  AVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKAEDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYA
        + DAPP P            N+    AE   E +EE K + ++EE   GL L  LL+  A+A+  D+     A+ +L+ +++L +P +GT+ +R+ AY++
Subjt:  AVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKAEDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYA

Query:  QAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDHSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFV
        +A    L    ++   G  +  P      +H            +++AFQ+   +SP VKF HFTANQAI EA   +  VHI+D DIM+G QW  L     
Subjt:  QAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDHSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFV

Query:  SRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRLVAFAASIGQPFSFHQCKL-DSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFL
        SR  GP  PH+R+T +           S+  +Q TG+RL  FA  +G PF F  C L +   +     L + + EA+ V+    L H +Y    S A  L
Subjt:  SRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRLVAFAASIGQPFSFHQCKL-DSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFL

Query:  SGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQANC-----SWGERLEKLGFK
           + L P++VT+VE+++ H     G +  +F++++  YSA++DSLGA + +++  R +VE+  L   I   L  +G P  +      SW E++++ GFK
Subjt:  SGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQANC-----SWGERLEKLGFK

Query:  AAAISFANHCQARLLLDLF-NDGYRVEELGNNKLVLGWKSKRLLSTSIWVSSS
          +++     QA LLL +F +DGY + +  N  L LGWK   LL+ S W   S
Subjt:  AAAISFANHCQARLLLDLF-NDGYRVEELGNNKLVLGWKSKRLLSTSIWVSSS

AT4G08250.1 GRAS family transcription factor6.3e-11849.41Show/hide
Query:  PFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPV-VDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEV----DSGSNCNSGSSPAEDQEEEEEEAKAED
        P+ D     +ST T+    A   + N   L+ + +DW+       DF D+ +S+M  +     P +  V    D    CNS S+         +     +
Subjt:  PFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPV-VDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEV----DSGSNCNSGSSPAEDQEEEEEEAKAED

Query:  REEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDD-HS
         + +E KGLRL HLL+AAADA  G +KSR+L  VIL RL +LVSP   TN+ERL A++      LL+  +V                      HRDD + 
Subjt:  REEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDD-HS

Query:  PTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRLV
          DV++AF+LLQ MSPYV FG+ TA QAILEAV  +RR+HIVDYDI EG QWASLMQA VSR  GP A HLRITA+SR   G   ++S+A VQETGRRL 
Subjt:  PTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRLV

Query:  AFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYSA
        AFA SIGQPFS+  CKLD++ +F  S LKLVRGEA+V+NCMLHLP F ++TP S+ SFLS AKTLNP+LVTLV EE+G        Y+  F+D L ++SA
Subjt:  AFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYSA

Query:  IYDSLGAGFPKKNRARALVERVFLGPRISATLTRI----GQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWKSKRL
        I+DSL AG    N AR  VERVF+GP ++  LTRI     + ++  SW + LE  GFK   +SF N CQA+LLL LFNDG+RVEELG N LVLGWKS+RL
Subjt:  IYDSLGAGFPKKNRARALVERVFLGPRISATLTRI----GQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWKSKRL

Query:  LSTSIWVS
        +S S W S
Subjt:  LSTSIWVS

AT5G41920.1 GRAS family transcription factor2.4e-4531.52Show/hide
Query:  SSPAEDQEEEEEEAKAEDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKP
        SS      +   E   E  E +    ++L  LL+  A+ +  DH     A  +L  ++E+ SP  G++ ER+ AY+AQA Q  +  + +SG     S KP
Subjt:  SSPAEDQEEEEEEAKAEDREEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKP

Query:  HHHHHHSHHHLHRDDHSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGA
                            + +A Q    +SP +KF HFTANQAI +A+  +  VHI+D D+M+G QW +L     SR         ++ +I   G G+
Subjt:  HHHHHHSHHHLHRDDHSPTDVLAAFQLLQEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGA

Query:  TCRRSIATVQETGRRLVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTM
            S   +  TGRRL  FA+S+  PF FH  +        PS L   +GEA+VV+ M    H +Y    +    L   + L P L+T+VE+E+ +    
Subjt:  TCRRSIATVQETGRRLVAFAASIGQPFSFHQCKLDSDESFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTM

Query:  DGDYKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFN-DGYRVEEL
         G +  +F+++L  YSA++D+LG G  +++  R  VE++ LG  I   +   G  +    W E L ++GF+  ++      QA LLL +   +GY + E 
Subjt:  DGDYKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISATLTRIGQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFN-DGYRVEEL

Query:  GNNKLVLGWKSKRLLSTSIWVS
         N  L LGWK   LL+ S W S
Subjt:  GNNKLVLGWKSKRLLSTSIWVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCAAACTGTAAACTCTCCGATGGCTTTGGCCATCGACAACCCTTTTACCGACCTCGCCCTCTCCGATTACAGCACCTCCACCAACAACTCCGACGACGCCCACCA
CCTCGCCGGAAACTGGAACTACTTGTCGCCGGTCGTCGACTGGGAGCCCTTTCCGGGCACCCACGTCGACTTCCACGACCTCTTCGACTCCATGATGGCCGTCGATGCTC
CGCCGTATCCGCCGCGTGCGGCGGAGGTCGACAGCGGAAGCAACTGCAACTCTGGGTCCAGTCCGGCGGAGGACCAGGAGGAAGAGGAAGAAGAGGCGAAAGCAGAGGAT
CGGGAGGAGGAGGAATATAAAGGGCTTCGGCTCTACCACCTCCTGATGGCGGCTGCCGACGCCTTGTTCGGCGACCACAAGAGCCGCGATTTGGCTCATGTGATATTGGT
TCGGCTCAATGAATTGGTTTCCCCTTCACACGGGACTAACCTCGAACGCCTCACCGCGTATTACGCTCAAGCTTTTCAGGGCTTGCTCGATTGCGCCGCCGTCTCCGGCG
GCGCCGGAGGTGGTTCGAATAAACCCCATCACCATCACCATCACAGCCACCATCATCTCCACCGCGACGACCATAGTCCGACGGACGTTCTGGCGGCGTTTCAGTTGCTG
CAGGAGATGTCCCCTTATGTGAAATTCGGCCATTTCACTGCAAATCAGGCGATTCTGGAGGCGGTGGCTGATGACCGGAGAGTCCACATAGTGGATTACGATATAATGGA
AGGGAGTCAATGGGCGTCGTTGATGCAGGCTTTTGTGTCGAGAAAGGACGGCCCACCGGCCCCACATCTGAGAATCACCGCCATTTCCAGAGGCGGCGGTGGAGCCACTT
GCCGGAGATCGATTGCGACGGTTCAGGAGACAGGGCGGCGATTGGTGGCGTTTGCGGCTTCGATTGGGCAACCCTTTTCGTTTCATCAGTGTAAGTTGGATTCCGATGAG
AGTTTTCGTCCTTCTGGGTTGAAATTGGTCAGAGGGGAAGCGCTTGTGGTGAACTGTATGCTCCATCTCCCTCATTTCATTTACCGTACGCCGGAATCCATCGCTTCGTT
TCTCTCCGGCGCCAAGACGTTGAATCCGAGGCTGGTGACTTTGGTCGAAGAGGAAATCGGACACGGACCCACCATGGACGGCGATTACAAGGTCCAATTCCTCGATTCCT
TGGAGCGTTACTCGGCGATTTACGATTCACTCGGAGCAGGGTTTCCGAAGAAAAACAGAGCAAGGGCATTGGTGGAGAGGGTTTTCCTCGGGCCGAGAATCTCGGCCACT
CTGACCCGAATCGGACAGCCGCAGGCGAATTGCTCGTGGGGGGAGCGATTGGAGAAGTTGGGATTCAAGGCGGCGGCGATCAGCTTTGCGAATCACTGCCAGGCGAGGCT
GTTGCTGGATTTGTTCAACGATGGGTACAGAGTTGAAGAATTGGGAAACAATAAGCTGGTTTTGGGATGGAAATCGAAGCGTTTGCTTTCGACTTCCATTTGGGTTTCTT
CATCTTGCTCTTCTTCTTTATGTTCTTCATCGGATTCCGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCCAAACTGTAAACTCTCCGATGGCTTTGGCCATCGACAACCCTTTTACCGACCTCGCCCTCTCCGATTACAGCACCTCCACCAACAACTCCGACGACGCCCACCA
CCTCGCCGGAAACTGGAACTACTTGTCGCCGGTCGTCGACTGGGAGCCCTTTCCGGGCACCCACGTCGACTTCCACGACCTCTTCGACTCCATGATGGCCGTCGATGCTC
CGCCGTATCCGCCGCGTGCGGCGGAGGTCGACAGCGGAAGCAACTGCAACTCTGGGTCCAGTCCGGCGGAGGACCAGGAGGAAGAGGAAGAAGAGGCGAAAGCAGAGGAT
CGGGAGGAGGAGGAATATAAAGGGCTTCGGCTCTACCACCTCCTGATGGCGGCTGCCGACGCCTTGTTCGGCGACCACAAGAGCCGCGATTTGGCTCATGTGATATTGGT
TCGGCTCAATGAATTGGTTTCCCCTTCACACGGGACTAACCTCGAACGCCTCACCGCGTATTACGCTCAAGCTTTTCAGGGCTTGCTCGATTGCGCCGCCGTCTCCGGCG
GCGCCGGAGGTGGTTCGAATAAACCCCATCACCATCACCATCACAGCCACCATCATCTCCACCGCGACGACCATAGTCCGACGGACGTTCTGGCGGCGTTTCAGTTGCTG
CAGGAGATGTCCCCTTATGTGAAATTCGGCCATTTCACTGCAAATCAGGCGATTCTGGAGGCGGTGGCTGATGACCGGAGAGTCCACATAGTGGATTACGATATAATGGA
AGGGAGTCAATGGGCGTCGTTGATGCAGGCTTTTGTGTCGAGAAAGGACGGCCCACCGGCCCCACATCTGAGAATCACCGCCATTTCCAGAGGCGGCGGTGGAGCCACTT
GCCGGAGATCGATTGCGACGGTTCAGGAGACAGGGCGGCGATTGGTGGCGTTTGCGGCTTCGATTGGGCAACCCTTTTCGTTTCATCAGTGTAAGTTGGATTCCGATGAG
AGTTTTCGTCCTTCTGGGTTGAAATTGGTCAGAGGGGAAGCGCTTGTGGTGAACTGTATGCTCCATCTCCCTCATTTCATTTACCGTACGCCGGAATCCATCGCTTCGTT
TCTCTCCGGCGCCAAGACGTTGAATCCGAGGCTGGTGACTTTGGTCGAAGAGGAAATCGGACACGGACCCACCATGGACGGCGATTACAAGGTCCAATTCCTCGATTCCT
TGGAGCGTTACTCGGCGATTTACGATTCACTCGGAGCAGGGTTTCCGAAGAAAAACAGAGCAAGGGCATTGGTGGAGAGGGTTTTCCTCGGGCCGAGAATCTCGGCCACT
CTGACCCGAATCGGACAGCCGCAGGCGAATTGCTCGTGGGGGGAGCGATTGGAGAAGTTGGGATTCAAGGCGGCGGCGATCAGCTTTGCGAATCACTGCCAGGCGAGGCT
GTTGCTGGATTTGTTCAACGATGGGTACAGAGTTGAAGAATTGGGAAACAATAAGCTGGTTTTGGGATGGAAATCGAAGCGTTTGCTTTCGACTTCCATTTGGGTTTCTT
CATCTTGCTCTTCTTCTTTATGTTCTTCATCGGATTCCGAGTAG
Protein sequenceShow/hide protein sequence
MIQTVNSPMALAIDNPFTDLALSDYSTSTNNSDDAHHLAGNWNYLSPVVDWEPFPGTHVDFHDLFDSMMAVDAPPYPPRAAEVDSGSNCNSGSSPAEDQEEEEEEAKAED
REEEEYKGLRLYHLLMAAADALFGDHKSRDLAHVILVRLNELVSPSHGTNLERLTAYYAQAFQGLLDCAAVSGGAGGGSNKPHHHHHHSHHHLHRDDHSPTDVLAAFQLL
QEMSPYVKFGHFTANQAILEAVADDRRVHIVDYDIMEGSQWASLMQAFVSRKDGPPAPHLRITAISRGGGGATCRRSIATVQETGRRLVAFAASIGQPFSFHQCKLDSDE
SFRPSGLKLVRGEALVVNCMLHLPHFIYRTPESIASFLSGAKTLNPRLVTLVEEEIGHGPTMDGDYKVQFLDSLERYSAIYDSLGAGFPKKNRARALVERVFLGPRISAT
LTRIGQPQANCSWGERLEKLGFKAAAISFANHCQARLLLDLFNDGYRVEELGNNKLVLGWKSKRLLSTSIWVSSSCSSSLCSSSDSE