; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G007670 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G007670
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionnodulation-signaling pathway 1 protein
Genome locationCmo_Chr08:4842017..4843645
RNA-Seq ExpressionCmoCh08G007670
SyntenyCmoCh08G007670
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0042446 - hormone biosynthetic process (biological process)
GO:2000032 - regulation of secondary shoot formation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR005202 - Transcription factor GRAS
IPR030015 - Scarecrow-like protein 29/nodulation signalling pathway 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593549.1 Protein NODULATION SIGNALING PATHWAY 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0099.45Show/hide
Query:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR
        MTIEDHILDWLADSVPFFSSPFPDDSYNS+SINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR
Subjt:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR

Query:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
        KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
Subjt:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA

Query:  LAHYLSSNSSC-SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        LAHYLSSNSSC SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  LAHYLSSNSSC-SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
        PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
Subjt:  PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS

Query:  MRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMRV+EKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  MRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

KAG7025891.1 Nodulation-signaling pathway 1 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0099.26Show/hide
Query:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR
        MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQ QQNQR
Subjt:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR

Query:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
        KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
Subjt:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA

Query:  LAHYLSSNSSC-SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        LAHYLSSNSSC SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  LAHYLSSNSSC-SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
        PKAVILSENNMACSCTNCGNFD GFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
Subjt:  PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS

Query:  MRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMRV+EKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  MRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_022964298.1 nodulation-signaling pathway 1 protein [Cucurbita moschata]0.0e+00100Show/hide
Query:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR
        MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR
Subjt:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR

Query:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
        KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
Subjt:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA

Query:  LAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI
        LAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI
Subjt:  LAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI

Query:  RLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEP
        RLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEP
Subjt:  RLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEP

Query:  KAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASM
        KAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASM
Subjt:  KAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASM

Query:  RRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        RRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  RRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_023000031.1 nodulation-signaling pathway 1 protein [Cucurbita maxima]9.2e-30096.1Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQN
        DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQN
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQN

Query:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY
        NQS+NGADK SGAV GVTV+KKSVGNKRNSSKATGNNN+NG+NKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY
Subjt:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY

Query:  LSSNSSC-SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT
        LSSNSSC SSSST+APVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT
Subjt:  LSSNSSC-SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT

Query:  VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAV
        VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL S N+QVIGK PDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAV
Subjt:  VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAV

Query:  ILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY
        ILSENNMACSC NCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERK+MEGEAA+ L N+GEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY
Subjt:  ILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY

Query:  DNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        DNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  DNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_023514155.1 nodulation-signaling pathway 1 protein [Cucurbita pepo subsp. pepo]5.9e-30797.61Show/hide
Query:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR
        MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPN STSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR
Subjt:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR

Query:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
        KNQNNQS+NGADKSSGAVAGVTVMKKSVGNKRNSSKATG+NNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
Subjt:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA

Query:  LAHYLSSNSSC-SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        LAHYLSSNSSC SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  LAHYLSSNSSC-SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVIAPT+EHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL SLN+QVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRK+E
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
        PKAVILSENNMACSC NCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERK+MEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
Subjt:  PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS

Query:  MRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLG+KSNGG
Subjt:  MRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

TrEMBL top hitse value%identityAlignment
A0A0A0KCK6 GRAS domain-containing protein2.8e-26285.61Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTT-VSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQ
        DHILDWL DSVPFFSS F D++ NSSSINCYQWWDENQD G+DLINGCLS+SPTT VST+ PNT TSH LTPSDLTKKRKAPDD+VHK SQT QN RKNQ
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTT-VSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQ

Query:  NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAH
        NNQS+N ADK SGAV GVTVMKKSVGNK+N+SK+TGNN N+G+NKEGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHRLA +GLRALA+
Subjt:  NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAH

Query:  YLSSNSSCSS----SSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        +LSSNSS SS    SST+AP TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NR RNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  YLSSNSSCSS----SSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVIAPT+EHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD  SL SLN+Q I K  DEILIVCAQFRLHQLKH APDER EFL+NLRK+E
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRN-EGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARA
        PKAVILSENNM CSC+ CGNF+ GF R VEY+W+FLDSTS+AFKGRESEER++MEGEAAKALRN +GEMNEE  KWCERMRN GF RK FGEDTIDTARA
Subjt:  PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRN-EGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARA

Query:  SMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSN
        SMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCSFWKLG+KSN
Subjt:  SMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSN

A0A1S3CGQ3 nodulation-signaling pathway 1 protein5.3e-26185.24Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTT-VSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQ
        DHILDWL DSVPFFSS F D++ NSSSINCYQWWDENQD G+DLINGCLS+SPTT VST+ PNT TSHHL PSDLTKKRKAPDD+VHK SQT QN RKNQ
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTT-VSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQ

Query:  NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAH
        NNQS+N ADK SGAV GVTV+KKSVGNK+N+SK+TGNN NNG+NKEGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHRLA +GLRALA+
Subjt:  NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAH

Query:  YLSSNSSCSS----SSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        +LSSNSS SS    SST++P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NR RNLH+LDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  YLSSNSSCSS----SSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD  SL SLN+Q I K  DEILIVC+QFRLHQLKH APDER EFLQNLRK+E
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRN-EGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARA
        PKAVILSENNM CSC+ C NF+ GF R VEY+W+FLDSTS+AFKGRESEER++MEGEAAKALRN EGEMNEE  KWCERMRN GF RK FGEDTIDTARA
Subjt:  PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRN-EGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARA

Query:  SMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSN
        SMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCS WKLG+KSN
Subjt:  SMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSN

A0A5A7V101 Nodulation-signaling pathway 1 protein1.5e-26085.24Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTT-VSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQ
        DHILDWL DSVPFFSS F D++ NSSSINCYQWWDENQD G+DLINGCLS+SPTT VST+ PNT TSHHL PSDLTKKRKAPDD+VHK SQT QN RKNQ
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTT-VSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQ

Query:  NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAH
        NNQS+N ADK SGAV GVTV+KKSVGNK+N+SK+TGNN NNG+NKEGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHRLA +GLRALA+
Subjt:  NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAH

Query:  YLSSNSSCSS----SSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        +LSSNSS SS    SST++P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NR RNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  YLSSNSSCSS----SSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVI PTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD  SL SLN+Q I K  DEILIVC+QFRLHQLKH APDER EFLQNLRK+E
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRN-EGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARA
        PKAVILSENNM CSC+ C NF+ GF R VEY+W+FLDSTS+AFKGRESEER++MEGEAAKALRN EGEMNEE  KWCERMRN GF RK FGEDTIDTARA
Subjt:  PKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRN-EGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARA

Query:  SMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSN
        SMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCS WKLG+KSN
Subjt:  SMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSN

A0A6J1HKE2 nodulation-signaling pathway 1 protein0.0e+00100Show/hide
Query:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR
        MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR
Subjt:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQR

Query:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
        KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
Subjt:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA

Query:  LAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI
        LAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI
Subjt:  LAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI

Query:  RLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEP
        RLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEP
Subjt:  RLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEP

Query:  KAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASM
        KAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASM
Subjt:  KAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASM

Query:  RRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        RRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  RRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

A0A6J1KER4 nodulation-signaling pathway 1 protein4.4e-30096.1Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQN
        DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQN
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQN

Query:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY
        NQS+NGADK SGAV GVTV+KKSVGNKRNSSKATGNNN+NG+NKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY
Subjt:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY

Query:  LSSNSSC-SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT
        LSSNSSC SSSST+APVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT
Subjt:  LSSNSSC-SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT

Query:  VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAV
        VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL S N+QVIGK PDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAV
Subjt:  VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAV

Query:  ILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY
        ILSENNMACSC NCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERK+MEGEAA+ L N+GEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY
Subjt:  ILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY

Query:  DNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        DNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  DNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

SwissProt top hitse value%identityAlignment
A1DQP9 Protein NODULATION SIGNALING PATHWAY 14.0e-17360.44Show/hide
Query:  TIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQ--DLINGCLSSSPTTVSTKPPNTST-----SHHLTP-SDLTKKRKAPDDTVHKPS
        T  DHILDWL  SV FF S   +   NS  I  Y  WD+ Q         N   S++ T V+T   +T++     S++  P SDL KKR A D++  KP 
Subjt:  TIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQ--DLINGCLSSSPTTVSTKPPNTST-----SHHLTP-SDLTKKRKAPDDTVHKPS

Query:  QTQQNQRKNQ-NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHR
        Q +  + K +  N+  NG             ++K   NK+  +KA G+N N+GN+KEGRWAEQLLNPCA AI  G+  RV HLL VL ELASPTGD NHR
Subjt:  QTQQNQRKNQ-NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHR

Query:  LAAYGLRALAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPN-RSRNLHILDIGVSHGVQWPTLLEALTR
        LAA+GLRAL H+LSS+SS  +SS    +TFAST+PRFFQ+SL+KF+EVSPWF+FPNNIAN+SIL +L+EE N  SR LHILDIGVSHGVQWPTLL+AL+R
Subjt:  LAAYGLRALAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPN-RSRNLHILDIGVSHGVQWPTLLEALTR

Query:  RSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEF
        RSGGPP ++RLTV+  T E+DQN ETPFS  PPG N   RLL +A+S+NINLQINR+++ SL +LNAQ I   PDEILIVCAQFRLH L H +PDER EF
Subjt:  RSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEF

Query:  LQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGED
        L+ LR +EP+ VILSENN  C C+ CGNF  GFTR+VEYLWRFLDSTSSAFKGRES+ER++MEGEAAKAL N+ EMNEE EKWC RM+ AGFA ++FGED
Subjt:  LQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGED

Query:  TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
         +D  RA +R+YD+NWEM+VEEK+  VGLWWKGQPVSFCS WKL     GG
Subjt:  TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

Q4VYC8 Protein NODULATION SIGNALING PATHWAY 11.6e-17761.54Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDI-GQDLINGCLSSS------------PTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHK
        DHILDWL  SV FF S F DD YN+  I+ Y+ W++NQDI  Q  I+   +SS             TT ST     ++ +++  SDL KKR A D+   K
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDI-GQDLINGCLSSS------------PTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHK

Query:  PSQTQQNQRKNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANH
          Q Q  + K   ++  N +D    A+ G TV++KS GNK+ ++KA G+N+NNGNNK+GRWAEQLLNPCA AI  G+  RV HLL VL ELAS TGDANH
Subjt:  PSQTQQNQRKNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANH

Query:  RLAAYGLRALAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTR
        RLAA+GLRAL H+LSS+SS + S T   +TFAST+PRFFQ+SL+KF+E SPWF+FPNNIAN+SIL +L+EEPN  R LHILDIGVSHGVQWPT LEAL+R
Subjt:  RLAAYGLRALAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTR

Query:  RSGGPPPLIRLTVI--APTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERF
        R GGPPPL+RLTV+  + + E+DQN ETPFSIGP GD  SS LL +A+SLN+NLQI +LD+  L +LNA+ +    DE LIVCAQFRLH L H  PDER 
Subjt:  RSGGPPPLIRLTVI--APTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERF

Query:  EFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFG
        EFL+ LR +EPK VILSENNM C C++CG+F TGF+R+VEYLWRFLDSTSSAFK R+S+ERK+MEGEAAKAL N+ EMNE  EKWCERM+ AGFA ++FG
Subjt:  EFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFG

Query:  EDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKL
        ED ID  RA +R+YDNNWEM+VEE    V LWWK QPVSFCS WKL
Subjt:  EDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKL

Q84MQ9 Protein NODULATION SIGNALING PATHWAY 11.5e-8239.88Show/hide
Query:  WWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSK
        WW  +    QD I   ++      +  PP+T+     +PS  +    +P D    PS + + +        ++ A ++ G   G    KK  G       
Subjt:  WWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSK

Query:  ATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSTL-------APVT-FASTDPR
          G     G++++ RWAEQLLNPCA A+  G+ +RV HL  VL EL S +GDANHRLAA+GLRALA +L +    ++++ +        P T FA+ +PR
Subjt:  ATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSTL-------APVT-FASTDPR

Query:  FFQRSLIKFHEVSPWFAFPNNIANSSILH--ILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVIAPTVEHDQNAETPFSIGPPG
         F+ SLI+FHEVSPWFA PN +AN++I             R LH++D+GVSHGVQWPTLLE+LTR+ GG  PP +RLTV+ P      +   PFS  PPG
Subjt:  FFQRSLIKFHEVSPWFAFPNNIANSSILH--ILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVIAPTVEHDQNAETPFSIGPPG

Query:  DNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCT--NCGNFDTG
         + S  LL +AKS+N++L+I+R  +L         +     E L+VC QFR   L H A +ER E L+  R + P+ V+LSE +        + G+    
Subjt:  DNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCT--NCGNFDTG

Query:  FTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAK--ALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRV-EEKDGCVGL
        F  ++E LWRFL+STS+AFKG++ EER+L+E EA    A  +     E  E W ERM  AGF    FG + +++AR+ +R+YD+ WEM         V L
Subjt:  FTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAK--ALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRV-EEKDGCVGL

Query:  WWKGQPVSFCSFWK
         WKGQPVSFCS W+
Subjt:  WWKGQPVSFCSFWK

Q9LRW3 Scarecrow-like protein 293.6e-12950.94Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQN
        DH+L WL DSV     P  DDSY     +  Q W+ +Q   QD  +G + S    +S              ++L    +AP   +  P + QQ      N
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQN

Query:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY
        +QSR    +S         +KKS  +KR + K++  ++ +G NKEGRWAE+LLNPCA AI   +++RV H LCVL ELAS +GDAN RLAA+GLRAL H+
Subjt:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY

Query:  LSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTV
        LSS+S   SSS     TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG+QWPTLLEAL+ R  GPPP +R+TV
Subjt:  LSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTV

Query:  IAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVI
        I+     D  A+ PFS+GPPG N  S+LL FA+SL INLQI+ LD L       Q+I   P E LIVCAQFRLH LKH   DER E L+ +R + PK V+
Subjt:  IAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVI

Query:  LSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYD
        L ENN  CS  +  +F  GF++K+EY+W+FLDSTSS FK   SEERKLMEGEA K L N G+MNE  EKW ERMR AGF  + F ED +D A++ +R+YD
Subjt:  LSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYD

Query:  NNWEMRVEEKDGCVGLWWKGQPVSFCSFWK
        NNWE+R+E+ D   GL WKG+ VSFCS WK
Subjt:  NNWEMRVEEKDGCVGLWWKGQPVSFCSFWK

Q9SN22 Scarecrow-like protein 323.0e-3529.73Show/hide
Query:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPN
        +  + EQLL  CA AI   DA   H +L VL  +A P GD+  RL +  LRAL     S +  + SST++ +  A    RF    L  F +++PW  F  
Subjt:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPN

Query:  NIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRL
          AN++IL  +         +HI+D+ ++H +Q PTL++A+  R   PPPL++LTV++       +   P  I    + + S+L++FA + NI ++   +
Subjt:  NIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRL

Query:  -----DSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDE---------RFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRF
             D  S L    ++     +E L+V      H +  Y P+E         R  FL+ LR + P+ V L E ++  +  N  N          Y W  
Subjt:  -----DSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDE---------RFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRF

Query:  LDSTSSAFKGRESEERKLMEGEAAKALRN----EG----EMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQP
         D+T +      SE+R+  E E +  + N    EG    E  E   +W ERMR A F      ED +   +A +  +   W M+ E+ D  + L WKG  
Subjt:  LDSTSSAFKGRESEERKLMEGEAAKALRN----EG----EMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQP

Query:  VSFCSFW
        V F + W
Subjt:  VSFCSFW

Arabidopsis top hitse value%identityAlignment
AT3G03450.1 RGA-like 21.7e-2527.01Show/hide
Query:  CANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHIL
        CA AI + +      L+  +  LA     A  ++A Y  +ALA  +  + +  +         A+ +P F +   + F+E  P+  F +  AN +IL   
Subjt:  CANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHIL

Query:  SEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQ
         E    +R +H++D+G++ G+QWP L++AL  R GGPP   RLT I P     +N+++   +G        +L  FA+++ +  +   L + SL  L  +
Subjt:  SEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQ

Query:  VIGKFPD-EILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAA
        +    P+ E L+V + F LH+L   +     + L  ++ I+P  V + E     +  N   F   F   + Y     DS   ++    S++R + E    
Subjt:  VIGKFPD-EILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAA

Query:  KALRN----EG----EMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKL
        + + N    EG    E +E   +W  RM++AGF     G      A   +  Y      RVEE DGC+ + W+ +P+   S WKL
Subjt:  KALRN----EG----EMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKL

AT3G13840.1 GRAS family transcription factor2.5e-13050.94Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQN
        DH+L WL DSV     P  DDSY     +  Q W+ +Q   QD  +G + S    +S              ++L    +AP   +  P + QQ      N
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQN

Query:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY
        +QSR    +S         +KKS  +KR + K++  ++ +G NKEGRWAE+LLNPCA AI   +++RV H LCVL ELAS +GDAN RLAA+GLRAL H+
Subjt:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY

Query:  LSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTV
        LSS+S   SSS     TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG+QWPTLLEAL+ R  GPPP +R+TV
Subjt:  LSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTV

Query:  IAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVI
        I+     D  A+ PFS+GPPG N  S+LL FA+SL INLQI+ LD L       Q+I   P E LIVCAQFRLH LKH   DER E L+ +R + PK V+
Subjt:  IAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVI

Query:  LSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYD
        L ENN  CS  +  +F  GF++K+EY+W+FLDSTSS FK   SEERKLMEGEA K L N G+MNE  EKW ERMR AGF  + F ED +D A++ +R+YD
Subjt:  LSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYD

Query:  NNWEMRVEEKDGCVGLWWKGQPVSFCSFWK
        NNWE+R+E+ D   GL WKG+ VSFCS WK
Subjt:  NNWEMRVEEKDGCVGLWWKGQPVSFCSFWK

AT3G49950.1 GRAS family transcription factor2.1e-3629.73Show/hide
Query:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPN
        +  + EQLL  CA AI   DA   H +L VL  +A P GD+  RL +  LRAL     S +  + SST++ +  A    RF    L  F +++PW  F  
Subjt:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPN

Query:  NIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRL
          AN++IL  +         +HI+D+ ++H +Q PTL++A+  R   PPPL++LTV++       +   P  I    + + S+L++FA + NI ++   +
Subjt:  NIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRL

Query:  -----DSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDE---------RFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRF
             D  S L    ++     +E L+V      H +  Y P+E         R  FL+ LR + P+ V L E ++  +  N  N          Y W  
Subjt:  -----DSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDE---------RFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRF

Query:  LDSTSSAFKGRESEERKLMEGEAAKALRN----EG----EMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQP
         D+T +      SE+R+  E E +  + N    EG    E  E   +W ERMR A F      ED +   +A +  +   W M+ E+ D  + L WKG  
Subjt:  LDSTSSAFKGRESEERKLMEGEAAKALRN----EG----EMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQP

Query:  VSFCSFW
        V F + W
Subjt:  VSFCSFW

AT4G37650.1 GRAS family transcription factor8.1e-3629.93Show/hide
Query:  RWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLS-SNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNN
        +WA+ +L   A A    D  R   +L  L EL+SP GD   +LA+Y L+AL + ++ S   C  +   A  T  +      +++++KF EVSPW  F + 
Subjt:  RWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLS-SNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNN

Query:  IANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINR
         AN +IL  +  E      +HI+DI  +   QWPTLLEAL  RS   P L RLT  V+A    +DQ A            I +R+  FA+ + +  + N 
Subjt:  IANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINR

Query:  LDSLSLLS-LNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRF----LDSTSSAF
        +  +  LS  +   +   PDE+L +     +H +       R   + + R++ P+ V + E          G FD  F R      R+     +S   +F
Subjt:  LDSLSLLS-LNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRF----LDSTSSAF

Query:  KGRESEERKLMEGEAAKAL--------RNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY-DNNWEMRVEEKDGCVGLWWKGQPVSFCSFW
          R S ER ++E  A +A+         +  E  E   KW  RMRN+GF    + ++  D  RA +RRY +  W M        + L W+ QPV + S W
Subjt:  KGRESEERKLMEGEAAKAL--------RNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY-DNNWEMRVEEKDGCVGLWWKGQPVSFCSFW

Query:  K
        +
Subjt:  K

AT5G66770.1 GRAS family transcription factor1.7e-2529.16Show/hide
Query:  CANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHIL
        CA  I   D       L  ++E  S  GD   R+A Y   AL++ LS NS  +SSS+      +ST+      S    ++  P+  F +  AN +IL   
Subjt:  CANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHIL

Query:  SEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVI-APTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQ-INRLDSLSLLSLN
         E   +S  +HI+D G+  G+QWP LL+AL  R+ G P  IR++ I AP++      E+P    P      +RL  FAK L++N   I  L  + L  LN
Subjt:  SEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVI-APTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQ-INRLDSLSLLSLN

Query:  AQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFK---GRESEERKLME
               PDE+L V    +L++L    P      L+  + + P+ V L E  ++ +         GF  +V+   +F  +   + +   GR+SEER  +E
Subjt:  AQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSSAFK---GRESEERKLME

Query:  GE----------AAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYD-NNWEMRVEEKDGCVGLWWKGQPVSFCSFWK
         E            +      E  EE E+W   M NAGF         +  A+  +  Y+ +N    VE K G + L W   P+   S W+
Subjt:  GE----------AAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYD-NNWEMRVEEKDGCVGLWWKGQPVSFCSFWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATTGAAGATCATATATTGGATTGGTTAGCGGATTCAGTTCCTTTTTTTTCTTCCCCGTTCCCGGATGATTCCTACAACTCTAGCTCTATAAACTGCTATCAATG
GTGGGATGAGAACCAAGACATAGGCCAAGATCTGATTAATGGCTGTCTAAGCAGTTCCCCCACCACTGTCAGTACTAAACCACCTAACACTTCCACTTCCCATCACTTGA
CGCCATCTGATTTGACTAAGAAAAGAAAAGCTCCGGATGATACAGTTCATAAACCATCGCAAACCCAACAGAATCAGAGGAAGAACCAGAACAATCAGAGCAGAAATGGT
GCAGATAAAAGCAGTGGAGCTGTTGCAGGTGTGACTGTGATGAAGAAATCAGTGGGGAACAAAAGGAATTCATCAAAAGCCACAGGGAATAACAATAATAATGGGAATAA
CAAGGAAGGAAGGTGGGCAGAGCAATTGCTAAATCCTTGTGCTAATGCTATCATAAAAGGGGATGCAACAAGAGTACATCACCTTCTTTGTGTTCTTCAAGAGCTCGCCT
CGCCCACCGGCGATGCTAACCACCGGCTCGCCGCCTATGGTCTCCGAGCTTTGGCCCATTACCTGTCCTCCAATTCTTCATGTTCTTCTTCTTCCACACTTGCGCCGGTT
ACTTTTGCTTCGACGGACCCTCGATTCTTTCAGAGGTCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGCCTTTCCAAATAACATCGCAAATTCTTCGATTCTCCA
CATTCTCTCTGAAGAACCAAATCGCTCGCGCAATCTTCACATTCTTGACATCGGGGTTTCTCATGGTGTGCAATGGCCGACGCTGCTCGAGGCCTTAACTCGCCGTTCCG
GTGGACCCCCGCCGCTAATTCGCCTCACAGTCATCGCTCCAACCGTCGAACATGATCAAAATGCAGAGACGCCGTTTTCGATTGGTCCACCGGGAGACAACATCTCCTCT
CGGCTTCTTAGCTTCGCGAAATCATTAAACATCAATTTACAGATCAACCGCCTCGATAGTCTCTCGCTACTGAGTTTAAATGCGCAAGTAATCGGCAAGTTTCCGGACGA
AATCTTAATCGTTTGCGCACAGTTCAGACTCCACCAATTGAAACACTACGCTCCAGACGAAAGATTCGAGTTCTTACAAAACCTAAGAAAGATAGAACCGAAAGCTGTGA
TTCTAAGCGAAAACAACATGGCATGTAGCTGTACCAACTGCGGAAATTTCGACACCGGATTCACAAGAAAAGTTGAATACCTATGGAGATTCCTGGATTCAACAAGCTCC
GCATTCAAAGGTCGAGAAAGCGAGGAAAGAAAGTTGATGGAAGGCGAAGCCGCAAAGGCGCTGAGGAACGAAGGCGAAATGAACGAGGAAATGGAAAAATGGTGCGAAAG
AATGAGAAATGCTGGATTTGCAAGAAAGTTGTTCGGTGAAGACACCATTGATACGGCTCGAGCTTCAATGAGAAGGTATGATAACAACTGGGAGATGAGAGTTGAAGAGA
AAGATGGATGCGTAGGGCTATGGTGGAAAGGGCAACCTGTTTCGTTTTGCTCGTTTTGGAAGTTGGGGATGAAATCCAATGGCGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCATTGAAGATCATATATTGGATTGGTTAGCGGATTCAGTTCCTTTTTTTTCTTCCCCGTTCCCGGATGATTCCTACAACTCTAGCTCTATAAACTGCTATCAATG
GTGGGATGAGAACCAAGACATAGGCCAAGATCTGATTAATGGCTGTCTAAGCAGTTCCCCCACCACTGTCAGTACTAAACCACCTAACACTTCCACTTCCCATCACTTGA
CGCCATCTGATTTGACTAAGAAAAGAAAAGCTCCGGATGATACAGTTCATAAACCATCGCAAACCCAACAGAATCAGAGGAAGAACCAGAACAATCAGAGCAGAAATGGT
GCAGATAAAAGCAGTGGAGCTGTTGCAGGTGTGACTGTGATGAAGAAATCAGTGGGGAACAAAAGGAATTCATCAAAAGCCACAGGGAATAACAATAATAATGGGAATAA
CAAGGAAGGAAGGTGGGCAGAGCAATTGCTAAATCCTTGTGCTAATGCTATCATAAAAGGGGATGCAACAAGAGTACATCACCTTCTTTGTGTTCTTCAAGAGCTCGCCT
CGCCCACCGGCGATGCTAACCACCGGCTCGCCGCCTATGGTCTCCGAGCTTTGGCCCATTACCTGTCCTCCAATTCTTCATGTTCTTCTTCTTCCACACTTGCGCCGGTT
ACTTTTGCTTCGACGGACCCTCGATTCTTTCAGAGGTCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGCCTTTCCAAATAACATCGCAAATTCTTCGATTCTCCA
CATTCTCTCTGAAGAACCAAATCGCTCGCGCAATCTTCACATTCTTGACATCGGGGTTTCTCATGGTGTGCAATGGCCGACGCTGCTCGAGGCCTTAACTCGCCGTTCCG
GTGGACCCCCGCCGCTAATTCGCCTCACAGTCATCGCTCCAACCGTCGAACATGATCAAAATGCAGAGACGCCGTTTTCGATTGGTCCACCGGGAGACAACATCTCCTCT
CGGCTTCTTAGCTTCGCGAAATCATTAAACATCAATTTACAGATCAACCGCCTCGATAGTCTCTCGCTACTGAGTTTAAATGCGCAAGTAATCGGCAAGTTTCCGGACGA
AATCTTAATCGTTTGCGCACAGTTCAGACTCCACCAATTGAAACACTACGCTCCAGACGAAAGATTCGAGTTCTTACAAAACCTAAGAAAGATAGAACCGAAAGCTGTGA
TTCTAAGCGAAAACAACATGGCATGTAGCTGTACCAACTGCGGAAATTTCGACACCGGATTCACAAGAAAAGTTGAATACCTATGGAGATTCCTGGATTCAACAAGCTCC
GCATTCAAAGGTCGAGAAAGCGAGGAAAGAAAGTTGATGGAAGGCGAAGCCGCAAAGGCGCTGAGGAACGAAGGCGAAATGAACGAGGAAATGGAAAAATGGTGCGAAAG
AATGAGAAATGCTGGATTTGCAAGAAAGTTGTTCGGTGAAGACACCATTGATACGGCTCGAGCTTCAATGAGAAGGTATGATAACAACTGGGAGATGAGAGTTGAAGAGA
AAGATGGATGCGTAGGGCTATGGTGGAAAGGGCAACCTGTTTCGTTTTGCTCGTTTTGGAAGTTGGGGATGAAATCCAATGGCGGTTGA
Protein sequenceShow/hide protein sequence
MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQTQQNQRKNQNNQSRNG
ADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSTLAPV
TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISS
RLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDTGFTRKVEYLWRFLDSTSS
AFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG