; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg13938 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg13938
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionnodulation-signaling pathway 1 protein
Genome locationCarg_Chr08:4354654..4356285
RNA-Seq ExpressionCarg13938
SyntenyCarg13938
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0042446 - hormone biosynthetic process (biological process)
GO:2000032 - regulation of secondary shoot formation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR005202 - Transcription factor GRAS
IPR030015 - Scarecrow-like protein 29/nodulation signalling pathway 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593549.1 Protein NODULATION SIGNALING PATHWAY 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0099.45Show/hide
Query:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQR
        MTIEDHILDWLADSVPFFSSPFPDDSYNS+SINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQ QQNQR
Subjt:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQR

Query:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
        KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
Subjt:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA

Query:  LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
        PKAVILSENNMACSCTNCGNFD GFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
Subjt:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS

Query:  MRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  MRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

KAG7025891.1 Nodulation-signaling pathway 1 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQR
        MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQR
Subjt:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQR

Query:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
        KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
Subjt:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA

Query:  LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
        PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
Subjt:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS

Query:  MRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  MRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_022964298.1 nodulation-signaling pathway 1 protein [Cucurbita moschata]0.0e+0099.26Show/hide
Query:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQR
        MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQ QQNQR
Subjt:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQR

Query:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
        KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
Subjt:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA

Query:  LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        LAHYLSSNSSC SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
        PKAVILSENNMACSCTNCGNFD GFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
Subjt:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS

Query:  MRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMRV+EKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  MRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_023000031.1 nodulation-signaling pathway 1 protein [Cucurbita maxima]1.6e-29995.55Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQN
        DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQ QQNQRKNQN
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQN

Query:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY
        NQS+NGADK SGAV GVTV+KKSVGNKRNSSKATGNNN+NG+NKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY
Subjt:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY

Query:  LSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT
        LSSNSSC SSSST+APVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT
Subjt:  LSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT

Query:  VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAV
        VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL S N+QVIGK PDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAV
Subjt:  VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAV

Query:  ILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY
        ILSENNMACSC NCGNFD GFTRKVEYLWRFLDSTSSAFKGRESEERK+MEGEAA+ L N+GEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY
Subjt:  ILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY

Query:  DNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        DNNWEMRV+EKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  DNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_023514155.1 nodulation-signaling pathway 1 protein [Cucurbita pepo subsp. pepo]2.7e-30797.24Show/hide
Query:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQR
        MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPN STSHHLTPSDLTKKRKAPDDTVHKPSQ QQNQR
Subjt:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQR

Query:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
        KNQNNQS+NGADKSSGAVAGVTVMKKSVGNKRNSSKATG+NNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
Subjt:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA

Query:  LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVIAPT+EHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL SLN+QVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRK+E
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
        PKAVILSENNMACSC NCGNFD GFTRKVEYLWRFLDSTSSAFKGRESEERK+MEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
Subjt:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS

Query:  MRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMRV+EKDGCVGLWWKGQPVSFCSFWKLG+KSNGG
Subjt:  MRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

TrEMBL top hitse value%identityAlignment
A0A0A0KCK6 GRAS domain-containing protein1.1e-26185.24Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTT-VSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQ
        DHILDWL DSVPFFSS F D++ NSSSINCYQWWDENQD G+DLINGCLS+SPTT VST+ PNT TSH LTPSDLTKKRKAPDD+VHK SQ  QN RKNQ
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTT-VSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQ

Query:  NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAH
        NNQS+N ADK SGAV GVTVMKKSVGNK+N+SK+TGNN N+G+NKEGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHRLA +GLRALA+
Subjt:  NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAH

Query:  YLSSNSSC---SSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        +LSSNSS    SS SST+AP TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NR RNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  YLSSNSSC---SSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVIAPT+EHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD  SL SLN+Q I K  DEILIVCAQFRLHQLKH APDER EFL+NLRK+E
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRN-EGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARA
        PKAVILSENNM CSC+ CGNF++GF R VEY+W+FLDSTS+AFKGRESEER++MEGEAAKALRN +GEMNEE  KWCERMRN GF RK FGEDTIDTARA
Subjt:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRN-EGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARA

Query:  SMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSN
        SMRRYDNNWEMR+++KDGCVGLWWKGQPVSFCSFWKLG+KSN
Subjt:  SMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSN

A0A1S3CGQ3 nodulation-signaling pathway 1 protein2.0e-26084.87Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTT-VSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQ
        DHILDWL DSVPFFSS F D++ NSSSINCYQWWDENQD G+DLINGCLS+SPTT VST+ PNT TSHHL PSDLTKKRKAPDD+VHK SQ  QN RKNQ
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTT-VSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQ

Query:  NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAH
        NNQS+N ADK SGAV GVTV+KKSVGNK+N+SK+TGNN NNG+NKEGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHRLA +GLRALA+
Subjt:  NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAH

Query:  YLSSNSSC---SSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        +LSSNSS    SS SST++P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NR RNLH+LDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  YLSSNSSC---SSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD  SL SLN+Q I K  DEILIVC+QFRLHQLKH APDER EFLQNLRK+E
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRN-EGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARA
        PKAVILSENNM CSC+ C NF++GF R VEY+W+FLDSTS+AFKGRESEER++MEGEAAKALRN EGEMNEE  KWCERMRN GF RK FGEDTIDTARA
Subjt:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRN-EGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARA

Query:  SMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSN
        SMRRYDNNWEMR+++KDGCVGLWWKGQPVSFCS WKLG+KSN
Subjt:  SMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSN

A0A5A7V101 Nodulation-signaling pathway 1 protein5.9e-26084.87Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTT-VSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQ
        DHILDWL DSVPFFSS F D++ NSSSINCYQWWDENQD G+DLINGCLS+SPTT VST+ PNT TSHHL PSDLTKKRKAPDD+VHK SQ  QN RKNQ
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTT-VSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQ

Query:  NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAH
        NNQS+N ADK SGAV GVTV+KKSVGNK+N+SK+TGNN NNG+NKEGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHRLA +GLRALA+
Subjt:  NNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAH

Query:  YLSSNSSC---SSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        +LSSNSS    SS SST++P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NR RNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  YLSSNSSC---SSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVI PTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD  SL SLN+Q I K  DEILIVC+QFRLHQLKH APDER EFLQNLRK+E
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRN-EGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARA
        PKAVILSENNM CSC+ C NF++GF R VEY+W+FLDSTS+AFKGRESEER++MEGEAAKALRN EGEMNEE  KWCERMRN GF RK FGEDTIDTARA
Subjt:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRN-EGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARA

Query:  SMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSN
        SMRRYDNNWEMR+++KDGCVGLWWKGQPVSFCS WKLG+KSN
Subjt:  SMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSN

A0A6J1HKE2 nodulation-signaling pathway 1 protein0.0e+0099.26Show/hide
Query:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQR
        MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQ QQNQR
Subjt:  MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQR

Query:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
        KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA
Subjt:  KNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRA

Query:  LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        LAHYLSSNSSC SSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  LAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
        IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE
Subjt:  IRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIE

Query:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
        PKAVILSENNMACSCTNCGNFD GFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS
Subjt:  PKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARAS

Query:  MRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMRV+EKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  MRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

A0A6J1KER4 nodulation-signaling pathway 1 protein7.6e-30095.55Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQN
        DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQ QQNQRKNQN
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQN

Query:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY
        NQS+NGADK SGAV GVTV+KKSVGNKRNSSKATGNNN+NG+NKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY
Subjt:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY

Query:  LSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT
        LSSNSSC SSSST+APVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT
Subjt:  LSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT

Query:  VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAV
        VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL S N+QVIGK PDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAV
Subjt:  VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAV

Query:  ILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY
        ILSENNMACSC NCGNFD GFTRKVEYLWRFLDSTSSAFKGRESEERK+MEGEAA+ L N+GEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY
Subjt:  ILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY

Query:  DNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        DNNWEMRV+EKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  DNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

SwissProt top hitse value%identityAlignment
A1DQP9 Protein NODULATION SIGNALING PATHWAY 14.0e-17360.44Show/hide
Query:  TIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQ--DLINGCLSSSPTTVSTKPPNTST-----SHHLTP-SDLTKKRKAPDDTVHKPS
        T  DHILDWL  SV FF S   +   NS  I  Y  WD+ Q         N   S++ T V+T   +T++     S++  P SDL KKR A D++  KP 
Subjt:  TIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQ--DLINGCLSSSPTTVSTKPPNTST-----SHHLTP-SDLTKKRKAPDDTVHKPS

Query:  QIQQNQRKNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRL
             Q KN+  ++R   +  +G       ++K   NK+  +KA G+N N+GN+KEGRWAEQLLNPCA AI  G+  RV HLL VL ELASPTGD NHRL
Subjt:  QIQQNQRKNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRL

Query:  AAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPN-RSRNLHILDIGVSHGVQWPTLLEALTR
        AA+GLRAL H+LSS+SS  +SS T   +TFAST+PRFFQ+SL+KF+EVSPWF+FPNNIAN+SIL +L+EE N  SR LHILDIGVSHGVQWPTLL+AL+R
Subjt:  AAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPN-RSRNLHILDIGVSHGVQWPTLLEALTR

Query:  RSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEF
        RSGGPP ++RLTV+  T E+DQN ETPFS  PPG N   RLL +A+S+NINLQINR+++ SL +LNAQ I   PDEILIVCAQFRLH L H +PDER EF
Subjt:  RSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEF

Query:  LQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGED
        L+ LR +EP+ VILSENN  C C+ CGNF  GFTR+VEYLWRFLDSTSSAFKGRES+ER++MEGEAAKAL N+ EMNEE EKWC RM+ AGFA ++FGED
Subjt:  LQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGED

Query:  TIDTARASMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
         +D  RA +R+YD+NWEM+V+EK+  VGLWWKGQPVSFCS WKL     GG
Subjt:  TIDTARASMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

Q4VYC8 Protein NODULATION SIGNALING PATHWAY 13.9e-17661.06Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDI-GQDLINGCLSSS------------PTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHK
        DHILDWL  SV FF S F DD YN+  I+ Y+ W++NQDI  Q  I+   +SS             TT ST     ++ +++  SDL KKR A D+   K
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDI-GQDLINGCLSSS------------PTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHK

Query:  PSQIQQNQRKNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANH
          Q Q  + K   ++  N +D    A+ G TV++KS GNK+ ++KA G+N+NNGNNK+GRWAEQLLNPCA AI  G+  RV HLL VL ELAS TGDANH
Subjt:  PSQIQQNQRKNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANH

Query:  RLAAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALT
        RLAA+GLRAL H+LSS    SSSS+    +TFAST+PRFFQ+SL+KF+E SPWF+FPNNIAN+SIL +L+EEPN  R LHILDIGVSHGVQWPT LEAL+
Subjt:  RLAAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALT

Query:  RRSGGPPPLIRLTVI--APTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDER
        RR GGPPPL+RLTV+  + + E+DQN ETPFSIGP GD  SS LL +A+SLN+NLQI +LD+  L +LNA+ +    DE LIVCAQFRLH L H  PDER
Subjt:  RRSGGPPPLIRLTVI--APTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDER

Query:  FEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLF
         EFL+ LR +EPK VILSENNM C C++CG+F  GF+R+VEYLWRFLDSTSSAFK R+S+ERK+MEGEAAKAL N+ EMNE  EKWCERM+ AGFA ++F
Subjt:  FEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLF

Query:  GEDTIDTARASMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKL
        GED ID  RA +R+YDNNWEM+V+E    V LWWK QPVSFCS WKL
Subjt:  GEDTIDTARASMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKL

Q84MQ9 Protein NODULATION SIGNALING PATHWAY 15.0e-8339.69Show/hide
Query:  WWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSK
        WW  +    QD I   ++      +  PP+T+     +PS  +    +P D    PS        + + + ++ A ++ G   G    KK  G       
Subjt:  WWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQNNQSRNGADKSSGAVAGVTVMKKSVGNKRNSSK

Query:  ATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAP-------VTFASTDPR
          G     G++++ RWAEQLLNPCA A+  G+ +RV HL  VL EL S +GDANHRLAA+GLRALA +L +    +++++   P         FA+ +PR
Subjt:  ATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAP-------VTFASTDPR

Query:  FFQRSLIKFHEVSPWFAFPNNIANSSILH--ILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVIAPTVEHDQNAETPFSIGPPG
         F+ SLI+FHEVSPWFA PN +AN++I             R LH++D+GVSHGVQWPTLLE+LTR+ GG  PP +RLTV+ P      +   PFS  PPG
Subjt:  FFQRSLIKFHEVSPWFAFPNNIANSSILH--ILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVIAPTVEHDQNAETPFSIGPPG

Query:  DNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCT--NCGNFDIG
         + S  LL +AKS+N++L+I+R  +L         +     E L+VC QFR   L H A +ER E L+  R + P+ V+LSE +        + G+    
Subjt:  DNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCT--NCGNFDIG

Query:  FTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAK--ALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRV-DEKDGCVGL
        F  ++E LWRFL+STS+AFKG++ EER+L+E EA    A  +     E  E W ERM  AGF    FG + +++AR+ +R+YD+ WEM         V L
Subjt:  FTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAK--ALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRV-DEKDGCVGL

Query:  WWKGQPVSFCSFWK
         WKGQPVSFCS W+
Subjt:  WWKGQPVSFCSFWK

Q9LRW3 Scarecrow-like protein 297.2e-13051.13Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQN
        DH+L WL DSV     P  DDSY     +  Q W+ +Q   QD  +G + S    +S              ++L    +AP   +  P +IQQ      N
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQN

Query:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY
        +QSR    +S         +KKS  +KR + K++  ++ +G NKEGRWAE+LLNPCA AI   +++RV H LCVL ELAS +GDAN RLAA+GLRAL H+
Subjt:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY

Query:  LSSNSSCSSSSSTLAPV-TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRL
        LSS    SS SS+  PV TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG+QWPTLLEAL+ R  GPPP +R+
Subjt:  LSSNSSCSSSSSTLAPV-TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRL

Query:  TVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKA
        TVI+     D  A+ PFS+GPPG N  S+LL FA+SL INLQI+ LD L       Q+I   P E LIVCAQFRLH LKH   DER E L+ +R + PK 
Subjt:  TVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKA

Query:  VILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRR
        V+L ENN  CS  +  +F  GF++K+EY+W+FLDSTSS FK   SEERKLMEGEA K L N G+MNE  EKW ERMR AGF  + F ED +D A++ +R+
Subjt:  VILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRR

Query:  YDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWK
        YDNNWE+R+++ D   GL WKG+ VSFCS WK
Subjt:  YDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWK

Q9SN22 Scarecrow-like protein 321.1e-3429.41Show/hide
Query:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFP
        +  + EQLL  CA AI   DA   H +L VL  +A P GD+  RL +  LRAL     S +   SS+ +  P   A    RF    L  F +++PW  F 
Subjt:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFP

Query:  NNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINR
           AN++IL  +         +HI+D+ ++H +Q PTL++A+  R   PPPL++LTV++       +   P  I    + + S+L++FA + NI ++   
Subjt:  NNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINR

Query:  L-----DSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDE---------RFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWR
        +     D  S L    ++     +E L+V      H +  Y P+E         R  FL+ LR + P+ V L E ++  +  N  N          Y W 
Subjt:  L-----DSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDE---------RFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWR

Query:  FLDSTSSAFKGRESEERKLMEGEAAKALRN----EG----EMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVDEKDGCVGLWWKGQ
          D+T +      SE+R+  E E +  + N    EG    E  E   +W ERMR A F      ED +   +A +  +   W M+ ++ D  + L WKG 
Subjt:  FLDSTSSAFKGRESEERKLMEGEAAKALRN----EG----EMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVDEKDGCVGLWWKGQ

Query:  PVSFCSFW
         V F + W
Subjt:  PVSFCSFW

Arabidopsis top hitse value%identityAlignment
AT3G03450.1 RGA-like 28.4e-2526.68Show/hide
Query:  CANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHI
        CA AI + +      L+  +  LA     A  ++A Y  +ALA  +  + +  +          A+ +P F +   + F+E  P+  F +  AN +IL  
Subjt:  CANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHI

Query:  LSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNA
          E    +R +H++D+G++ G+QWP L++AL  R GGPP   RLT I P     +N+++   +G        +L  FA+++ +  +   L + SL  L  
Subjt:  LSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNA

Query:  QVIGKFPD-EILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEA
        ++    P+ E L+V + F LH+L   +     + L  ++ I+P  V + E     +  N   F   F   + Y     DS   ++    S++R + E   
Subjt:  QVIGKFPD-EILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEA

Query:  AKALRN----EG----EMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKL
         + + N    EG    E +E   +W  RM++AGF     G      A   +  Y      RV+E DGC+ + W+ +P+   S WKL
Subjt:  AKALRN----EG----EMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKL

AT3G13840.1 GRAS family transcription factor5.1e-13151.13Show/hide
Query:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQN
        DH+L WL DSV     P  DDSY     +  Q W+ +Q   QD  +G + S    +S              ++L    +AP   +  P +IQQ      N
Subjt:  DHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQN

Query:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY
        +QSR    +S         +KKS  +KR + K++  ++ +G NKEGRWAE+LLNPCA AI   +++RV H LCVL ELAS +GDAN RLAA+GLRAL H+
Subjt:  NQSRNGADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHY

Query:  LSSNSSCSSSSSTLAPV-TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRL
        LSS    SS SS+  PV TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG+QWPTLLEAL+ R  GPPP +R+
Subjt:  LSSNSSCSSSSSTLAPV-TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRL

Query:  TVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKA
        TVI+     D  A+ PFS+GPPG N  S+LL FA+SL INLQI+ LD L       Q+I   P E LIVCAQFRLH LKH   DER E L+ +R + PK 
Subjt:  TVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKA

Query:  VILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRR
        V+L ENN  CS  +  +F  GF++K+EY+W+FLDSTSS FK   SEERKLMEGEA K L N G+MNE  EKW ERMR AGF  + F ED +D A++ +R+
Subjt:  VILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRR

Query:  YDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWK
        YDNNWE+R+++ D   GL WKG+ VSFCS WK
Subjt:  YDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWK

AT3G49950.1 GRAS family transcription factor8.1e-3629.41Show/hide
Query:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFP
        +  + EQLL  CA AI   DA   H +L VL  +A P GD+  RL +  LRAL     S +   SS+ +  P   A    RF    L  F +++PW  F 
Subjt:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFP

Query:  NNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINR
           AN++IL  +         +HI+D+ ++H +Q PTL++A+  R   PPPL++LTV++       +   P  I    + + S+L++FA + NI ++   
Subjt:  NNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINR

Query:  L-----DSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDE---------RFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWR
        +     D  S L    ++     +E L+V      H +  Y P+E         R  FL+ LR + P+ V L E ++  +  N  N          Y W 
Subjt:  L-----DSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDE---------RFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWR

Query:  FLDSTSSAFKGRESEERKLMEGEAAKALRN----EG----EMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVDEKDGCVGLWWKGQ
          D+T +      SE+R+  E E +  + N    EG    E  E   +W ERMR A F      ED +   +A +  +   W M+ ++ D  + L WKG 
Subjt:  FLDSTSSAFKGRESEERKLMEGEAAKALRN----EG----EMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVDEKDGCVGLWWKGQ

Query:  PVSFCSFW
         V F + W
Subjt:  PVSFCSFW

AT4G37650.1 GRAS family transcription factor3.1e-3529.43Show/hide
Query:  RWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNN
        +WA+ +L   A A    D  R   +L  L EL+SP GD   +LA+Y L+AL + ++ +      +   A  T  +      +++++KF EVSPW  F + 
Subjt:  RWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNN

Query:  IANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINR
         AN +IL  +  E      +HI+DI  +   QWPTLLEAL  RS   P L RLT  V+A    +DQ A            I +R+  FA+ + +  + N 
Subjt:  IANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VIAPTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQINR

Query:  LDSLSLLS-LNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRF----LDSTSSAF
        +  +  LS  +   +   PDE+L +     +H +       R   + + R++ P+ V + E          G FD  F R      R+     +S   +F
Subjt:  LDSLSLLS-LNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRF----LDSTSSAF

Query:  KGRESEERKLMEGEAAKAL--------RNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY-DNNWEMRVDEKDGCVGLWWKGQPVSFCSFW
          R S ER ++E  A +A+         +  E  E   KW  RMRN+GF    + ++  D  RA +RRY +  W M        + L W+ QPV + S W
Subjt:  KGRESEERKLMEGEAAKAL--------RNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRY-DNNWEMRVDEKDGCVGLWWKGQPVSFCSFW

Query:  K
        +
Subjt:  K

AT5G66770.1 GRAS family transcription factor1.3e-2529.08Show/hide
Query:  CANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHI
        CA  I   D       L  ++E  S  GD   R+A Y   AL++ LS NS  +SSSS       +ST+      S    ++  P+  F +  AN +IL  
Subjt:  CANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHI

Query:  LSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVI-APTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQ-INRLDSLSLLSL
          E   +S  +HI+D G+  G+QWP LL+AL  R+ G P  IR++ I AP++      E+P    P      +RL  FAK L++N   I  L  + L  L
Subjt:  LSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVI-APTVEHDQNAETPFSIGPPGDNISSRLLSFAKSLNINLQ-INRLDSLSLLSL

Query:  NAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFK---GRESEERKLM
        N       PDE+L V    +L++L    P      L+  + + P+ V L E  ++ +        +GF  +V+   +F  +   + +   GR+SEER  +
Subjt:  NAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTSSAFK---GRESEERKLM

Query:  EGE----------AAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYD-NNWEMRVDEKDGCVGLWWKGQPVSFCSFWK
        E E            +      E  EE E+W   M NAGF         +  A+  +  Y+ +N    V+ K G + L W   P+   S W+
Subjt:  EGE----------AAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYD-NNWEMRVDEKDGCVGLWWKGQPVSFCSFWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATTGAAGATCATATATTGGATTGGTTAGCGGATTCAGTTCCTTTTTTTTCTTCCCCGTTCCCGGATGATTCCTACAACTCTAGCTCTATAAACTGCTATCAATG
GTGGGATGAGAACCAAGACATAGGCCAAGATCTGATTAATGGCTGTCTAAGCAGTTCCCCCACCACTGTCAGTACTAAACCACCTAACACTTCCACTTCCCATCACTTGA
CGCCATCTGATTTGACTAAGAAAAGAAAAGCTCCGGATGATACAGTTCATAAACCATCGCAAATCCAACAGAATCAGAGGAAGAACCAGAACAATCAGAGCAGAAATGGT
GCAGATAAAAGCAGTGGAGCTGTTGCAGGTGTTACTGTGATGAAGAAATCAGTGGGGAACAAAAGGAATTCATCAAAAGCCACAGGGAATAACAATAATAATGGGAATAA
CAAGGAAGGAAGGTGGGCAGAGCAATTGCTAAATCCTTGTGCTAATGCTATCATAAAAGGGGATGCAACAAGAGTACATCACCTTCTTTGTGTTCTTCAAGAGCTCGCCT
CGCCCACCGGCGATGCTAACCACCGGCTCGCCGCCTATGGTCTCCGAGCTTTGGCCCATTACCTGTCCTCCAATTCTTCATGTTCTTCTTCTTCTTCCACACTTGCGCCG
GTTACTTTTGCTTCGACGGACCCTCGATTCTTTCAGAGGTCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGCTTTTCCAAATAACATCGCAAATTCTTCGATTCT
CCACATTCTCTCTGAAGAACCAAATCGCTCGCGCAATCTTCACATTCTTGACATCGGGGTTTCTCATGGTGTGCAATGGCCGACGCTGCTCGAGGCCTTAACTCGCCGTT
CCGGTGGACCCCCGCCGCTAATTCGCCTCACAGTCATCGCTCCAACCGTCGAACATGATCAAAATGCAGAGACGCCGTTTTCGATTGGTCCACCGGGAGACAACATCTCC
TCTCGGCTTCTTAGCTTCGCGAAATCATTAAACATCAATTTACAGATCAACCGCCTCGATAGTCTCTCGCTACTGAGTTTAAATGCGCAAGTAATCGGCAAGTTTCCGGA
CGAAATCTTAATCGTTTGCGCACAGTTCAGACTCCACCAACTGAAACACTACGCTCCAGACGAAAGATTCGAGTTCTTACAAAACCTAAGAAAGATAGAACCGAAAGCTG
TGATTCTAAGCGAAAACAACATGGCATGTAGCTGTACCAACTGCGGAAATTTCGACATCGGATTCACAAGAAAAGTTGAATACCTATGGAGATTCCTGGATTCAACAAGC
TCCGCATTCAAAGGTCGAGAAAGCGAGGAAAGAAAGTTGATGGAAGGCGAAGCCGCAAAGGCGTTGAGGAACGAAGGCGAAATGAACGAGGAAATGGAAAAATGGTGCGA
AAGAATGAGAAATGCTGGATTTGCAAGAAAGTTGTTCGGTGAAGACACCATTGATACGGCTCGAGCTTCAATGAGAAGGTATGATAACAACTGGGAGATGAGAGTTGATG
AGAAAGATGGATGCGTAGGGCTATGGTGGAAAGGGCAACCTGTTTCGTTTTGTTCGTTTTGGAAGTTGGGGATGAAATCCAATGGCGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCATTGAAGATCATATATTGGATTGGTTAGCGGATTCAGTTCCTTTTTTTTCTTCCCCGTTCCCGGATGATTCCTACAACTCTAGCTCTATAAACTGCTATCAATG
GTGGGATGAGAACCAAGACATAGGCCAAGATCTGATTAATGGCTGTCTAAGCAGTTCCCCCACCACTGTCAGTACTAAACCACCTAACACTTCCACTTCCCATCACTTGA
CGCCATCTGATTTGACTAAGAAAAGAAAAGCTCCGGATGATACAGTTCATAAACCATCGCAAATCCAACAGAATCAGAGGAAGAACCAGAACAATCAGAGCAGAAATGGT
GCAGATAAAAGCAGTGGAGCTGTTGCAGGTGTTACTGTGATGAAGAAATCAGTGGGGAACAAAAGGAATTCATCAAAAGCCACAGGGAATAACAATAATAATGGGAATAA
CAAGGAAGGAAGGTGGGCAGAGCAATTGCTAAATCCTTGTGCTAATGCTATCATAAAAGGGGATGCAACAAGAGTACATCACCTTCTTTGTGTTCTTCAAGAGCTCGCCT
CGCCCACCGGCGATGCTAACCACCGGCTCGCCGCCTATGGTCTCCGAGCTTTGGCCCATTACCTGTCCTCCAATTCTTCATGTTCTTCTTCTTCTTCCACACTTGCGCCG
GTTACTTTTGCTTCGACGGACCCTCGATTCTTTCAGAGGTCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGCTTTTCCAAATAACATCGCAAATTCTTCGATTCT
CCACATTCTCTCTGAAGAACCAAATCGCTCGCGCAATCTTCACATTCTTGACATCGGGGTTTCTCATGGTGTGCAATGGCCGACGCTGCTCGAGGCCTTAACTCGCCGTT
CCGGTGGACCCCCGCCGCTAATTCGCCTCACAGTCATCGCTCCAACCGTCGAACATGATCAAAATGCAGAGACGCCGTTTTCGATTGGTCCACCGGGAGACAACATCTCC
TCTCGGCTTCTTAGCTTCGCGAAATCATTAAACATCAATTTACAGATCAACCGCCTCGATAGTCTCTCGCTACTGAGTTTAAATGCGCAAGTAATCGGCAAGTTTCCGGA
CGAAATCTTAATCGTTTGCGCACAGTTCAGACTCCACCAACTGAAACACTACGCTCCAGACGAAAGATTCGAGTTCTTACAAAACCTAAGAAAGATAGAACCGAAAGCTG
TGATTCTAAGCGAAAACAACATGGCATGTAGCTGTACCAACTGCGGAAATTTCGACATCGGATTCACAAGAAAAGTTGAATACCTATGGAGATTCCTGGATTCAACAAGC
TCCGCATTCAAAGGTCGAGAAAGCGAGGAAAGAAAGTTGATGGAAGGCGAAGCCGCAAAGGCGTTGAGGAACGAAGGCGAAATGAACGAGGAAATGGAAAAATGGTGCGA
AAGAATGAGAAATGCTGGATTTGCAAGAAAGTTGTTCGGTGAAGACACCATTGATACGGCTCGAGCTTCAATGAGAAGGTATGATAACAACTGGGAGATGAGAGTTGATG
AGAAAGATGGATGCGTAGGGCTATGGTGGAAAGGGCAACCTGTTTCGTTTTGTTCGTTTTGGAAGTTGGGGATGAAATCCAATGGCGGTTGA
Protein sequenceShow/hide protein sequence
MTIEDHILDWLADSVPFFSSPFPDDSYNSSSINCYQWWDENQDIGQDLINGCLSSSPTTVSTKPPNTSTSHHLTPSDLTKKRKAPDDTVHKPSQIQQNQRKNQNNQSRNG
ADKSSGAVAGVTVMKKSVGNKRNSSKATGNNNNNGNNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAYGLRALAHYLSSNSSCSSSSSTLAP
VTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNAETPFSIGPPGDNIS
SRLLSFAKSLNINLQINRLDSLSLLSLNAQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLRKIEPKAVILSENNMACSCTNCGNFDIGFTRKVEYLWRFLDSTS
SAFKGRESEERKLMEGEAAKALRNEGEMNEEMEKWCERMRNAGFARKLFGEDTIDTARASMRRYDNNWEMRVDEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG