; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0189 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0189
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionnodulation-signaling pathway 1 protein
Genome locationMC09:1668858..1670444
RNA-Seq ExpressionMC09g0189
SyntenyMC09g0189
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0042446 - hormone biosynthetic process (biological process)
GO:2000032 - regulation of secondary shoot formation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR005202 - Transcription factor GRAS
IPR030015 - Scarecrow-like protein 29/nodulation signalling pathway 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148342.1 nodulation-signaling pathway 1 protein [Momordica charantia]0.099.81Show/hide
Query:  PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPR
        PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPR
Subjt:  PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPR

Query:  KNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHL
        KNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHL
Subjt:  KNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHL

Query:  SSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVV
        SSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVV
Subjt:  SSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVV

Query:  APTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVIL
        APTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVIL
Subjt:  APTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVIL

Query:  SENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDN
        SENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAE TIDTARASMRRYDN
Subjt:  SENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDN

Query:  NWEMRMEEKDGCVGLWWKGQPISFCSFWK
        NWEMRMEEKDGCVGLWWKGQPISFCSFWK
Subjt:  NWEMRMEEKDGCVGLWWKGQPISFCSFWK

XP_022964298.1 nodulation-signaling pathway 1 protein [Cucurbita moschata]0.084.83Show/hide
Query:  DHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQNPRK
        DHILDWL DS PFFS PF D++YNSSSINCYQWWDE+Q+IGQDLINGCLSSSP   TT ST PPNT++   LTPSDL+KKRKAPDDT HK +Q  QN RK
Subjt:  DHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQNPRK

Query:  NQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRAL
        NQNNQS+NGADK  G V     +KKSVGNK++SSK+TGNN NNG+NKEGRWAEQLLNPCANAIIKGDATRVHHL+CVLQELASPTGDANHRLA +GLRAL
Subjt:  NQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRAL

Query:  AHHLSSNSS-SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI
        AH+LSSNSS SSSSTLAPV  FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEEPN SRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI
Subjt:  AHHLSSNSS-SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI

Query:  RLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEP
        RLTV+APTVEHDQ AETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL +LN+QVIGKF DEILIVCA FRLHQLKH APDER EFL+NLRK+EP
Subjt:  RLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEP

Query:  NAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASM
         AVILSENN+ACSC+NCGNFD  FTR+VEYLWRFLDSTSSAFKGRES+ER++MEGEAAKAL N+GEMNEE EKW ERMRNAGFARK F EDTIDTARASM
Subjt:  NAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASM

Query:  RRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        RRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWK
Subjt:  RRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

XP_023000031.1 nodulation-signaling pathway 1 protein [Cucurbita maxima]0.085.34Show/hide
Query:  NHPSDHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQ
        NHPSDHILDWL DS PFFS PF D++YNSSSINCYQWWDE+Q+IGQDLINGCLSSSP   TT ST PPNT++   LTPSDL+KKRKAPDDT HK +Q  Q
Subjt:  NHPSDHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQ

Query:  NPRKNQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHG
        N RKNQNNQSKNGADKG G V     +KKSVGNK++SSK+TGNN +NGSNKEGRWAEQLLNPCANAIIKGDATRVHHL+CVLQELASPTGDANHRLA +G
Subjt:  NPRKNQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHG

Query:  LRALAHHLSSNSS--SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG
        LRALAH+LSSNSS  SSSST+APV  FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEEPN SRNLHILDIGVSHGVQWPTLLEALTRRSGG
Subjt:  LRALAHHLSSNSS--SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG

Query:  PPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNL
        PPPLIRLTV+APTVEHDQ AETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQ+ NSQVIGK  DEILIVCA FRLHQLKH APDER EFL+NL
Subjt:  PPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNL

Query:  RKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDT
        RK+EP AVILSENN+ACSC+NCGNFD  FTR+VEYLWRFLDSTSSAFKGRES+ER+VMEGEAA+ LTNQGEMNEE EKW ERMRNAGFARK F EDTIDT
Subjt:  RKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDT

Query:  ARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        ARASMRRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWK
Subjt:  ARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

XP_023514155.1 nodulation-signaling pathway 1 protein [Cucurbita pepo subsp. pepo]0.085.23Show/hide
Query:  DHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQNPRK
        DHILDWL DS PFFS PF D++YNSSSINCYQWWDE+Q+IGQDLINGCLSSSP   TT ST PPN ++   LTPSDL+KKRKAPDDT HK +Q  QN RK
Subjt:  DHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQNPRK

Query:  NQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRAL
        NQNNQSKNGADK  G V     +KKSVGNK++SSK+TG+N NNG+NKEGRWAEQLLNPCANAIIKGDATRVHHL+CVLQELASPTGDANHRLA +GLRAL
Subjt:  NQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRAL

Query:  AHHLSSNSS--SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        AH+LSSNSS  SSSSTLAPV  FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEEPN SRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  AHHLSSNSS--SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKME
        IRLTV+APT+EHDQ AETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQ+LNSQVIGKF DEILIVCA FRLHQLKH APDER EFL+NLRKME
Subjt:  IRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKME

Query:  PNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARAS
        P AVILSENN+ACSC+NCGNFD  FTR+VEYLWRFLDSTSSAFKGRES+ER+VMEGEAAKAL N+GEMNEE EKW ERMRNAGFARK F EDTIDTARAS
Subjt:  PNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARAS

Query:  MRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        MRRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWK
Subjt:  MRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

XP_038897214.1 protein NODULATION SIGNALING PATHWAY 1 [Benincasa hispida]0.083.67Show/hide
Query:  PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPR
        PNHPSDHILDWLEDS PFFSPFLDET NSSSINCYQWWD +Q+ G+DLING LS+SP   +T     P +  LTPSDL+KKRKAPDD+ HK +Q HQN R
Subjt:  PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPR

Query:  KNQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRA
        KNQNNQSKNG   GGG V     +KKSVGNKK+SSK TGNN NNGSN+EGRWAEQLLNPCA+AIIKGDATRVHHL+CVLQELASPTGDANHRLADHGLRA
Subjt:  KNQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRA

Query:  LAHHLSSNSSSSS-----STLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG
        LAHHLSSNSSSSS     ST+APV  FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEEPN  RNLHILDIGVSHGVQWPTLLEALTRRSGG
Subjt:  LAHHLSSNSSSSS-----STLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG

Query:  PPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNL
        PP LIRLTV+ PT+EHDQ  ETPFSIGPPGDNISSRLLSFAKSLNINLQINRL+NHSLQ+LNSQVI KF DEILIVCA FRLHQLKH  PDER EFL+NL
Subjt:  PPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNL

Query:  RKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDT
        RKMEP AVILSENN+ CSCSNCGNFD  FTRRVEYLWRFLDSTS+AFKGRES+ERRVMEGEAAKALTN GEMNEEK KW ERMRNAGF RK FAEDTIDT
Subjt:  RKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDT

Query:  ARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        ARASMRRYDNNWEMR+EEKDGC+GLWWKGQP+SFCSFWK
Subjt:  ARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

TrEMBL top hitse value%identityAlignment
A0A0A0KCK6 GRAS domain-containing protein0.083.06Show/hide
Query:  PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTT---SLTPSDLSKKRKAPDDTGHKTTQPHQ
        PNHPSDHILDWLEDS PFFS FLDET NSSSINCYQWWDE+Q+ G+DLINGCLS+SP   T  ST  PNT    SLTPSDL+KKRKAPDD+ HK +Q HQ
Subjt:  PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTT---SLTPSDLSKKRKAPDDTGHKTTQPHQ

Query:  NPRKNQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHG
        NPRKNQNNQSKN ADKG G V     +KKSVGNKK++SKSTGNNYN+GSNKEGRWAEQLLNPCANAI+KGDATRVHHL+CVLQELASPTGDANHRLADHG
Subjt:  NPRKNQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHG

Query:  LRALAHHLSSNSSSSS-----STLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRR
        LRALA+HLSSNSSSSS     ST+AP   FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEE N  RNLHILDIGVSHGVQWPTLLEALTRR
Subjt:  LRALAHHLSSNSSSSS-----STLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRR

Query:  SGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFL
        SGGPPPLIRLTV+APT+EHDQ  ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQ+LNSQ I K RDEILIVCA FRLHQLKH APDER EFL
Subjt:  SGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFL

Query:  RNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFARKFFAED
         NLRKMEP AVILSENN+ CSCS CGNF++ F R VEY+W+FLDSTS+AFKGRES+ERRVMEGEAAKAL N  GEMNEEK KW ERMRN GF RK F ED
Subjt:  RNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFARKFFAED

Query:  TIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        TIDTARASMRRYDNNWEMRME+KDGCVGLWWKGQP+SFCSFWK
Subjt:  TIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

A0A5A7V101 Nodulation-signaling pathway 1 protein2.09e-31381.95Show/hide
Query:  PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQ
        P+HPSDHILDWLEDS PFFS FLDET NSSSINCYQWWDE+Q+ G+DLINGCLS+SP   T  ST  PNT +   L PSDL+KKRKAPDD+ HK +Q HQ
Subjt:  PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQ

Query:  NPRKNQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHG
        NPRKNQNNQSKN ADKG G V     +KKSVGNKK++SKSTGNNYNNGSNKEGRWAEQLLNPCANAI+KGDATRVHHL+CVLQELASPTGDANHRLADHG
Subjt:  NPRKNQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHG

Query:  LRALAHHLSSNSSSSS-----STLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRR
        LRALA+HLSSNSSSSS     ST++P+  FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEE N  RNLHILDIGVSHGVQWPTLLEALTRR
Subjt:  LRALAHHLSSNSSSSS-----STLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRR

Query:  SGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFL
        SGGPPPLIRLTV+ PTVEHDQ  ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQ+LNSQ I K RDEILIVC+ FRLHQLKH APDER EFL
Subjt:  SGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFL

Query:  RNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFARKFFAED
        +NLRKMEP AVILSENN+ CSCS C NF++ F R VEY+W+FLDSTS+AFKGRES+ERRVMEGEAAKAL N +GEMNEEK KW ERMRN GF RK F ED
Subjt:  RNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFARKFFAED

Query:  TIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        TIDTARASMRRYDNNWEMRME+KDGCVGLWWKGQP+SFCS WK
Subjt:  TIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

A0A6J1D4T6 nodulation-signaling pathway 1 protein0.099.81Show/hide
Query:  PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPR
        PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPR
Subjt:  PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPR

Query:  KNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHL
        KNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHL
Subjt:  KNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHL

Query:  SSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVV
        SSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVV
Subjt:  SSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVV

Query:  APTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVIL
        APTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVIL
Subjt:  APTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVIL

Query:  SENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDN
        SENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAE TIDTARASMRRYDN
Subjt:  SENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDN

Query:  NWEMRMEEKDGCVGLWWKGQPISFCSFWK
        NWEMRMEEKDGCVGLWWKGQPISFCSFWK
Subjt:  NWEMRMEEKDGCVGLWWKGQPISFCSFWK

A0A6J1HKE2 nodulation-signaling pathway 1 protein0.084.83Show/hide
Query:  DHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQNPRK
        DHILDWL DS PFFS PF D++YNSSSINCYQWWDE+Q+IGQDLINGCLSSSP   TT ST PPNT++   LTPSDL+KKRKAPDDT HK +Q  QN RK
Subjt:  DHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQNPRK

Query:  NQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRAL
        NQNNQS+NGADK  G V     +KKSVGNK++SSK+TGNN NNG+NKEGRWAEQLLNPCANAIIKGDATRVHHL+CVLQELASPTGDANHRLA +GLRAL
Subjt:  NQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRAL

Query:  AHHLSSNSS-SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI
        AH+LSSNSS SSSSTLAPV  FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEEPN SRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI
Subjt:  AHHLSSNSS-SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI

Query:  RLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEP
        RLTV+APTVEHDQ AETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL +LN+QVIGKF DEILIVCA FRLHQLKH APDER EFL+NLRK+EP
Subjt:  RLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEP

Query:  NAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASM
         AVILSENN+ACSC+NCGNFD  FTR+VEYLWRFLDSTSSAFKGRES+ER++MEGEAAKAL N+GEMNEE EKW ERMRNAGFARK F EDTIDTARASM
Subjt:  NAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASM

Query:  RRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        RRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWK
Subjt:  RRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

A0A6J1KER4 nodulation-signaling pathway 1 protein0.085.34Show/hide
Query:  NHPSDHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQ
        NHPSDHILDWL DS PFFS PF D++YNSSSINCYQWWDE+Q+IGQDLINGCLSSSP   TT ST PPNT++   LTPSDL+KKRKAPDDT HK +Q  Q
Subjt:  NHPSDHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQ

Query:  NPRKNQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHG
        N RKNQNNQSKNGADKG G V     +KKSVGNK++SSK+TGNN +NGSNKEGRWAEQLLNPCANAIIKGDATRVHHL+CVLQELASPTGDANHRLA +G
Subjt:  NPRKNQNNQSKNGADKGGGGV-----VKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHG

Query:  LRALAHHLSSNSS--SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG
        LRALAH+LSSNSS  SSSST+APV  FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEEPN SRNLHILDIGVSHGVQWPTLLEALTRRSGG
Subjt:  LRALAHHLSSNSS--SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG

Query:  PPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNL
        PPPLIRLTV+APTVEHDQ AETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQ+ NSQVIGK  DEILIVCA FRLHQLKH APDER EFL+NL
Subjt:  PPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNL

Query:  RKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDT
        RK+EP AVILSENN+ACSC+NCGNFD  FTR+VEYLWRFLDSTSSAFKGRES+ER+VMEGEAA+ LTNQGEMNEE EKW ERMRNAGFARK F EDTIDT
Subjt:  RKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDT

Query:  ARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        ARASMRRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWK
Subjt:  ARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

SwissProt top hitse value%identityAlignment
A1DQP9 Protein NODULATION SIGNALING PATHWAY 11.8e-17362.66Show/hide
Query:  SDHILDWLEDSAPFFSPFLDETYNSSS-INCYQWWDESQNIGQDLINGCLSSSPAA-----ATTDSTTPPNTTSLTP---------SDLSKKRKAPDDTG
        SDHILDWLE S  FF  FLDE  N+S  I  Y  WD+ Q       +    SSP A     AT  +T+  +TTSL P         SDL KKR A D++ 
Subjt:  SDHILDWLEDSAPFFSPFLDETYNSSS-INCYQWWDESQNIGQDLINGCLSSSPAA-----ATTDSTTPPNTTSLTP---------SDLSKKRKAPDDTG

Query:  HKTTQPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLA
         K   P QN  KN+  +++   +   G  V+K   NKK  +K+ G+N N+G++KEGRWAEQLLNPCA AI  G+  RV HL+ VL ELASPTGD NHRLA
Subjt:  HKTTQPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLA

Query:  DHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNL-SRNLHILDIGVSHGVQWPTLLEALTRRS
         HGLRAL HHLS  SSSSS T +  + FAST+ RFFQ+SL+KF+EVSPWF+ PNNIAN+SIL  L+EE N+ SR LHILDIGVSHGVQWPTLL+AL+RRS
Subjt:  DHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNL-SRNLHILDIGVSHGVQWPTLLEALTRRS

Query:  GGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLR
        GGPP ++RLTVV  T E+DQ  ETPFS  PPG N   RLL +A+S+NINLQINR++NHSLQ LN+Q I    DEILIVCA FRLH L H +PDER+EFL+
Subjt:  GGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLR

Query:  NLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTI
         LR MEP  VILSENN  C CS CGNF   FTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQ EMNEEKEKW  RM+ AGFA + F ED +
Subjt:  NLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTI

Query:  DTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        D  RA +R+YD+NWEM++EEK+  VGLWWKGQP+SFCS WK
Subjt:  DTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

Q4VYC8 Protein NODULATION SIGNALING PATHWAY 11.9e-17560.95Show/hide
Query:  PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNI-GQDLINGCLSSSPAAATTD----STTPPNTTSLTP--------SDLSKKRKAPDD
        PN  SDHILDWLE S  FF  FLD+ YN+  I+ Y+ W+++Q+I  Q  I+   +SS A  +T     ++T  +TTSL P        SDL KKR A D+
Subjt:  PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNI-GQDLINGCLSSSPAAATTD----STTPPNTTSLTP--------SDLSKKRKAPDD

Query:  TGHKTTQPHQNPRKNQNNQSKNGADKGG----GGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGD
           K  QP     K   ++  N +D G     G VV+KS GNKK ++K+ G+N NNG+NK+GRWAEQLLNPCA AI  G+  RV HL+ VL ELAS TGD
Subjt:  TGHKTTQPHQNPRKNQNNQSKNGADKGG----GGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGD

Query:  ANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEA
        ANHRLA HGLRAL HHL   SSSSSST +  + FAST+ RFFQ+SL+KF+E SPWF+ PNNIAN+SIL  L+EEPN  R LHILDIGVSHGVQWPT LEA
Subjt:  ANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEA

Query:  LTRRSGGPPPLIRLTVV--APTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPD
        L+RR GGPPPL+RLTVV  + + E+DQ  ETPFSIGP GD  SS LL +A+SLN+NLQI +LDNH LQ LN++ +    DE LIVCA FRLH L H  PD
Subjt:  LTRRSGGPPPLIRLTVV--APTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPD

Query:  ERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARK
        ER+EFL+ LR MEP  VILSENN+ C CS+CG+F   F+RRVEYLWRFLDSTSSAFK R+SDER++MEGEAAKALTNQ EMNE +EKW ERM+ AGFA +
Subjt:  ERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARK

Query:  FFAEDTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
         F ED ID  RA +R+YDNNWEM++EE    V LWWK QP+SFCS WK
Subjt:  FFAEDTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

Q84MQ9 Protein NODULATION SIGNALING PATHWAY 16.2e-8641.51Show/hide
Query:  WWDESQNIGQDLINGCL--------SSSPAAATTDSTTP-PNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKK
        WW  S    QD I   +        +++PAAA+    +P  ++ S  PS  SKKRK+P          H+ P        K G  KGGGG          
Subjt:  WWDESQNIGQDLINGCL--------SSSPAAATTDSTTP-PNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKK

Query:  SSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLA--------PVVNFAS
                    GS+++ RWAEQLLNPCA A+  G+ +RV HL  VL EL S +GDANHRLA HGLRALA  L +    +++           P   FA+
Subjt:  SSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLA--------PVVNFAS

Query:  TDARFFQRSLIKFHEVSPWFALPNNIANSSILH--TLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVVAPTVEHDQTAETPFSI
         + R F+ SLI+FHEVSPWFALPN +AN++I    T        R LH++D+GVSHGVQWPTLLE+LTR+ GG  PP +RLTVV P          PFS 
Subjt:  TDARFFQRSLIKFHEVSPWFALPNNIANSSILH--TLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVVAPTVEHDQTAETPFSI

Query:  GPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSE--NNIACSCSNCGN
         PPG + S  LL +AKS+N++L+I+R        L+  V G    E L+VC  FR   L H A +ER E LR  R + P  V+LSE  + +     + G+
Subjt:  GPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSE--NNIACSCSNCGN

Query:  FDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAK--ALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRM-EEKDG
            F  R+E LWRFL+STS+AFKG++ +ERR++E EA    A  +     E +E W ERM  AGF    F  + +++AR+ +R+YD+ WEM        
Subjt:  FDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAK--ALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRM-EEKDG

Query:  CVGLWWKGQPISFCSFWK
         V L WKGQP+SFCS W+
Subjt:  CVGLWWKGQPISFCSFWK

Q9LRW3 Scarecrow-like protein 293.6e-12649.35Show/hide
Query:  PNHPSDHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSS-----SPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQ
        PN   DH+L WLEDS      P  D++Y     +  Q W+  Q   QD  +G + S     S A    ++T     T     DL        D   +  Q
Subjt:  PNHPSDHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSS-----SPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQ

Query:  PHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLR
        P+   RK  ++             VKKS  +K+ + KS+  +  +G NKEGRWAE+LLNPCA AI   +++RV H +CVL ELAS +GDAN RLA  GLR
Subjt:  PHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLR

Query:  ALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        AL HHLS  SSS SS+  PV  FAS + + FQ++L+KF+EVSPWFALPNN+ANS+IL  L+++P   ++LHI+DIGVSHG+QWPTLLEAL+ R  GPPP 
Subjt:  ALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKME
        +R+TV++     D TA+ PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I     E LIVCA FRLH LKH   DER E L+ +R + 
Subjt:  IRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKME

Query:  PNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARAS
        P  V+L ENN  CS S   +F   F++++EY+W+FLDSTSS FK   S+ER++MEGEA K L N G+MNE KEKWYERMR AGF  + F ED +D A++ 
Subjt:  PNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARAS

Query:  MRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        +R+YDNNWE+RME+ D   GL WKG+ +SFCS WK
Subjt:  MRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

Q9SN22 Scarecrow-like protein 326.5e-3529.03Show/hide
Query:  EGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPN
        +  + EQLL  CA AI   DA   H ++ VL  +A P GD+  RL    LRAL     S + + SST++  +  A    RF    L  F +++PW     
Subjt:  EGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPN

Query:  NIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRL
          AN++IL  +         +HI+D+ ++H +Q PTL++A+  R   PPPL++LTVV+       +   P  I    + + S+L++FA + NI ++   +
Subjt:  NIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRL

Query:  DNHSLQNLNS-----QVIGKFRDEILIVCAHFRL-----HQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDST
         +      +S     ++     +E L+V  H  L       L   +   RT FL+ LR + P  V L E ++  +  N  N          Y W   D+T
Subjt:  DNHSLQNLNS-----QVIGKFRDEILIVCAHFRL-----HQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDST

Query:  SSAFKGRESDERRVMEGE--------AAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFC
         +      S++RR  E E         AK    + E  E K +W ERMR A F      ED +   +A +  +   W M+ E+ D  + L WKG  + F 
Subjt:  SSAFKGRESDERRVMEGE--------AAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFC

Query:  SFW
        + W
Subjt:  SFW

Arabidopsis top hitse value%identityAlignment
AT3G03450.1 RGA-like 21.9e-2123.79Show/hide
Query:  KSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQ
        +SS +ST +     S + G      L  CA AI + +      L+  +  LA     A  ++A +  +ALA  +  + ++ +   A V      +  F +
Subjt:  KSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQ

Query:  RSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSR
           + F+E  P+    +  AN +IL    E    +R +H++D+G++ G+QWP L++AL  R GGPP      +  P  E+  + +           +  +
Subjt:  RSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSR

Query:  LLSFAKSLNINLQINRLDNHSLQNLNSQVI-GKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEY
        L  FA+++ +  +   L   SL +L  ++   +   E L+V + F LH+L  R+     + L  ++ ++P+ V + E     +  N   F   F   + Y
Subjt:  LLSFAKSLNINLQINRLDNHSLQNLNSQVI-GKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEY

Query:  LWRFLDSTSSAFKGRESDE--------RRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRMEEKDGCVGLWW
             DS   ++     D         R+++   AA+  +++ E +E   +W  RM++AGF            A   +  Y      R+EE DGC+ + W
Subjt:  LWRFLDSTSSAFKGRESDE--------RRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRMEEKDGCVGLWW

Query:  KGQPISFCSFWK
        + +P+   S WK
Subjt:  KGQPISFCSFWK

AT3G13840.1 GRAS family transcription factor2.6e-12749.35Show/hide
Query:  PNHPSDHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSS-----SPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQ
        PN   DH+L WLEDS      P  D++Y     +  Q W+  Q   QD  +G + S     S A    ++T     T     DL        D   +  Q
Subjt:  PNHPSDHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSS-----SPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQ

Query:  PHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLR
        P+   RK  ++             VKKS  +K+ + KS+  +  +G NKEGRWAE+LLNPCA AI   +++RV H +CVL ELAS +GDAN RLA  GLR
Subjt:  PHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLR

Query:  ALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        AL HHLS  SSS SS+  PV  FAS + + FQ++L+KF+EVSPWFALPNN+ANS+IL  L+++P   ++LHI+DIGVSHG+QWPTLLEAL+ R  GPPP 
Subjt:  ALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKME
        +R+TV++     D TA+ PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I     E LIVCA FRLH LKH   DER E L+ +R + 
Subjt:  IRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKME

Query:  PNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARAS
        P  V+L ENN  CS S   +F   F++++EY+W+FLDSTSS FK   S+ER++MEGEA K L N G+MNE KEKWYERMR AGF  + F ED +D A++ 
Subjt:  PNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARAS

Query:  MRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        +R+YDNNWE+RME+ D   GL WKG+ +SFCS WK
Subjt:  MRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

AT3G49950.1 GRAS family transcription factor4.6e-3629.03Show/hide
Query:  EGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPN
        +  + EQLL  CA AI   DA   H ++ VL  +A P GD+  RL    LRAL     S + + SST++  +  A    RF    L  F +++PW     
Subjt:  EGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPN

Query:  NIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRL
          AN++IL  +         +HI+D+ ++H +Q PTL++A+  R   PPPL++LTVV+       +   P  I    + + S+L++FA + NI ++   +
Subjt:  NIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRL

Query:  DNHSLQNLNS-----QVIGKFRDEILIVCAHFRL-----HQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDST
         +      +S     ++     +E L+V  H  L       L   +   RT FL+ LR + P  V L E ++  +  N  N          Y W   D+T
Subjt:  DNHSLQNLNS-----QVIGKFRDEILIVCAHFRL-----HQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDST

Query:  SSAFKGRESDERRVMEGE--------AAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFC
         +      S++RR  E E         AK    + E  E K +W ERMR A F      ED +   +A +  +   W M+ E+ D  + L WKG  + F 
Subjt:  SSAFKGRESDERRVMEGE--------AAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFC

Query:  SFW
        + W
Subjt:  SFW

AT4G37650.1 GRAS family transcription factor7.4e-3428.68Show/hide
Query:  RWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFAST-DARFFQRSLIKFHEVSPWFALPNN
        +WA+ +L   A A    D  R   ++  L EL+SP GD   +LA + L+AL + ++ +      T+        T      +++++KF EVSPW    + 
Subjt:  RWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFAST-DARFFQRSLIKFHEVSPWFALPNN

Query:  IANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINR
         AN +IL  +  E      +HI+DI  +   QWPTLLEAL  RS   P L RLT  VVA    +DQTA            I +R+  FA+ + +  + N 
Subjt:  IANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINR

Query:  LDN-HSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRF----LDSTSSAF
        + +   L   +   +    DE+L +     +H +  R    R   + + R++ P  V + E          G FD  F R      R+     +S   +F
Subjt:  LDN-HSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRF----LDSTSSAF

Query:  KGRESDERRVMEGEAAKAL--------TNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRY-DNNWEMRMEEKDGCVGLWWKGQPISFCSFW
          R S+ER ++E  A +A+        ++  E  E   KW  RMRN+GF    ++++  D  RA +RRY +  W M        + L W+ QP+ + S W
Subjt:  KGRESDERRVMEGEAAKAL--------TNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRY-DNNWEMRMEEKDGCVGLWWKGQPISFCSFW

Query:  K
        +
Subjt:  K

AT5G66770.1 GRAS family transcription factor2.9e-2226.92Show/hide
Query:  CANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTL
        CA  I   D       +  ++E  S  GD   R+A +   AL++ LS NS ++SS        +S+       S    ++  P+    +  AN +IL   
Subjt:  CANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTL

Query:  SEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT-VVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNS
         E    S  +HI+D G+  G+QWP LL+AL  R+ G P  IR++ + AP++      E+P    P      +RL  FAK L++N     +    +  LN 
Subjt:  SEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT-VVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNS

Query:  QVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFK---GRESDER----R
               DE+L V    +L++L    P      LR  + + P  V L E  ++ +        + F  RV+   +F  +   + +   GR+S+ER    R
Subjt:  QVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFK---GRESDER----R

Query:  VMEGEAAKALTN------QGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYD-NNWEMRMEEKDGCVGLWWKGQPISFCSFWK
         + G     L          E  EEKE+W   M NAGF     +   +  A+  +  Y+ +N    +E K G + L W   P+   S W+
Subjt:  VMEGEAAKALTN------QGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYD-NNWEMRMEEKDGCVGLWWKGQPISFCSFWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CCAAACCACCCTTCAGATCACATATTGGACTGGTTAGAGGATTCAGCTCCTTTCTTTTCCCCATTCCTGGACGAGACTTACAACTCTAGCTCTATAAACTGCTATCAATG
GTGGGATGAGAGCCAAAACATAGGCCAAGATCTGATTAATGGCTGTCTCAGTAGCTCCCCTGCCGCTGCCACCACAGACAGTACTACACCACCAAACACTACCAGTTTGA
CGCCATCGGATTTGTCAAAGAAAAGGAAAGCCCCAGATGACACAGGTCATAAGACAACACAACCCCATCAGAACCCAAGGAAGAACCAGAACAATCAGAGCAAAAATGGT
GCAGATAAAGGCGGTGGGGGGGTGGTAAAGAAGTCAGTGGGGAACAAGAAAAGTTCATCAAAATCCACAGGAAATAATTATAATAACGGAAGTAACAAGGAAGGAAGGTG
GGCGGAGCAATTGCTAAATCCCTGTGCAAATGCTATCATAAAAGGAGATGCGACACGAGTACATCACCTTATTTGTGTTCTGCAAGAGCTCGCTTCCCCCACCGGCGACG
CCAATCACCGGCTCGCCGATCATGGTCTCCGAGCTCTGGCCCATCACCTGTCCTCCAATTCATCATCTTCTTCTTCCACACTTGCGCCGGTGGTTAATTTCGCTTCGACG
GACGCGCGATTCTTCCAGCGGTCGTTGATCAAATTCCACGAGGTGAGTCCCTGGTTTGCTCTTCCGAACAACATCGCGAATTCTTCAATCCTCCACACTCTCTCTGAAGA
ACCTAATCTCTCGCGCAATCTTCACATTCTTGACATTGGGGTTTCTCACGGTGTGCAATGGCCGACGCTGCTCGAGGCCTTGACTCGCCGTTCCGGTGGGCCTCCGCCGC
TAATTCGGCTCACAGTTGTCGCTCCCACCGTCGAACACGACCAAACTGCGGAGACGCCGTTCTCCATTGGTCCACCGGGAGACAACATCTCCTCTCGGCTACTTAGTTTC
GCCAAATCCTTGAACATCAATTTACAGATCAACCGCCTCGACAATCACTCGTTACAGAACTTAAATTCGCAAGTAATCGGTAAGTTCCGGGACGAAATTTTGATCGTTTG
CGCACACTTCAGACTCCACCAGTTGAAACACCGCGCTCCAGACGAAAGAACAGAGTTCCTACGAAATCTGAGAAAAATGGAGCCAAATGCAGTGATTCTGAGCGAAAACA
ACATAGCATGTAGCTGCAGCAACTGCGGGAATTTCGACATCACATTCACTCGGCGAGTGGAGTACTTGTGGAGGTTTCTGGACTCGACGAGCTCCGCATTCAAAGGGCGA
GAAAGCGACGAAAGAAGAGTGATGGAAGGAGAGGCGGCGAAGGCGTTGACGAATCAGGGCGAAATGAACGAGGAAAAGGAGAAATGGTACGAGAGAATGAGAAATGCAGG
ATTCGCTAGAAAATTCTTCGCAGAAGACACCATTGATACGGCTCGAGCTTCCATGAGAAGATATGACAATAACTGGGAAATGAGAATGGAAGAGAAAGATGGATGCGTGG
GGTTATGGTGGAAAGGGCAGCCAATTTCGTTTTGTTCGTTTTGGAAA
mRNA sequenceShow/hide mRNA sequence
CCAAACCACCCTTCAGATCACATATTGGACTGGTTAGAGGATTCAGCTCCTTTCTTTTCCCCATTCCTGGACGAGACTTACAACTCTAGCTCTATAAACTGCTATCAATG
GTGGGATGAGAGCCAAAACATAGGCCAAGATCTGATTAATGGCTGTCTCAGTAGCTCCCCTGCCGCTGCCACCACAGACAGTACTACACCACCAAACACTACCAGTTTGA
CGCCATCGGATTTGTCAAAGAAAAGGAAAGCCCCAGATGACACAGGTCATAAGACAACACAACCCCATCAGAACCCAAGGAAGAACCAGAACAATCAGAGCAAAAATGGT
GCAGATAAAGGCGGTGGGGGGGTGGTAAAGAAGTCAGTGGGGAACAAGAAAAGTTCATCAAAATCCACAGGAAATAATTATAATAACGGAAGTAACAAGGAAGGAAGGTG
GGCGGAGCAATTGCTAAATCCCTGTGCAAATGCTATCATAAAAGGAGATGCGACACGAGTACATCACCTTATTTGTGTTCTGCAAGAGCTCGCTTCCCCCACCGGCGACG
CCAATCACCGGCTCGCCGATCATGGTCTCCGAGCTCTGGCCCATCACCTGTCCTCCAATTCATCATCTTCTTCTTCCACACTTGCGCCGGTGGTTAATTTCGCTTCGACG
GACGCGCGATTCTTCCAGCGGTCGTTGATCAAATTCCACGAGGTGAGTCCCTGGTTTGCTCTTCCGAACAACATCGCGAATTCTTCAATCCTCCACACTCTCTCTGAAGA
ACCTAATCTCTCGCGCAATCTTCACATTCTTGACATTGGGGTTTCTCACGGTGTGCAATGGCCGACGCTGCTCGAGGCCTTGACTCGCCGTTCCGGTGGGCCTCCGCCGC
TAATTCGGCTCACAGTTGTCGCTCCCACCGTCGAACACGACCAAACTGCGGAGACGCCGTTCTCCATTGGTCCACCGGGAGACAACATCTCCTCTCGGCTACTTAGTTTC
GCCAAATCCTTGAACATCAATTTACAGATCAACCGCCTCGACAATCACTCGTTACAGAACTTAAATTCGCAAGTAATCGGTAAGTTCCGGGACGAAATTTTGATCGTTTG
CGCACACTTCAGACTCCACCAGTTGAAACACCGCGCTCCAGACGAAAGAACAGAGTTCCTACGAAATCTGAGAAAAATGGAGCCAAATGCAGTGATTCTGAGCGAAAACA
ACATAGCATGTAGCTGCAGCAACTGCGGGAATTTCGACATCACATTCACTCGGCGAGTGGAGTACTTGTGGAGGTTTCTGGACTCGACGAGCTCCGCATTCAAAGGGCGA
GAAAGCGACGAAAGAAGAGTGATGGAAGGAGAGGCGGCGAAGGCGTTGACGAATCAGGGCGAAATGAACGAGGAAAAGGAGAAATGGTACGAGAGAATGAGAAATGCAGG
ATTCGCTAGAAAATTCTTCGCAGAAGACACCATTGATACGGCTCGAGCTTCCATGAGAAGATATGACAATAACTGGGAAATGAGAATGGAAGAGAAAGATGGATGCGTGG
GGTTATGGTGGAAAGGGCAGCCAATTTCGTTTTGTTCGTTTTGGAAA
Protein sequenceShow/hide protein sequence
PNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPRKNQNNQSKNG
ADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFAST
DARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSF
AKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGR
ESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK