; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g02010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g02010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionnodulation-signaling pathway 1 protein
Genome locationchr9:1656410..1658044
RNA-Seq ExpressionMoc09g02010
SyntenyMoc09g02010
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0042446 - hormone biosynthetic process (biological process)
GO:2000032 - regulation of secondary shoot formation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR005202 - Transcription factor GRAS
IPR030015 - Scarecrow-like protein 29/nodulation signalling pathway 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141813.1 protein NODULATION SIGNALING PATHWAY 1 [Cucumis sativus]5.7e-25782.73Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTT---SLTPSDLSKKRKAPDDTGH
        MTIEE GPNHPSDHILDWLEDS PFFS FLDET NSSSINCYQWWDE+Q+ G+DLINGCLS+SP   T  ST  PNT    SLTPSDL+KKRKAPDD+ H
Subjt:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTT---SLTPSDLSKKRKAPDDTGH

Query:  KTTQPHQNPRKNQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDAN
        K +Q HQNPRKNQNNQSKN ADKG G      V+KKSVGNKK++SKSTGNNYN+GSNKEGRWAEQLLNPCANAI+KGDATRVHHL+CVLQELASPTGDAN
Subjt:  KTTQPHQNPRKNQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDAN

Query:  HRLADHGLRALAHHLSSNSSSS-----SSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTL
        HRLADHGLRALA+HLSSNSSSS     SST+AP   FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEE N  RNLHILDIGVSHGVQWPTL
Subjt:  HRLADHGLRALAHHLSSNSSSS-----SSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTL

Query:  LEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAP
        LEALTRRSGGPPPLIRLTV+APT+EHDQ  ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQ+LNSQ I K RDEILIVCA FRLHQLKH AP
Subjt:  LEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAP

Query:  DERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFA
        DER EFL NLRKMEP AVILSENN+ CSCS CGNF++ F R VEY+W+FLDSTS+AFKGRES+ERRVMEGEAAKAL N  GEMNEEK KW ERMRN GF 
Subjt:  DERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFA

Query:  RKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPN
        RK F E TIDTARASMRRYDNNWEMRME+KDGCVGLWWKGQP+SFCSFWKLG K N
Subjt:  RKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPN

XP_022148342.1 nodulation-signaling pathway 1 protein [Momordica charantia]0.0e+0099.82Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTT
        MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTT
Subjt:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTT

Query:  QPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGL
        QPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGL
Subjt:  QPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGL

Query:  RALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPP
        RALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPP
Subjt:  RALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPP

Query:  LIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKM
        LIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKM
Subjt:  LIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKM

Query:  EPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARA
        EPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAE TIDTARA
Subjt:  EPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARA

Query:  SMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG
        SMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG
Subjt:  SMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG

XP_023000031.1 nodulation-signaling pathway 1 protein [Cucurbita maxima]2.1e-26485.05Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSAPFF-SPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTG
        MTIEEPG NHPSDHILDWL DS PFF SPF D++YNSSSINCYQWWDE+Q+IGQDLINGCLSSSP   TT ST PPNT++   LTPSDL+KKRKAPDDT 
Subjt:  MTIEEPGPNHPSDHILDWLEDSAPFF-SPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTG

Query:  HKTTQPHQNPRKNQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDA
        HK +Q  QN RKNQNNQSKNGADKG G      V+KKSVGNK++SSK+TGNN +NGSNKEGRWAEQLLNPCANAIIKGDATRVHHL+CVLQELASPTGDA
Subjt:  HKTTQPHQNPRKNQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDA

Query:  NHRLADHGLRALAHHLSSNSS--SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLE
        NHRLA +GLRALAH+LSSNSS  SSSST+AP V FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEEPN SRNLHILDIGVSHGVQWPTLLE
Subjt:  NHRLADHGLRALAHHLSSNSS--SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDE
        ALTRRSGGPPPLIRLTV+APTVEHDQ AETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQ+ NSQVIGK  DEILIVCA FRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDE

Query:  RTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKF
        R EFL+NLRK+EP AVILSENN+ACSC+NCGNFD  FTR+VEYLWRFLDSTSSAFKGRES+ER+VMEGEAA+ LTNQGEMNEE EKW ERMRNAGFARK 
Subjt:  RTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKF

Query:  FAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG
        F E TIDTARASMRRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWKLG K NGG
Subjt:  FAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG

XP_023514155.1 nodulation-signaling pathway 1 protein [Cucurbita pepo subsp. pepo]7.4e-25784.9Show/hide
Query:  DHILDWLEDSAPFF-SPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQNPRK
        DHILDWL DS PFF SPF D++YNSSSINCYQWWDE+Q+IGQDLINGCLSSSP   TT ST PPN ++   LTPSDL+KKRKAPDDT HK +Q  QN RK
Subjt:  DHILDWLEDSAPFF-SPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQNPRK

Query:  NQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRAL
        NQNNQSKNGADK  G      V+KKSVGNK++SSK+TG+N NNG+NKEGRWAEQLLNPCANAIIKGDATRVHHL+CVLQELASPTGDANHRLA +GLRAL
Subjt:  NQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRAL

Query:  AHHLSSNS--SSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        AH+LSSNS  SSSSSTLAP V FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEEPN SRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  AHHLSSNS--SSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKME
        IRLTV+APT+EHDQ AETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQ+LNSQVIGKF DEILIVCA FRLHQLKH APDER EFL+NLRKME
Subjt:  IRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKME

Query:  PNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARAS
        P AVILSENN+ACSC+NCGNFD  FTR+VEYLWRFLDSTSSAFKGRES+ER+VMEGEAAKAL N+GEMNEE EKW ERMRNAGFARK F E TIDTARAS
Subjt:  PNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARAS

Query:  MRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG
        MRRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWKLG K NGG
Subjt:  MRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG

XP_038897214.1 protein NODULATION SIGNALING PATHWAY 1 [Benincasa hispida]1.3e-25883.33Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTT
        MTIEE GPNHPSDHILDWLEDS PFFSPFLDET NSSSINCYQWWD +Q+ G+DLING LS+SP   +T     P +  LTPSDL+KKRKAPDD+ HK +
Subjt:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTT

Query:  QPHQNPRKNQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRL
        Q HQN RKNQNNQSKNG   GGG      V+KKSVGNKK+SSK TGNN NNGSN+EGRWAEQLLNPCA+AIIKGDATRVHHL+CVLQELASPTGDANHRL
Subjt:  QPHQNPRKNQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRL

Query:  ADHGLRALAHHLSSNSSSS-----SSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEA
        ADHGLRALAHHLSSNSSSS     SST+AP V FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEEPN  RNLHILDIGVSHGVQWPTLLEA
Subjt:  ADHGLRALAHHLSSNSSSS-----SSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEA

Query:  LTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDER
        LTRRSGGPP LIRLTV+ PT+EHDQ  ETPFSIGPPGDNISSRLLSFAKSLNINLQINRL+NHSLQ+LNSQVI KF DEILIVCA FRLHQLKH  PDER
Subjt:  LTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDER

Query:  TEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFF
         EFL+NLRKMEP AVILSENN+ CSCSNCGNFD  FTRRVEYLWRFLDSTS+AFKGRES+ERRVMEGEAAKALTN GEMNEEK KW ERMRNAGF RK F
Subjt:  TEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFF

Query:  AEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPN
        AE TIDTARASMRRYDNNWEMR+EEKDGC+GLWWKGQP+SFCSFWKLG K N
Subjt:  AEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPN

TrEMBL top hitse value%identityAlignment
A0A0A0KCK6 GRAS domain-containing protein2.7e-25782.73Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTT---SLTPSDLSKKRKAPDDTGH
        MTIEE GPNHPSDHILDWLEDS PFFS FLDET NSSSINCYQWWDE+Q+ G+DLINGCLS+SP   T  ST  PNT    SLTPSDL+KKRKAPDD+ H
Subjt:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTT---SLTPSDLSKKRKAPDDTGH

Query:  KTTQPHQNPRKNQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDAN
        K +Q HQNPRKNQNNQSKN ADKG G      V+KKSVGNKK++SKSTGNNYN+GSNKEGRWAEQLLNPCANAI+KGDATRVHHL+CVLQELASPTGDAN
Subjt:  KTTQPHQNPRKNQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDAN

Query:  HRLADHGLRALAHHLSSNSSSS-----SSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTL
        HRLADHGLRALA+HLSSNSSSS     SST+AP   FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEE N  RNLHILDIGVSHGVQWPTL
Subjt:  HRLADHGLRALAHHLSSNSSSS-----SSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTL

Query:  LEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAP
        LEALTRRSGGPPPLIRLTV+APT+EHDQ  ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQ+LNSQ I K RDEILIVCA FRLHQLKH AP
Subjt:  LEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAP

Query:  DERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFA
        DER EFL NLRKMEP AVILSENN+ CSCS CGNF++ F R VEY+W+FLDSTS+AFKGRES+ERRVMEGEAAKAL N  GEMNEEK KW ERMRN GF 
Subjt:  DERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFA

Query:  RKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPN
        RK F E TIDTARASMRRYDNNWEMRME+KDGCVGLWWKGQP+SFCSFWKLG K N
Subjt:  RKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPN

A0A1S3CGQ3 nodulation-signaling pathway 1 protein2.2e-25481.65Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGH
        MTIEE GP+HPSDHILDWLEDS PFFS FLDET NSSSINCYQWWDE+Q+ G+DLINGCLS+SP   T  ST  PNT +   L PSDL+KKRKAPDD+ H
Subjt:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGH

Query:  KTTQPHQNPRKNQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDAN
        K +Q HQNPRKNQNNQSKN ADKG G      V+KKSVGNKK++SKSTGNNYNNGSNKEGRWAEQLLNPCANAI+KGDATRVHHL+CVLQELASPTGDAN
Subjt:  KTTQPHQNPRKNQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDAN

Query:  HRLADHGLRALAHHLSSNSSSS-----SSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTL
        HRLADHGLRALA+HLSSNSSSS     SST++P + FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEE N  RNLH+LDIGVSHGVQWPTL
Subjt:  HRLADHGLRALAHHLSSNSSSS-----SSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTL

Query:  LEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAP
        LEALTRRSGGPPPLIRLTV+APTVEHDQ  ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQ+LNSQ I K RDEILIVC+ FRLHQLKH AP
Subjt:  LEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAP

Query:  DERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFA
        DER EFL+NLRKMEP AVILSENN+ CSCS C NF++ F R VEY+W+FLDSTS+AFKGRES+ERRVMEGEAAKAL N +GEMNEEK KW ERMRN GF 
Subjt:  DERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFA

Query:  RKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPN
        RK F E TIDTARASMRRYDNNWEMRME+KDGCVGLWWKGQP+SFCS WKLG K N
Subjt:  RKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPN

A0A6J1D4T6 nodulation-signaling pathway 1 protein0.0e+0099.82Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTT
        MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTT
Subjt:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTT

Query:  QPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGL
        QPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGL
Subjt:  QPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGL

Query:  RALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPP
        RALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPP
Subjt:  RALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPP

Query:  LIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKM
        LIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKM
Subjt:  LIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKM

Query:  EPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARA
        EPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAE TIDTARA
Subjt:  EPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARA

Query:  SMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG
        SMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG
Subjt:  SMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG

A0A6J1HKE2 nodulation-signaling pathway 1 protein1.5e-25584.5Show/hide
Query:  DHILDWLEDSAPFF-SPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQNPRK
        DHILDWL DS PFF SPF D++YNSSSINCYQWWDE+Q+IGQDLINGCLSSSP   TT ST PPNT++   LTPSDL+KKRKAPDDT HK +Q  QN RK
Subjt:  DHILDWLEDSAPFF-SPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTGHKTTQPHQNPRK

Query:  NQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRAL
        NQNNQS+NGADK  G      V+KKSVGNK++SSK+TGNN NNG+NKEGRWAEQLLNPCANAIIKGDATRVHHL+CVLQELASPTGDANHRLA +GLRAL
Subjt:  NQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRAL

Query:  AHHLSSNSS-SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI
        AH+LSSNSS SSSSTLAP V FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEEPN SRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI
Subjt:  AHHLSSNSS-SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLI

Query:  RLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEP
        RLTV+APTVEHDQ AETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL +LN+QVIGKF DEILIVCA FRLHQLKH APDER EFL+NLRK+EP
Subjt:  RLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEP

Query:  NAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASM
         AVILSENN+ACSC+NCGNFD  FTR+VEYLWRFLDSTSSAFKGRES+ER++MEGEAAKAL N+GEMNEE EKW ERMRNAGFARK F E TIDTARASM
Subjt:  NAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASM

Query:  RRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG
        RRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWKLG K NGG
Subjt:  RRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG

A0A6J1KER4 nodulation-signaling pathway 1 protein1.0e-26485.05Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSAPFF-SPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTG
        MTIEEPG NHPSDHILDWL DS PFF SPF D++YNSSSINCYQWWDE+Q+IGQDLINGCLSSSP   TT ST PPNT++   LTPSDL+KKRKAPDDT 
Subjt:  MTIEEPGPNHPSDHILDWLEDSAPFF-SPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTS---LTPSDLSKKRKAPDDTG

Query:  HKTTQPHQNPRKNQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDA
        HK +Q  QN RKNQNNQSKNGADKG G      V+KKSVGNK++SSK+TGNN +NGSNKEGRWAEQLLNPCANAIIKGDATRVHHL+CVLQELASPTGDA
Subjt:  HKTTQPHQNPRKNQNNQSKNGADKGGGG-----VVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDA

Query:  NHRLADHGLRALAHHLSSNSS--SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLE
        NHRLA +GLRALAH+LSSNSS  SSSST+AP V FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEEPN SRNLHILDIGVSHGVQWPTLLE
Subjt:  NHRLADHGLRALAHHLSSNSS--SSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDE
        ALTRRSGGPPPLIRLTV+APTVEHDQ AETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQ+ NSQVIGK  DEILIVCA FRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDE

Query:  RTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKF
        R EFL+NLRK+EP AVILSENN+ACSC+NCGNFD  FTR+VEYLWRFLDSTSSAFKGRES+ER+VMEGEAA+ LTNQGEMNEE EKW ERMRNAGFARK 
Subjt:  RTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKF

Query:  FAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG
        F E TIDTARASMRRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWKLG K NGG
Subjt:  FAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG

SwissProt top hitse value%identityAlignment
A1DQP9 Protein NODULATION SIGNALING PATHWAY 11.3e-17461.68Show/hide
Query:  MTIEEPGPNH-PSDHILDWLEDSAPFFSPFLDETYNSSS-INCYQWWDESQNIGQDLINGCLSSSPAA-----ATTDSTTPPNTTSLTP---------SD
        MT+E   PN   SDHILDWLE S  FF  FLDE  N+S  I  Y  WD+ Q       +    SSP A     AT  +T+  +TTSL P         SD
Subjt:  MTIEEPGPNH-PSDHILDWLEDSAPFFSPFLDETYNSSS-INCYQWWDESQNIGQDLINGCLSSSPAA-----ATTDSTTPPNTTSLTP---------SD

Query:  LSKKRKAPDDTGHKTTQPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQEL
        L KKR A D++  K   P QN  KN+  +++   +   G  V+K   NKK  +K+ G+N N+G++KEGRWAEQLLNPCA AI  G+  RV HL+ VL EL
Subjt:  LSKKRKAPDDTGHKTTQPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQEL

Query:  ASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNL-SRNLHILDIGVSHGVQ
        ASPTGD NHRLA HGLRAL HHLS  SSSSS T +  + FAST+ RFFQ+SL+KF+EVSPWF+ PNNIAN+SIL  L+EE N+ SR LHILDIGVSHGVQ
Subjt:  ASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNL-SRNLHILDIGVSHGVQ

Query:  WPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLK
        WPTLL+AL+RRSGGPP ++RLTVV  T E+DQ  ETPFS  PPG N   RLL +A+S+NINLQINR++NHSLQ LN+Q I    DEILIVCA FRLH L 
Subjt:  WPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLK

Query:  HRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNA
        H +PDER+EFL+ LR MEP  VILSENN  C CS CGNF   FTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQ EMNEEKEKW  RM+ A
Subjt:  HRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNA

Query:  GFARKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG
        GFA + F E  +D  RA +R+YD+NWEM++EEK+  VGLWWKGQP+SFCS WKL     GG
Subjt:  GFARKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG

Q4VYC8 Protein NODULATION SIGNALING PATHWAY 11.8e-17660.36Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNI-GQDLINGCLSSSPAAATTD----STTPPNTTSLTP--------SDLSK
        MT+E   PN  SDHILDWLE S  FF  FLD+ YN+  I+ Y+ W+++Q+I  Q  I+   +SS A  +T     ++T  +TTSL P        SDL K
Subjt:  MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNI-GQDLINGCLSSSPAAATTD----STTPPNTTSLTP--------SDLSK

Query:  KRKAPDDTGHKTTQPHQNPRKNQNNQSKNGADKGG----GGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQE
        KR A D+   K  QP     K   ++  N +D G     G VV+KS GNKK ++K+ G+N NNG+NK+GRWAEQLLNPCA AI  G+  RV HL+ VL E
Subjt:  KRKAPDDTGHKTTQPHQNPRKNQNNQSKNGADKGG----GGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQE

Query:  LASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQ
        LAS TGDANHRLA HGLRAL HHL   SSSSSST +  + FAST+ RFFQ+SL+KF+E SPWF+ PNNIAN+SIL  L+EEPN  R LHILDIGVSHGVQ
Subjt:  LASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQ

Query:  WPTLLEALTRRSGGPPPLIRLTVV--APTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQ
        WPT LEAL+RR GGPPPL+RLTVV  + + E+DQ  ETPFSIGP GD  SS LL +A+SLN+NLQI +LDNH LQ LN++ +    DE LIVCA FRLH 
Subjt:  WPTLLEALTRRSGGPPPLIRLTVV--APTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQ

Query:  LKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMR
        L H  PDER+EFL+ LR MEP  VILSENN+ C CS+CG+F   F+RRVEYLWRFLDSTSSAFK R+SDER++MEGEAAKALTNQ EMNE +EKW ERM+
Subjt:  LKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMR

Query:  NAGFARKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKP
         AGFA + F E  ID  RA +R+YDNNWEM++EE    V LWWK QP+SFCS WKL  +P
Subjt:  NAGFARKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKP

Q84MQ9 Protein NODULATION SIGNALING PATHWAY 11.1e-8541.46Show/hide
Query:  WWDESQNIGQDLINGCL--------SSSPAAATTDSTTP-PNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKK
        WW  S    QD I   +        +++PAAA+    +P  ++ S  PS  SKKRK+P          H+ P        K G  KGGGG          
Subjt:  WWDESQNIGQDLINGCL--------SSSPAAATTDSTTP-PNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKK

Query:  SSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLA--------PVVNFAS
                    GS+++ RWAEQLLNPCA A+  G+ +RV HL  VL EL S +GDANHRLA HGLRALA  L +    +++           P   FA+
Subjt:  SSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLA--------PVVNFAS

Query:  TDARFFQRSLIKFHEVSPWFALPNNIANSSILH--TLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVVAPTVEHDQTAETPFSI
         + R F+ SLI+FHEVSPWFALPN +AN++I    T        R LH++D+GVSHGVQWPTLLE+LTR+ GG  PP +RLTVV P          PFS 
Subjt:  TDARFFQRSLIKFHEVSPWFALPNNIANSSILH--TLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVVAPTVEHDQTAETPFSI

Query:  GPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSE--NNIACSCSNCGN
         PPG + S  LL +AKS+N++L+I+R        L+  V G    E L+VC  FR   L H A +ER E LR  R + P  V+LSE  + +     + G+
Subjt:  GPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSE--NNIACSCSNCGN

Query:  FDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAK--ALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRYDNNWEMRM-EEKDG
            F  R+E LWRFL+STS+AFKG++ +ERR++E EA    A  +     E +E W ERM  AGF    F    +++AR+ +R+YD+ WEM        
Subjt:  FDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAK--ALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRYDNNWEMRM-EEKDG

Query:  CVGLWWKGQPISFCSFWKLGA
         V L WKGQP+SFCS W+  A
Subjt:  CVGLWWKGQPISFCSFWKLGA

Q9LRW3 Scarecrow-like protein 291.8e-12548.98Show/hide
Query:  EEPGPNHPSDHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSS-----SPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGH
        E   PN   DH+L WLEDS      P  D++Y     +  Q W+  Q   QD  +G + S     S A    ++T     T     DL        D   
Subjt:  EEPGPNHPSDHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSS-----SPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGH

Query:  KTTQPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLAD
        +  QP+   RK  ++             VKKS  +K+ + KS+  +  +G NKEGRWAE+LLNPCA AI   +++RV H +CVL ELAS +GDAN RLA 
Subjt:  KTTQPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLAD

Query:  HGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG
         GLRAL HHLS  SSS SS+  PV  FAS + + FQ++L+KF+EVSPWFALPNN+ANS+IL  L+++P   ++LHI+DIGVSHG+QWPTLLEAL+ R  G
Subjt:  HGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG

Query:  PPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNL
        PPP +R+TV++     D TA+ PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I     E LIVCA FRLH LKH   DER E L+ +
Subjt:  PPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNL

Query:  RKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDT
        R + P  V+L ENN  CS S   +F   F++++EY+W+FLDSTSS FK   S+ER++MEGEA K L N G+MNE KEKWYERMR AGF  + F E  +D 
Subjt:  RKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDT

Query:  ARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        A++ +R+YDNNWE+RME+ D   GL WKG+ +SFCS WK
Subjt:  ARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

Q9SN22 Scarecrow-like protein 323.3e-3428.78Show/hide
Query:  EGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPN
        +  + EQLL  CA AI   DA   H ++ VL  +A P GD+  RL    LRAL     S + + SST++  +  A    RF    L  F +++PW     
Subjt:  EGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPN

Query:  NIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRL
          AN++IL  +         +HI+D+ ++H +Q PTL++A+  R   PPPL++LTVV+       +   P  I    + + S+L++FA + NI ++   +
Subjt:  NIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRL

Query:  DNHSLQNLNS-----QVIGKFRDEILIVCAHFRL-----HQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDST
         +      +S     ++     +E L+V  H  L       L   +   RT FL+ LR + P  V L E ++  +  N  N          Y W   D+T
Subjt:  DNHSLQNLNS-----QVIGKFRDEILIVCAHFRL-----HQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDST

Query:  SSAFKGRESDERRVMEGE--------AAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFC
         +      S++RR  E E         AK    + E  E K +W ERMR A F      E  +   +A +  +   W M+ E+ D  + L WKG  + F 
Subjt:  SSAFKGRESDERRVMEGE--------AAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFC

Query:  SFW
        + W
Subjt:  SFW

Arabidopsis top hitse value%identityAlignment
AT3G03450.1 RGA-like 25.1e-2223.97Show/hide
Query:  KSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQ
        +SS +ST +     S + G      L  CA AI + +      L+  +  LA     A  ++A +  +ALA  +  + ++ +   A V      +  F +
Subjt:  KSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQ

Query:  RSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSR
           + F+E  P+    +  AN +IL    E    +R +H++D+G++ G+QWP L++AL  R GGPP      +  P  E+  + +           +  +
Subjt:  RSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSR

Query:  LLSFAKSLNINLQINRLDNHSLQNLNSQVI-GKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEY
        L  FA+++ +  +   L   SL +L  ++   +   E L+V + F LH+L  R+     + L  ++ ++P+ V + E     +  N   F   F   + Y
Subjt:  LLSFAKSLNINLQINRLDNHSLQNLNSQVI-GKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEY

Query:  LWRFLDSTSSAFKGRESDE--------RRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWW
             DS   ++     D         R+++   AA+  +++ E +E   +W  RM++AGF            A   +  Y      R+EE DGC+ + W
Subjt:  LWRFLDSTSSAFKGRESDE--------RRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWW

Query:  KGQPISFCSFWKL
        + +P+   S WKL
Subjt:  KGQPISFCSFWKL

AT3G13840.1 GRAS family transcription factor1.3e-12648.98Show/hide
Query:  EEPGPNHPSDHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSS-----SPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGH
        E   PN   DH+L WLEDS      P  D++Y     +  Q W+  Q   QD  +G + S     S A    ++T     T     DL        D   
Subjt:  EEPGPNHPSDHILDWLEDSAPFFS-PFLDETYNSSSINCYQWWDESQNIGQDLINGCLSS-----SPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGH

Query:  KTTQPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLAD
        +  QP+   RK  ++             VKKS  +K+ + KS+  +  +G NKEGRWAE+LLNPCA AI   +++RV H +CVL ELAS +GDAN RLA 
Subjt:  KTTQPHQNPRKNQNNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLAD

Query:  HGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG
         GLRAL HHLS  SSS SS+  PV  FAS + + FQ++L+KF+EVSPWFALPNN+ANS+IL  L+++P   ++LHI+DIGVSHG+QWPTLLEAL+ R  G
Subjt:  HGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGG

Query:  PPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNL
        PPP +R+TV++     D TA+ PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I     E LIVCA FRLH LKH   DER E L+ +
Subjt:  PPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNL

Query:  RKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDT
        R + P  V+L ENN  CS S   +F   F++++EY+W+FLDSTSS FK   S+ER++MEGEA K L N G+MNE KEKWYERMR AGF  + F E  +D 
Subjt:  RKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDT

Query:  ARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK
        A++ +R+YDNNWE+RME+ D   GL WKG+ +SFCS WK
Subjt:  ARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWK

AT3G49950.1 GRAS family transcription factor2.4e-3528.78Show/hide
Query:  EGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPN
        +  + EQLL  CA AI   DA   H ++ VL  +A P GD+  RL    LRAL     S + + SST++  +  A    RF    L  F +++PW     
Subjt:  EGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPN

Query:  NIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRL
          AN++IL  +         +HI+D+ ++H +Q PTL++A+  R   PPPL++LTVV+       +   P  I    + + S+L++FA + NI ++   +
Subjt:  NIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRL

Query:  DNHSLQNLNS-----QVIGKFRDEILIVCAHFRL-----HQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDST
         +      +S     ++     +E L+V  H  L       L   +   RT FL+ LR + P  V L E ++  +  N  N          Y W   D+T
Subjt:  DNHSLQNLNS-----QVIGKFRDEILIVCAHFRL-----HQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDST

Query:  SSAFKGRESDERRVMEGE--------AAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFC
         +      S++RR  E E         AK    + E  E K +W ERMR A F      E  +   +A +  +   W M+ E+ D  + L WKG  + F 
Subjt:  SSAFKGRESDERRVMEGE--------AAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFC

Query:  SFW
        + W
Subjt:  SFW

AT4G37650.1 GRAS family transcription factor1.7e-3328.68Show/hide
Query:  RWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFAST-DARFFQRSLIKFHEVSPWFALPNN
        +WA+ +L   A A    D  R   ++  L EL+SP GD   +LA + L+AL + ++ +      T+        T      +++++KF EVSPW    + 
Subjt:  RWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFAST-DARFFQRSLIKFHEVSPWFALPNN

Query:  IANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINR
         AN +IL  +  E      +HI+DI  +   QWPTLLEAL  RS   P L RLT  VVA    +DQTA            I +R+  FA+ + +  + N 
Subjt:  IANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINR

Query:  LDN-HSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRF----LDSTSSAF
        + +   L   +   +    DE+L +     +H +  R    R   + + R++ P  V + E          G FD  F R      R+     +S   +F
Subjt:  LDN-HSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRF----LDSTSSAF

Query:  KGRESDERRVMEGEAAKAL--------TNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRY-DNNWEMRMEEKDGCVGLWWKGQPISFCSFW
          R S+ER ++E  A +A+        ++  E  E   KW  RMRN+GF    +++   D  RA +RRY +  W M        + L W+ QP+ + S W
Subjt:  KGRESDERRVMEGEAAKAL--------TNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRY-DNNWEMRMEEKDGCVGLWWKGQPISFCSFW

Query:  K
        +
Subjt:  K

AT5G66770.1 GRAS family transcription factor2.3e-2226.92Show/hide
Query:  CANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTL
        CA  I   D       +  ++E  S  GD   R+A +   AL++ LS NS ++SS        +S+       S    ++  P+    +  AN +IL   
Subjt:  CANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAPVVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTL

Query:  SEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT-VVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNS
         E    S  +HI+D G+  G+QWP LL+AL  R+ G P  IR++ + AP++      E+P    P      +RL  FAK L++N     +    +  LN 
Subjt:  SEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT-VVAPTVEHDQTAETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQNLNS

Query:  QVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFK---GRESDER----R
               DE+L V    +L++L    P      LR  + + P  V L E  ++ +        + F  RV+   +F  +   + +   GR+S+ER    R
Subjt:  QVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDSTSSAFK---GRESDER----R

Query:  VMEGEAAKALTN------QGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRYD-NNWEMRMEEKDGCVGLWWKGQPISFCSFWK
         + G     L          E  EEKE+W   M NAGF     +   +  A+  +  Y+ +N    +E K G + L W   P+   S W+
Subjt:  VMEGEAAKALTN------QGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRYD-NNWEMRMEEKDGCVGLWWKGQPISFCSFWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATTGAAGAACCAGGGCCAAACCACCCTTCAGATCACATATTGGACTGGTTAGAGGATTCAGCTCCTTTCTTTTCCCCATTCCTGGACGAGACTTACAACTCTAG
CTCTATAAACTGCTATCAATGGTGGGATGAGAGCCAAAACATAGGCCAAGATCTGATTAATGGCTGTCTCAGTAGCTCCCCTGCCGCTGCCACCACAGACAGTACTACAC
CACCAAACACTACCAGTTTGACGCCATCGGATTTGTCAAAGAAAAGGAAAGCCCCAGATGACACAGGTCATAAGACAACACAACCCCATCAGAACCCAAGGAAGAACCAG
AACAATCAGAGCAAAAATGGTGCAGATAAAGGCGGTGGGGGGGTGGTAAAGAAGTCAGTGGGGAACAAGAAAAGTTCATCAAAATCCACAGGAAATAATTATAATAACGG
AAGTAACAAGGAAGGAAGGTGGGCGGAGCAATTGCTAAATCCCTGTGCAAATGCTATCATAAAAGGAGATGCGACACGAGTACATCACCTTATTTGTGTTCTGCAAGAGC
TCGCTTCCCCCACCGGCGACGCCAATCACCGGCTCGCCGATCATGGTCTCCGAGCTCTGGCCCATCACCTGTCCTCCAATTCATCATCTTCTTCTTCCACACTTGCGCCG
GTGGTTAATTTCGCTTCGACGGACGCGCGATTCTTCCAGCGGTCGTTGATCAAATTCCACGAGGTGAGTCCCTGGTTTGCTCTTCCGAACAACATCGCGAATTCTTCAAT
CCTCCACACTCTCTCTGAAGAACCTAATCTCTCGCGCAATCTTCACATTCTTGACATTGGGGTTTCTCACGGTGTGCAATGGCCGACGCTGCTCGAGGCCTTGACTCGCC
GTTCCGGTGGGCCTCCGCCGCTAATTCGGCTCACAGTTGTCGCTCCCACCGTCGAACACGACCAAACTGCGGAGACGCCGTTCTCCATTGGTCCACCGGGAGACAACATC
TCCTCTCGGCTACTTAGTTTCGCCAAATCCTTGAACATCAATTTACAGATCAACCGCCTCGACAATCACTCGTTACAGAACTTAAATTCGCAAGTAATCGGTAAGTTCCG
GGACGAAATTTTGATCGTTTGCGCACACTTCAGACTCCACCAGTTGAAACACCGCGCTCCAGACGAAAGAACAGAGTTCCTACGAAATCTGAGAAAAATGGAGCCAAATG
CAGTGATTCTGAGCGAAAACAACATAGCATGTAGCTGCAGCAACTGCGGGAATTTCGACATCACATTCACTCGGCGAGTGGAGTACTTGTGGAGGTTTCTGGACTCGACG
AGCTCCGCATTCAAAGGGCGAGAAAGCGACGAAAGAAGAGTGATGGAAGGAGAGGCGGCGAAGGCGTTGACGAATCAGGGCGAAATGAACGAGGAAAAGGAGAAATGGTA
CGAGAGAATGAGAAATGCAGGATTCGCTAGAAAATTCTTCGCAGAAGGCACCATTGATACGGCTCGAGCTTCCATGAGAAGATATGACAATAACTGGGAAATGAGAATGG
AAGAGAAAGATGGATGCGTGGGGTTATGGTGGAAAGGGCAGCCAATTTCGTTTTGTTCGTTTTGGAAATTGGGAGCGAAACCCAATGGCGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCATTGAAGAACCAGGGCCAAACCACCCTTCAGATCACATATTGGACTGGTTAGAGGATTCAGCTCCTTTCTTTTCCCCATTCCTGGACGAGACTTACAACTCTAG
CTCTATAAACTGCTATCAATGGTGGGATGAGAGCCAAAACATAGGCCAAGATCTGATTAATGGCTGTCTCAGTAGCTCCCCTGCCGCTGCCACCACAGACAGTACTACAC
CACCAAACACTACCAGTTTGACGCCATCGGATTTGTCAAAGAAAAGGAAAGCCCCAGATGACACAGGTCATAAGACAACACAACCCCATCAGAACCCAAGGAAGAACCAG
AACAATCAGAGCAAAAATGGTGCAGATAAAGGCGGTGGGGGGGTGGTAAAGAAGTCAGTGGGGAACAAGAAAAGTTCATCAAAATCCACAGGAAATAATTATAATAACGG
AAGTAACAAGGAAGGAAGGTGGGCGGAGCAATTGCTAAATCCCTGTGCAAATGCTATCATAAAAGGAGATGCGACACGAGTACATCACCTTATTTGTGTTCTGCAAGAGC
TCGCTTCCCCCACCGGCGACGCCAATCACCGGCTCGCCGATCATGGTCTCCGAGCTCTGGCCCATCACCTGTCCTCCAATTCATCATCTTCTTCTTCCACACTTGCGCCG
GTGGTTAATTTCGCTTCGACGGACGCGCGATTCTTCCAGCGGTCGTTGATCAAATTCCACGAGGTGAGTCCCTGGTTTGCTCTTCCGAACAACATCGCGAATTCTTCAAT
CCTCCACACTCTCTCTGAAGAACCTAATCTCTCGCGCAATCTTCACATTCTTGACATTGGGGTTTCTCACGGTGTGCAATGGCCGACGCTGCTCGAGGCCTTGACTCGCC
GTTCCGGTGGGCCTCCGCCGCTAATTCGGCTCACAGTTGTCGCTCCCACCGTCGAACACGACCAAACTGCGGAGACGCCGTTCTCCATTGGTCCACCGGGAGACAACATC
TCCTCTCGGCTACTTAGTTTCGCCAAATCCTTGAACATCAATTTACAGATCAACCGCCTCGACAATCACTCGTTACAGAACTTAAATTCGCAAGTAATCGGTAAGTTCCG
GGACGAAATTTTGATCGTTTGCGCACACTTCAGACTCCACCAGTTGAAACACCGCGCTCCAGACGAAAGAACAGAGTTCCTACGAAATCTGAGAAAAATGGAGCCAAATG
CAGTGATTCTGAGCGAAAACAACATAGCATGTAGCTGCAGCAACTGCGGGAATTTCGACATCACATTCACTCGGCGAGTGGAGTACTTGTGGAGGTTTCTGGACTCGACG
AGCTCCGCATTCAAAGGGCGAGAAAGCGACGAAAGAAGAGTGATGGAAGGAGAGGCGGCGAAGGCGTTGACGAATCAGGGCGAAATGAACGAGGAAAAGGAGAAATGGTA
CGAGAGAATGAGAAATGCAGGATTCGCTAGAAAATTCTTCGCAGAAGGCACCATTGATACGGCTCGAGCTTCCATGAGAAGATATGACAATAACTGGGAAATGAGAATGG
AAGAGAAAGATGGATGCGTGGGGTTATGGTGGAAAGGGCAGCCAATTTCGTTTTGTTCGTTTTGGAAATTGGGAGCGAAACCCAATGGCGGTTAA
Protein sequenceShow/hide protein sequence
MTIEEPGPNHPSDHILDWLEDSAPFFSPFLDETYNSSSINCYQWWDESQNIGQDLINGCLSSSPAAATTDSTTPPNTTSLTPSDLSKKRKAPDDTGHKTTQPHQNPRKNQ
NNQSKNGADKGGGGVVKKSVGNKKSSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLICVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSTLAP
VVNFASTDARFFQRSLIKFHEVSPWFALPNNIANSSILHTLSEEPNLSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVVAPTVEHDQTAETPFSIGPPGDNI
SSRLLSFAKSLNINLQINRLDNHSLQNLNSQVIGKFRDEILIVCAHFRLHQLKHRAPDERTEFLRNLRKMEPNAVILSENNIACSCSNCGNFDITFTRRVEYLWRFLDST
SSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFARKFFAEGTIDTARASMRRYDNNWEMRMEEKDGCVGLWWKGQPISFCSFWKLGAKPNGG