; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10023445 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10023445
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionnodulation-signaling pathway 1 protein
Genome locationChr05:34294756..34296420
RNA-Seq ExpressionHG10023445
SyntenyHG10023445
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0042446 - hormone biosynthetic process (biological process)
GO:2000032 - regulation of secondary shoot formation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR005202 - Transcription factor GRAS
IPR030015 - Scarecrow-like protein 29/nodulation signalling pathway 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059401.1 nodulation-signaling pathway 1 protein [Cucumis melo var. makuwa]2.5e-29291.37Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EETGP+HPSDHILDWLEDSVPFFS FLDET+NSSSINCYQWWDENQDTGEDLINGCLSNSPTT VSTR PNTPTSHHL PSDLT+KRKAPDDSVHKK
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSKN A KGSG VEGVTV+KKSVGNKKN+SKSTGNNYNNGSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE
        LADHGLRALA+HLSSN SSSSSFSSYSSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLHILDIGVSHGVQWPTLLE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVI PTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ INK  DEILIVC+QFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK
        R EFL+NLRKMEPKAVILSENNMGCSCS C NF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N +GEMNEEKGKWCERMRN GFERK
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK

Query:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV
         FGEDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWK QPVSFCS WKLGIKSNA+
Subjt:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV

XP_004141813.1 protein NODULATION SIGNALING PATHWAY 1 [Cucumis sativus]1.1e-29592.45Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EETGPNHPSDHILDWLEDSVPFFS FLDET+NSSSINCYQWWDENQDTGEDLINGCLSNSPTT VSTR PNTPTSH LTPSDLT+KRKAPDDSVHKK
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSKN A KGSG VEGVTVMKKSVGNKKN+SKSTGNNYN+GSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE
        LADHGLRALA+HLSSN SSSSSFSSYSSTVAP TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLHILDIGVSHGVQWPTLLE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVIAPT+EHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ INK  DEILIVCAQFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK
        R EFL NLRKMEPKAVILSENNMGCSCS CGNF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N DGEMNEEKGKWCERMRN GFERK
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK

Query:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV
         FGEDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWK QPVSFCSFWKLGIKSNA+
Subjt:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV

XP_008462311.1 PREDICTED: nodulation-signaling pathway 1 protein [Cucumis melo]8.5e-29391.37Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EETGP+HPSDHILDWLEDSVPFFS FLDET+NSSSINCYQWWDENQDTGEDLINGCLSNSPTT VSTR PNTPTSHHL PSDLT+KRKAPDDSVHKK
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSKN A KGSG VEGVTV+KKSVGNKKN+SKSTGNNYNNGSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE
        LADHGLRALA+HLSSN SSSSSFSSYSSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLH+LDIGVSHGVQWPTLLE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ INK  DEILIVC+QFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK
        R EFL+NLRKMEPKAVILSENNMGCSCS C NF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N +GEMNEEKGKWCERMRN GFERK
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK

Query:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV
         FGEDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWK QPVSFCS WKLGIKSNA+
Subjt:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV

XP_023000031.1 nodulation-signaling pathway 1 protein [Cucurbita maxima]8.0e-28388.97Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFF-SPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EE G NHPSDHILDWL DSVPFF SPF D++ NSSSINCYQWWDENQD G+DLINGCLS+SPTTVST+ PNT TSHHLTPSDLT+KRKAPDD+VHK 
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFF-SPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQT QN RKNQNNQSKNGA KGSG VEGVTV+KKSVGNK+NSSK+TGNN +NGSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE
        LA +GLRALAH+LSSNSS  SS    SSTVAPVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNR RNLHILDIGVSHGVQWPTLLE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQS NSQVI K PDEILIVCAQFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKL
        RFEFL+NLRK+EPKAVILSENNM CSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRESEER+VMEGEAA+ LTN GEMNEE  KWCERMRNAGF RKL
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKL

Query:  FGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSN
        FGEDTIDTARASMRRYDNNWEMR+EEKDGCVGLWWK QPVSFCSFWKLG+KSN
Subjt:  FGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSN

XP_038897214.1 protein NODULATION SIGNALING PATHWAY 1 [Benincasa hispida]3.1e-30394.4Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKS
        MT EETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWD NQDTGEDLING LSNSPTTVST+L N PTSHHLTPSDLT+KRKAPDDSVHKKS
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKS

Query:  QTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRL
        QTHQN RKNQNNQSKNG   G G VEGVTVMKKSVGNKKNSSK TGNN NNGSNREGRWAEQLLNPCA+AI+KGDATRVHHLLCVLQELASPTGDANHRL
Subjt:  QTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRL

Query:  ADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA
        ADHGLRALAHHLSSN SSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA
Subjt:  ADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA

Query:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER
        LTRRSGGPP LIRLTVI PT+EHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRL+NHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHC PDER
Subjt:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER

Query:  FEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLF
        FEFL+NLRKMEPKAVILSENNMGCSCSNCGNFDT FTRRVEYLWRFLDSTS+AFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLF
Subjt:  FEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLF

Query:  GEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV
         EDTIDTARASMRRYDNNWEMR+EEKDGC+GLWWK QPVSFCSFWKLGIKSNA+
Subjt:  GEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV

TrEMBL top hitse value%identityAlignment
A0A0A0KCK6 GRAS domain-containing protein5.2e-29692.45Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EETGPNHPSDHILDWLEDSVPFFS FLDET+NSSSINCYQWWDENQDTGEDLINGCLSNSPTT VSTR PNTPTSH LTPSDLT+KRKAPDDSVHKK
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSKN A KGSG VEGVTVMKKSVGNKKN+SKSTGNNYN+GSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE
        LADHGLRALA+HLSSN SSSSSFSSYSSTVAP TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLHILDIGVSHGVQWPTLLE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVIAPT+EHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ INK  DEILIVCAQFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK
        R EFL NLRKMEPKAVILSENNMGCSCS CGNF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N DGEMNEEKGKWCERMRN GFERK
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK

Query:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV
         FGEDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWK QPVSFCSFWKLGIKSNA+
Subjt:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV

A0A1S3CGQ3 nodulation-signaling pathway 1 protein4.1e-29391.37Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EETGP+HPSDHILDWLEDSVPFFS FLDET+NSSSINCYQWWDENQDTGEDLINGCLSNSPTT VSTR PNTPTSHHL PSDLT+KRKAPDDSVHKK
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSKN A KGSG VEGVTV+KKSVGNKKN+SKSTGNNYNNGSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE
        LADHGLRALA+HLSSN SSSSSFSSYSSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLH+LDIGVSHGVQWPTLLE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ INK  DEILIVC+QFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK
        R EFL+NLRKMEPKAVILSENNMGCSCS C NF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N +GEMNEEKGKWCERMRN GFERK
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK

Query:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV
         FGEDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWK QPVSFCS WKLGIKSNA+
Subjt:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV

A0A5A7V101 Nodulation-signaling pathway 1 protein1.2e-29291.37Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EETGP+HPSDHILDWLEDSVPFFS FLDET+NSSSINCYQWWDENQDTGEDLINGCLSNSPTT VSTR PNTPTSHHL PSDLT+KRKAPDDSVHKK
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSKN A KGSG VEGVTV+KKSVGNKKN+SKSTGNNYNNGSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE
        LADHGLRALA+HLSSN SSSSSFSSYSSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLHILDIGVSHGVQWPTLLE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVI PTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ INK  DEILIVC+QFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK
        R EFL+NLRKMEPKAVILSENNMGCSCS C NF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N +GEMNEEKGKWCERMRN GFERK
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK

Query:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV
         FGEDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWK QPVSFCS WKLGIKSNA+
Subjt:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSNAV

A0A6J1HKE2 nodulation-signaling pathway 1 protein1.2e-27688.72Show/hide
Query:  DHILDWLEDSVPFF-SPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKSQTHQNPRKNQN
        DHILDWL DSVPFF SPF D++ NSSSINCYQWWDENQD G+DLINGCLS+SPTTVST+ PNT TSHHLTPSDLT+KRKAPDD+VHK SQT QN RKNQN
Subjt:  DHILDWLEDSVPFF-SPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKSQTHQNPRKNQN

Query:  NQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHH
        NQS+NGA K SG V GVTVMKKSVGNK+NSSK+TGNN NNG+N+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHRLA +GLRALAH+
Subjt:  NQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHH

Query:  LSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        LSSNSS SS     SST+APVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNR RNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  LSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKME
        IRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL SLN+QVI KFPDEILIVCAQFRLHQLKH APDERFEFL+NLRK+E
Subjt:  IRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKME

Query:  PKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARAS
        PKAVILSENNM CSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRESEER++MEGEAAKAL N+GEMNEE  KWCERMRNAGF RKLFGEDTIDTARAS
Subjt:  PKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARAS

Query:  MRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSN
        MRRYDNNWEMR+EEKDGCVGLWWK QPVSFCSFWKLG+KSN
Subjt:  MRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSN

A0A6J1KER4 nodulation-signaling pathway 1 protein3.9e-28388.97Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFF-SPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EE G NHPSDHILDWL DSVPFF SPF D++ NSSSINCYQWWDENQD G+DLINGCLS+SPTTVST+ PNT TSHHLTPSDLT+KRKAPDD+VHK 
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFF-SPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQT QN RKNQNNQSKNGA KGSG VEGVTV+KKSVGNK+NSSK+TGNN +NGSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE
        LA +GLRALAH+LSSNSS  SS    SSTVAPVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNR RNLHILDIGVSHGVQWPTLLE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQS NSQVI K PDEILIVCAQFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKL
        RFEFL+NLRK+EPKAVILSENNM CSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRESEER+VMEGEAA+ LTN GEMNEE  KWCERMRNAGF RKL
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKL

Query:  FGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSN
        FGEDTIDTARASMRRYDNNWEMR+EEKDGCVGLWWK QPVSFCSFWKLG+KSN
Subjt:  FGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIKSN

SwissProt top hitse value%identityAlignment
A1DQP9 Protein NODULATION SIGNALING PATHWAY 18.8e-17661.97Show/hide
Query:  SDHILDWLEDSVPFFSPFLDE-TNNSSSINCYQWWDENQ---DTGEDLINGCLSNSPTTVSTRLPNTPT-----SHHLTP-SDLTRKRKAPDDSVHKKSQ
        SDHILDWLE SV FF  FLDE  NNS  I  Y  WD+ Q   +TG    N   S + T V+T   +T +     S++  P SDL +KR A D+S  K   
Subjt:  SDHILDWLEDSVPFFSPFLDE-TNNSSSINCYQWWDENQ---DTGEDLINGCLSNSPTTVSTRLPNTPT-----SHHLTP-SDLTRKRKAPDDSVHKKSQ

Query:  THQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLA
          QN  K    +  N    G  + +          NKK  +K+ G+N N+G+++EGRWAEQLLNPCA AI  G+  RV HLL VL ELASPTGD NHRLA
Subjt:  THQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLA

Query:  DHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPN-RPRNLHILDIGVSHGVQWPTLLEA
         HGLRAL HHLSS+SSS +S  +       +TFAST+PRFFQ+SL+KF+EVSPWF+FPNNIAN+SIL +L+EE N   R LHILDIGVSHGVQWPTLL+A
Subjt:  DHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPN-RPRNLHILDIGVSHGVQWPTLLEA

Query:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER
        L+RRSGGPP ++RLTV+  T E+DQN ETPFS  PPG N   RLL +A+S+NINLQINR++NHSLQ+LN+Q I+  PDEILIVCAQFRLH L H +PDER
Subjt:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER

Query:  FEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLF
         EFL+ LR MEP+ VILSENN  C CS CGNF  GFTRRVEYLWRFLDSTSSAFKGRES+ERRVMEGEAAKALTN  EMNEEK KWC RM+ AGF  ++F
Subjt:  FEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLF

Query:  GEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKL
        GED +D  RA +R+YD+NWEM++EEK+  VGLWWK QPVSFCS WKL
Subjt:  GEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKL

Q4VYC8 Protein NODULATION SIGNALING PATHWAY 12.0e-18060.43Show/hide
Query:  PNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNS-------------PTTVSTRLPNTPTSHHLTPSDLTRKRKAPDD
        PN  SDHILDWLE SV FF  FLD+  N+  I+ Y+ W++NQD          +NS              TT ST      + +++  SDL +KR A D+
Subjt:  PNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNS-------------PTTVSTRLPNTPTSHHLTPSDLTRKRKAPDD

Query:  -SVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPT
         S+ K+ Q  +N  K   ++  N +  G   +EG TV++KS GNKK ++K+ G+N NNG+N++GRWAEQLLNPCA AI  G+  RV HLL VL ELAS T
Subjt:  -SVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPT

Query:  GDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQ
        GDANHRLA HGLRAL HHLSS+SSS+ S          +TFAST+PRFFQ+SL+KF+E SPWF+FPNNIAN+SIL +L+EEPN  R LHILDIGVSHGVQ
Subjt:  GDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQ

Query:  WPTLLEALTRRSGGPPPLIRLTVI--APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQ
        WPT LEAL+RR GGPPPL+RLTV+  + + E+DQN ETPFSIGP GD  SS LL +A+SLN+NLQI +LDNH LQ+LN++ ++   DE LIVCAQFRLH 
Subjt:  WPTLLEALTRRSGGPPPLIRLTVI--APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQ

Query:  LKHCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMR
        L H  PDER EFL+ LR MEPK VILSENNM C CS+CG+F TGF+RRVEYLWRFLDSTSSAFK R+S+ER++MEGEAAKALTN  EMNE + KWCERM+
Subjt:  LKHCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMR

Query:  NAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKL
         AGF  ++FGED ID  RA +R+YDNNWEM++EE    V LWWK QPVSFCS WKL
Subjt:  NAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKL

Q75I13 Protein SHORT-ROOT 24.4e-3428.95Show/hide
Query:  SSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPR
        SS   G       +  GRWA QLL  CA A+   D+ RV  L+ +L ELASP GD + +LA + L+ L   L+++   +    + +S     +F ST   
Subjt:  SSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPR

Query:  FFQRSLIKFHEVSPWFAFPNNIANSSILHIL---------------SEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQ
          +R+ +KF E+SPW  F +  AN +IL                  S     P  LHILD+  +   QWPTLLEAL  RS    P + +T + PT     
Subjt:  FFQRSLIKFHEVSPWFAFPNNIANSSILHIL---------------SEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQ

Query:  NTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHS--LQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSENNMG
          +           I  RL  FA+ + +     R  +HS  L  L+   ++          A   ++ L+  A   R  F+ +LR++EP+ V + E    
Subjt:  NTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHS--LQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSENNMG

Query:  -----CSCSNCGNFDTGFTR----RVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKAL--------TNDGEMNEEKGKWCERMRNAGFERKLFGEDTID
                S+  + D  F +     + +   ++DS   +F  + S ER  +E    +A+        +   E  E    W  RMR+AGF    F ED  D
Subjt:  -----CSCSNCGNFDTGFTR----RVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKAL--------TNDGEMNEEKGKWCERMRNAGFERKLFGEDTID

Query:  TARASMRRYDNNWEMR-----LEEKDGCVG----LWWKCQPVSFCSFWK
          R+ +RRY   W MR      ++  G       L WK QPV + S WK
Subjt:  TARASMRRYDNNWEMR-----LEEKDGCVG----LWWKCQPVSFCSFWK

Q84MQ9 Protein NODULATION SIGNALING PATHWAY 17.2e-8540.27Show/hide
Query:  WWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSK
        WW  +    +D I   ++ + +  ST  P   +    +P+  +        S  +KS  H+ P        K G GKG G                    
Subjt:  WWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSK

Query:  STGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSS--NSSSSSSFSSYSSTVAPVT-FASTDPR
                GS+R+ RWAEQLLNPCA A+  G+ +RV HL  VL EL S +GDANHRLA HGLRALA  L +    +++++      +  P T FA+ +PR
Subjt:  STGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSS--NSSSSSSFSSYSSTVAPVT-FASTDPR

Query:  FFQRSLIKFHEVSPWFAFPNNIANSSILH--ILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVIAPTVEHDQNTETPFSIGPPG
         F+ SLI+FHEVSPWFA PN +AN++I            PR LH++D+GVSHGVQWPTLLE+LTR+ GG  PP +RLTV+ P      +   PFS  PPG
Subjt:  FFQRSLIKFHEVSPWFAFPNNIANSSILH--ILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVIAPTVEHDQNTETPFSIGPPG

Query:  DNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSE--NNMGCSCSNCGNFDTG
         + S  LL +AKS+N++L+I+R       +     +     E L+VC QFR   L H A +ER E LR  R + P+ V+LSE  + +G    + G+    
Subjt:  DNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSE--NNMGCSCSNCGNFDTG

Query:  FTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAK--ALTNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRL-EEKDGCVGL
        F  R+E LWRFL+STS+AFKG++ EERR++E EA    A  +     E +  W ERM  AGFE   FG + +++AR+ +R+YD+ WEM         V L
Subjt:  FTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAK--ALTNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRL-EEKDGCVGL

Query:  WWKCQPVSFCSFWK
         WK QPVSFCS W+
Subjt:  WWKCQPVSFCSFWK

Q9LRW3 Scarecrow-like protein 299.3e-12548.64Show/hide
Query:  MTTEET-GPNHPSDHILDWLEDSVPFFS-PFLDETNNSSSINCYQ-W-WDENQDTGEDLINGCLSN-SPTTVSTRLPN--TPTSHHLTPSDLTRKRKAPD
        M  EET  PN   DH+L WLEDSV     P  D++      +  Q W WD+ QD     I     + S   V     N    T       DL  + + P+
Subjt:  MTTEET-GPNHPSDHILDWLEDSVPFFS-PFLDETNNSSSINCYQ-W-WDENQDTGEDLINGCLSN-SPTTVSTRLPN--TPTSHHLTPSDLTRKRKAPD

Query:  DSVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPT
        D   K+S                      G +E   V KKS  +K+ + KS+  +  +G N+EGRWAE+LLNPCA AI   +++RV H LCVL ELAS +
Subjt:  DSVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPT

Query:  GDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQ
        GDAN RLA  GLRAL HHLSS+S SSS +  +       TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG+Q
Subjt:  GDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQ

Query:  WPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLK
        WPTLLEAL+ R  GPPP +R+TVI+     D   + PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I+  P E LIVCAQFRLH LK
Subjt:  WPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLK

Query:  HCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNA
        H   DER E L+ +R + PK V+L ENN  CS S   +F  GF++++EY+W+FLDSTSS FK   SEER++MEGEA K L N G+MNE K KW ERMR A
Subjt:  HCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNA

Query:  GFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWK
        GF  + F ED +D A++ +R+YDNNWE+R+E+ D   GL WK + VSFCS WK
Subjt:  GFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWK

Arabidopsis top hitse value%identityAlignment
AT3G03450.1 RGA-like 21.2e-2326.25Show/hide
Query:  KNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTD
        ++S +ST +     S   G      L  CA AI + +      L+  +  LA     A  ++A +  +ALA  +  + ++ +              A+ +
Subjt:  KNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTD

Query:  PRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGD
        P F +   + F+E  P+  F +  AN +IL    E     R +H++D+G++ G+QWP L++AL  R GGPP   RLT I P       TE   S+     
Subjt:  PRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGD

Query:  NISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPD-EILIVCAQFRLHQLKHCAPDERFEFLRN-LRKMEPKAVILSENNMGCSCSNCGNFDTGF
         +  +L  FA+++ +  +   L   SL  L  ++    P+ E L+V + F LH+L   A     E L N ++ ++P  V + E        N   F   F
Subjt:  NISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPD-EILIVCAQFRLHQLKHCAPDERFEFLRN-LRKMEPKAVILSENNMGCSCSNCGNFDTGF

Query:  TRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDG
           + Y     DS   ++    S++R + E    + + N    +G    E +E   +W  RM++AGF+    G      A   +  Y      R+EE DG
Subjt:  TRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDG

Query:  CVGLWWKCQPVSFCSFWKL
        C+ + W+ +P+   S WKL
Subjt:  CVGLWWKCQPVSFCSFWKL

AT3G13840.1 GRAS family transcription factor6.6e-12648.64Show/hide
Query:  MTTEET-GPNHPSDHILDWLEDSVPFFS-PFLDETNNSSSINCYQ-W-WDENQDTGEDLINGCLSN-SPTTVSTRLPN--TPTSHHLTPSDLTRKRKAPD
        M  EET  PN   DH+L WLEDSV     P  D++      +  Q W WD+ QD     I     + S   V     N    T       DL  + + P+
Subjt:  MTTEET-GPNHPSDHILDWLEDSVPFFS-PFLDETNNSSSINCYQ-W-WDENQDTGEDLINGCLSN-SPTTVSTRLPN--TPTSHHLTPSDLTRKRKAPD

Query:  DSVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPT
        D   K+S                      G +E   V KKS  +K+ + KS+  +  +G N+EGRWAE+LLNPCA AI   +++RV H LCVL ELAS +
Subjt:  DSVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPT

Query:  GDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQ
        GDAN RLA  GLRAL HHLSS+S SSS +  +       TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG+Q
Subjt:  GDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQ

Query:  WPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLK
        WPTLLEAL+ R  GPPP +R+TVI+     D   + PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I+  P E LIVCAQFRLH LK
Subjt:  WPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLK

Query:  HCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNA
        H   DER E L+ +R + PK V+L ENN  CS S   +F  GF++++EY+W+FLDSTSS FK   SEER++MEGEA K L N G+MNE K KW ERMR A
Subjt:  HCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNA

Query:  GFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWK
        GF  + F ED +D A++ +R+YDNNWE+R+E+ D   GL WK + VSFCS WK
Subjt:  GFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWK

AT3G49950.1 GRAS family transcription factor1.6e-3429.37Show/hide
Query:  EGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPW
        +  + EQLL  CA AI   DA   H +L VL  +A P GD+  RL    LRAL       S + S   + SST++ +  A    RF    L  F +++PW
Subjt:  EGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPW

Query:  FAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINL
          F    AN++IL  +         +HI+D+ ++H +Q PTL++A+  R   PPPL++LTV++       +   P  I    + + S+L++FA + NI +
Subjt:  FAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINL

Query:  QINRLDNHSLQSLNS--QVINKFP---DEILIVCAQFRLHQLKHCAPDE---------RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVE
        +   + +      +S  Q +  +P   +E L+V      H +    P+E         R  FL+ LR + P+ V L E ++  +  N  N          
Subjt:  QINRLDNHSLQSLNS--QVINKFP---DEILIVCAQFRLHQLKHCAPDE---------RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVE

Query:  YLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLW
        Y W   D+T +      SE+RR  E E +  + N    +G    E  E K +W ERMR A F      ED +   +A +  +   W M+ E+ D  + L 
Subjt:  YLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLW

Query:  WKCQPVSFCSFW
        WK   V F + W
Subjt:  WKCQPVSFCSFW

AT4G37650.1 GRAS family transcription factor9.8e-2926.11Show/hide
Query:  ENQDTGEDLINGCLSNSPT-TVSTRLPNTPTSHHLTPSDLTR--------KRKAPDDSVHKKSQTHQNPR--------KNQNNQSKNGAGKGSGIVEGVT
        + Q   + +I    S S T T +T  P T   ++   +D+          +      S H     H NP           Q + + +     +     + 
Subjt:  ENQDTGEDLINGCLSNSPT-TVSTRLPNTPTSHHLTPSDLTR--------KRKAPDDSVHKKSQTHQNPR--------KNQNNQSKNGAGKGSGIVEGVT

Query:  VMKKSVGNKKNSS----KSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLS-SNSSSSSSFSS
            S G+  + S      T  +++  +N   +WA+ +L   A A    D  R   +L  L EL+SP GD   +LA + L+AL + ++ S      +  +
Subjt:  VMKKSVGNKKNSS----KSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLS-SNSSSSSSFSS

Query:  YSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VIAPTVE
         ++T    +F ST     +++++KF EVSPW  F +  AN +IL  +  E      +HI+DI  +   QWPTLLEAL  RS   P L RLT  V+A    
Subjt:  YSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VIAPTVE

Query:  HDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKF---PDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSE
        +DQ              I +R+  FA+ + +  + N +  H +  L+   +N+    PDE+L +     +H +       R   + + R++ P+ V + E
Subjt:  HDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKF---PDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSE

Query:  NNMGCSCSNCGNFDTGFTRRVEYLWRF----LDSTSSAFKGRESEERRVMEGEAAKAL--------TNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDT
                  G FD  F R      R+     +S   +F  R S ER ++E  A +A+        ++  E  E   KW  RMRN+GF    + ++  D 
Subjt:  NNMGCSCSNCGNFDTGFTRRVEYLWRF----LDSTSSAFKGRESEERRVMEGEAAKAL--------TNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDT

Query:  ARASMRRY-DNNWEMRLEEKDGCVGLWWKCQPVSFCSFWK
         RA +RRY +  W M        + L W+ QPV + S W+
Subjt:  ARASMRRY-DNNWEMRLEEKDGCVGLWWKCQPVSFCSFWK

AT5G66770.1 GRAS family transcription factor4.3e-2428.61Show/hide
Query:  CANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSS
        CA  I   D       L  ++E  S  GD   R+A +   AL++ LS NS ++SS SS           ST+      S    ++  P+  F +  AN +
Subjt:  CANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSS

Query:  ILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVI-APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSL
        IL    E   +   +HI+D G+  G+QWP LL+AL  R+ G P  IR++ I AP++      E+P    P      +RL  FAK L++N     +    +
Subjt:  ILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVI-APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSL

Query:  QSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFK---GRESEER
          LN       PDE+L V    +L++L    P      LR  + + P+ V L E  +  +         GF  RV+   +F  +   + +   GR+SEER
Subjt:  QSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFK---GRESEER

Query:  ----RVMEGEAAKALTN------DGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYD-NNWEMRLEEKDGCVGLWWKCQPVSFCSFWK
            R + G     L          E  EEK +W   M NAGFE        +  A+  +  Y+ +N    +E K G + L W   P+   S W+
Subjt:  ----RVMEGEAAKALTN------DGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYD-NNWEMRLEEKDGCVGLWWKCQPVSFCSFWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACTGAAGAAACAGGGCCAAACCACCCTTCAGATCACATATTGGACTGGTTAGAGGACTCGGTTCCTTTCTTTTCCCCATTCCTGGATGAGACTAACAACTCCAG
CTCTATAAACTGCTATCAATGGTGGGATGAGAACCAGGACACAGGCGAAGATCTGATTAATGGCTGTCTCAGCAACTCCCCCACTACTGTTAGTACTAGACTACCAAACA
CACCCACTTCCCACCACTTGACACCATCTGATTTGACCAGGAAAAGAAAAGCTCCAGATGATTCAGTTCATAAGAAATCACAAACCCATCAGAACCCAAGGAAGAACCAG
AACAATCAGAGCAAAAATGGTGCAGGTAAAGGCAGTGGAATTGTTGAGGGAGTGACTGTGATGAAGAAATCAGTAGGGAACAAGAAGAATTCATCAAAATCCACAGGAAA
TAACTATAATAACGGAAGCAACAGGGAAGGAAGGTGGGCAGAGCAATTGCTAAATCCCTGTGCTAATGCTATCATGAAAGGGGATGCGACAAGAGTACATCACCTTCTTT
GTGTTCTTCAAGAGCTCGCCTCACCCACCGGCGACGCCAACCACCGGCTTGCCGACCACGGTCTCCGAGCTTTGGCCCATCACCTGTCCTCCAATTCTTCGTCTTCTTCT
TCTTTTTCTTCTTATTCCTCCACAGTTGCCCCGGTTACTTTCGCTTCAACGGACCCTCGATTCTTCCAGAGATCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGC
ATTTCCGAACAACATTGCAAATTCTTCAATCCTCCACATTCTCTCTGAAGAACCTAATCGCCCGCGAAATCTTCACATTCTTGACATTGGGGTTTCTCATGGTGTGCAAT
GGCCGACGCTCCTCGAGGCCTTGACTCGCCGTTCCGGTGGACCTCCGCCGCTAATTCGTCTTACAGTTATCGCTCCAACCGTCGAACATGACCAAAATACCGAGACGCCG
TTTTCAATTGGTCCACCAGGAGACAACATCTCGTCTCGGCTTCTCAGTTTCGCCAAATCCTTGAACATCAATTTACAAATCAACCGCCTCGACAATCACTCACTCCAGAG
TTTAAATTCGCAAGTAATCAACAAATTCCCAGACGAAATCCTGATCGTTTGCGCACAGTTCAGACTCCATCAGTTGAAACACTGTGCTCCTGACGAAAGATTCGAGTTCT
TACGAAATCTAAGAAAAATGGAACCAAAGGCAGTGATTCTAAGTGAAAACAACATGGGATGTAGCTGTAGCAACTGCGGAAATTTCGACACCGGATTCACACGACGAGTT
GAATACCTATGGAGATTTCTGGATTCAACAAGCTCGGCATTCAAAGGGAGAGAAAGCGAAGAAAGAAGAGTGATGGAAGGTGAAGCGGCAAAAGCACTGACGAATGATGG
GGAAATGAACGAGGAAAAGGGAAAATGGTGCGAAAGAATGAGAAATGCGGGTTTTGAGAGAAAATTATTCGGTGAAGACACCATTGATACAGCCCGAGCTTCAATGAGAA
GGTATGATAATAACTGGGAGATGAGATTGGAAGAGAAAGATGGATGTGTAGGCTTATGGTGGAAATGCCAACCCGTTTCCTTTTGTTCGTTTTGGAAGTTGGGGATTAAA
TCCAATGCCGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCACTGAAGAAACAGGGCCAAACCACCCTTCAGATCACATATTGGACTGGTTAGAGGACTCGGTTCCTTTCTTTTCCCCATTCCTGGATGAGACTAACAACTCCAG
CTCTATAAACTGCTATCAATGGTGGGATGAGAACCAGGACACAGGCGAAGATCTGATTAATGGCTGTCTCAGCAACTCCCCCACTACTGTTAGTACTAGACTACCAAACA
CACCCACTTCCCACCACTTGACACCATCTGATTTGACCAGGAAAAGAAAAGCTCCAGATGATTCAGTTCATAAGAAATCACAAACCCATCAGAACCCAAGGAAGAACCAG
AACAATCAGAGCAAAAATGGTGCAGGTAAAGGCAGTGGAATTGTTGAGGGAGTGACTGTGATGAAGAAATCAGTAGGGAACAAGAAGAATTCATCAAAATCCACAGGAAA
TAACTATAATAACGGAAGCAACAGGGAAGGAAGGTGGGCAGAGCAATTGCTAAATCCCTGTGCTAATGCTATCATGAAAGGGGATGCGACAAGAGTACATCACCTTCTTT
GTGTTCTTCAAGAGCTCGCCTCACCCACCGGCGACGCCAACCACCGGCTTGCCGACCACGGTCTCCGAGCTTTGGCCCATCACCTGTCCTCCAATTCTTCGTCTTCTTCT
TCTTTTTCTTCTTATTCCTCCACAGTTGCCCCGGTTACTTTCGCTTCAACGGACCCTCGATTCTTCCAGAGATCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGC
ATTTCCGAACAACATTGCAAATTCTTCAATCCTCCACATTCTCTCTGAAGAACCTAATCGCCCGCGAAATCTTCACATTCTTGACATTGGGGTTTCTCATGGTGTGCAAT
GGCCGACGCTCCTCGAGGCCTTGACTCGCCGTTCCGGTGGACCTCCGCCGCTAATTCGTCTTACAGTTATCGCTCCAACCGTCGAACATGACCAAAATACCGAGACGCCG
TTTTCAATTGGTCCACCAGGAGACAACATCTCGTCTCGGCTTCTCAGTTTCGCCAAATCCTTGAACATCAATTTACAAATCAACCGCCTCGACAATCACTCACTCCAGAG
TTTAAATTCGCAAGTAATCAACAAATTCCCAGACGAAATCCTGATCGTTTGCGCACAGTTCAGACTCCATCAGTTGAAACACTGTGCTCCTGACGAAAGATTCGAGTTCT
TACGAAATCTAAGAAAAATGGAACCAAAGGCAGTGATTCTAAGTGAAAACAACATGGGATGTAGCTGTAGCAACTGCGGAAATTTCGACACCGGATTCACACGACGAGTT
GAATACCTATGGAGATTTCTGGATTCAACAAGCTCGGCATTCAAAGGGAGAGAAAGCGAAGAAAGAAGAGTGATGGAAGGTGAAGCGGCAAAAGCACTGACGAATGATGG
GGAAATGAACGAGGAAAAGGGAAAATGGTGCGAAAGAATGAGAAATGCGGGTTTTGAGAGAAAATTATTCGGTGAAGACACCATTGATACAGCCCGAGCTTCAATGAGAA
GGTATGATAATAACTGGGAGATGAGATTGGAAGAGAAAGATGGATGTGTAGGCTTATGGTGGAAATGCCAACCCGTTTCCTTTTGTTCGTTTTGGAAGTTGGGGATTAAA
TCCAATGCCGTTTGA
Protein sequenceShow/hide protein sequence
MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKSQTHQNPRKNQ
NNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSS
SFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETP
FSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRV
EYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKCQPVSFCSFWKLGIK
SNAV