; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G029020 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G029020
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionnodulation-signaling pathway 1 protein
Genome locationchr02:35244960..35246624
RNA-Seq ExpressionLsi02G029020
SyntenyLsi02G029020
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0042446 - hormone biosynthetic process (biological process)
GO:2000032 - regulation of secondary shoot formation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR005202 - Transcription factor GRAS
IPR030015 - Scarecrow-like protein 29/nodulation signalling pathway 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059401.1 nodulation-signaling pathway 1 protein [Cucumis melo var. makuwa]5.0e-29391.37Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EETGP+HPSDHILDWLEDSVPFFS FLDET+NSSSINCYQWWDENQDTGEDLINGCLSNSPTT VSTR PNTPTSHHL PSDLT+KRKAPDDSVHKK
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSKN A KGSG VEGVTV+KKSVGNKKN+SKSTGNNYNNGSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE
        LADHGLRALA+HLSSN SSSSSFSSYSSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLHILDIGVSHGVQWPT LE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVI PTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ INK  DEILIVC+QFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK
        R EFL+NLRKMEPKAVILSENNMGCSCS C NF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N +GEMNEEKGKWCERMRN GFERK
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK

Query:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV
         FGEDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCS WKLGIKSNA+
Subjt:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV

XP_004141813.1 protein NODULATION SIGNALING PATHWAY 1 [Cucumis sativus]2.2e-29692.45Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EETGPNHPSDHILDWLEDSVPFFS FLDET+NSSSINCYQWWDENQDTGEDLINGCLSNSPTT VSTR PNTPTSH LTPSDLT+KRKAPDDSVHKK
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSKN A KGSG VEGVTVMKKSVGNKKN+SKSTGNNYN+GSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE
        LADHGLRALA+HLSSN SSSSSFSSYSSTVAP TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLHILDIGVSHGVQWPT LE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVIAPT+EHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ INK  DEILIVCAQFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK
        R EFL NLRKMEPKAVILSENNMGCSCS CGNF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N DGEMNEEKGKWCERMRN GFERK
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK

Query:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV
         FGEDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCSFWKLGIKSNA+
Subjt:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV

XP_008462311.1 PREDICTED: nodulation-signaling pathway 1 protein [Cucumis melo]1.7e-29391.37Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EETGP+HPSDHILDWLEDSVPFFS FLDET+NSSSINCYQWWDENQDTGEDLINGCLSNSPTT VSTR PNTPTSHHL PSDLT+KRKAPDDSVHKK
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSKN A KGSG VEGVTV+KKSVGNKKN+SKSTGNNYNNGSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE
        LADHGLRALA+HLSSN SSSSSFSSYSSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLH+LDIGVSHGVQWPT LE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ INK  DEILIVC+QFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK
        R EFL+NLRKMEPKAVILSENNMGCSCS C NF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N +GEMNEEKGKWCERMRN GFERK
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK

Query:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV
         FGEDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCS WKLGIKSNA+
Subjt:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV

XP_023000031.1 nodulation-signaling pathway 1 protein [Cucurbita maxima]1.6e-28388.97Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFF-SPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EE G NHPSDHILDWL DSVPFF SPF D++ NSSSINCYQWWDENQD G+DLINGCLS+SPTTVST+ PNT TSHHLTPSDLT+KRKAPDD+VHK 
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFF-SPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQT QN RKNQNNQSKNGA KGSG VEGVTV+KKSVGNK+NSSK+TGNN +NGSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE
        LA +GLRALAH+LSSNSS  SS    SSTVAPVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNR RNLHILDIGVSHGVQWPT LE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQS NSQVI K PDEILIVCAQFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKL
        RFEFL+NLRK+EPKAVILSENNM CSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRESEER+VMEGEAA+ LTN GEMNEE  KWCERMRNAGF RKL
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKL

Query:  FGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSN
        FGEDTIDTARASMRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLG+KSN
Subjt:  FGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSN

XP_038897214.1 protein NODULATION SIGNALING PATHWAY 1 [Benincasa hispida]6.3e-30494.4Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKS
        MT EETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWD NQDTGEDLING LSNSPTTVST+L N PTSHHLTPSDLT+KRKAPDDSVHKKS
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKS

Query:  QTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRL
        QTHQN RKNQNNQSKNG   G G VEGVTVMKKSVGNKKNSSK TGNN NNGSNREGRWAEQLLNPCA+AI+KGDATRVHHLLCVLQELASPTGDANHRL
Subjt:  QTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRL

Query:  ADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEA
        ADHGLRALAHHLSSN SSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPT LEA
Subjt:  ADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEA

Query:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER
        LTRRSGGPP LIRLTVI PT+EHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRL+NHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHC PDER
Subjt:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER

Query:  FEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLF
        FEFL+NLRKMEPKAVILSENNMGCSCSNCGNFDT FTRRVEYLWRFLDSTS+AFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLF
Subjt:  FEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLF

Query:  GEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV
         EDTIDTARASMRRYDNNWEMR+EEKDGC+GLWWKGQPVSFCSFWKLGIKSNA+
Subjt:  GEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV

TrEMBL top hitse value%identityAlignment
A0A0A0KCK6 GRAS domain-containing protein1.0e-29692.45Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EETGPNHPSDHILDWLEDSVPFFS FLDET+NSSSINCYQWWDENQDTGEDLINGCLSNSPTT VSTR PNTPTSH LTPSDLT+KRKAPDDSVHKK
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSKN A KGSG VEGVTVMKKSVGNKKN+SKSTGNNYN+GSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE
        LADHGLRALA+HLSSN SSSSSFSSYSSTVAP TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLHILDIGVSHGVQWPT LE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVIAPT+EHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ INK  DEILIVCAQFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK
        R EFL NLRKMEPKAVILSENNMGCSCS CGNF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N DGEMNEEKGKWCERMRN GFERK
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK

Query:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV
         FGEDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCSFWKLGIKSNA+
Subjt:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV

A0A1S3CGQ3 nodulation-signaling pathway 1 protein8.3e-29491.37Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EETGP+HPSDHILDWLEDSVPFFS FLDET+NSSSINCYQWWDENQDTGEDLINGCLSNSPTT VSTR PNTPTSHHL PSDLT+KRKAPDDSVHKK
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSKN A KGSG VEGVTV+KKSVGNKKN+SKSTGNNYNNGSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE
        LADHGLRALA+HLSSN SSSSSFSSYSSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLH+LDIGVSHGVQWPT LE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ INK  DEILIVC+QFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK
        R EFL+NLRKMEPKAVILSENNMGCSCS C NF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N +GEMNEEKGKWCERMRN GFERK
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK

Query:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV
         FGEDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCS WKLGIKSNA+
Subjt:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV

A0A5A7V101 Nodulation-signaling pathway 1 protein2.4e-29391.37Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EETGP+HPSDHILDWLEDSVPFFS FLDET+NSSSINCYQWWDENQDTGEDLINGCLSNSPTT VSTR PNTPTSHHL PSDLT+KRKAPDDSVHKK
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSKN A KGSG VEGVTV+KKSVGNKKN+SKSTGNNYNNGSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE
        LADHGLRALA+HLSSN SSSSSFSSYSSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLHILDIGVSHGVQWPT LE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVI PTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ INK  DEILIVC+QFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK
        R EFL+NLRKMEPKAVILSENNMGCSCS C NF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N +GEMNEEKGKWCERMRN GFERK
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERMRNAGFERK

Query:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV
         FGEDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCS WKLGIKSNA+
Subjt:  LFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSNAV

A0A6J1HKE2 nodulation-signaling pathway 1 protein2.4e-27788.72Show/hide
Query:  DHILDWLEDSVPFF-SPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKSQTHQNPRKNQN
        DHILDWL DSVPFF SPF D++ NSSSINCYQWWDENQD G+DLINGCLS+SPTTVST+ PNT TSHHLTPSDLT+KRKAPDD+VHK SQT QN RKNQN
Subjt:  DHILDWLEDSVPFF-SPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKSQTHQNPRKNQN

Query:  NQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHH
        NQS+NGA K SG V GVTVMKKSVGNK+NSSK+TGNN NNG+N+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHRLA +GLRALAH+
Subjt:  NQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHH

Query:  LSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPL
        LSSNSS SS     SST+APVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNR RNLHILDIGVSHGVQWPT LEALTRRSGGPPPL
Subjt:  LSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKME
        IRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL SLN+QVI KFPDEILIVCAQFRLHQLKH APDERFEFL+NLRK+E
Subjt:  IRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKME

Query:  PKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARAS
        PKAVILSENNM CSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRESEER++MEGEAAKAL N+GEMNEE  KWCERMRNAGF RKLFGEDTIDTARAS
Subjt:  PKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARAS

Query:  MRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSN
        MRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLG+KSN
Subjt:  MRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSN

A0A6J1KER4 nodulation-signaling pathway 1 protein7.8e-28488.97Show/hide
Query:  MTTEETGPNHPSDHILDWLEDSVPFF-SPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK
        MT EE G NHPSDHILDWL DSVPFF SPF D++ NSSSINCYQWWDENQD G+DLINGCLS+SPTTVST+ PNT TSHHLTPSDLT+KRKAPDD+VHK 
Subjt:  MTTEETGPNHPSDHILDWLEDSVPFF-SPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR
        SQT QN RKNQNNQSKNGA KGSG VEGVTV+KKSVGNK+NSSK+TGNN +NGSN+EGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE
        LA +GLRALAH+LSSNSS  SS    SSTVAPVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNR RNLHILDIGVSHGVQWPT LE
Subjt:  LADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE
        ALTRRSGGPPPLIRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQS NSQVI K PDEILIVCAQFRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKL
        RFEFL+NLRK+EPKAVILSENNM CSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRESEER+VMEGEAA+ LTN GEMNEE  KWCERMRNAGF RKL
Subjt:  RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKL

Query:  FGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSN
        FGEDTIDTARASMRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLG+KSN
Subjt:  FGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIKSN

SwissProt top hitse value%identityAlignment
A1DQP9 Protein NODULATION SIGNALING PATHWAY 11.4e-17661.97Show/hide
Query:  SDHILDWLEDSVPFFSPFLDE-TNNSSSINCYQWWDENQ---DTGEDLINGCLSNSPTTVSTRLPNTPT-----SHHLTP-SDLTRKRKAPDDSVHKKSQ
        SDHILDWLE SV FF  FLDE  NNS  I  Y  WD+ Q   +TG    N   S + T V+T   +T +     S++  P SDL +KR A D+S  K   
Subjt:  SDHILDWLEDSVPFFSPFLDE-TNNSSSINCYQWWDENQ---DTGEDLINGCLSNSPTTVSTRLPNTPT-----SHHLTP-SDLTRKRKAPDDSVHKKSQ

Query:  THQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLA
          QN  K    +  N    G  + +          NKK  +K+ G+N N+G+++EGRWAEQLLNPCA AI  G+  RV HLL VL ELASPTGD NHRLA
Subjt:  THQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLA

Query:  DHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPN-RPRNLHILDIGVSHGVQWPTFLEA
         HGLRAL HHLSS+SSS +S  +       +TFAST+PRFFQ+SL+KF+EVSPWF+FPNNIAN+SIL +L+EE N   R LHILDIGVSHGVQWPT L+A
Subjt:  DHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPN-RPRNLHILDIGVSHGVQWPTFLEA

Query:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER
        L+RRSGGPP ++RLTV+  T E+DQN ETPFS  PPG N   RLL +A+S+NINLQINR++NHSLQ+LN+Q I+  PDEILIVCAQFRLH L H +PDER
Subjt:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER

Query:  FEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLF
         EFL+ LR MEP+ VILSENN  C CS CGNF  GFTRRVEYLWRFLDSTSSAFKGRES+ERRVMEGEAAKALTN  EMNEEK KWC RM+ AGF  ++F
Subjt:  FEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLF

Query:  GEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKL
        GED +D  RA +R+YD+NWEM++EEK+  VGLWWKGQPVSFCS WKL
Subjt:  GEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKL

Q4VYC8 Protein NODULATION SIGNALING PATHWAY 11.8e-18160.61Show/hide
Query:  PNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNS-------------PTTVSTRLPNTPTSHHLTPSDLTRKRKAPDD
        PN  SDHILDWLE SV FF  FLD+  N+  I+ Y+ W++NQD          +NS              TT ST      + +++  SDL +KR A D+
Subjt:  PNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNS-------------PTTVSTRLPNTPTSHHLTPSDLTRKRKAPDD

Query:  -SVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPT
         S+ K+ Q  +N  K   ++  N +  G   +EG TV++KS GNKK ++K+ G+N NNG+N++GRWAEQLLNPCA AI  G+  RV HLL VL ELAS T
Subjt:  -SVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPT

Query:  GDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQ
        GDANHRLA HGLRAL HHLSS+SSS+ S          +TFAST+PRFFQ+SL+KF+E SPWF+FPNNIAN+SIL +L+EEPN  R LHILDIGVSHGVQ
Subjt:  GDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQ

Query:  WPTFLEALTRRSGGPPPLIRLTVI--APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQ
        WPTFLEAL+RR GGPPPL+RLTV+  + + E+DQN ETPFSIGP GD  SS LL +A+SLN+NLQI +LDNH LQ+LN++ ++   DE LIVCAQFRLH 
Subjt:  WPTFLEALTRRSGGPPPLIRLTVI--APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQ

Query:  LKHCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMR
        L H  PDER EFL+ LR MEPK VILSENNM C CS+CG+F TGF+RRVEYLWRFLDSTSSAFK R+S+ER++MEGEAAKALTN  EMNE + KWCERM+
Subjt:  LKHCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMR

Query:  NAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKL
         AGF  ++FGED ID  RA +R+YDNNWEM++EE    V LWWK QPVSFCS WKL
Subjt:  NAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKL

Q84MQ9 Protein NODULATION SIGNALING PATHWAY 11.1e-8540.27Show/hide
Query:  WWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSK
        WW  +    +D I   ++ + +  ST  P   +    +P+  +        S  +KS  H+ P        K G GKG G                    
Subjt:  WWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSK

Query:  STGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSS--NSSSSSSFSSYSSTVAPVT-FASTDPR
                GS+R+ RWAEQLLNPCA A+  G+ +RV HL  VL EL S +GDANHRLA HGLRALA  L +    +++++      +  P T FA+ +PR
Subjt:  STGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSS--NSSSSSSFSSYSSTVAPVT-FASTDPR

Query:  FFQRSLIKFHEVSPWFAFPNNIANSSILH--ILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGG-PPPLIRLTVIAPTVEHDQNTETPFSIGPPG
         F+ SLI+FHEVSPWFA PN +AN++I            PR LH++D+GVSHGVQWPT LE+LTR+ GG  PP +RLTV+ P      +   PFS  PPG
Subjt:  FFQRSLIKFHEVSPWFAFPNNIANSSILH--ILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGG-PPPLIRLTVIAPTVEHDQNTETPFSIGPPG

Query:  DNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSE--NNMGCSCSNCGNFDTG
         + S  LL +AKS+N++L+I+R       +     +     E L+VC QFR   L H A +ER E LR  R + P+ V+LSE  + +G    + G+    
Subjt:  DNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSE--NNMGCSCSNCGNFDTG

Query:  FTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAK--ALTNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRL-EEKDGCVGL
        F  R+E LWRFL+STS+AFKG++ EERR++E EA    A  +     E +  W ERM  AGFE   FG + +++AR+ +R+YD+ WEM         V L
Subjt:  FTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAK--ALTNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRL-EEKDGCVGL

Query:  WWKGQPVSFCSFWK
         WKGQPVSFCS W+
Subjt:  WWKGQPVSFCSFWK

Q9LRW3 Scarecrow-like protein 291.4e-12548.64Show/hide
Query:  MTTEET-GPNHPSDHILDWLEDSVPFFS-PFLDETNNSSSINCYQ-W-WDENQDTGEDLINGCLSN-SPTTVSTRLPN--TPTSHHLTPSDLTRKRKAPD
        M  EET  PN   DH+L WLEDSV     P  D++      +  Q W WD+ QD     I     + S   V     N    T       DL  + + P+
Subjt:  MTTEET-GPNHPSDHILDWLEDSVPFFS-PFLDETNNSSSINCYQ-W-WDENQDTGEDLINGCLSN-SPTTVSTRLPN--TPTSHHLTPSDLTRKRKAPD

Query:  DSVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPT
        D   K+S                      G +E   V KKS  +K+ + KS+  +  +G N+EGRWAE+LLNPCA AI   +++RV H LCVL ELAS +
Subjt:  DSVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPT

Query:  GDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQ
        GDAN RLA  GLRAL HHLSS+S SSS +  +       TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG+Q
Subjt:  GDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQ

Query:  WPTFLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLK
        WPT LEAL+ R  GPPP +R+TVI+     D   + PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I+  P E LIVCAQFRLH LK
Subjt:  WPTFLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLK

Query:  HCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNA
        H   DER E L+ +R + PK V+L ENN  CS S   +F  GF++++EY+W+FLDSTSS FK   SEER++MEGEA K L N G+MNE K KW ERMR A
Subjt:  HCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNA

Query:  GFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWK
        GF  + F ED +D A++ +R+YDNNWE+R+E+ D   GL WKG+ VSFCS WK
Subjt:  GFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWK

Q9SN22 Scarecrow-like protein 323.4e-3429.37Show/hide
Query:  EGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPW
        +  + EQLL  CA AI   DA   H +L VL  +A P GD+  RL    LRAL       S + S   + SST++ +  A    RF    L  F +++PW
Subjt:  EGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPW

Query:  FAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINL
          F    AN++IL  +         +HI+D+ ++H +Q PT ++A+  R   PPPL++LTV++       +   P  I    + + S+L++FA + NI +
Subjt:  FAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINL

Query:  QINRLDNHSLQSLNS--QVINKFP---DEILIVCAQFRLHQLKHCAPDE---------RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVE
        +   + +      +S  Q +  +P   +E L+V      H +    P+E         R  FL+ LR + P+ V L E ++  +  N  N          
Subjt:  QINRLDNHSLQSLNS--QVINKFP---DEILIVCAQFRLHQLKHCAPDE---------RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVE

Query:  YLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLW
        Y W   D+T +      SE+RR  E E +  + N    +G    E  E K +W ERMR A F      ED +   +A +  +   W M+ E+ D  + L 
Subjt:  YLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLW

Query:  WKGQPVSFCSFW
        WKG  V F + W
Subjt:  WKGQPVSFCSFW

Arabidopsis top hitse value%identityAlignment
AT3G03450.1 RGA-like 22.8e-2326.01Show/hide
Query:  KNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTD
        ++S +ST +     S   G      L  CA AI + +      L+  +  LA     A  ++A +  +ALA  +  + ++ +              A+ +
Subjt:  KNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTD

Query:  PRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGD
        P F +   + F+E  P+  F +  AN +IL    E     R +H++D+G++ G+QWP  ++AL  R GGPP   RLT I P       TE   S+     
Subjt:  PRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGD

Query:  NISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPD-EILIVCAQFRLHQLKHCAPDERFEFLRN-LRKMEPKAVILSENNMGCSCSNCGNFDTGF
         +  +L  FA+++ +  +   L   SL  L  ++    P+ E L+V + F LH+L   A     E L N ++ ++P  V + E        N   F   F
Subjt:  NISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPD-EILIVCAQFRLHQLKHCAPDERFEFLRN-LRKMEPKAVILSENNMGCSCSNCGNFDTGF

Query:  TRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDG
           + Y     DS   ++    S++R + E    + + N    +G    E +E   +W  RM++AGF+    G      A   +  Y      R+EE DG
Subjt:  TRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDG

Query:  CVGLWWKGQPVSFCSFWKL
        C+ + W+ +P+   S WKL
Subjt:  CVGLWWKGQPVSFCSFWKL

AT3G13840.1 GRAS family transcription factor1.0e-12648.64Show/hide
Query:  MTTEET-GPNHPSDHILDWLEDSVPFFS-PFLDETNNSSSINCYQ-W-WDENQDTGEDLINGCLSN-SPTTVSTRLPN--TPTSHHLTPSDLTRKRKAPD
        M  EET  PN   DH+L WLEDSV     P  D++      +  Q W WD+ QD     I     + S   V     N    T       DL  + + P+
Subjt:  MTTEET-GPNHPSDHILDWLEDSVPFFS-PFLDETNNSSSINCYQ-W-WDENQDTGEDLINGCLSN-SPTTVSTRLPN--TPTSHHLTPSDLTRKRKAPD

Query:  DSVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPT
        D   K+S                      G +E   V KKS  +K+ + KS+  +  +G N+EGRWAE+LLNPCA AI   +++RV H LCVL ELAS +
Subjt:  DSVHKKSQTHQNPRKNQNNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPT

Query:  GDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQ
        GDAN RLA  GLRAL HHLSS+S SSS +  +       TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG+Q
Subjt:  GDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQ

Query:  WPTFLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLK
        WPT LEAL+ R  GPPP +R+TVI+     D   + PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I+  P E LIVCAQFRLH LK
Subjt:  WPTFLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLK

Query:  HCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNA
        H   DER E L+ +R + PK V+L ENN  CS S   +F  GF++++EY+W+FLDSTSS FK   SEER++MEGEA K L N G+MNE K KW ERMR A
Subjt:  HCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNA

Query:  GFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWK
        GF  + F ED +D A++ +R+YDNNWE+R+E+ D   GL WKG+ VSFCS WK
Subjt:  GFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWK

AT3G49950.1 GRAS family transcription factor2.4e-3529.37Show/hide
Query:  EGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPW
        +  + EQLL  CA AI   DA   H +L VL  +A P GD+  RL    LRAL       S + S   + SST++ +  A    RF    L  F +++PW
Subjt:  EGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPW

Query:  FAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINL
          F    AN++IL  +         +HI+D+ ++H +Q PT ++A+  R   PPPL++LTV++       +   P  I    + + S+L++FA + NI +
Subjt:  FAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINL

Query:  QINRLDNHSLQSLNS--QVINKFP---DEILIVCAQFRLHQLKHCAPDE---------RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVE
        +   + +      +S  Q +  +P   +E L+V      H +    P+E         R  FL+ LR + P+ V L E ++  +  N  N          
Subjt:  QINRLDNHSLQSLNS--QVINKFP---DEILIVCAQFRLHQLKHCAPDE---------RFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVE

Query:  YLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLW
        Y W   D+T +      SE+RR  E E +  + N    +G    E  E K +W ERMR A F      ED +   +A +  +   W M+ E+ D  + L 
Subjt:  YLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLW

Query:  WKGQPVSFCSFW
        WKG  V F + W
Subjt:  WKGQPVSFCSFW

AT4G37650.1 GRAS family transcription factor9.8e-2925.93Show/hide
Query:  ENQDTGEDLINGCLSNSPT-TVSTRLPNTPTSHHLTPSDLTR--------KRKAPDDSVHKKSQTHQNPR--------KNQNNQSKNGAGKGSGIVEGVT
        + Q   + +I    S S T T +T  P T   ++   +D+          +      S H     H NP           Q + + +     +     + 
Subjt:  ENQDTGEDLINGCLSNSPT-TVSTRLPNTPTSHHLTPSDLTR--------KRKAPDDSVHKKSQTHQNPR--------KNQNNQSKNGAGKGSGIVEGVT

Query:  VMKKSVGNKKNSS----KSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLS-SNSSSSSSFSS
            S G+  + S      T  +++  +N   +WA+ +L   A A    D  R   +L  L EL+SP GD   +LA + L+AL + ++ S      +  +
Subjt:  VMKKSVGNKKNSS----KSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLS-SNSSSSSSFSS

Query:  YSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPLIRLT--VIAPTVE
         ++T    +F ST     +++++KF EVSPW  F +  AN +IL  +  E      +HI+DI  +   QWPT LEAL  RS   P L RLT  V+A    
Subjt:  YSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPLIRLT--VIAPTVE

Query:  HDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKF---PDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSE
        +DQ              I +R+  FA+ + +  + N +  H +  L+   +N+    PDE+L +     +H +       R   + + R++ P+ V + E
Subjt:  HDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKF---PDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSE

Query:  NNMGCSCSNCGNFDTGFTRRVEYLWRF----LDSTSSAFKGRESEERRVMEGEAAKAL--------TNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDT
                  G FD  F R      R+     +S   +F  R S ER ++E  A +A+        ++  E  E   KW  RMRN+GF    + ++  D 
Subjt:  NNMGCSCSNCGNFDTGFTRRVEYLWRF----LDSTSSAFKGRESEERRVMEGEAAKAL--------TNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDT

Query:  ARASMRRY-DNNWEMRLEEKDGCVGLWWKGQPVSFCSFWK
         RA +RRY +  W M        + L W+ QPV + S W+
Subjt:  ARASMRRY-DNNWEMRLEEKDGCVGLWWKGQPVSFCSFWK

AT5G66770.1 GRAS family transcription factor4.3e-2428.35Show/hide
Query:  CANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSS
        CA  I   D       L  ++E  S  GD   R+A +   AL++ LS NS ++SS SS           ST+      S    ++  P+  F +  AN +
Subjt:  CANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSSSFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSS

Query:  ILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPLIRLTVI-APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSL
        IL    E   +   +HI+D G+  G+QWP  L+AL  R+ G P  IR++ I AP++      E+P    P      +RL  FAK L++N     +    +
Subjt:  ILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPLIRLTVI-APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSL

Query:  QSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFK---GRESEER
          LN       PDE+L V    +L++L    P      LR  + + P+ V L E  +  +         GF  RV+   +F  +   + +   GR+SEER
Subjt:  QSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFK---GRESEER

Query:  ----RVMEGEAAKALTN------DGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYD-NNWEMRLEEKDGCVGLWWKGQPVSFCSFWK
            R + G     L          E  EEK +W   M NAGFE        +  A+  +  Y+ +N    +E K G + L W   P+   S W+
Subjt:  ----RVMEGEAAKALTN------DGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYD-NNWEMRLEEKDGCVGLWWKGQPVSFCSFWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACTGAAGAAACAGGGCCAAACCACCCTTCAGATCACATATTGGACTGGTTAGAGGACTCGGTTCCTTTCTTTTCCCCATTCCTGGATGAGACTAACAACTCCAG
CTCTATAAACTGCTATCAATGGTGGGATGAGAACCAGGACACAGGCGAAGATCTGATTAATGGCTGTCTCAGCAACTCCCCCACTACTGTTAGTACTAGACTACCAAACA
CACCCACTTCCCACCACTTGACACCATCTGATTTGACCAGGAAAAGAAAAGCTCCAGATGATTCAGTTCATAAGAAATCACAAACCCATCAGAACCCAAGGAAGAACCAG
AACAATCAGAGCAAAAATGGTGCAGGTAAAGGCAGTGGAATTGTTGAGGGAGTGACTGTGATGAAGAAATCAGTAGGGAACAAGAAGAATTCATCAAAATCCACAGGAAA
TAACTATAATAACGGAAGCAACAGGGAAGGAAGGTGGGCAGAGCAATTGCTAAATCCCTGTGCTAATGCTATCATGAAAGGGGATGCGACAAGAGTACATCACCTTCTTT
GTGTTCTTCAAGAGCTCGCCTCACCCACCGGCGACGCCAACCACCGGCTTGCCGACCACGGTCTCCGAGCTTTGGCCCATCACCTGTCCTCCAATTCTTCGTCTTCTTCT
TCTTTTTCTTCTTATTCCTCCACAGTTGCCCCGGTTACTTTCGCTTCAACGGACCCTCGATTCTTCCAGAGATCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGC
ATTTCCGAACAACATTGCAAATTCTTCAATCCTCCACATTCTCTCTGAAGAACCTAATCGCCCGCGAAATCTTCACATTCTTGACATTGGGGTTTCTCATGGTGTGCAAT
GGCCGACGTTCCTCGAGGCCTTGACTCGCCGTTCCGGTGGACCTCCGCCGCTAATTCGTCTTACAGTTATCGCTCCAACCGTCGAACATGACCAAAATACCGAGACGCCG
TTTTCAATTGGTCCACCAGGAGACAACATCTCGTCTCGGCTTCTCAGTTTCGCCAAATCCTTGAACATCAATTTACAAATCAACCGCCTCGACAATCACTCACTCCAGAG
TTTAAATTCGCAAGTAATCAACAAATTCCCAGACGAAATCCTGATCGTTTGCGCACAGTTCAGACTCCATCAGTTGAAACACTGTGCTCCTGACGAAAGATTCGAGTTCT
TACGAAATCTAAGAAAAATGGAACCAAAGGCAGTGATTCTAAGTGAAAACAACATGGGATGTAGCTGTAGCAACTGCGGAAATTTCGACACCGGATTCACACGACGAGTT
GAATACCTATGGAGATTTCTGGATTCAACAAGCTCGGCATTCAAAGGGAGAGAAAGCGAAGAAAGAAGAGTGATGGAAGGTGAAGCGGCAAAAGCACTGACGAATGATGG
GGAAATGAACGAGGAAAAGGGAAAATGGTGCGAAAGAATGAGAAATGCGGGTTTTGAGAGAAAATTATTCGGTGAAGACACCATTGATACAGCCCGAGCTTCAATGAGAA
GGTATGATAATAACTGGGAGATGAGATTGGAAGAGAAAGATGGATGTGTAGGCTTATGGTGGAAAGGCCAACCCGTTTCCTTTTGTTCGTTTTGGAAGTTGGGGATTAAA
TCCAATGCCGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCACTGAAGAAACAGGGCCAAACCACCCTTCAGATCACATATTGGACTGGTTAGAGGACTCGGTTCCTTTCTTTTCCCCATTCCTGGATGAGACTAACAACTCCAG
CTCTATAAACTGCTATCAATGGTGGGATGAGAACCAGGACACAGGCGAAGATCTGATTAATGGCTGTCTCAGCAACTCCCCCACTACTGTTAGTACTAGACTACCAAACA
CACCCACTTCCCACCACTTGACACCATCTGATTTGACCAGGAAAAGAAAAGCTCCAGATGATTCAGTTCATAAGAAATCACAAACCCATCAGAACCCAAGGAAGAACCAG
AACAATCAGAGCAAAAATGGTGCAGGTAAAGGCAGTGGAATTGTTGAGGGAGTGACTGTGATGAAGAAATCAGTAGGGAACAAGAAGAATTCATCAAAATCCACAGGAAA
TAACTATAATAACGGAAGCAACAGGGAAGGAAGGTGGGCAGAGCAATTGCTAAATCCCTGTGCTAATGCTATCATGAAAGGGGATGCGACAAGAGTACATCACCTTCTTT
GTGTTCTTCAAGAGCTCGCCTCACCCACCGGCGACGCCAACCACCGGCTTGCCGACCACGGTCTCCGAGCTTTGGCCCATCACCTGTCCTCCAATTCTTCGTCTTCTTCT
TCTTTTTCTTCTTATTCCTCCACAGTTGCCCCGGTTACTTTCGCTTCAACGGACCCTCGATTCTTCCAGAGATCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGC
ATTTCCGAACAACATTGCAAATTCTTCAATCCTCCACATTCTCTCTGAAGAACCTAATCGCCCGCGAAATCTTCACATTCTTGACATTGGGGTTTCTCATGGTGTGCAAT
GGCCGACGTTCCTCGAGGCCTTGACTCGCCGTTCCGGTGGACCTCCGCCGCTAATTCGTCTTACAGTTATCGCTCCAACCGTCGAACATGACCAAAATACCGAGACGCCG
TTTTCAATTGGTCCACCAGGAGACAACATCTCGTCTCGGCTTCTCAGTTTCGCCAAATCCTTGAACATCAATTTACAAATCAACCGCCTCGACAATCACTCACTCCAGAG
TTTAAATTCGCAAGTAATCAACAAATTCCCAGACGAAATCCTGATCGTTTGCGCACAGTTCAGACTCCATCAGTTGAAACACTGTGCTCCTGACGAAAGATTCGAGTTCT
TACGAAATCTAAGAAAAATGGAACCAAAGGCAGTGATTCTAAGTGAAAACAACATGGGATGTAGCTGTAGCAACTGCGGAAATTTCGACACCGGATTCACACGACGAGTT
GAATACCTATGGAGATTTCTGGATTCAACAAGCTCGGCATTCAAAGGGAGAGAAAGCGAAGAAAGAAGAGTGATGGAAGGTGAAGCGGCAAAAGCACTGACGAATGATGG
GGAAATGAACGAGGAAAAGGGAAAATGGTGCGAAAGAATGAGAAATGCGGGTTTTGAGAGAAAATTATTCGGTGAAGACACCATTGATACAGCCCGAGCTTCAATGAGAA
GGTATGATAATAACTGGGAGATGAGATTGGAAGAGAAAGATGGATGTGTAGGCTTATGGTGGAAAGGCCAACCCGTTTCCTTTTGTTCGTTTTGGAAGTTGGGGATTAAA
TCCAATGCCGTTTGA
Protein sequenceShow/hide protein sequence
MTTEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSSSINCYQWWDENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSDLTRKRKAPDDSVHKKSQTHQNPRKNQ
NNQSKNGAGKGSGIVEGVTVMKKSVGNKKNSSKSTGNNYNNGSNREGRWAEQLLNPCANAIMKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSSSS
SFSSYSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTFLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETP
FSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLRNLRKMEPKAVILSENNMGCSCSNCGNFDTGFTRRV
EYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERMRNAGFERKLFGEDTIDTARASMRRYDNNWEMRLEEKDGCVGLWWKGQPVSFCSFWKLGIK
SNAV