; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G01030 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G01030
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionnodulation-signaling pathway 1 protein
Genome locationClcChr09:809799..811451
RNA-Seq ExpressionClc09G01030
SyntenyClc09G01030
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0042446 - hormone biosynthetic process (biological process)
GO:2000032 - regulation of secondary shoot formation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR005202 - Transcription factor GRAS
IPR030015 - Scarecrow-like protein 29/nodulation signalling pathway 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059401.1 nodulation-signaling pathway 1 protein [Cucumis melo var. makuwa]3.3e-28990.27Show/hide
Query:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK
        MTIEETGP+HPSDHILDWLEDSVPFFS FLDET+NS S+NCYQWW ENQDTGEDLINGCLSNSPTT VSTR PNTPTSHHL PS+LTKKRKAPDDSVHKK
Subjt:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSK  ADKGSGAVEGVTV+KKS+GNKKN+SKSTGNN NNGSN+EGRWAEQLLNPCA+AIVKGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA
        LADHGLRALA+HLSSN   SSFSSYSSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLHILDIGVSHGVQWPTLLEA
Subjt:  LADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA

Query:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER
        LTRRSGGPPPLIRLTVI PTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSL+SLNSQ INK  DEILIVC+QFRLHQLKH APDER
Subjt:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER

Query:  FEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERIRNAGFERKI
         EFLQNLRKMEPKAVILSENNM CSCS C NF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N +GEMNEEKGKWCER+RN GFERK 
Subjt:  FEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERIRNAGFERKI

Query:  FAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV
        F EDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCSLWKLGIKSNA+
Subjt:  FAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV

XP_004141813.1 protein NODULATION SIGNALING PATHWAY 1 [Cucumis sativus]1.0e-29090.63Show/hide
Query:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK
        MTIEETGPNHPSDHILDWLEDSVPFFS FLDET+NS S+NCYQWW ENQDTGEDLINGCLSNSPTT VSTR PNTPTSH LTPS+LTKKRKAPDDSVHKK
Subjt:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSK  ADKGSGAVEGVTVMKKS+GNKKN+SKSTGNN N+GSN+EGRWAEQLLNPCA+AIVKGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA
        LADHGLRALA+HLSSN   SSFSSYSSTV P TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLHILDIGVSHGVQWPTLLEA
Subjt:  LADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA

Query:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER
        LTRRSGGPPPLIRLTVIAPT+EHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSL+SLNSQ INK  DEILIVCAQFRLHQLKH APDER
Subjt:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER

Query:  FEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERIRNAGFERKI
         EFL+NLRKMEPKAVILSENNM CSCS CGNF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N DGEMNEEKGKWCER+RN GFERK 
Subjt:  FEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERIRNAGFERKI

Query:  FAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV
        F EDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCS WKLGIKSNA+
Subjt:  FAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV

XP_008462311.1 PREDICTED: nodulation-signaling pathway 1 protein [Cucumis melo]1.1e-28990.27Show/hide
Query:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK
        MTIEETGP+HPSDHILDWLEDSVPFFS FLDET+NS S+NCYQWW ENQDTGEDLINGCLSNSPTT VSTR PNTPTSHHL PS+LTKKRKAPDDSVHKK
Subjt:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSK  ADKGSGAVEGVTV+KKS+GNKKN+SKSTGNN NNGSN+EGRWAEQLLNPCA+AIVKGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA
        LADHGLRALA+HLSSN   SSFSSYSSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLH+LDIGVSHGVQWPTLLEA
Subjt:  LADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA

Query:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER
        LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSL+SLNSQ INK  DEILIVC+QFRLHQLKH APDER
Subjt:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER

Query:  FEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERIRNAGFERKI
         EFLQNLRKMEPKAVILSENNM CSCS C NF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N +GEMNEEKGKWCER+RN GFERK 
Subjt:  FEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERIRNAGFERKI

Query:  FAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV
        F EDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCSLWKLGIKSNA+
Subjt:  FAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV

XP_023000031.1 nodulation-signaling pathway 1 protein [Cucurbita maxima]9.4e-28488.71Show/hide
Query:  MTIEETGPNHPSDHILDWLEDSVPFF-SPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK
        MTIEE G NHPSDHILDWL DSVPFF SPF D++ NS S+NCYQWW ENQD G+DLINGCLS+SPTTVST+ PNT TSHHLTPS+LTKKRKAPDD+VHK 
Subjt:  MTIEETGPNHPSDHILDWLEDSVPFF-SPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR
        SQT QN RKNQNNQSK GADKGSGAVEGVTV+KKS+GNK+NSSK+TGNNN+NGSN+EGRWAEQLLNPCA+AI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTR
        LA +GLRALAH+LSSNSS  S SSTV PVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNR RNLHILDIGVSHGVQWPTLLEALTR
Subjt:  LADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTR

Query:  RSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEF
        RSGGPPPLIRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SL+S NSQVI K PDEILIVCAQFRLHQLKH APDERFEF
Subjt:  RSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEF

Query:  LQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFAED
        LQNLRK+EPKAVILSENNMACSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRESEER+VMEGEAA+ LTN GEMNEE  KWCER+RNAGF RK+F ED
Subjt:  LQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFAED

Query:  TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSN
        TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCS WKLG+KSN
Subjt:  TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSN

XP_038897214.1 protein NODULATION SIGNALING PATHWAY 1 [Benincasa hispida]1.4e-30093.13Show/hide
Query:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKKS
        MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNS S+NCYQWW  NQDTGEDLING LSNSPTTVST+L N PTSHHLTPS+LTKKRKAPDDSVHKKS
Subjt:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKKS

Query:  QTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRL
        QTHQN RKNQNNQSK G   G GAVEGVTVMKKS+GNKKNSSK TGNN NNGSNREGRWAEQLLNPCASAI+KGDATRVHHLLCVLQELASPTGDANHRL
Subjt:  QTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRL

Query:  ADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEAL
        ADHGLRALAHHLSSN   SSFSSYSSTV PVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEAL
Subjt:  ADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEAL

Query:  TRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERF
        TRRSGGPP LIRLTVI PT+EHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRL+NHSL+SLNSQVINKFPDEILIVCAQFRLHQLKHC PDERF
Subjt:  TRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERF

Query:  EFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFA
        EFLQNLRKMEPKAVILSENNM CSCSNCGNFDT FTRRVEYLWRFLDSTS+AFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCER+RNAGFERK+FA
Subjt:  EFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFA

Query:  EDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV
        EDTIDTARASMRRYDNNWEMR+EEKDGC+GLWWKGQPVSFCS WKLGIKSNA+
Subjt:  EDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV

TrEMBL top hitse value%identityAlignment
A0A0A0KCK6 GRAS domain-containing protein5.0e-29190.63Show/hide
Query:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK
        MTIEETGPNHPSDHILDWLEDSVPFFS FLDET+NS S+NCYQWW ENQDTGEDLINGCLSNSPTT VSTR PNTPTSH LTPS+LTKKRKAPDDSVHKK
Subjt:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSK  ADKGSGAVEGVTVMKKS+GNKKN+SKSTGNN N+GSN+EGRWAEQLLNPCA+AIVKGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA
        LADHGLRALA+HLSSN   SSFSSYSSTV P TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLHILDIGVSHGVQWPTLLEA
Subjt:  LADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA

Query:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER
        LTRRSGGPPPLIRLTVIAPT+EHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSL+SLNSQ INK  DEILIVCAQFRLHQLKH APDER
Subjt:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER

Query:  FEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERIRNAGFERKI
         EFL+NLRKMEPKAVILSENNM CSCS CGNF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N DGEMNEEKGKWCER+RN GFERK 
Subjt:  FEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERIRNAGFERKI

Query:  FAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV
        F EDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCS WKLGIKSNA+
Subjt:  FAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV

A0A1S3CGQ3 nodulation-signaling pathway 1 protein5.5e-29090.27Show/hide
Query:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK
        MTIEETGP+HPSDHILDWLEDSVPFFS FLDET+NS S+NCYQWW ENQDTGEDLINGCLSNSPTT VSTR PNTPTSHHL PS+LTKKRKAPDDSVHKK
Subjt:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSK  ADKGSGAVEGVTV+KKS+GNKKN+SKSTGNN NNGSN+EGRWAEQLLNPCA+AIVKGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA
        LADHGLRALA+HLSSN   SSFSSYSSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLH+LDIGVSHGVQWPTLLEA
Subjt:  LADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA

Query:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER
        LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSL+SLNSQ INK  DEILIVC+QFRLHQLKH APDER
Subjt:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER

Query:  FEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERIRNAGFERKI
         EFLQNLRKMEPKAVILSENNM CSCS C NF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N +GEMNEEKGKWCER+RN GFERK 
Subjt:  FEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERIRNAGFERKI

Query:  FAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV
        F EDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCSLWKLGIKSNA+
Subjt:  FAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV

A0A5A7V101 Nodulation-signaling pathway 1 protein1.6e-28990.27Show/hide
Query:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK
        MTIEETGP+HPSDHILDWLEDSVPFFS FLDET+NS S+NCYQWW ENQDTGEDLINGCLSNSPTT VSTR PNTPTSHHL PS+LTKKRKAPDDSVHKK
Subjt:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTT-VSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR
        SQTHQNPRKNQNNQSK  ADKGSGAVEGVTV+KKS+GNKKN+SKSTGNN NNGSN+EGRWAEQLLNPCA+AIVKGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA
        LADHGLRALA+HLSSN   SSFSSYSSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE NRPRNLHILDIGVSHGVQWPTLLEA
Subjt:  LADHGLRALAHHLSSN---SSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEA

Query:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER
        LTRRSGGPPPLIRLTVI PTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSL+SLNSQ INK  DEILIVC+QFRLHQLKH APDER
Subjt:  LTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDER

Query:  FEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERIRNAGFERKI
         EFLQNLRKMEPKAVILSENNM CSCS C NF+ GF R VEY+W+FLDSTS+AFKGRESEERRVMEGEAAKAL N +GEMNEEKGKWCER+RN GFERK 
Subjt:  FEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN-DGEMNEEKGKWCERIRNAGFERKI

Query:  FAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV
        F EDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCSLWKLGIKSNA+
Subjt:  FAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV

A0A6J1HKE2 nodulation-signaling pathway 1 protein1.2e-27688.64Show/hide
Query:  DHILDWLEDSVPFF-SPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKKSQTHQNPRKNQN
        DHILDWL DSVPFF SPF D++ NS S+NCYQWW ENQD G+DLINGCLS+SPTTVST+ PNT TSHHLTPS+LTKKRKAPDD+VHK SQT QN RKNQN
Subjt:  DHILDWLEDSVPFF-SPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKKSQTHQNPRKNQN

Query:  NQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHH
        NQS+ GADK SGAV GVTVMKKS+GNK+NSSK+TGNNNNNG+N+EGRWAEQLLNPCA+AI+KGDATRVHHLLCVLQELASPTGDANHRLA +GLRALAH+
Subjt:  NQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHH

Query:  LSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT
        LSSNSS SS SST+ PVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNR RNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT
Subjt:  LSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT

Query:  VIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLQNLRKMEPKAV
        VIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL SLN+QVI KFPDEILIVCAQFRLHQLKH APDERFEFLQNLRK+EPKAV
Subjt:  VIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLQNLRKMEPKAV

Query:  ILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFAEDTIDTARASMRRY
        ILSENNMACSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRESEER++MEGEAAKAL N+GEMNEE  KWCER+RNAGF RK+F EDTIDTARASMRRY
Subjt:  ILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFAEDTIDTARASMRRY

Query:  DNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSN
        DNNWEMRVEEKDGCVGLWWKGQPVSFCS WKLG+KSN
Subjt:  DNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSN

A0A6J1KER4 nodulation-signaling pathway 1 protein4.5e-28488.71Show/hide
Query:  MTIEETGPNHPSDHILDWLEDSVPFF-SPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK
        MTIEE G NHPSDHILDWL DSVPFF SPF D++ NS S+NCYQWW ENQD G+DLINGCLS+SPTTVST+ PNT TSHHLTPS+LTKKRKAPDD+VHK 
Subjt:  MTIEETGPNHPSDHILDWLEDSVPFF-SPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKK

Query:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR
        SQT QN RKNQNNQSK GADKGSGAVEGVTV+KKS+GNK+NSSK+TGNNN+NGSN+EGRWAEQLLNPCA+AI+KGDATRVHHLLCVLQELASPTGDANHR
Subjt:  SQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHR

Query:  LADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTR
        LA +GLRALAH+LSSNSS  S SSTV PVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNR RNLHILDIGVSHGVQWPTLLEALTR
Subjt:  LADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTR

Query:  RSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEF
        RSGGPPPLIRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SL+S NSQVI K PDEILIVCAQFRLHQLKH APDERFEF
Subjt:  RSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEF

Query:  LQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFAED
        LQNLRK+EPKAVILSENNMACSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRESEER+VMEGEAA+ LTN GEMNEE  KWCER+RNAGF RK+F ED
Subjt:  LQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFAED

Query:  TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSN
        TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCS WKLG+KSN
Subjt:  TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSN

SwissProt top hitse value%identityAlignment
A1DQP9 Protein NODULATION SIGNALING PATHWAY 13.6e-17761.98Show/hide
Query:  MTIEETGPNH-PSDHILDWLEDSVPFFSPFLDE-TNNSCSLNCYQWWGENQ---DTGEDLINGCLSNSPTTVSTRLPNTPT-----SHHLTP-SELTKKR
        MT+E   PN   SDHILDWLE SV FF  FLDE  NNS  +  Y  W + Q   +TG    N   S + T V+T   +T +     S++  P S+L KKR
Subjt:  MTIEETGPNH-PSDHILDWLEDSVPFFSPFLDE-TNNSCSLNCYQWWGENQ---DTGEDLINGCLSNSPTTVSTRLPNTPT-----SHHLTP-SELTKKR

Query:  KAPDDSVHKKSQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQEL
         A D+S  K       P +N+N + KT   +     E    ++K   NKK  +K+ G+N N+G+++EGRWAEQLLNPCA+AI  G+  RV HLL VL EL
Subjt:  KAPDDSVHKKSQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQEL

Query:  ASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPN-RPRNLHILDIGVSHGV
        ASPTGD NHRLA HGLRAL HHLSS+SS  + S T   +TFAST+PRFFQ+SL+KF+EVSPWF+FPNNIAN+SIL +L+EE N   R LHILDIGVSHGV
Subjt:  ASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPN-RPRNLHILDIGVSHGV

Query:  QWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQL
        QWPTLL+AL+RRSGGPP ++RLTV+  T E+DQN ETPFS  PPG N   RLL +A+S+NINLQINR++NHSL++LN+Q I+  PDEILIVCAQFRLH L
Subjt:  QWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQL

Query:  KHCAPDERFEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRN
         H +PDER EFL+ LR MEP+ VILSENN  C CS CGNF  GFTRRVEYLWRFLDSTSSAFKGRES+ERRVMEGEAAKALTN  EMNEEK KWC R++ 
Subjt:  KHCAPDERFEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRN

Query:  AGFERKIFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKL
        AGF  ++F ED +D  RA +R+YD+NWEM+VEEK+  VGLWWKGQPVSFCSLWKL
Subjt:  AGFERKIFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKL

Q4VYC8 Protein NODULATION SIGNALING PATHWAY 16.9e-18160.64Show/hide
Query:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNS-------------PTTVSTRLPNTPTSHHLTPSELTK
        MT+E   PN  SDHILDWLE SV FF  FLD+  N+  ++ Y+ W +NQD          +NS              TT ST      + +++  S+L K
Subjt:  MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNS-------------PTTVSTRLPNTPTSHHLTPSELTK

Query:  KRKAPDD-SVHKKSQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVL
        KR A D+ S+ K+ Q  +N R      ++  +D G  A+EG TV++KS GNKK ++K+ G+N+NNG+N++GRWAEQLLNPCA AI  G+  RV HLL VL
Subjt:  KRKAPDD-SVHKKSQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVL

Query:  QELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSH
         ELAS TGDANHRLA HGLRAL HHLSS+SS S+ S T   +TFAST+PRFFQ+SL+KF+E SPWF+FPNNIAN+SIL +L+EEPN  R LHILDIGVSH
Subjt:  QELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSH

Query:  GVQWPTLLEALTRRSGGPPPLIRLTVI--APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFR
        GVQWPT LEAL+RR GGPPPL+RLTV+  + + E+DQN ETPFSIGP GD  SS LL +A+SLN+NLQI +LDNH L++LN++ ++   DE LIVCAQFR
Subjt:  GVQWPTLLEALTRRSGGPPPLIRLTVI--APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFR

Query:  LHQLKHCAPDERFEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCE
        LH L H  PDER EFL+ LR MEPK VILSENNM C CS+CG+F TGF+RRVEYLWRFLDSTSSAFK R+S+ER++MEGEAAKALTN  EMNE + KWCE
Subjt:  LHQLKHCAPDERFEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCE

Query:  RIRNAGFERKIFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKL
        R++ AGF  ++F ED ID  RA +R+YDNNWEM+VEE    V LWWK QPVSFCSLWKL
Subjt:  RIRNAGFERKIFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKL

Q75I13 Protein SHORT-ROOT 22.3e-3528.76Show/hide
Query:  SSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQR
        SS   G       +  GRWA QLL  CA A+   D+ RV  L+ +L ELASP GD + +LA + L+ L   L+++   +  +        AS D    +R
Subjt:  SSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQR

Query:  SLIKFHEVSPWFAFPNNIANSSILHIL---------------SEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTET
        + +KF E+SPW  F +  AN +IL                  S     P  LHILD+  +   QWPTLLEAL  RS    P + +T + PT       + 
Subjt:  SLIKFHEVSPWFAFPNNIANSSILHIL---------------SEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTET

Query:  PFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHS--LRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLQNLRKMEPKAVILSENNM-----
                  I  RL  FA+ + +     R  +HS  L  L+   ++          A   ++ L+  A   R  F+ +LR++EP+ V + E        
Subjt:  PFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHS--LRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLQNLRKMEPKAVILSENNM-----

Query:  ACSCSNCGNFDTGFTR----RVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKAL--------TNDGEMNEEKGKWCERIRNAGFERKIFAEDTIDTARA
            S+  + D  F +     + +   ++DS   +F  + S ER  +E    +A+        +   E  E    W  R+R+AGF    F+ED  D  R+
Subjt:  ACSCSNCGNFDTGFTR----RVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKAL--------TNDGEMNEEKGKWCERIRNAGFERKIFAEDTIDTARA

Query:  SMRRYDNNWEMR-----VEEKDGCVG----LWWKGQPVSFCSLWK
         +RRY   W MR      ++  G       L WK QPV + S WK
Subjt:  SMRRYDNNWEMR-----VEEKDGCVG----LWWKGQPVSFCSLWK

Q84MQ9 Protein NODULATION SIGNALING PATHWAY 18.4e-8639.69Show/hide
Query:  WWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKKSQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSK
        WW  +    +D I   ++ + +  ST  P   +    +P+  +        S  +KS  H+ P         TG  KG G                    
Subjt:  WWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKKSQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSK

Query:  STGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTP-------VTFASTDPR
          G     GS+R+ RWAEQLLNPCA A+  G+ +RV HL  VL EL S +GDANHRLA HGLRALA  L +    ++ ++   P         FA+ +PR
Subjt:  STGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTP-------VTFASTDPR

Query:  FFQRSLIKFHEVSPWFAFPNNIANSSILH--ILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVIAPTVEHDQNTETPFSIGPPG
         F+ SLI+FHEVSPWFA PN +AN++I            PR LH++D+GVSHGVQWPTLLE+LTR+ GG  PP +RLTV+ P      +   PFS  PPG
Subjt:  FFQRSLIKFHEVSPWFAFPNNIANSSILH--ILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVIAPTVEHDQNTETPFSIGPPG

Query:  DNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLQNLRKMEPKAVILSE--NNMACSCSNCGNFDTG
         + S  LL +AKS+N++L+I+R       +     +     E L+VC QFR   L H A +ER E L+  R + P+ V+LSE  + +     + G+    
Subjt:  DNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLQNLRKMEPKAVILSE--NNMACSCSNCGNFDTG

Query:  FTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAK--ALTNDGEMNEEKGKWCERIRNAGFERKIFAEDTIDTARASMRRYDNNWEMRV-EEKDGCVGL
        F  R+E LWRFL+STS+AFKG++ EERR++E EA    A  +     E +  W ER+  AGFE   F  + +++AR+ +R+YD+ WEM         V L
Subjt:  FTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAK--ALTNDGEMNEEKGKWCERIRNAGFERKIFAEDTIDTARASMRRYDNNWEMRV-EEKDGCVGL

Query:  WWKGQPVSFCSLWK
         WKGQPVSFCSLW+
Subjt:  WWKGQPVSFCSLWK

Q9LRW3 Scarecrow-like protein 296.4e-12648.99Show/hide
Query:  MTIEET-GPNHPSDHILDWLEDSVPFFS-PFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHK
        M +EET  PN   DH+L WLEDSV     P  D++      +  Q W    D  +D  +G + +    +S        ++      L    +AP   +  
Subjt:  MTIEET-GPNHPSDHILDWLEDSVPFFS-PFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHK

Query:  KSQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANH
          +  Q      N+QS+  +  G    + V   KKS  +K+ + KS+  ++ +G N+EGRWAE+LLNPCA AI   +++RV H LCVL ELAS +GDAN 
Subjt:  KSQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANH

Query:  RLADHGLRALAHHLSSNSSFSSYSSTVTPV-TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEAL
        RLA  GLRAL HHLSS    SS SS+  PV TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG+QWPTLLEAL
Subjt:  RLADHGLRALAHHLSSNSSFSSYSSTVTPV-TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEAL

Query:  TRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERF
        + R  GPPP +R+TVI+     D   + PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I+  P E LIVCAQFRLH LKH   DER 
Subjt:  TRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERF

Query:  EFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFA
        E L+ +R + PK V+L ENN  CS S   +F  GF++++EY+W+FLDSTSS FK   SEER++MEGEA K L N G+MNE K KW ER+R AGF  + F 
Subjt:  EFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFA

Query:  EDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWK
        ED +D A++ +R+YDNNWE+R+E+ D   GL WKG+ VSFCSLWK
Subjt:  EDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWK

Arabidopsis top hitse value%identityAlignment
AT3G03450.1 RGA-like 23.6e-2326.75Show/hide
Query:  KNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFF
        ++S +ST +     S   G      L  CA AI + +      L+  +  LA     A  ++A +  +ALA  +     +  Y  T      A+ +P F 
Subjt:  KNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFF

Query:  QRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISS
        +   + F+E  P+  F +  AN +IL    E     R +H++D+G++ G+QWP L++AL  R GGPP   RLT I P       TE   S+      +  
Subjt:  QRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISS

Query:  RLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPD-EILIVCAQFRLHQLKHCAPDERFEFLQN-LRKMEPKAVILSENNMACSCSNCGNFDTGFTRRV
        +L  FA+++ +  +   L   SL  L  ++    P+ E L+V + F LH+L   A     E L N ++ ++P  V + E     +  N   F   F   +
Subjt:  RLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPD-EILIVCAQFRLHQLKHCAPDERFEFLQN-LRKMEPKAVILSENNMACSCSNCGNFDTGFTRRV

Query:  EYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERIRNAGFERKIFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGL
         Y     DS   ++    S++R + E    + + N    +G    E +E   +W  R+++AGF+           A   +  Y      RVEE DGC+ +
Subjt:  EYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERIRNAGFERKIFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGL

Query:  WWKGQPVSFCSLWKL
         W+ +P+   S WKL
Subjt:  WWKGQPVSFCSLWKL

AT3G13840.1 GRAS family transcription factor4.5e-12748.99Show/hide
Query:  MTIEET-GPNHPSDHILDWLEDSVPFFS-PFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHK
        M +EET  PN   DH+L WLEDSV     P  D++      +  Q W    D  +D  +G + +    +S        ++      L    +AP   +  
Subjt:  MTIEET-GPNHPSDHILDWLEDSVPFFS-PFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHK

Query:  KSQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANH
          +  Q      N+QS+  +  G    + V   KKS  +K+ + KS+  ++ +G N+EGRWAE+LLNPCA AI   +++RV H LCVL ELAS +GDAN 
Subjt:  KSQTHQNPRKNQNNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANH

Query:  RLADHGLRALAHHLSSNSSFSSYSSTVTPV-TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEAL
        RLA  GLRAL HHLSS    SS SS+  PV TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG+QWPTLLEAL
Subjt:  RLADHGLRALAHHLSSNSSFSSYSSTVTPV-TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEAL

Query:  TRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERF
        + R  GPPP +R+TVI+     D   + PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I+  P E LIVCAQFRLH LKH   DER 
Subjt:  TRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERF

Query:  EFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFA
        E L+ +R + PK V+L ENN  CS S   +F  GF++++EY+W+FLDSTSS FK   SEER++MEGEA K L N G+MNE K KW ER+R AGF  + F 
Subjt:  EFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFA

Query:  EDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWK
        ED +D A++ +R+YDNNWE+R+E+ D   GL WKG+ VSFCSLWK
Subjt:  EDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWK

AT3G49950.1 GRAS family transcription factor6.3e-3629.41Show/hide
Query:  EGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFP
        +  + EQLL  CA+AI   DA   H +L VL  +A P GD+  RL    LRAL     S +   + SST++ +  A    RF    L  F +++PW  F 
Subjt:  EGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFP

Query:  NNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINR
           AN++IL  +         +HI+D+ ++H +Q PTL++A+  R   PPPL++LTV++       +   P  I    + + S+L++FA + NI ++   
Subjt:  NNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINR

Query:  LDNHSLRSLNS--QVINKFP---DEILIVCAQFRLHQLKHCAPDE---------RFEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWR
        + +      +S  Q +  +P   +E L+V      H +    P+E         R  FL+ LR + P+ V L E ++  +  N  N          Y W 
Subjt:  LDNHSLRSLNS--QVINKFP---DEILIVCAQFRLHQLKHCAPDE---------RFEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWR

Query:  FLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERIRNAGFERKIFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQ
          D+T +      SE+RR  E E +  + N    +G    E  E K +W ER+R A F      ED +   +A +  +   W M+ E+ D  + L WKG 
Subjt:  FLDSTSSAFKGRESEERRVMEGEAAKALTN----DG----EMNEEKGKWCERIRNAGFERKIFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQ

Query:  PVSFCSLW
         V F ++W
Subjt:  PVSFCSLW

AT4G37650.1 GRAS family transcription factor9.7e-2925.19Show/hide
Query:  ENQDTGEDLINGCLSNSPT-TVSTRLPNTPTSHHLTPSELTK--------KRKAPDDSVHKKSQTHQNPRKNQNN-QSKTGADKGSGAVEGVTVMKKSIG
        + Q   + +I    S S T T +T  P T   ++   +++ +        +      S H     H NP    +   + T     + +    T    ++ 
Subjt:  ENQDTGEDLINGCLSNSPT-TVSTRLPNTPTSHHLTPSELTK--------KRKAPDDSVHKKSQTHQNPRKNQNN-QSKTGADKGSGAVEGVTVMKKSIG

Query:  NKKNSSKSTGNNNNNGS------------NREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSST
        +  +SS   G++N+  +            +   +WA+ +L   A A    D  R   +L  L EL+SP GD   +LA + L+AL + ++ +      +  
Subjt:  NKKNSSKSTGNNNNNGS------------NREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSST

Query:  VTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VIAPTVEHDQN
            T  +      +++++KF EVSPW  F +  AN +IL  +  E      +HI+DI  +   QWPTLLEAL  RS   P L RLT  V+A    +DQ 
Subjt:  VTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VIAPTVEHDQN

Query:  TETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKF---PDEILIVCAQFRLHQLKHCAPDERFEFLQNLRKMEPKAVILSENNMA
                     I +R+  FA+ + +  + N +  H +  L+   +N+    PDE+L +     +H +       R   + + R++ P+ V + E    
Subjt:  TETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKF---PDEILIVCAQFRLHQLKHCAPDERFEFLQNLRKMEPKAVILSENNMA

Query:  CSCSNCGNFDTGFTRRVEYLWRF----LDSTSSAFKGRESEERRVMEGEAAKAL--------TNDGEMNEEKGKWCERIRNAGFERKIFAEDTIDTARAS
              G FD  F R      R+     +S   +F  R S ER ++E  A +A+        ++  E  E   KW  R+RN+GF    ++++  D  RA 
Subjt:  CSCSNCGNFDTGFTRRVEYLWRF----LDSTSSAFKGRESEERRVMEGEAAKAL--------TNDGEMNEEKGKWCERIRNAGFERKIFAEDTIDTARAS

Query:  MRRY-DNNWEMRVEEKDGCVGLWWKGQPVSFCSLWK
        +RRY +  W M        + L W+ QPV + S W+
Subjt:  MRRY-DNNWEMRVEEKDGCVGLWWKGQPVSFCSLWK

AT5G66770.1 GRAS family transcription factor2.5e-2427.91Show/hide
Query:  IVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE
        I   D       L  ++E  S  GD   R+A +   AL++ LS NS  +S SS+ T     S             ++  P+  F +  AN +IL    E 
Subjt:  IVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSSYSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE

Query:  PNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVI-APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVI
          +   +HI+D G+  G+QWP LL+AL  R+ G P  IR++ I AP++      E+P    P      +RL  FAK L++N     +    +  LN    
Subjt:  PNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVI-APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVI

Query:  NKFPDEILIVCAQFRLHQLKHCAPDERFEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFK---GRESEER----RVME
           PDE+L V    +L++L    P      L+  + + P+ V L E  ++ +         GF  RV+   +F  +   + +   GR+SEER    R + 
Subjt:  NKFPDEILIVCAQFRLHQLKHCAPDERFEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFK---GRESEER----RVME

Query:  GEAAKALTN------DGEMNEEKGKWCERIRNAGFERKIFAEDTIDTARASMRRYD-NNWEMRVEEKDGCVGLWWKGQPVSFCSLWK
        G     L          E  EEK +W   + NAGFE    +   +  A+  +  Y+ +N    VE K G + L W   P+   S W+
Subjt:  GEAAKALTN------DGEMNEEKGKWCERIRNAGFERKIFAEDTIDTARASMRRYD-NNWEMRVEEKDGCVGLWWKGQPVSFCSLWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATTGAAGAAACAGGGCCAAACCACCCTTCAGATCACATATTGGACTGGTTAGAGGACTCGGTTCCTTTCTTTTCCCCATTCCTGGACGAGACTAACAACTCTTG
CTCTTTAAACTGCTATCAATGGTGGGGTGAGAACCAGGATACAGGCGAAGATCTGATTAATGGCTGTCTTAGCAACTCACCCACTACTGTTAGTACTAGACTACCAAACA
CACCCACTTCCCACCACTTGACACCTTCTGAGTTGACCAAGAAAAGAAAAGCTCCAGATGATTCAGTTCATAAGAAATCACAAACCCATCAGAACCCAAGGAAGAACCAG
AACAATCAGAGCAAAACTGGTGCTGATAAAGGCAGTGGAGCTGTTGAGGGAGTGACTGTGATGAAGAAATCAATAGGGAACAAGAAAAATTCATCAAAATCCACAGGAAA
TAACAATAATAACGGAAGCAACAGGGAAGGAAGGTGGGCAGAGCAATTGCTAAATCCCTGTGCTAGTGCTATCGTAAAAGGGGATGCGACAAGAGTACATCACCTTCTTT
GTGTTCTTCAAGAGCTCGCCTCACCCACCGGCGACGCTAACCACCGGCTCGCCGACCACGGTCTCCGAGCTTTGGCCCATCACCTGTCCTCCAATTCTTCTTTTTCTTCT
TATTCCTCCACAGTTACCCCGGTTACTTTCGCTTCGACGGACCCTCGATTCTTCCAGAGATCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGCATTTCCTAACAA
CATCGCAAACTCTTCAATCCTCCACATTCTCTCTGAAGAACCTAATCGCCCGCGAAATCTTCACATTCTTGACATTGGGGTTTCTCATGGTGTGCAATGGCCGACACTGC
TCGAGGCCTTGACTCGCCGTTCCGGCGGACCTCCGCCGCTAATTCGTCTTACAGTTATCGCTCCAACCGTCGAACATGATCAAAATACCGAGACGCCGTTTTCAATCGGT
CCACCCGGAGACAACATCTCCTCTCGGCTTCTCAGTTTCGCCAAATCCTTGAACATCAATTTACAAATCAACCGCCTCGACAACCACTCGCTCCGGAGTCTAAATTCGCA
AGTAATCAACAAATTCCCTGACGAAATCCTGATCGTTTGCGCACAGTTCAGACTCCACCAGTTGAAACACTGCGCTCCTGACGAAAGATTCGAGTTCTTACAAAATCTGA
GAAAAATGGAACCAAAGGCAGTGATTCTGAGTGAAAACAACATGGCATGTAGCTGTAGCAACTGCGGAAATTTCGACACCGGATTCACACGACGAGTAGAATACCTATGG
AGATTTCTGGATTCAACAAGCTCGGCATTCAAAGGGAGAGAAAGCGAGGAAAGAAGAGTGATGGAAGGGGAAGCCGCAAAAGCACTGACGAATGATGGGGAAATGAACGA
GGAAAAGGGAAAATGGTGCGAAAGAATTAGAAATGCGGGTTTTGAGAGAAAAATCTTCGCGGAAGACACCATTGATACGGCTCGAGCTTCAATGAGAAGGTATGATAACA
ACTGGGAGATGAGAGTGGAAGAGAAAGATGGATGCGTAGGGTTATGGTGGAAAGGCCAACCCGTTTCCTTTTGTTCGTTATGGAAGTTGGGGATCAAATCCAACGCCGTT
TGA
mRNA sequenceShow/hide mRNA sequence
ATGACCATTGAAGAAACAGGGCCAAACCACCCTTCAGATCACATATTGGACTGGTTAGAGGACTCGGTTCCTTTCTTTTCCCCATTCCTGGACGAGACTAACAACTCTTG
CTCTTTAAACTGCTATCAATGGTGGGGTGAGAACCAGGATACAGGCGAAGATCTGATTAATGGCTGTCTTAGCAACTCACCCACTACTGTTAGTACTAGACTACCAAACA
CACCCACTTCCCACCACTTGACACCTTCTGAGTTGACCAAGAAAAGAAAAGCTCCAGATGATTCAGTTCATAAGAAATCACAAACCCATCAGAACCCAAGGAAGAACCAG
AACAATCAGAGCAAAACTGGTGCTGATAAAGGCAGTGGAGCTGTTGAGGGAGTGACTGTGATGAAGAAATCAATAGGGAACAAGAAAAATTCATCAAAATCCACAGGAAA
TAACAATAATAACGGAAGCAACAGGGAAGGAAGGTGGGCAGAGCAATTGCTAAATCCCTGTGCTAGTGCTATCGTAAAAGGGGATGCGACAAGAGTACATCACCTTCTTT
GTGTTCTTCAAGAGCTCGCCTCACCCACCGGCGACGCTAACCACCGGCTCGCCGACCACGGTCTCCGAGCTTTGGCCCATCACCTGTCCTCCAATTCTTCTTTTTCTTCT
TATTCCTCCACAGTTACCCCGGTTACTTTCGCTTCGACGGACCCTCGATTCTTCCAGAGATCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGCATTTCCTAACAA
CATCGCAAACTCTTCAATCCTCCACATTCTCTCTGAAGAACCTAATCGCCCGCGAAATCTTCACATTCTTGACATTGGGGTTTCTCATGGTGTGCAATGGCCGACACTGC
TCGAGGCCTTGACTCGCCGTTCCGGCGGACCTCCGCCGCTAATTCGTCTTACAGTTATCGCTCCAACCGTCGAACATGATCAAAATACCGAGACGCCGTTTTCAATCGGT
CCACCCGGAGACAACATCTCCTCTCGGCTTCTCAGTTTCGCCAAATCCTTGAACATCAATTTACAAATCAACCGCCTCGACAACCACTCGCTCCGGAGTCTAAATTCGCA
AGTAATCAACAAATTCCCTGACGAAATCCTGATCGTTTGCGCACAGTTCAGACTCCACCAGTTGAAACACTGCGCTCCTGACGAAAGATTCGAGTTCTTACAAAATCTGA
GAAAAATGGAACCAAAGGCAGTGATTCTGAGTGAAAACAACATGGCATGTAGCTGTAGCAACTGCGGAAATTTCGACACCGGATTCACACGACGAGTAGAATACCTATGG
AGATTTCTGGATTCAACAAGCTCGGCATTCAAAGGGAGAGAAAGCGAGGAAAGAAGAGTGATGGAAGGGGAAGCCGCAAAAGCACTGACGAATGATGGGGAAATGAACGA
GGAAAAGGGAAAATGGTGCGAAAGAATTAGAAATGCGGGTTTTGAGAGAAAAATCTTCGCGGAAGACACCATTGATACGGCTCGAGCTTCAATGAGAAGGTATGATAACA
ACTGGGAGATGAGAGTGGAAGAGAAAGATGGATGCGTAGGGTTATGGTGGAAAGGCCAACCCGTTTCCTTTTGTTCGTTATGGAAGTTGGGGATCAAATCCAACGCCGTT
TGA
Protein sequenceShow/hide protein sequence
MTIEETGPNHPSDHILDWLEDSVPFFSPFLDETNNSCSLNCYQWWGENQDTGEDLINGCLSNSPTTVSTRLPNTPTSHHLTPSELTKKRKAPDDSVHKKSQTHQNPRKNQ
NNQSKTGADKGSGAVEGVTVMKKSIGNKKNSSKSTGNNNNNGSNREGRWAEQLLNPCASAIVKGDATRVHHLLCVLQELASPTGDANHRLADHGLRALAHHLSSNSSFSS
YSSTVTPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPNRPRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIG
PPGDNISSRLLSFAKSLNINLQINRLDNHSLRSLNSQVINKFPDEILIVCAQFRLHQLKHCAPDERFEFLQNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLW
RFLDSTSSAFKGRESEERRVMEGEAAKALTNDGEMNEEKGKWCERIRNAGFERKIFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSLWKLGIKSNAV