; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021714 (gene) of Chayote v1 genome

Gene IDSed0021714
OrganismSechium edule (Chayote v1)
Descriptionnodulation-signaling pathway 1 protein
Genome locationLG04:6354137..6356234
RNA-Seq ExpressionSed0021714
SyntenySed0021714
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0042446 - hormone biosynthetic process (biological process)
GO:2000032 - regulation of secondary shoot formation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR005202 - Transcription factor GRAS
IPR030015 - Scarecrow-like protein 29/nodulation signalling pathway 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593549.1 Protein NODULATION SIGNALING PATHWAY 1, partial [Cucurbita argyrosperma subsp. sororia]1.2e-23778.61Show/hide
Query:  LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLST-----------VTTSQHLMQSDFTKKRKAPYDKVH----------------TKNG
        L +SVP FS  + D+++NS+S+NC++WW    D  QDLINGCLS+            +TS HL  SD TKKRKAP D VH                ++NG
Subjt:  LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLST-----------VTTSQHLMQSDFTKKRKAPYDKVH----------------TKNG

Query:  AHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNT----EGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNP-
        A K + AV GVTV+K++VGNK++ SK+ GNN+N     EGRWAEQLLNPCANAI KGDATRVHHLLCVL+ELAS TGDANHRLAA+GLRALAHYLSSN  
Subjt:  AHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNT----EGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNP-

Query:  -SSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTV
         SSSSS + PVTFASTDPRFFQRSLIK HEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPP IRLTV+APTV
Subjt:  -SSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTV

Query:  EHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENN
        EHDQN  T FS+GPPGDNISSRLLSFAKSLNINLQINRLD+ SL SLN+QVIGK PDEILIVCAQFRLHQL+HYAPDERFEFL+NLRK+EPKAVILSENN
Subjt:  EHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENN

Query:  MACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEM
        MACSC+NCGNFDTGFTR+VEYLW+FLDSTSSAFKGRES+ERK+MEGEAAK+L N+GEMNEE EKWCERMRNAGFARK F EDTIDTARASMRRYDNNWEM
Subjt:  MACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEM

Query:  RVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        RV+EKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  RVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

KAG7025891.1 Nodulation-signaling pathway 1 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.7e-23778.61Show/hide
Query:  LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLST-----------VTTSQHLMQSDFTKKRKAPYDKVH----------------TKNG
        L +SVP FS  + D+++NSSS+NC++WW    D  QDLINGCLS+            +TS HL  SD TKKRKAP D VH                ++NG
Subjt:  LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLST-----------VTTSQHLMQSDFTKKRKAPYDKVH----------------TKNG

Query:  AHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNT----EGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNP-
        A K + AV GVTV+K++VGNK++ SK+ GNN+N     EGRWAEQLLNPCANAI KGDATRVHHLLCVL+ELAS TGDANHRLAA+GLRALAHYLSSN  
Subjt:  AHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNT----EGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNP-

Query:  -SSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTV
         SSSSS + PVTFASTDPRFFQRSLIK HEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPP IRLTV+APTV
Subjt:  -SSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTV

Query:  EHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENN
        EHDQN  T FS+GPPGDNISSRLLSFAKSLNINLQINRLD+ SL SLN+QVIGK PDEILIVCAQFRLHQL+HYAPDERFEFL+NLRK+EPKAVILSENN
Subjt:  EHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENN

Query:  MACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEM
        MACSC+NCGNFD GFTR+VEYLW+FLDSTSSAFKGRES+ERK+MEGEAAK+L N+GEMNEE EKWCERMRNAGFARK F EDTIDTARASMRRYDNNWEM
Subjt:  MACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEM

Query:  RVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        RV+EKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  RVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_022964298.1 nodulation-signaling pathway 1 protein [Cucurbita moschata]1.9e-23879.14Show/hide
Query:  LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLST-----------VTTSQHLMQSDFTKKRKAPYDKVH----------------TKNG
        L +SVP FS  + D+++NSSS+NC++WW    D  QDLINGCLS+            +TS HL  SD TKKRKAP D VH                ++NG
Subjt:  LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLST-----------VTTSQHLMQSDFTKKRKAPYDKVH----------------TKNG

Query:  AHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNT----EGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNPS
        A K + AV GVTV+K++VGNK++ SK+ GNN+N     EGRWAEQLLNPCANAI KGDATRVHHLLCVL+ELAS TGDANHRLAA+GLRALAHYLSSN S
Subjt:  AHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNT----EGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNPS

Query:  -SSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTVE
         SSSS + PVTFASTDPRFFQRSLIK HEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPP IRLTV+APTVE
Subjt:  -SSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTVE

Query:  HDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENNM
        HDQN  T FS+GPPGDNISSRLLSFAKSLNINLQINRLD+ SL SLN+QVIGK PDEILIVCAQFRLHQL+HYAPDERFEFL+NLRK+EPKAVILSENNM
Subjt:  HDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENNM

Query:  ACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMR
        ACSC+NCGNFDTGFTR+VEYLW+FLDSTSSAFKGRES+ERK+MEGEAAK+L N+GEMNEE EKWCERMRNAGFARK F EDTIDTARASMRRYDNNWEMR
Subjt:  ACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMR

Query:  VEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        VEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  VEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_023000031.1 nodulation-signaling pathway 1 protein [Cucurbita maxima]1.6e-24579.49Show/hide
Query:  MSIEEAQASNPLDQILE---ESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLST-----------VTTSQHLMQSDFTKKRKAPYDKVH--
        M+IEE  A++P D IL+   +SVP FS  + D+++NSSS+NC++WW    D  QDLINGCLS+            +TS HL  SD TKKRKAP D VH  
Subjt:  MSIEEAQASNPLDQILE---ESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLST-----------VTTSQHLMQSDFTKKRKAPYDKVH--

Query:  --------------TKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNN----SNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHR
                      +KNGA KG+ AVEGVTV+K++VGNK++ SK+ GNN    SN EGRWAEQLLNPCANAI KGDATRVHHLLCVL+ELAS TGDANHR
Subjt:  --------------TKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNN----SNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHR

Query:  LAAHGLRALAHYLSSNPS--SSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTR
        LAA+GLRALAHYLSSN S  SSSS V PVTFASTD RFFQRSLIK HEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTR
Subjt:  LAAHGLRALAHYLSSNPS--SSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTR

Query:  RSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEF
        RSGGPPP IRLTV+APTVEHDQN  T FS+GPPGDNISSRLLSFAKSLNINLQINRLDN SLQS NSQVIGKLPDEILIVCAQFRLHQL+HYAPDERFEF
Subjt:  RSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEF

Query:  LRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAED
        L+NLRK+EPKAVILSENNMACSC+NCGNFDTGFTR+VEYLW+FLDSTSSAFKGRES+ERKVMEGEAA+ LTNQGEMNEE EKWCERMRNAGFARK F ED
Subjt:  LRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAED

Query:  TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_023514155.1 nodulation-signaling pathway 1 protein [Cucurbita pepo subsp. pepo]5.9e-24079.55Show/hide
Query:  LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLSTV-----------TTSQHLMQSDFTKKRKAPYDKVH----------------TKNG
        L +SVP FS  + D+++NSSS+NC++WW    D  QDLINGCLS+            +TS HL  SD TKKRKAP D VH                +KNG
Subjt:  LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLSTV-----------TTSQHLMQSDFTKKRKAPYDKVH----------------TKNG

Query:  AHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNT----EGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNP-
        A K + AV GVTV+K++VGNK++ SK+ G+N+N     EGRWAEQLLNPCANAI KGDATRVHHLLCVL+ELAS TGDANHRLAA+GLRALAHYLSSN  
Subjt:  AHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNT----EGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNP-

Query:  -SSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTV
         SSSSS + PVTFASTDPRFFQRSLIK HEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPP IRLTV+APT+
Subjt:  -SSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTV

Query:  EHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENN
        EHDQN  T FS+GPPGDNISSRLLSFAKSLNINLQINRLDN SLQSLNSQVIGK PDEILIVCAQFRLHQL+HYAPDERFEFL+NLRKMEPKAVILSENN
Subjt:  EHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENN

Query:  MACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEM
        MACSC+NCGNFDTGFTR+VEYLW+FLDSTSSAFKGRES+ERKVMEGEAAK+L N+GEMNEE EKWCERMRNAGFARK F EDTIDTARASMRRYDNNWEM
Subjt:  MACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEM

Query:  RVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        RVEEKDGCVGLWWKGQPVSFCSFWKLG+KSNGG
Subjt:  RVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

TrEMBL top hitse value%identityAlignment
A0A0A0KCK6 GRAS domain-containing protein2.8e-22775.45Show/hide
Query:  MSIEEAQASNPLDQI---LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLS------------TVTTSQHLMQSDFTKKRKAPYDKVH-
        M+IEE   ++P D I   LE+SVP FS S+LDET NSSS+NC++WW    D  +DLINGCLS               TS  L  SD TKKRKAP D VH 
Subjt:  MSIEEAQASNPLDQI---LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLS------------TVTTSQHLMQSDFTKKRKAPYDKVH-

Query:  ---------------TKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNN----SNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANH
                       +KN A KG+ AVEGVTV+K++VGNKK+ SKS GNN    SN EGRWAEQLLNPCANAI KGDATRVHHLLCVL+ELAS TGDANH
Subjt:  ---------------TKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNN----SNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANH

Query:  RLAAHGLRALAHYLSSNPSSS-----SSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLE
        RLA HGLRALA++LSSN SSS     SS V P TFASTDPRFFQRSLIK HEVSPWFAFPNNIANSSILHILSEE NR RNLHILDIGVSHGVQWPTLLE
Subjt:  RLAAHGLRALAHYLSSNPSSS-----SSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDE
        ALTRRSGGPPP IRLTV+APT+EHDQN  T FS+GPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ I K  DEILIVCAQFRLHQL+H APDE
Subjt:  ALTRRSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTN-QGEMNEEKEKWCERMRNAGFARK
        R EFL NLRKMEPKAVILSENNM CSCS CGNF+ GF R VEY+WKFLDSTS+AFKGRES+ER+VMEGEAAK+L N  GEMNEEK KWCERMRN GF RK
Subjt:  RFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTN-QGEMNEEKEKWCERMRNAGFARK

Query:  FFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSN
         F EDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCSFWKLG+KSN
Subjt:  FFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSN

A0A1S3CGQ3 nodulation-signaling pathway 1 protein1.4e-22675.09Show/hide
Query:  MSIEEAQASNPLDQI---LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLS------------TVTTSQHLMQSDFTKKRKAPYDKVH-
        M+IEE    +P D I   LE+SVP FS S+LDET NSSS+NC++WW    D  +DLINGCLS               TS HL  SD TKKRKAP D VH 
Subjt:  MSIEEAQASNPLDQI---LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLS------------TVTTSQHLMQSDFTKKRKAPYDKVH-

Query:  ---------------TKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNN----SNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANH
                       +KN A KG+ AVEGVTV+K++VGNKK+ SKS GNN    SN EGRWAEQLLNPCANAI KGDATRVHHLLCVL+ELAS TGDANH
Subjt:  ---------------TKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNN----SNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANH

Query:  RLAAHGLRALAHYLSSNPSSS-----SSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLE
        RLA HGLRALA++LSSN SSS     SS V P+TFASTDPRFFQRSLIK HEVSPWFAFPNNIANSSILHILSEE NR RNLH+LDIGVSHGVQWPTLLE
Subjt:  RLAAHGLRALAHYLSSNPSSS-----SSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDE
        ALTRRSGGPPP IRLTV+APTVEHDQN  T FS+GPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ I K  DEILIVC+QFRLHQL+H APDE
Subjt:  ALTRRSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDE

Query:  RFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTN-QGEMNEEKEKWCERMRNAGFARK
        R EFL+NLRKMEPKAVILSENNM CSCS C NF+ GF R VEY+WKFLDSTS+AFKGRES+ER+VMEGEAAK+L N +GEMNEEK KWCERMRN GF RK
Subjt:  RFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTN-QGEMNEEKEKWCERMRNAGFARK

Query:  FFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSN
         F EDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCS WKLG+KSN
Subjt:  FFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSN

A0A6J1D4T6 nodulation-signaling pathway 1 protein4.6e-23077.13Show/hide
Query:  MSIEEAQASNPLDQI---LEESVPSFSPSYLDETFNSSSMNCFEWWVDES----QDLINGCLS-----------TVTTSQHLMQSDFTKKRKAPYDKVH-
        M+IEE   ++P D I   LE+S P FSP +LDET+NSSS+NC++WW DES    QDLINGCLS           T   +  L  SD +KKRKAP D  H 
Subjt:  MSIEEAQASNPLDQI---LEESVPSFSPSYLDETFNSSSMNCFEWWVDES----QDLINGCLS-----------TVTTSQHLMQSDFTKKRKAPYDKVH-

Query:  ---------------TKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNN----SNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANH
                       +KNGA KG     G  VVK++VGNKKS SKS GNN    SN EGRWAEQLLNPCANAI KGDATRVHHL+CVL+ELAS TGDANH
Subjt:  ---------------TKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNN----SNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANH

Query:  RLAAHGLRALAHYLSSNPSSSSS-IVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTR
        RLA HGLRALAH+LSSN SSSSS + P V FASTD RFFQRSLIK HEVSPWFA PNNIANSSILH LSEEPN SRNLHILDIGVSHGVQWPTLLEALTR
Subjt:  RLAAHGLRALAHYLSSNPSSSSS-IVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTR

Query:  RSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEF
        RSGGPPP IRLTVVAPTVEHDQ   T FS+GPPGDNISSRLLSFAKSLNINLQINRLDNHSLQ+LNSQVIGK  DEILIVCA FRLHQL+H APDER EF
Subjt:  RSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEF

Query:  LRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAED
        LRNLRKMEP AVILSENN+ACSCSNCGNFD  FTRRVEYLW+FLDSTSSAFKGRESDER+VMEGEAAK+LTNQGEMNEEKEKW ERMRNAGFARKFFAE 
Subjt:  LRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAED

Query:  TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        TIDTARASMRRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWKLG K NGG
Subjt:  TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

A0A6J1HKE2 nodulation-signaling pathway 1 protein9.2e-23979.14Show/hide
Query:  LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLST-----------VTTSQHLMQSDFTKKRKAPYDKVH----------------TKNG
        L +SVP FS  + D+++NSSS+NC++WW    D  QDLINGCLS+            +TS HL  SD TKKRKAP D VH                ++NG
Subjt:  LEESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLST-----------VTTSQHLMQSDFTKKRKAPYDKVH----------------TKNG

Query:  AHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNT----EGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNPS
        A K + AV GVTV+K++VGNK++ SK+ GNN+N     EGRWAEQLLNPCANAI KGDATRVHHLLCVL+ELAS TGDANHRLAA+GLRALAHYLSSN S
Subjt:  AHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNT----EGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNPS

Query:  -SSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTVE
         SSSS + PVTFASTDPRFFQRSLIK HEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPP IRLTV+APTVE
Subjt:  -SSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTVE

Query:  HDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENNM
        HDQN  T FS+GPPGDNISSRLLSFAKSLNINLQINRLD+ SL SLN+QVIGK PDEILIVCAQFRLHQL+HYAPDERFEFL+NLRK+EPKAVILSENNM
Subjt:  HDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENNM

Query:  ACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMR
        ACSC+NCGNFDTGFTR+VEYLW+FLDSTSSAFKGRES+ERK+MEGEAAK+L N+GEMNEE EKWCERMRNAGFARK F EDTIDTARASMRRYDNNWEMR
Subjt:  ACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMR

Query:  VEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        VEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  VEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

A0A6J1KER4 nodulation-signaling pathway 1 protein7.8e-24679.49Show/hide
Query:  MSIEEAQASNPLDQILE---ESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLST-----------VTTSQHLMQSDFTKKRKAPYDKVH--
        M+IEE  A++P D IL+   +SVP FS  + D+++NSSS+NC++WW    D  QDLINGCLS+            +TS HL  SD TKKRKAP D VH  
Subjt:  MSIEEAQASNPLDQILE---ESVPSFSPSYLDETFNSSSMNCFEWW---VDESQDLINGCLST-----------VTTSQHLMQSDFTKKRKAPYDKVH--

Query:  --------------TKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNN----SNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHR
                      +KNGA KG+ AVEGVTV+K++VGNK++ SK+ GNN    SN EGRWAEQLLNPCANAI KGDATRVHHLLCVL+ELAS TGDANHR
Subjt:  --------------TKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNN----SNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHR

Query:  LAAHGLRALAHYLSSNPS--SSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTR
        LAA+GLRALAHYLSSN S  SSSS V PVTFASTD RFFQRSLIK HEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTR
Subjt:  LAAHGLRALAHYLSSNPS--SSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTR

Query:  RSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEF
        RSGGPPP IRLTV+APTVEHDQN  T FS+GPPGDNISSRLLSFAKSLNINLQINRLDN SLQS NSQVIGKLPDEILIVCAQFRLHQL+HYAPDERFEF
Subjt:  RSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEF

Query:  LRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAED
        L+NLRK+EPKAVILSENNMACSC+NCGNFDTGFTR+VEYLW+FLDSTSSAFKGRES+ERKVMEGEAA+ LTNQGEMNEE EKWCERMRNAGFARK F ED
Subjt:  LRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAED

Query:  TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  TIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

SwissProt top hitse value%identityAlignment
A1DQP9 Protein NODULATION SIGNALING PATHWAY 13.2e-16458.85Show/hide
Query:  SNPLDQILE--ESVPSFSPSYLDETFNSSS-MNCFEWWVDESQDLING------------CLSTVTTSQHLMQ----------SDFTKKRKA-------P
        S   D IL+  E   SF PS+LDE  N+S  +  +  W     D   G             ++T  TS   ++          SD  KKR A       P
Subjt:  SNPLDQILE--ESVPSFSPSYLDETFNSSS-MNCFEWWVDESQDLING------------CLSTVTTSQHLMQ----------SDFTKKRKA-------P

Query:  YDKVHTKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYL
            + +      N    G  V K   G  K+   +  + ++ EGRWAEQLLNPCA AI  G+  RV HLL VL ELAS TGD NHRLAAHGLRAL H+L
Subjt:  YDKVHTKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYL

Query:  SSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPN-RSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVV
        SS+ SS +S    +TFAST+PRFFQ+SL+K +EVSPWF+FPNNIAN+SIL +L+EE N  SR LHILDIGVSHGVQWPTLL+AL+RRSGGPP  +RLTVV
Subjt:  SSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPN-RSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVV

Query:  APTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVIL
          T E+DQN  T FS  PPG N   RLL +A+S+NINLQINR++NHSLQ+LN+Q I   PDEILIVCAQFRLH L H +PDER EFL+ LR MEP+ VIL
Subjt:  APTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVIL

Query:  SENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDN
        SENN  C CS CGNF  GFTRRVEYLW+FLDSTSSAFKGRESDER+VMEGEAAK+LTNQ EMNEEKEKWC RM+ AGFA + F ED +D  RA +R+YD+
Subjt:  SENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDN

Query:  NWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        NWEM+VEEK+  VGLWWKGQPVSFCS WKL     GG
Subjt:  NWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

Q4VYC8 Protein NODULATION SIGNALING PATHWAY 11.2e-16657.76Show/hide
Query:  MSIEEAQASNPLDQILEESVPSFSPSYLDETFNSSSMNCFEWWVDESQDLIN----------------------GCLSTVTTS------QHLMQSDFTKK
        M++E    S+ +   LE SV SF PS+LD+ +N+  ++ +E W +++QD+ N                         +T TTS       ++  SD  KK
Subjt:  MSIEEAQASNPLDQILEESVPSFSPSYLDETFNSSSMNCFEWWVDESQDLIN----------------------GCLSTVTTS------QHLMQSDFTKK

Query:  RKAPYD---------------KVHTKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNT----EGRWAEQLLNPCANAIFKGDATRVHHLLCVLREL
        R A  +               K    N +  G+ A+EG TVV+++ GNKK  +K+ G+NSN     +GRWAEQLLNPCA AI  G+  RV HLL VL EL
Subjt:  RKAPYD---------------KVHTKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNT----EGRWAEQLLNPCANAIFKGDATRVHHLLCVLREL

Query:  ASATGDANHRLAAHGLRALAHYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWP
        AS TGDANHRLAAHGLRAL H+LSS  SSSS+    +TFAST+PRFFQ+SL+K +E SPWF+FPNNIAN+SIL +L+EEPN  R LHILDIGVSHGVQWP
Subjt:  ASATGDANHRLAAHGLRALAHYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWP

Query:  TLLEALTRRSGGPPPFIRLTVV--APTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLR
        T LEAL+RR GGPPP +RLTVV  + + E+DQN  T FS+GP GD  SS LL +A+SLN+NLQI +LDNH LQ+LN++ +    DE LIVCAQFRLH L 
Subjt:  TLLEALTRRSGGPPPFIRLTVV--APTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLR

Query:  HYAPDERFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNA
        H  PDER EFL+ LR MEPK VILSENNM C CS+CG+F TGF+RRVEYLW+FLDSTSSAFK R+SDERK+MEGEAAK+LTNQ EMNE +EKWCERM+ A
Subjt:  HYAPDERFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNA

Query:  GFARKFFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKL
        GFA + F ED ID  RA +R+YDNNWEM+VEE    V LWWK QPVSFCS WKL
Subjt:  GFARKFFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKL

Q75I13 Protein SHORT-ROOT 29.1e-3428.82Show/hide
Query:  AVGNKKSPSKSIGNNSNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRAL-AHYLSSNPSSSSSIVPP----VTFASTD
        AV +        G   ++ GRWA QLL  CA A+   D+ RV  L+ +L ELAS  GD + +LA++ L+ L A   +S P +  ++        +F ST 
Subjt:  AVGNKKSPSKSIGNNSNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRAL-AHYLSSNPSSSSSIVPP----VTFASTD

Query:  PRFFQRSLIKLHEVSPWFAFPNNIANSSILHIL---------------SEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTVEH
            +R+ +K  E+SPW  F +  AN +IL                  S        LHILD+  +   QWPTLLEAL  RS    P + +T V PT   
Subjt:  PRFFQRSLIKLHEVSPWFAFPNNIANSSILHIL---------------SEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTVEH

Query:  DQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHS--LQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENN
              S +       I  RL  FA+ + +     R  +HS  L  L+   +           A   ++ LR  A   R  F+ +LR++EP+ V + E  
Subjt:  DQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHS--LQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENN

Query:  M-----ACSCSNCGNFDTGFTR----RVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSL--------TNQGEMNEEKEKWCERMRNAGFARKFFAEDT
                  S+  + D  F +     + +   ++DS   +F  + S+ER  +E    +++        +   E  E    W  RMR+AGF+   F+ED 
Subjt:  M-----ACSCSNCGNFDTGFTR----RVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSL--------TNQGEMNEEKEKWCERMRNAGFARKFFAEDT

Query:  IDTARASMRRYDNNWEMR-----VEEKDGCVG----LWWKGQPVSFCSFWK
         D  R+ +RRY   W MR      ++  G       L WK QPV + S WK
Subjt:  IDTARASMRRYDNNWEMR-----VEEKDGCVG----LWWKGQPVSFCSFWK

Q84MQ9 Protein NODULATION SIGNALING PATHWAY 14.6e-8642.7Show/hide
Query:  STVTTSQHLMQSDFTKKRKAPYDKVHTKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELAS
        S   +S   + S  +KKRK+P  +     G  KG     G                  G  S+ + RWAEQLLNPCA A+  G+ +RV HL  VL EL S
Subjt:  STVTTSQHLMQSDFTKKRKAPYDKVHTKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELAS

Query:  ATGDANHRLAAHGLRALAHYLSS--NPSSSSSI-VPP------VTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILH--ILSEEPNRSRNLHILDI
         +GDANHRLAAHGLRALA +L +   P++++++ VPP        FA+ +PR F+ SLI+ HEVSPWFA PN +AN++I             R LH++D+
Subjt:  ATGDANHRLAAHGLRALAHYLSS--NPSSSSSI-VPP------VTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILH--ILSEEPNRSRNLHILDI

Query:  GVSHGVQWPTLLEALTRRSGG-PPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCA
        GVSHGVQWPTLLE+LTR+ GG  PP +RLTVV P      +    FS  PPG + S  LL +AKS+N++L+I+R       +L+  V G    E L+VC 
Subjt:  GVSHGVQWPTLLEALTRRSGG-PPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCA

Query:  QFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSE--NNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSL--TNQGEMNE
        QFR   L H A +ER E LR  R + P+ V+LSE  + +     + G+    F  R+E LW+FL+STS+AFKG++ +ER+++E EA   L   +     E
Subjt:  QFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSE--NNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSL--TNQGEMNE

Query:  EKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRV-EEKDGCVGLWWKGQPVSFCSFWK
         +E W ERM  AGF    F  + +++AR+ +R+YD+ WEM         V L WKGQPVSFCS W+
Subjt:  EKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRV-EEKDGCVGLWWKGQPVSFCSFWK

Q9LRW3 Scarecrow-like protein 291.9e-12449.15Show/hide
Query:  MSIEEAQASN-PLDQI---LEESV-----PSFSPSYLDETFNSSSMNCFEWWVDESQD---------------LINGCLST---VTTSQHLMQSDFTKKR
        M +EE +  N  LD +   LE+SV     P F  SYL   F+ S    +EW  D++QD                  GC +T   V T    +  D   + 
Subjt:  MSIEEAQASN-PLDQI---LEESV-----PSFSPSYLDETFNSSSMNCFEWWVDESQD---------------LINGCLST---VTTSQHLMQSDFTKKR

Query:  KAPYDKVHTKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALA
        + P D+  ++  +H G    + V    R+       S+    + N EGRWAE+LLNPCA AI   +++RV H LCVL ELAS++GDAN RLAA GLRAL 
Subjt:  KAPYDKVHTKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALA

Query:  HYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLT
        H+LSS+ S SSS  P  TFAS + + FQ++L+K +EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG+QWPTLLEAL+ R  GPPP +R+T
Subjt:  HYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLT

Query:  VVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAV
        V++     D      FSVGPPG N  S+LL FA+SL INLQI+ LD         Q+I   P E LIVCAQFRLH L+H   DER E L+ +R + PK V
Subjt:  VVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAV

Query:  ILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRY
        +L ENN  CS S   +F  GF++++EY+WKFLDSTSS FK   S+ERK+MEGEA K L N G+MNE KEKW ERMR AGF  + F ED +D A++ +R+Y
Subjt:  ILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRY

Query:  DNNWEMRVEEKDGCVGLWWKGQPVSFCSFWK
        DNNWE+R+E+ D   GL WKG+ VSFCS WK
Subjt:  DNNWEMRVEEKDGCVGLWWKGQPVSFCSFWK

Arabidopsis top hitse value%identityAlignment
AT3G03450.1 RGA-like 22.1e-2525Show/hide
Query:  CANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILS
        CA AI + +      L+  +  LA +   A  ++A +  +ALA  +  + ++ + +      A+ +P F +   +  +E  P+  F +  AN +IL    
Subjt:  CANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILS

Query:  EEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQV
        E    +R +H++D+G++ G+QWP L++AL  R GGPP F RLT + P    + +             +  +L  FA+++ +  +   L   SL  L  ++
Subjt:  EEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQV

Query:  IGKLPD-EILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDE--------RK
            P+ E L+V + F LH+L   +     + L  ++ ++P  V + E     +  N   F   F   + Y     DS   ++     D         R+
Subjt:  IGKLPD-EILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDE--------RK

Query:  VMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKL
        ++   AA+  +++ E +E   +W  RM++AGF            A   +  Y      RVEE DGC+ + W+ +P+   S WKL
Subjt:  VMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKL

AT3G13840.1 GRAS family transcription factor1.4e-12549.15Show/hide
Query:  MSIEEAQASN-PLDQI---LEESV-----PSFSPSYLDETFNSSSMNCFEWWVDESQD---------------LINGCLST---VTTSQHLMQSDFTKKR
        M +EE +  N  LD +   LE+SV     P F  SYL   F+ S    +EW  D++QD                  GC +T   V T    +  D   + 
Subjt:  MSIEEAQASN-PLDQI---LEESV-----PSFSPSYLDETFNSSSMNCFEWWVDESQD---------------LINGCLST---VTTSQHLMQSDFTKKR

Query:  KAPYDKVHTKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALA
        + P D+  ++  +H G    + V    R+       S+    + N EGRWAE+LLNPCA AI   +++RV H LCVL ELAS++GDAN RLAA GLRAL 
Subjt:  KAPYDKVHTKNGAHKGNTAVEGVTVVKRAVGNKKSPSKSIGNNSNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALA

Query:  HYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLT
        H+LSS+ S SSS  P  TFAS + + FQ++L+K +EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG+QWPTLLEAL+ R  GPPP +R+T
Subjt:  HYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLT

Query:  VVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAV
        V++     D      FSVGPPG N  S+LL FA+SL INLQI+ LD         Q+I   P E LIVCAQFRLH L+H   DER E L+ +R + PK V
Subjt:  VVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAV

Query:  ILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRY
        +L ENN  CS S   +F  GF++++EY+WKFLDSTSS FK   S+ERK+MEGEA K L N G+MNE KEKW ERMR AGF  + F ED +D A++ +R+Y
Subjt:  ILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRY

Query:  DNNWEMRVEEKDGCVGLWWKGQPVSFCSFWK
        DNNWE+R+E+ D   GL WKG+ VSFCS WK
Subjt:  DNNWEMRVEEKDGCVGLWWKGQPVSFCSFWK

AT3G50650.1 GRAS family transcription factor4.3e-2327.54Show/hide
Query:  LRELASATGDANHRLAAHGLRALAHYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHG
        ++E  S +GD   R+  +   AL+H  + +PSSSS         S+    F  S   L++  P+  F +  AN +IL    E  N+S N+HI+D G+  G
Subjt:  LRELASATGDANHRLAAHGLRALAHYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHG

Query:  VQWPTLLEALTRRSGGPPPFIRLT-VVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLH
        +QW  LL+AL  RS G P  IR++ + AP++          S GP      +RL  FA  L++N +   +    +Q LN       PDE+L+V     L+
Subjt:  VQWPTLLEALTRRSGGPPPFIRLT-VVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIGKLPDEILIVCAQFRLH

Query:  QLRHYAPDERFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFK---GRESDER----KVMEGEAAKSLTNQGEMN---
        +L           LR  R + P+ V L E  ++ +          F  RV+   +F  +   + +    R+S ER    +V+ G     L    + N   
Subjt:  QLRHYAPDERFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFK---GRESDER----KVMEGEAAKSLTNQGEMN---

Query:  -------EEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYD-NNWEMRVEEKDGCVGLWWKGQPVSFCSFWK
               EEKE+W   M  AGF     +   +  A+  +  Y+ +     VE + G + L W   P+   S W+
Subjt:  -------EEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYD-NNWEMRVEEKDGCVGLWWKGQPVSFCSFWK

AT4G37650.1 GRAS family transcription factor2.3e-3228.36Show/hide
Query:  NTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNPSSSSSIVPPVTFASTDP----RFFQRSLIKLHEVSPW
        +   +WA+ +L   A A    D  R   +L  L EL+S  GD   +LA++ L+AL + ++   S        VT A+T+        +++++K  EVSPW
Subjt:  NTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNPSSSSSIVPPVTFASTDP----RFFQRSLIKLHEVSPW

Query:  FAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLT--VVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNI
          F +  AN +IL  +  E      +HI+DI  +   QWPTLLEAL  RS    P +RLT  VVA    +DQ              I +R+  FA+ + +
Subjt:  FAFPNNIANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLT--VVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNI

Query:  NLQINRLDNHSLQSLNSQVIGKL---PDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENNMACSCSNCGNFD----TGFTRRVEYLWKF
          + N +  H +  L+   + +L   PDE+L +     +H +       R   + + R++ P+ V + E          G FD     GF   + +    
Subjt:  NLQINRLDNHSLQSLNSQVIGKL---PDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENNMACSCSNCGNFD----TGFTRRVEYLWKF

Query:  LDSTSSAFKGRESDERKVMEGEAAKSL--------TNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRY-DNNWEMRVEEKDGCVGLWWKGQ
         +S   +F  R S+ER ++E  A +++        ++  E  E   KW  RMRN+GF    ++++  D  RA +RRY +  W M        + L W+ Q
Subjt:  LDSTSSAFKGRESDERKVMEGEAAKSL--------TNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRY-DNNWEMRVEEKDGCVGLWWKGQ

Query:  PVSFCSFWK
        PV + S W+
Subjt:  PVSFCSFWK

AT5G66770.1 GRAS family transcription factor5.5e-2628.02Show/hide
Query:  CANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILS
        CA  I   D       L  +RE  S  GD   R+A +   AL++ LS N  ++SS       +S+       S   L++  P+  F +  AN +IL    
Subjt:  CANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNIANSSILHILS

Query:  EEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLT-VVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQ
        E   +S  +HI+D G+  G+QWP LL+AL  R+ G P  IR++ + AP++          S  P      +RL  FAK L++N     +    +  LN  
Subjt:  EEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLT-VVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQ

Query:  VIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFK---GRESDERKVMEGE
             PDE+L V    +L++L    P      LR  + + P+ V L E  ++ +         GF  RV+   +F  +   + +   GR+S+ER  +E E
Subjt:  VIGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFK---GRESDERKVMEGE

Query:  ----------AAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYD-NNWEMRVEEKDGCVGLWWKGQPVSFCSFWK
                    +      E  EEKE+W   M NAGF     +   +  A+  +  Y+ +N    VE K G + L W   P+   S W+
Subjt:  ----------AAKSLTNQGEMNEEKEKWCERMRNAGFARKFFAEDTIDTARASMRRYD-NNWEMRVEEKDGCVGLWWKGQPVSFCSFWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATTGAAGAAGCACAGGCAAGCAACCCTTTAGATCAAATATTGGAGGAGTCAGTTCCTTCGTTTTCCCCATCATACTTGGATGAAACTTTCAACTCTAGCTCTAT
GAATTGCTTTGAATGGTGGGTGGATGAGAGCCAAGATCTGATTAATGGCTGTTTAAGCACTGTCACTACTTCCCAACACTTGATGCAATCTGATTTTACCAAGAAAAGGA
AAGCTCCATATGATAAAGTTCACACCAAAAATGGTGCACATAAAGGCAATACAGCTGTTGAAGGAGTGACTGTGGTAAAAAGGGCAGTGGGGAACAAGAAAAGTCCTTCA
AAATCCATAGGGAATAACTCTAATACTGAAGGAAGGTGGGCAGAGCAATTGCTAAATCCTTGTGCTAATGCTATCTTTAAAGGGGATGCAACAAGAGTACATCATCTTCT
TTGTGTTCTTCGAGAGCTCGCCTCAGCCACTGGTGACGCCAACCACCGTCTTGCCGCCCATGGTCTTCGAGCTTTGGCTCATTATCTGTCCTCCAATCCTTCTTCTTCTT
CTTCCATAGTTCCACCGGTTACTTTCGCTTCGACCGATCCTCGATTCTTCCAGAGGTCGTTGATCAAACTCCACGAGGTGAGTCCATGGTTTGCTTTTCCGAACAACATT
GCAAATTCTTCAATCCTTCACATTCTCTCTGAAGAACCTAATCGCTCGAGAAATCTTCACATTCTTGATATTGGGGTTTCTCATGGTGTGCAATGGCCGACGCTGCTCGA
AGCCTTGACTCGTCGTTCCGGTGGCCCTCCCCCATTCATTCGTCTTACCGTTGTTGCTCCAACCGTTGAACATGACCAAAATAAGCATACGTCGTTTTCGGTTGGTCCAC
CAGGAGACAATATCTCCTCTCGACTTCTAAGTTTCGCCAAATCCTTGAACATCAATCTACAAATCAACCGCCTCGACAATCACTCACTCCAAAGTTTAAATTCGCAAGTA
ATTGGCAAGCTTCCGGACGAAATCCTAATAGTTTGTGCACAGTTCAGACTCCACCAATTGAGACACTATGCTCCCGACGAAAGATTCGAGTTCTTACGTAACCTAAGAAA
AATGGAGCCAAAGGCAGTGATTCTAAGTGAAAACAACATGGCATGTAGTTGCAGCAACTGCGGTAATTTCGACACCGGATTCACACGACGAGTTGAATACCTATGGAAAT
TCCTAGATTCAACAAGTTCTGCATTCAAAGGCCGAGAGAGCGATGAAAGGAAAGTGATGGAAGGCGAAGCCGCGAAGTCCCTCACGAATCAAGGCGAAATGAACGAAGAA
AAGGAAAAATGGTGCGAAAGAATGAGAAATGCGGGTTTTGCGAGAAAATTCTTCGCGGAAGACACCATTGATACAGCTCGAGCTTCAATGAGAAGGTATGATAACAACTG
GGAGATGAGAGTTGAAGAGAAAGATGGATGCGTGGGGCTATGGTGGAAAGGGCAACCAGTTTCGTTTTGTTCGTTTTGGAAGTTGGGGATGAAATCAAATGGTGGTTAA
mRNA sequenceShow/hide mRNA sequence
TATCATTGTACGGGACATGGTTTCGAATCCCTCTACCCCAATTGTTTTAAAACTGACAAAAAGAAGAAAAATCCAAATGAAAGGCTAAAAGTTACAAAGAATGAGTTATA
TAACCCAACCCAGGGTGAGGAAGAGACAATACCAAACTTTTTTATATTGAGTAATAAAAAAAGTGAAGAAAACATCAAATCCTCCAAAAGAACAGCTGAAATAGCTTAGA
AAACACCCCACCATTGTCTGTTTATGGAGGCTGCAAGTGACCCCTTGGCATTTTATGAAAACATTATTGGGTCTATCCTCCTAATCCCAATTATTAAACTGTAAAGCCAG
CAACCCTTGAACTCTAATCCTCTCCATCAAAAGGGTCTCTTCTCTTTCTCTTTTTTCTCCTTTTTATCTGAAATTTTTTTATCAGCAAGCAAAAATGAGCATTGAAGAAG
CACAGGCAAGCAACCCTTTAGATCAAATATTGGAGGAGTCAGTTCCTTCGTTTTCCCCATCATACTTGGATGAAACTTTCAACTCTAGCTCTATGAATTGCTTTGAATGG
TGGGTGGATGAGAGCCAAGATCTGATTAATGGCTGTTTAAGCACTGTCACTACTTCCCAACACTTGATGCAATCTGATTTTACCAAGAAAAGGAAAGCTCCATATGATAA
AGTTCACACCAAAAATGGTGCACATAAAGGCAATACAGCTGTTGAAGGAGTGACTGTGGTAAAAAGGGCAGTGGGGAACAAGAAAAGTCCTTCAAAATCCATAGGGAATA
ACTCTAATACTGAAGGAAGGTGGGCAGAGCAATTGCTAAATCCTTGTGCTAATGCTATCTTTAAAGGGGATGCAACAAGAGTACATCATCTTCTTTGTGTTCTTCGAGAG
CTCGCCTCAGCCACTGGTGACGCCAACCACCGTCTTGCCGCCCATGGTCTTCGAGCTTTGGCTCATTATCTGTCCTCCAATCCTTCTTCTTCTTCTTCCATAGTTCCACC
GGTTACTTTCGCTTCGACCGATCCTCGATTCTTCCAGAGGTCGTTGATCAAACTCCACGAGGTGAGTCCATGGTTTGCTTTTCCGAACAACATTGCAAATTCTTCAATCC
TTCACATTCTCTCTGAAGAACCTAATCGCTCGAGAAATCTTCACATTCTTGATATTGGGGTTTCTCATGGTGTGCAATGGCCGACGCTGCTCGAAGCCTTGACTCGTCGT
TCCGGTGGCCCTCCCCCATTCATTCGTCTTACCGTTGTTGCTCCAACCGTTGAACATGACCAAAATAAGCATACGTCGTTTTCGGTTGGTCCACCAGGAGACAATATCTC
CTCTCGACTTCTAAGTTTCGCCAAATCCTTGAACATCAATCTACAAATCAACCGCCTCGACAATCACTCACTCCAAAGTTTAAATTCGCAAGTAATTGGCAAGCTTCCGG
ACGAAATCCTAATAGTTTGTGCACAGTTCAGACTCCACCAATTGAGACACTATGCTCCCGACGAAAGATTCGAGTTCTTACGTAACCTAAGAAAAATGGAGCCAAAGGCA
GTGATTCTAAGTGAAAACAACATGGCATGTAGTTGCAGCAACTGCGGTAATTTCGACACCGGATTCACACGACGAGTTGAATACCTATGGAAATTCCTAGATTCAACAAG
TTCTGCATTCAAAGGCCGAGAGAGCGATGAAAGGAAAGTGATGGAAGGCGAAGCCGCGAAGTCCCTCACGAATCAAGGCGAAATGAACGAAGAAAAGGAAAAATGGTGCG
AAAGAATGAGAAATGCGGGTTTTGCGAGAAAATTCTTCGCGGAAGACACCATTGATACAGCTCGAGCTTCAATGAGAAGGTATGATAACAACTGGGAGATGAGAGTTGAA
GAGAAAGATGGATGCGTGGGGCTATGGTGGAAAGGGCAACCAGTTTCGTTTTGTTCGTTTTGGAAGTTGGGGATGAAATCAAATGGTGGTTAAAAAAGTTTGAGGATTTC
CATAGTTTTTTGGTTGTTTTCTGCCAACTACTTGGGAAGTTCAAATCCTGTTTGGAAAGAAAGAAAGACATTGTGGCTTTGTGTTGGAATTCAGAGTGTTTGATTTTCAA
TCCTTCTG
Protein sequenceShow/hide protein sequence
MSIEEAQASNPLDQILEESVPSFSPSYLDETFNSSSMNCFEWWVDESQDLINGCLSTVTTSQHLMQSDFTKKRKAPYDKVHTKNGAHKGNTAVEGVTVVKRAVGNKKSPS
KSIGNNSNTEGRWAEQLLNPCANAIFKGDATRVHHLLCVLRELASATGDANHRLAAHGLRALAHYLSSNPSSSSSIVPPVTFASTDPRFFQRSLIKLHEVSPWFAFPNNI
ANSSILHILSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPFIRLTVVAPTVEHDQNKHTSFSVGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQV
IGKLPDEILIVCAQFRLHQLRHYAPDERFEFLRNLRKMEPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWKFLDSTSSAFKGRESDERKVMEGEAAKSLTNQGEMNEE
KEKWCERMRNAGFARKFFAEDTIDTARASMRRYDNNWEMRVEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG