; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0003137 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0003137
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionnodulation-signaling pathway 1 protein
Genome locationchr4:48371850..48373514
RNA-Seq ExpressionLag0003137
SyntenyLag0003137
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0042446 - hormone biosynthetic process (biological process)
GO:2000032 - regulation of secondary shoot formation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR005202 - Transcription factor GRAS
IPR030015 - Scarecrow-like protein 29/nodulation signalling pathway 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593549.1 Protein NODULATION SIGNALING PATHWAY 1, partial [Cucurbita argyrosperma subsp. sororia]4.0e-28289.87Show/hide
Query:  DHILDWLEDSVPFF-SPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVHKTSQSHQNQRK
        DHILDWL DSVPFF SPF +++YNS+SINCYQWWDENQD+GQDLINGCLSSSP   TTVST+PPNT TSHHLTPSDLTKKRKAPD+TVHK SQ+ QNQRK
Subjt:  DHILDWLEDSVPFF-SPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVHKTSQSHQNQRK

Query:  NQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRAL
        NQNNQS+NGADK  GAV GVTV+KKSVGNK+NSSK+TGNN NNG+NKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASP GDANHRLAA+GLRAL
Subjt:  NQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRAL

Query:  AHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        AH+LSSN SS SSSSST+APVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEP+RSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  AHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMD
        IRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL SLN+QVI KFPDEILIVCAQFRLHQL+HYAPDERFEFLQNLRK++
Subjt:  IRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMD

Query:  PKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARAS
        PKAVILSENNMACSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRES+ER++MEGEAAKAL N+GEMNEE EKWCERM+NAGFARK F EDTIDTARAS
Subjt:  PKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARAS

Query:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMR++EKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_022964298.1 nodulation-signaling pathway 1 protein [Cucurbita moschata]3.0e-28290.06Show/hide
Query:  DHILDWLEDSVPFF-SPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVHKTSQSHQNQRK
        DHILDWL DSVPFF SPF +++YNSSSINCYQWWDENQD+GQDLINGCLSSSP   TTVST+PPNT TSHHLTPSDLTKKRKAPD+TVHK SQ+ QNQRK
Subjt:  DHILDWLEDSVPFF-SPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVHKTSQSHQNQRK

Query:  NQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRAL
        NQNNQS+NGADK  GAV GVTV+KKSVGNK+NSSK+TGNN NNG+NKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASP GDANHRLAA+GLRAL
Subjt:  NQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRAL

Query:  AHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        AH+LSSNSS   SSSST+APVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEP+RSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  AHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMD
        IRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL SLN+QVI KFPDEILIVCAQFRLHQL+HYAPDERFEFLQNLRK++
Subjt:  IRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMD

Query:  PKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARAS
        PKAVILSENNMACSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRES+ER++MEGEAAKAL N+GEMNEE EKWCERM+NAGFARK F EDTIDTARAS
Subjt:  PKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARAS

Query:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_023000031.1 nodulation-signaling pathway 1 protein [Cucurbita maxima]3.6e-29190.99Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFF-SPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETV
        MTIEEPG NHPSDHILDWL DSVPFF SPF +++YNSSSINCYQWWDENQD+GQDLINGCLSSSP   TTVST+PPNT TSHHLTPSDLTKKRKAPD+TV
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFF-SPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETV

Query:  HKTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDA
        HK SQ+ QNQRKNQNNQSKNGADKG GAVEGVTVIKKSVGNK+NSSK+TGNN +NGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASP GDA
Subjt:  HKTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDA

Query:  NHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLE
        NHRLAA+GLRALAH+LSSN SS  SSSSTVAPVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEP+RSRNLHILDIGVSHGVQWPTLLE
Subjt:  NHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDE
        ALTRRSGGPPPLIRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQS NSQVI K PDEILIVCAQFRLHQL+HYAPDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDE

Query:  RFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKF
        RFEFLQNLRK++PKAVILSENNMACSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRES+ER+VMEGEAA+ L NQGEMNEE EKWCERM+NAGFARK 
Subjt:  RFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKF

Query:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        F EDTIDTARASMRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_023514155.1 nodulation-signaling pathway 1 protein [Cucurbita pepo subsp. pepo]4.2e-28490.61Show/hide
Query:  DHILDWLEDSVPFF-SPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVHKTSQSHQNQRK
        DHILDWL DSVPFF SPF +++YNSSSINCYQWWDENQD+GQDLINGCLSSSP   TTVST+PPN  TSHHLTPSDLTKKRKAPD+TVHK SQ+ QNQRK
Subjt:  DHILDWLEDSVPFF-SPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVHKTSQSHQNQRK

Query:  NQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRAL
        NQNNQSKNGADK  GAV GVTV+KKSVGNK+NSSK+TG+N NNG+NKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASP GDANHRLAA+GLRAL
Subjt:  NQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRAL

Query:  AHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        AH+LSSN SS SSSSST+APVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEP+RSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  AHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMD
        IRLTVIAPT+EHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQSLNSQVI KFPDEILIVCAQFRLHQL+HYAPDERFEFLQNLRKM+
Subjt:  IRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMD

Query:  PKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARAS
        PKAVILSENNMACSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRES+ER+VMEGEAAKAL N+GEMNEE EKWCERM+NAGFARK F EDTIDTARAS
Subjt:  PKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARAS

Query:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLG+KSNGG
Subjt:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_038897214.1 protein NODULATION SIGNALING PATHWAY 1 [Benincasa hispida]5.5e-28489.71Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVH
        MTIEE GPNHPSDHILDWLEDSVPFFSPFL+ET NSSSINCYQWWD NQD G+DLING LS+SP   TTVST+  N PTSHHLTPSDLTKKRKAPD++VH
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVH

Query:  KTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDAN
        K SQ+HQN+RKNQNNQSKNG   GGGAVEGVTV+KKSVGNKKNSSK TGNN NNGSN+EGRWAEQLLNPCA+AIIKGDATRVHHLLCVLQELASP GDAN
Subjt:  KTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDAN

Query:  HRLAAHGLRALAHHLSSNSSST--SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLL
        HRLA HGLRALAHHLSSNSSS+  SS SSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEP+R RNLHILDIGVSHGVQWPTLL
Subjt:  HRLAAHGLRALAHHLSSNSSST--SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLL

Query:  EALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPD
        EALTRRSGGPP LIRLTVI PT+EHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRL+NHSLQSLNSQVI KFPDEILIVCAQFRLHQL+H  PD
Subjt:  EALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPD

Query:  ERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARK
        ERFEFLQNLRKM+PKAVILSENNM CSCSNCGNFDT FTRRVEYLWRFLDSTS+AFKGRES+ERRVMEGEAAKAL N GEMNEEK KWCERM+NAGF RK
Subjt:  ERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARK

Query:  FFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSN
         FAEDTIDTARASMRRYDNNWEMRIEEKDGC+GLWWKGQPVSFCSFWKLG+KSN
Subjt:  FFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSN

TrEMBL top hitse value%identityAlignment
A0A0A0KCK6 GRAS domain-containing protein8.9e-28088.11Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVH
        MTIEE GPNHPSDHILDWLEDSVPFFS FL+ET NSSSINCYQWWDENQD G+DLINGCLS+SPTTV  VSTR PNTPTSH LTPSDLTKKRKAPD++VH
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVH

Query:  KTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDAN
        K SQ+HQN RKNQNNQSKN ADKG GAVEGVTV+KKSVGNKKN+SKSTGNNYN+GSNKEGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASP GDAN
Subjt:  KTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDAN

Query:  HRLAAHGLRALAHHLSSNSSST--SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLL
        HRLA HGLRALA+HLSSNSSS+  SS SSTVAP TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE +R RNLHILDIGVSHGVQWPTLL
Subjt:  HRLAAHGLRALAHHLSSNSSST--SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLL

Query:  EALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPD
        EALTRRSGGPPPLIRLTVIAPT+EHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ I K  DEILIVCAQFRLHQL+H APD
Subjt:  EALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPD

Query:  ERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMN-QGEMNEEKEKWCERMKNAGFAR
        ER EFL+NLRKM+PKAVILSENNM CSCS CGNF+ GF R VEY+W+FLDSTS+AFKGRES+ERRVMEGEAAKAL N  GEMNEEK KWCERM+N GF R
Subjt:  ERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMN-QGEMNEEKEKWCERMKNAGFAR

Query:  KFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSN
        K F EDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCSFWKLG+KSN
Subjt:  KFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSN

A0A1S3CGQ3 nodulation-signaling pathway 1 protein1.3e-27887.75Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVH
        MTIEE GP+HPSDHILDWLEDSVPFFS FL+ET NSSSINCYQWWDENQD G+DLINGCLS+SPTTV  VSTR PNTPTSHHL PSDLTKKRKAPD++VH
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVH

Query:  KTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDAN
        K SQ+HQN RKNQNNQSKN ADKG GAVEGVTVIKKSVGNKKN+SKSTGNNYNNGSNKEGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASP GDAN
Subjt:  KTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDAN

Query:  HRLAAHGLRALAHHLSSNSSST--SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLL
        HRLA HGLRALA+HLSSNSSS+  SS SSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEE +R RNLH+LDIGVSHGVQWPTLL
Subjt:  HRLAAHGLRALAHHLSSNSSST--SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLL

Query:  EALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPD
        EALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD HSLQSLNSQ I K  DEILIVC+QFRLHQL+H APD
Subjt:  EALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPD

Query:  ERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMN-QGEMNEEKEKWCERMKNAGFAR
        ER EFLQNLRKM+PKAVILSENNM CSCS C NF+ GF R VEY+W+FLDSTS+AFKGRES+ERRVMEGEAAKAL N +GEMNEEK KWCERM+N GF R
Subjt:  ERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMN-QGEMNEEKEKWCERMKNAGFAR

Query:  KFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSN
        K F EDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCS WKLG+KSN
Subjt:  KFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSN

A0A6J1D4T6 nodulation-signaling pathway 1 protein1.7e-27887.93Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVH
        MTIEEPGPNHPSDHILDWLEDS PFFSPFL+ETYNSSSINCYQWWDE+Q++GQDLINGCLSSSP   TT ST PPNT +   LTPSDL+KKRKAPD+T H
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVH

Query:  KTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDAN
        KT+Q HQN RKNQNNQSKNGADKGGG      V+KKSVGNKK+SSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHL+CVLQELASP GDAN
Subjt:  KTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDAN

Query:  HRLAAHGLRALAHHLSSNSSSTSSSSSTVAP-VTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLE
        HRLA HGLRALAHHLSSNS   SSSSST+AP V FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILH LSEEP+ SRNLHILDIGVSHGVQWPTLLE
Subjt:  HRLAAHGLRALAHHLSSNSSSTSSSSSTVAP-VTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDE
        ALTRRSGGPPPLIRLTV+APTVEHDQ  ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQ+LNSQVI KF DEILIVCA FRLHQL+H APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDE

Query:  RFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKF
        R EFL+NLRKM+P AVILSENN+ACSCSNCGNFD  FTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKAL NQGEMNEEKEKW ERM+NAGFARKF
Subjt:  RFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKF

Query:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        FAE TIDTARASMRRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWKLG K NGG
Subjt:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

A0A6J1HKE2 nodulation-signaling pathway 1 protein1.5e-28290.06Show/hide
Query:  DHILDWLEDSVPFF-SPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVHKTSQSHQNQRK
        DHILDWL DSVPFF SPF +++YNSSSINCYQWWDENQD+GQDLINGCLSSSP   TTVST+PPNT TSHHLTPSDLTKKRKAPD+TVHK SQ+ QNQRK
Subjt:  DHILDWLEDSVPFF-SPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVHKTSQSHQNQRK

Query:  NQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRAL
        NQNNQS+NGADK  GAV GVTV+KKSVGNK+NSSK+TGNN NNG+NKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASP GDANHRLAA+GLRAL
Subjt:  NQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRAL

Query:  AHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        AH+LSSNSS   SSSST+APVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEP+RSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  AHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMD
        IRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLD+ SL SLN+QVI KFPDEILIVCAQFRLHQL+HYAPDERFEFLQNLRK++
Subjt:  IRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMD

Query:  PKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARAS
        PKAVILSENNMACSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRES+ER++MEGEAAKAL N+GEMNEE EKWCERM+NAGFARK F EDTIDTARAS
Subjt:  PKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARAS

Query:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

A0A6J1KER4 nodulation-signaling pathway 1 protein1.7e-29190.99Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFF-SPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETV
        MTIEEPG NHPSDHILDWL DSVPFF SPF +++YNSSSINCYQWWDENQD+GQDLINGCLSSSP   TTVST+PPNT TSHHLTPSDLTKKRKAPD+TV
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFF-SPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETV

Query:  HKTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDA
        HK SQ+ QNQRKNQNNQSKNGADKG GAVEGVTVIKKSVGNK+NSSK+TGNN +NGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASP GDA
Subjt:  HKTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDA

Query:  NHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLE
        NHRLAA+GLRALAH+LSSN SS  SSSSTVAPVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEP+RSRNLHILDIGVSHGVQWPTLLE
Subjt:  NHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDE
        ALTRRSGGPPPLIRLTVIAPTVEHDQN ETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN SLQS NSQVI K PDEILIVCAQFRLHQL+HYAPDE
Subjt:  ALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDE

Query:  RFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKF
        RFEFLQNLRK++PKAVILSENNMACSC+NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRES+ER+VMEGEAA+ L NQGEMNEE EKWCERM+NAGFARK 
Subjt:  RFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKF

Query:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        F EDTIDTARASMRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

SwissProt top hitse value%identityAlignment
A1DQP9 Protein NODULATION SIGNALING PATHWAY 11.3e-17962.01Show/hide
Query:  MTIEEPGPNH-PSDHILDWLEDSVPFFSPFLEETYNSSS-INCYQWWDENQDMGQDLINGCL--SSSPTTVTTVSTRPPNTPT-----SHHLTP-SDLTK
        MT+E   PN   SDHILDWLE SV FF  FL+E  N+S  I  Y  WD+ Q    D   G    +++ TT T V+T   +T +     S++  P SDL K
Subjt:  MTIEEPGPNH-PSDHILDWLEDSVPFFSPFLEETYNSSS-INCYQWWDENQDMGQDLINGCL--SSSPTTVTTVSTRPPNTPT-----SHHLTP-SDLTK

Query:  KRKAPDETVHKTSQSHQNQRKNQ-NNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVL
        KR A DE+  K  Q+   + K +  N+ +NG             ++K   NKK  +K+ G+N N+G++KEGRWAEQLLNPCA AI  G+  RV HLL VL
Subjt:  KRKAPDETVHKTSQSHQNQRKNQ-NNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVL

Query:  QELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPS-RSRNLHILDIGV
         ELASP GD NHRLAAHGLRAL HHLSS+SSS +SS +    +TFAST+PRFFQ+SL+KF+EVSPWF+FPNNIAN+SIL +L+EE +  SR LHILDIGV
Subjt:  QELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPS-RSRNLHILDIGV

Query:  SHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFR
        SHGVQWPTLL+AL+RRSGGPP ++RLTV+  T E+DQN ETPFS  PPG N   RLL +A+S+NINLQINR++NHSLQ+LN+Q I+  PDEILIVCAQFR
Subjt:  SHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFR

Query:  LHQLRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCE
        LH L H +PDER EFL+ LR M+P+ VILSENN  C CS CGNF  GFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKAL NQ EMNEEKEKWC 
Subjt:  LHQLRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCE

Query:  RMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        RMK AGFA + F ED +D  RA +R+YD+NWEM++EEK+  VGLWWKGQPVSFCS WKL     GG
Subjt:  RMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

Q4VYC8 Protein NODULATION SIGNALING PATHWAY 11.6e-18562.14Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLEETYNSSSINCYQWWDENQDM-GQDLINGCLSSSPTTVTTVSTRPPNTPTS---------HHLTPSDLTK
        MT+E   PN  SDHILDWLE SV FF  FL++ YN+  I+ Y+ W++NQD+  Q  I+   +SS  T +T +    +T TS         +++  SDL K
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLEETYNSSSINCYQWWDENQDM-GQDLINGCLSSSPTTVTTVSTRPPNTPTS---------HHLTPSDLTK

Query:  KRKAPDE-TVHKTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVL
        KR A DE ++ K  Q+ +N+R    ++  N +D G  A+EG TV++KS GNKK ++K+ G+N NNG+NK+GRWAEQLLNPCA AI  G+  RV HLL VL
Subjt:  KRKAPDE-TVHKTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVL

Query:  QELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVS
         ELAS  GDANHRLAAHGLRAL HHLSS+SSST S +     +TFAST+PRFFQ+SL+KF+E SPWF+FPNNIAN+SIL +L+EEP+  R LHILDIGVS
Subjt:  QELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVS

Query:  HGVQWPTLLEALTRRSGGPPPLIRLTVI--APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQF
        HGVQWPT LEAL+RR GGPPPL+RLTV+  + + E+DQN ETPFSIGP GD  SS LL +A+SLN+NLQI +LDNH LQ+LN++ +    DE LIVCAQF
Subjt:  HGVQWPTLLEALTRRSGGPPPLIRLTVI--APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQF

Query:  RLHQLRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWC
        RLH L H  PDER EFL+ LR M+PK VILSENNM C CS+CG+F TGF+RRVEYLWRFLDSTSSAFK R+SDER++MEGEAAKAL NQ EMNE +EKWC
Subjt:  RLHQLRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWC

Query:  ERMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKL
        ERMK AGFA + F ED ID  RA +R+YDNNWEM++EE    V LWWK QPVSFCS WKL
Subjt:  ERMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKL

Q84MQ9 Protein NODULATION SIGNALING PATHWAY 15.2e-8339.39Show/hide
Query:  WWDENQDMGQDLINGCLSS--SPTTVTTVSTRPPNTPTSHHLTPSDL----TKKRKAPDETVHKTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKS
        WW  +    QD I   ++   SP +    +   P+  +    +PSD+    +KKRK+P            ++        K G  KGGG           
Subjt:  WWDENQDMGQDLINGCLSS--SPTTVTTVSTRPPNTPTSHHLTPSDL----TKKRKAPDETVHKTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKS

Query:  VGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAP------
                         GS+++ RWAEQLLNPCA A+  G+ +RV HL  VL EL S  GDANHRLAAHGLRALA  L +     ++++  V P      
Subjt:  VGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAP------

Query:  VTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH--ILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVIAPTVEHDQNTE
          FA+ +PR F+ SLI+FHEVSPWFA PN +AN++I          +  R LH++D+GVSHGVQWPTLLE+LTR+ GG  PP +RLTV+ P      +  
Subjt:  VTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH--ILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVIAPTVEHDQNTE

Query:  TPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMDPKAVILSE--NNMACSC
         PFS  PPG + S  LL +AKS+N++L+I+R       +     +     E L+VC QFR   L H A +ER E L+  R ++P+ V+LSE  + +    
Subjt:  TPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMDPKAVILSE--NNMACSC

Query:  SNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAK--ALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRI-
         + G+    F  R+E LWRFL+STS+AFKG++ +ERR++E EA    A  +     E +E W ERM  AGF    F  + +++AR+ +R+YD+ WEM   
Subjt:  SNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAK--ALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRI-

Query:  EEKDGCVGLWWKGQPVSFCSFWK
              V L WKGQPVSFCS W+
Subjt:  EEKDGCVGLWWKGQPVSFCSFWK

Q9LRW3 Scarecrow-like protein 291.1e-12548.47Show/hide
Query:  EEPGPNHPSDHILDWLEDSVPFFS-PFLEETYNSSSINCYQ-W-WDENQD--------MGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKR
        E   PN   DH+L WLEDSV     P  +++Y     +  Q W WD+ QD          QDL    +    T +  V+  P            DL  + 
Subjt:  EEPGPNHPSDHILDWLEDSVPFFS-PFLEETYNSSSINCYQ-W-WDENQD--------MGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKR

Query:  KAPDETVHKTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQEL
        + P++   K  +SH    + Q                    +KKS  +K+ + KS+  +  +G NKEGRWAE+LLNPCA AI   +++RV H LCVL EL
Subjt:  KAPDETVHKTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQEL

Query:  ASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPV-TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHG
        AS  GDAN RLAA GLRAL HHLSS     SS SS+  PV TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG
Subjt:  ASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPV-TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHG

Query:  VQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQ
        +QWPTLLEAL+ R  GPPP +R+TVI+     D   + PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I   P E LIVCAQFRLH 
Subjt:  VQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQ

Query:  LRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMK
        L+H   DER E L+ +R + PK V+L ENN  CS S   +F  GF++++EY+W+FLDSTSS FK   S+ER++MEGEA K LMN G+MNE KEKW ERM+
Subjt:  LRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMK

Query:  NAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWK
         AGF  + F ED +D A++ +R+YDNNWE+R+E+ D   GL WKG+ VSFCS WK
Subjt:  NAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWK

Q9SN22 Scarecrow-like protein 326.2e-3630.07Show/hide
Query:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAF
        +  + EQLL  CA AI   DA   H +L VL  +A P GD+  RL +  LRAL   LS   S T + SST++ +  A    RF    L  F +++PW  F
Subjt:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAF

Query:  PNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQIN
            AN++IL  +    +    +HI+D+ ++H +Q PTL++A+  R   PPPL++LTV++       +   P  I    + + S+L++FA + NI ++  
Subjt:  PNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQIN

Query:  RLDNHSLQSLNS--QVIAKFP---DEILIVCAQFRLHQLRHYAPDE---------RFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLW
         + +      +S  Q +  +P   +E L+V      H +  Y P+E         R  FL+ LR ++P+ V L E ++  +  N  N          Y W
Subjt:  RLDNHSLQSLNS--QVIAKFP---DEILIVCAQFRLHQLRHYAPDE---------RFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLW

Query:  RFLDSTSSAFKGRESDERRVMEGE--------AAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKG
           D+T +      S++RR  E E         AK    + E  E K +W ERM+ A F      ED +   +A +  +   W M+ E+ D  + L WKG
Subjt:  RFLDSTSSAFKGRESDERRVMEGE--------AAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKG

Query:  QPVSFCSFW
          V F + W
Subjt:  QPVSFCSFW

Arabidopsis top hitse value%identityAlignment
AT3G03450.1 RGA-like 28.0e-2325.54Show/hide
Query:  KNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRF
        ++S +ST +     S + G      L  CA AI + +      L+  +  LA     A  ++A +  +ALA  +  + ++ +           A+ +P F
Subjt:  KNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRF

Query:  FQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNIS
         +   + F+E  P+  F +  AN +IL    E  + +R +H++D+G++ G+QWP L++AL  R GGPP   RLT I P       TE   S+      + 
Subjt:  FQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNIS

Query:  SRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPD-EILIVCAQFRLHQLRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRV
         +L  FA+++ +  +   L   SL  L  ++    P+ E L+V + F LH+L   +     + L  ++ + P  V + E     +  N   F   F   +
Subjt:  SRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPD-EILIVCAQFRLHQLRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRV

Query:  EYLWRFLDSTSSAFKGRESDE--------RRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGL
         Y     DS   ++     D         R+++   AA+   ++ E +E   +W  RMK+AGF            A   +  Y      R+EE DGC+ +
Subjt:  EYLWRFLDSTSSAFKGRESDE--------RRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGL

Query:  WWKGQPVSFCSFWKL
         W+ +P+   S WKL
Subjt:  WWKGQPVSFCSFWKL

AT3G13840.1 GRAS family transcription factor7.8e-12748.47Show/hide
Query:  EEPGPNHPSDHILDWLEDSVPFFS-PFLEETYNSSSINCYQ-W-WDENQD--------MGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKR
        E   PN   DH+L WLEDSV     P  +++Y     +  Q W WD+ QD          QDL    +    T +  V+  P            DL  + 
Subjt:  EEPGPNHPSDHILDWLEDSVPFFS-PFLEETYNSSSINCYQ-W-WDENQD--------MGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKR

Query:  KAPDETVHKTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQEL
        + P++   K  +SH    + Q                    +KKS  +K+ + KS+  +  +G NKEGRWAE+LLNPCA AI   +++RV H LCVL EL
Subjt:  KAPDETVHKTSQSHQNQRKNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQEL

Query:  ASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPV-TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHG
        AS  GDAN RLAA GLRAL HHLSS     SS SS+  PV TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL IL+++P   ++LHI+DIGVSHG
Subjt:  ASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPV-TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHG

Query:  VQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQ
        +QWPTLLEAL+ R  GPPP +R+TVI+     D   + PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I   P E LIVCAQFRLH 
Subjt:  VQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQ

Query:  LRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMK
        L+H   DER E L+ +R + PK V+L ENN  CS S   +F  GF++++EY+W+FLDSTSS FK   S+ER++MEGEA K LMN G+MNE KEKW ERM+
Subjt:  LRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMK

Query:  NAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWK
         AGF  + F ED +D A++ +R+YDNNWE+R+E+ D   GL WKG+ VSFCS WK
Subjt:  NAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWK

AT3G49950.1 GRAS family transcription factor4.4e-3730.07Show/hide
Query:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAF
        +  + EQLL  CA AI   DA   H +L VL  +A P GD+  RL +  LRAL   LS   S T + SST++ +  A    RF    L  F +++PW  F
Subjt:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAF

Query:  PNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQIN
            AN++IL  +    +    +HI+D+ ++H +Q PTL++A+  R   PPPL++LTV++       +   P  I    + + S+L++FA + NI ++  
Subjt:  PNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQIN

Query:  RLDNHSLQSLNS--QVIAKFP---DEILIVCAQFRLHQLRHYAPDE---------RFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLW
         + +      +S  Q +  +P   +E L+V      H +  Y P+E         R  FL+ LR ++P+ V L E ++  +  N  N          Y W
Subjt:  RLDNHSLQSLNS--QVIAKFP---DEILIVCAQFRLHQLRHYAPDE---------RFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLW

Query:  RFLDSTSSAFKGRESDERRVMEGE--------AAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKG
           D+T +      S++RR  E E         AK    + E  E K +W ERM+ A F      ED +   +A +  +   W M+ E+ D  + L WKG
Subjt:  RFLDSTSSAFKGRESDERRVMEGE--------AAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKG

Query:  QPVSFCSFW
          V F + W
Subjt:  QPVSFCSFW

AT4G37650.1 GRAS family transcription factor9.8e-2925.93Show/hide
Query:  ENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKK--RKAPDETVHKTSQSHQNQRKNQN--------------NQSKNGADKGGGAVEG
        + Q     +I    S S T+ TT  T  P T   ++   +D+ ++      DE    +S SH N   + N              + + +       A   
Subjt:  ENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKK--RKAPDETVHKTSQSHQNQRKNQN--------------NQSKNGADKGGGAVEG

Query:  VTVIKKSVGNKKNSS----KSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSS----ST
        +     S G+  + S      T  +++  +N   +WA+ +L   A A    D  R   +L  L EL+SP GD   +LA++ L+AL + ++ +      + 
Subjt:  VTVIKKSVGNKKNSS----KSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSS----ST

Query:  SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VIAPT
         ++++T    +F ST     +++++KF EVSPW  F +  AN +IL  +  E      +HI+DI  +   QWPTLLEAL  RS   P L RLT  V+A  
Subjt:  SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLT--VIAPT

Query:  VEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN-HSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMDPKAVILSE
          +DQ              I +R+  FA+ + +  + N + +   L   +   +   PDE+L +     +H +       R   + + R++ P+ V + E
Subjt:  VEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDN-HSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMDPKAVILSE

Query:  NNMACSCSNCGNFDTGFTRRVEYLWRF----LDSTSSAFKGRESDERRVMEGEAAKALM--------NQGEMNEEKEKWCERMKNAGFARKFFAEDTIDT
                  G FD  F R      R+     +S   +F  R S+ER ++E  A +A++        +  E  E   KW  RM+N+GF    ++++  D 
Subjt:  NNMACSCSNCGNFDTGFTRRVEYLWRF----LDSTSSAFKGRESDERRVMEGEAAKALM--------NQGEMNEEKEKWCERMKNAGFARKFFAEDTIDT

Query:  ARASMRRY-DNNWEMRIEEKDGCVGLWWKGQPVSFCSFWK
         RA +RRY +  W M        + L W+ QPV + S W+
Subjt:  ARASMRRY-DNNWEMRIEEKDGCVGLWWKGQPVSFCSFWK

AT5G66770.1 GRAS family transcription factor4.1e-2728.57Show/hide
Query:  CANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH
        CA  I   D       L  ++E  S +GD   R+A +   AL++ LS NS +TSSSSS+   +            S    ++  P+  F +  AN +IL 
Subjt:  CANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSSSTSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH

Query:  ILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVI-APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSL
           E   +S  +HI+D G+  G+QWP LL+AL  R+ G P  IR++ I AP++      E+P    P      +RL  FAK L++N     +    +  L
Subjt:  ILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVI-APTVEHDQNTETPFSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSL

Query:  NSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFK---GRESDER---
        N       PDE+L V    +L++L    P      L+  + ++P+ V L E  ++ +         GF  RV+   +F  +   + +   GR+S+ER   
Subjt:  NSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRVEYLWRFLDSTSSAFK---GRESDER---

Query:  -RVMEGEAAKALMN------QGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARASMRRYD-NNWEMRIEEKDGCVGLWWKGQPVSFCSFWK
         R + G     L+         E  EEKE+W   M+NAGF     +   +  A+  +  Y+ +N    +E K G + L W   P+   S W+
Subjt:  -RVMEGEAAKALMN------QGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARASMRRYD-NNWEMRIEEKDGCVGLWWKGQPVSFCSFWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATTGAAGAACCAGGGCCAAACCACCCTTCAGATCACATATTGGACTGGTTAGAGGACTCAGTTCCTTTCTTTTCCCCTTTCCTGGAGGAGACTTACAACTCTAG
CTCTATAAACTGCTATCAATGGTGGGATGAGAACCAAGACATGGGCCAAGATCTGATTAATGGCTGTCTCAGCAGCTCCCCCACCACTGTCACTACTGTCAGTACTAGAC
CACCAAACACTCCCACTTCCCATCACTTGACGCCATCTGATTTGACCAAGAAAAGGAAAGCCCCAGATGAGACAGTTCATAAGACATCACAATCCCATCAGAACCAGAGG
AAGAACCAGAACAACCAGAGCAAAAATGGTGCAGATAAAGGCGGTGGAGCTGTTGAGGGAGTGACTGTGATAAAGAAGTCAGTGGGGAACAAGAAAAATTCATCAAAATC
CACAGGAAATAACTATAATAACGGAAGTAACAAGGAAGGAAGGTGGGCGGAGCAATTGCTAAATCCTTGTGCTAATGCTATCATAAAAGGAGATGCAACAAGGGTACATC
ACCTTCTTTGTGTTCTTCAAGAGCTCGCGTCGCCCATCGGCGACGCCAACCATCGGCTCGCCGCCCATGGTCTCCGAGCTTTGGCCCATCACCTGTCCTCCAATTCATCA
TCAACTTCTTCTTCTTCTTCCACAGTTGCGCCGGTTACTTTCGCTTCGACCGACCCTCGATTCTTCCAGAGATCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGC
TTTTCCCAACAACATCGCAAATTCTTCAATCCTCCACATTCTCTCTGAAGAACCTAGTCGCTCGCGAAATCTTCACATTCTTGACATTGGGGTTTCCCATGGTGTGCAAT
GGCCAACGCTGCTCGAGGCCTTGACTCGCCGTTCCGGTGGACCTCCGCCGTTAATTCGCCTCACAGTCATCGCTCCGACCGTCGAACACGACCAAAATACAGAGACACCG
TTTTCAATTGGTCCACCGGGAGACAACATCTCCTCTAGGCTTCTTAGTTTCGCCAAATCCTTGAACATCAATTTACAGATCAACCGCCTCGACAATCACTCGCTACAGAG
TTTAAATTCGCAAGTAATCGCCAAGTTTCCGGACGAAATCCTAATCGTTTGCGCACAGTTCAGACTCCACCAGTTGAGACACTATGCTCCTGACGAAAGATTCGAATTCT
TACAAAACCTAAGAAAAATGGATCCAAAAGCAGTGATTCTTAGCGAAAACAACATGGCATGTAGCTGCAGCAACTGCGGGAATTTCGACACCGGATTCACTCGACGAGTT
GAATACCTATGGAGATTCCTCGATTCAACAAGCTCCGCATTCAAAGGCCGAGAAAGCGACGAAAGAAGAGTGATGGAAGGAGAAGCCGCAAAGGCGCTGATGAACCAAGG
CGAAATGAACGAGGAAAAGGAAAAATGGTGCGAAAGAATGAAGAATGCGGGGTTTGCGAGAAAATTCTTCGCTGAAGACACCATTGATACAGCTCGAGCTTCAATGAGAA
GGTACGATAACAACTGGGAGATGAGAATTGAAGAGAAAGATGGATGCGTGGGGTTATGGTGGAAAGGGCAACCAGTTTCGTTTTGTTCGTTTTGGAAGTTGGGGATGAAA
TCCAATGGCGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCATTGAAGAACCAGGGCCAAACCACCCTTCAGATCACATATTGGACTGGTTAGAGGACTCAGTTCCTTTCTTTTCCCCTTTCCTGGAGGAGACTTACAACTCTAG
CTCTATAAACTGCTATCAATGGTGGGATGAGAACCAAGACATGGGCCAAGATCTGATTAATGGCTGTCTCAGCAGCTCCCCCACCACTGTCACTACTGTCAGTACTAGAC
CACCAAACACTCCCACTTCCCATCACTTGACGCCATCTGATTTGACCAAGAAAAGGAAAGCCCCAGATGAGACAGTTCATAAGACATCACAATCCCATCAGAACCAGAGG
AAGAACCAGAACAACCAGAGCAAAAATGGTGCAGATAAAGGCGGTGGAGCTGTTGAGGGAGTGACTGTGATAAAGAAGTCAGTGGGGAACAAGAAAAATTCATCAAAATC
CACAGGAAATAACTATAATAACGGAAGTAACAAGGAAGGAAGGTGGGCGGAGCAATTGCTAAATCCTTGTGCTAATGCTATCATAAAAGGAGATGCAACAAGGGTACATC
ACCTTCTTTGTGTTCTTCAAGAGCTCGCGTCGCCCATCGGCGACGCCAACCATCGGCTCGCCGCCCATGGTCTCCGAGCTTTGGCCCATCACCTGTCCTCCAATTCATCA
TCAACTTCTTCTTCTTCTTCCACAGTTGCGCCGGTTACTTTCGCTTCGACCGACCCTCGATTCTTCCAGAGATCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGC
TTTTCCCAACAACATCGCAAATTCTTCAATCCTCCACATTCTCTCTGAAGAACCTAGTCGCTCGCGAAATCTTCACATTCTTGACATTGGGGTTTCCCATGGTGTGCAAT
GGCCAACGCTGCTCGAGGCCTTGACTCGCCGTTCCGGTGGACCTCCGCCGTTAATTCGCCTCACAGTCATCGCTCCGACCGTCGAACACGACCAAAATACAGAGACACCG
TTTTCAATTGGTCCACCGGGAGACAACATCTCCTCTAGGCTTCTTAGTTTCGCCAAATCCTTGAACATCAATTTACAGATCAACCGCCTCGACAATCACTCGCTACAGAG
TTTAAATTCGCAAGTAATCGCCAAGTTTCCGGACGAAATCCTAATCGTTTGCGCACAGTTCAGACTCCACCAGTTGAGACACTATGCTCCTGACGAAAGATTCGAATTCT
TACAAAACCTAAGAAAAATGGATCCAAAAGCAGTGATTCTTAGCGAAAACAACATGGCATGTAGCTGCAGCAACTGCGGGAATTTCGACACCGGATTCACTCGACGAGTT
GAATACCTATGGAGATTCCTCGATTCAACAAGCTCCGCATTCAAAGGCCGAGAAAGCGACGAAAGAAGAGTGATGGAAGGAGAAGCCGCAAAGGCGCTGATGAACCAAGG
CGAAATGAACGAGGAAAAGGAAAAATGGTGCGAAAGAATGAAGAATGCGGGGTTTGCGAGAAAATTCTTCGCTGAAGACACCATTGATACAGCTCGAGCTTCAATGAGAA
GGTACGATAACAACTGGGAGATGAGAATTGAAGAGAAAGATGGATGCGTGGGGTTATGGTGGAAAGGGCAACCAGTTTCGTTTTGTTCGTTTTGGAAGTTGGGGATGAAA
TCCAATGGCGGTTGA
Protein sequenceShow/hide protein sequence
MTIEEPGPNHPSDHILDWLEDSVPFFSPFLEETYNSSSINCYQWWDENQDMGQDLINGCLSSSPTTVTTVSTRPPNTPTSHHLTPSDLTKKRKAPDETVHKTSQSHQNQR
KNQNNQSKNGADKGGGAVEGVTVIKKSVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPIGDANHRLAAHGLRALAHHLSSNSS
STSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHILSEEPSRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPTVEHDQNTETP
FSIGPPGDNISSRLLSFAKSLNINLQINRLDNHSLQSLNSQVIAKFPDEILIVCAQFRLHQLRHYAPDERFEFLQNLRKMDPKAVILSENNMACSCSNCGNFDTGFTRRV
EYLWRFLDSTSSAFKGRESDERRVMEGEAAKALMNQGEMNEEKEKWCERMKNAGFARKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMK
SNGG