; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007499 (gene) of Snake gourd v1 genome

Gene IDTan0007499
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionnodulation-signaling pathway 1 protein
Genome locationLG06:1242771..1244764
RNA-Seq ExpressionTan0007499
SyntenyTan0007499
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0042446 - hormone biosynthetic process (biological process)
GO:2000032 - regulation of secondary shoot formation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR005202 - Transcription factor GRAS
IPR030015 - Scarecrow-like protein 29/nodulation signalling pathway 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148342.1 nodulation-signaling pathway 1 protein [Momordica charantia]1.6e-27887.57Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVH
        MTIEEPGPNHPSDHILDWLEDS PFFSPFLDETYNSSSINCYQWWDE+Q+IGQDLINGCLS SP   TT +T  PN  +   LTPSDL+KKRKAPDDT H
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVH

Query:  KKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDAN
        K +QPHQN RKNQNNQSKNGADKG G      V+KK+VGNKK+SSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHL+CVLQELASPTGDAN
Subjt:  KKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDAN

Query:  HRLAAHGLRALAHHLSSNSSSSSSSSSTVAP-VTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLE
        HRLA HGLRALAHHLSSN   SSSSSST+AP V FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILHTLSEEPN SRNLHILDIGVSHGVQWPTLLE
Subjt:  HRLAAHGLRALAHHLSSNSSSSSSSSSTVAP-VTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDE
        ALTRRSGGPPPLIRLTV+AP++EHD+  ETPFSIGPPGDNI SRLLSFAKSLNINLQINRLD+HSLQ+LNSQVIGKF DEILIVCA FRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDE

Query:  RFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKF
        R EFL+NLR+MEP AVILSENN+ACSC+NCGNFD  FTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGF RKF
Subjt:  RFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKF

Query:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        FAE TIDTARASMRRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWKLG K NGG
Subjt:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_022964298.1 nodulation-signaling pathway 1 protein [Cucurbita moschata]1.6e-27889.13Show/hide
Query:  DHILDWLEDSVPFF-SPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVHKKSQPHQNQRK
        DHILDWL DSVPFF SPF D++YNSSSINCYQWWDENQDIGQDLINGCLS SP   TTV+T+ PN  TSHHLTPSDLTKKRKAPDDTVHK SQ  QNQRK
Subjt:  DHILDWLEDSVPFF-SPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVHKKSQPHQNQRK

Query:  NQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRAL
        NQNNQS+NGADK +G V  VTV+KK+VGNK+NSSK+TGNN NNG+NKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAA+GLRAL
Subjt:  NQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRAL

Query:  AHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        AH+LSSN  SS SSSST+APVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH LSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  AHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREME
        IRLTVIAP++EHD+N ETPFSIGPPGDNI SRLLSFAKSLNINLQINRLDS SL SLN+QVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLR++E
Subjt:  IRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREME

Query:  PKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARAS
        PKAVILSENNMACSC NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRES+ER++MEGEAAKAL N+GEMNEE EKW ERMRNAGF RK F EDTIDTARAS
Subjt:  PKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARAS

Query:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_023000031.1 nodulation-signaling pathway 1 protein [Cucurbita maxima]1.1e-28790.09Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFF-SPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTV
        MTIEEPG NHPSDHILDWL DSVPFF SPF D++YNSSSINCYQWWDENQDIGQDLINGCLS SP   TTV+T+ PN  TSHHLTPSDLTKKRKAPDDTV
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFF-SPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTV

Query:  HKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDA
        HK SQ  QNQRKNQNNQSKNGADKG+G VE VTVIKK+VGNK+NSSK+TGNN +NGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDA
Subjt:  HKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDA

Query:  NHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLE
        NHRLAA+GLRALAH+LSSN SS  SSSSTVAPVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILH LSEEPNRSRNLHILDIGVSHGVQWPTLLE
Subjt:  NHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDE
        ALTRRSGGPPPLIRLTVIAP++EHD+N ETPFSIGPPGDNI SRLLSFAKSLNINLQINRLD+ SLQS NSQVIGK PDEILIVCAQFRLHQLKHYAPDE
Subjt:  ALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDE

Query:  RFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKF
        RFEFLQNLR++EPKAVILSENNMACSCNNCGNFDTGFTR+VEYLWRFLDSTSSAFKGRES+ER+VMEGEAA+ LTNQGEMNEE EKW ERMRNAGF RK 
Subjt:  RFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKF

Query:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        F EDTIDTARASMRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_023514155.1 nodulation-signaling pathway 1 protein [Cucurbita pepo subsp. pepo]1.3e-28089.87Show/hide
Query:  DHILDWLEDSVPFF-SPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVHKKSQPHQNQRK
        DHILDWL DSVPFF SPF D++YNSSSINCYQWWDENQDIGQDLINGCLS SP   TTV+T+ PN  TSHHLTPSDLTKKRKAPDDTVHK SQ  QNQRK
Subjt:  DHILDWLEDSVPFF-SPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVHKKSQPHQNQRK

Query:  NQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRAL
        NQNNQSKNGADK +G V  VTV+KK+VGNK+NSSK+TG+N NNG+NKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAA+GLRAL
Subjt:  NQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRAL

Query:  AHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        AH+LSSN SS SSSSST+APVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH LSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  AHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREME
        IRLTVIAP++EHD+N ETPFSIGPPGDNI SRLLSFAKSLNINLQINRLD+ SLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLR+ME
Subjt:  IRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREME

Query:  PKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARAS
        PKAVILSENNMACSCNNCGNFDTGFTR+VEYLWRFLDSTSSAFKGRES+ER+VMEGEAAKAL N+GEMNEE EKW ERMRNAGF RK F EDTIDTARAS
Subjt:  PKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARAS

Query:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLG+KSNGG
Subjt:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

XP_038897214.1 protein NODULATION SIGNALING PATHWAY 1 [Benincasa hispida]1.0e-28289.53Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVH
        MTIEE GPNHPSDHILDWLEDSVPFFSPFLDET NSSSINCYQWWD NQD G+DLING LS SP   TTV+T+L NIPTSHHLTPSDLTKKRKAPDD+VH
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVH

Query:  KKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDAN
        KKSQ HQN+RKNQNNQSKNG   G G VE VTV+KK+VGNKKNSSK TGNN NNGSN+EGRWAEQLLNPCA+AIIKGDATRVHHLLCVLQELASPTGDAN
Subjt:  KKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDAN

Query:  HRLAAHGLRALAHHLSSNSSSS--SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLL
        HRLA HGLRALAHHLSSNSSSS  SS SSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH LSEEPNR RNLHILDIGVSHGVQWPTLL
Subjt:  HRLAAHGLRALAHHLSSNSSSS--SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLL

Query:  EALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPD
        EALTRRSGGPP LIRLTVI P+IEHD+N ETPFSIGPPGDNI SRLLSFAKSLNINLQINRL++HSLQSLNSQVI KFPDEILIVCAQFRLHQLKH  PD
Subjt:  EALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPD

Query:  ERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRK
        ERFEFLQNLR+MEPKAVILSENNM CSC+NCGNFDT FTRRVEYLWRFLDSTS+AFKGRES+ERRVMEGEAAKALTN GEMNEEK KW ERMRNAGF RK
Subjt:  ERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRK

Query:  FFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSN
         FAEDTIDTARASMRRYDNNWEMRIEEKDGC+GLWWKGQPVSFCSFWKLG+KSN
Subjt:  FFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSN

TrEMBL top hitse value%identityAlignment
A0A0A0KCK6 GRAS domain-containing protein4.1e-27787.57Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVH
        MTIEE GPNHPSDHILDWLEDSVPFFS FLDET NSSSINCYQWWDENQD G+DLINGCLS SPTTV  V+TR PN PTSH LTPSDLTKKRKAPDD+VH
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVH

Query:  KKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDAN
        KKSQ HQN RKNQNNQSKN ADKG+G VE VTV+KK+VGNKKN+SKSTGNNYN+GSNKEGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDAN
Subjt:  KKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDAN

Query:  HRLAAHGLRALAHHLSSNSSSS--SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLL
        HRLA HGLRALA+HLSSNSSSS  SS SSTVAP TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH LSEE NR RNLHILDIGVSHGVQWPTLL
Subjt:  HRLAAHGLRALAHHLSSNSSSS--SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLL

Query:  EALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPD
        EALTRRSGGPPPLIRLTVIAP+IEHD+N ETPFSIGPPGDNI SRLLSFAKSLNINLQINRLD HSLQSLNSQ I K  DEILIVCAQFRLHQLKH APD
Subjt:  EALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPD

Query:  ERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFVR
        ER EFL+NLR+MEPKAVILSENNM CSC+ CGNF+ GF R VEY+W+FLDSTS+AFKGRES+ERRVMEGEAAKAL N  GEMNEEK KW ERMRN GF R
Subjt:  ERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFVR

Query:  KFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSN
        K F EDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCSFWKLG+KSN
Subjt:  KFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSN

A0A1S3CGQ3 nodulation-signaling pathway 1 protein1.0e-27586.85Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVH
        MTIEE GP+HPSDHILDWLEDSVPFFS FLDET NSSSINCYQWWDENQD G+DLINGCLS SPTTV  V+TR PN PTSHHL PSDLTKKRKAPDD+VH
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVH

Query:  KKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDAN
        KKSQ HQN RKNQNNQSKN ADKG+G VE VTVIKK+VGNKKN+SKSTGNNYNNGSNKEGRWAEQLLNPCANAI+KGDATRVHHLLCVLQELASPTGDAN
Subjt:  KKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDAN

Query:  HRLAAHGLRALAHHLSSNSSSS--SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLL
        HRLA HGLRALA+HLSSNSSSS  SS SSTV+P+TFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH LSEE NR RNLH+LDIGVSHGVQWPTLL
Subjt:  HRLAAHGLRALAHHLSSNSSSS--SSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLL

Query:  EALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPD
        EALTRRSGGPPPLIRLTVIAP++EHD+N ETPFSIGPPGDNI SRLLSFAKSLNINLQINRLD HSLQSLNSQ I K  DEILIVC+QFRLHQLKH APD
Subjt:  EALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPD

Query:  ERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFVR
        ER EFLQNLR+MEPKAVILSENNM CSC+ C NF+ GF R VEY+W+FLDSTS+AFKGRES+ERRVMEGEAAKAL N +GEMNEEK KW ERMRN GF R
Subjt:  ERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTN-QGEMNEEKEKWYERMRNAGFVR

Query:  KFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSN
        K F EDTIDTARASMRRYDNNWEMR+E+KDGCVGLWWKGQPVSFCS WKLG+KSN
Subjt:  KFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSN

A0A6J1D4T6 nodulation-signaling pathway 1 protein7.5e-27987.57Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVH
        MTIEEPGPNHPSDHILDWLEDS PFFSPFLDETYNSSSINCYQWWDE+Q+IGQDLINGCLS SP   TT +T  PN  +   LTPSDL+KKRKAPDDT H
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVH

Query:  KKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDAN
        K +QPHQN RKNQNNQSKNGADKG G      V+KK+VGNKK+SSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHL+CVLQELASPTGDAN
Subjt:  KKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDAN

Query:  HRLAAHGLRALAHHLSSNSSSSSSSSSTVAP-VTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLE
        HRLA HGLRALAHHLSSN   SSSSSST+AP V FASTD RFFQRSLIKFHEVSPWFA PNNIANSSILHTLSEEPN SRNLHILDIGVSHGVQWPTLLE
Subjt:  HRLAAHGLRALAHHLSSNSSSSSSSSSTVAP-VTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDE
        ALTRRSGGPPPLIRLTV+AP++EHD+  ETPFSIGPPGDNI SRLLSFAKSLNINLQINRLD+HSLQ+LNSQVIGKF DEILIVCA FRLHQLKH APDE
Subjt:  ALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDE

Query:  RFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKF
        R EFL+NLR+MEP AVILSENN+ACSC+NCGNFD  FTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGF RKF
Subjt:  RFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKF

Query:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        FAE TIDTARASMRRYDNNWEMR+EEKDGCVGLWWKGQP+SFCSFWKLG K NGG
Subjt:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

A0A6J1HKE2 nodulation-signaling pathway 1 protein7.5e-27989.13Show/hide
Query:  DHILDWLEDSVPFF-SPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVHKKSQPHQNQRK
        DHILDWL DSVPFF SPF D++YNSSSINCYQWWDENQDIGQDLINGCLS SP   TTV+T+ PN  TSHHLTPSDLTKKRKAPDDTVHK SQ  QNQRK
Subjt:  DHILDWLEDSVPFF-SPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVHKKSQPHQNQRK

Query:  NQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRAL
        NQNNQS+NGADK +G V  VTV+KK+VGNK+NSSK+TGNN NNG+NKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAA+GLRAL
Subjt:  NQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRAL

Query:  AHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
        AH+LSSN  SS SSSST+APVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH LSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL
Subjt:  AHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPL

Query:  IRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREME
        IRLTVIAP++EHD+N ETPFSIGPPGDNI SRLLSFAKSLNINLQINRLDS SL SLN+QVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLR++E
Subjt:  IRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREME

Query:  PKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARAS
        PKAVILSENNMACSC NCGNFDTGFTR+VEYLWRFLDSTSSAFKGRES+ER++MEGEAAKAL N+GEMNEE EKW ERMRNAGF RK F EDTIDTARAS
Subjt:  PKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARAS

Query:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        MRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  MRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

A0A6J1KER4 nodulation-signaling pathway 1 protein5.2e-28890.09Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFF-SPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTV
        MTIEEPG NHPSDHILDWL DSVPFF SPF D++YNSSSINCYQWWDENQDIGQDLINGCLS SP   TTV+T+ PN  TSHHLTPSDLTKKRKAPDDTV
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFF-SPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTV

Query:  HKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDA
        HK SQ  QNQRKNQNNQSKNGADKG+G VE VTVIKK+VGNK+NSSK+TGNN +NGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDA
Subjt:  HKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDA

Query:  NHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLE
        NHRLAA+GLRALAH+LSSN SS  SSSSTVAPVTFASTD RFFQRSLIKFHEVSPWFAFPNNIANSSILH LSEEPNRSRNLHILDIGVSHGVQWPTLLE
Subjt:  NHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLE

Query:  ALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDE
        ALTRRSGGPPPLIRLTVIAP++EHD+N ETPFSIGPPGDNI SRLLSFAKSLNINLQINRLD+ SLQS NSQVIGK PDEILIVCAQFRLHQLKHYAPDE
Subjt:  ALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDE

Query:  RFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKF
        RFEFLQNLR++EPKAVILSENNMACSCNNCGNFDTGFTR+VEYLWRFLDSTSSAFKGRES+ER+VMEGEAA+ LTNQGEMNEE EKW ERMRNAGF RK 
Subjt:  RFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKF

Query:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        F EDTIDTARASMRRYDNNWEMR+EEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
Subjt:  FAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

SwissProt top hitse value%identityAlignment
A1DQP9 Protein NODULATION SIGNALING PATHWAY 12.0e-18061.74Show/hide
Query:  MTIEEPGPNH-PSDHILDWLEDSVPFFSPFLDETYNSSS-INCYQWWDE-----NQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRK
        MT+E   PN   SDHILDWLE SV FF  FLDE  N+S  I  Y  WD+     N     +  N   +    T  T TT L    ++++   SDL KKR 
Subjt:  MTIEEPGPNH-PSDHILDWLEDSVPFFSPFLDETYNSSS-INCYQWWDE-----NQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRK

Query:  APDDTVHKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELA
        A D++  K   P QN+ K    +  N  +  NGD  R         NKK  +K+ G+N N+G++KEGRWAEQLLNPCA AI  G+  RV HLL VL ELA
Subjt:  APDDTVHKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELA

Query:  SPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPN-RSRNLHILDIGVSHGV
        SPTGD NHRLAAHGLRAL HHLSS+SSS +SS +    +TFAST+PRFFQ+SL+KF+EVSPWF+FPNNIAN+SIL  L+EE N  SR LHILDIGVSHGV
Subjt:  SPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPN-RSRNLHILDIGVSHGV

Query:  QWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQL
        QWPTLL+AL+RRSGGPP ++RLTV+  + E+D+N+ETPFS  PPG N + RLL +A+S+NINLQINR+++HSLQ+LN+Q I   PDEILIVCAQFRLH L
Subjt:  QWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQL

Query:  KHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRN
         H +PDER EFL+ LR MEP+ VILSENN  C C+ CGNF  GFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQ EMNEEKEKW  RM+ 
Subjt:  KHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRN

Query:  AGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG
        AGF  + F ED +D  RA +R+YD+NWEM++EEK+  VGLWWKGQPVSFCS WKL     GG
Subjt:  AGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMKSNGG

Q4VYC8 Protein NODULATION SIGNALING PATHWAY 11.5e-18361.32Show/hide
Query:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDI-----------GQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTP-SDL
        MT+E   PN  SDHILDWLE SV FF  FLD+ YN+  I+ Y+ W++NQDI             +  N   +    + TT TT L   P S +  P SDL
Subjt:  MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDI-----------GQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTP-SDL

Query:  TKKRKAPDDTVHKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCV
         KKR A D+   KK QP   + K   ++  N +D G+  +E  TV++K+ GNKK ++K+ G+N NNG+NK+GRWAEQLLNPCA AI  G+  RV HLL V
Subjt:  TKKRKAPDDTVHKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCV

Query:  LQELASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGV
        L ELAS TGDANHRLAAHGLRAL HHLSS+SSS+ S +     +TFAST+PRFFQ+SL+KF+E SPWF+FPNNIAN+SIL  L+EEPN  R LHILDIGV
Subjt:  LQELASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGV

Query:  SHGVQWPTLLEALTRRSGGPPPLIRLTVI--APSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQ
        SHGVQWPT LEAL+RR GGPPPL+RLTV+  + S E+D+N+ETPFSIGP GD   S LL +A+SLN+NLQI +LD+H LQ+LN++ +    DE LIVCAQ
Subjt:  SHGVQWPTLLEALTRRSGGPPPLIRLTVI--APSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQ

Query:  FRLHQLKHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKW
        FRLH L H  PDER EFL+ LR MEPK VILSENNM C C++CG+F TGF+RRVEYLWRFLDSTSSAFK R+SDER++MEGEAAKALTNQ EMNE +EKW
Subjt:  FRLHQLKHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKW

Query:  YERMRNAGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKL
         ERM+ AGF  + F ED ID  RA +R+YDNNWEM++EE    V LWWK QPVSFCS WKL
Subjt:  YERMRNAGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKL

Q84MQ9 Protein NODULATION SIGNALING PATHWAY 12.1e-8439.96Show/hide
Query:  WWDENQDIGQDLINGCLSG--SPTTVTTVTTRLPNIPTSHHLTPSDL----TKKRKAPDDTVHKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKA
        WW  +    QD I   ++   SP +        P+I +    +PSD+    +KKRK+P            ++        K G  KG G           
Subjt:  WWDENQDIGQDLINGCLSG--SPTTVTTVTTRLPNIPTSHHLTPSDL----TKKRKAPDDTVHKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKA

Query:  VGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAP------
                         GS+++ RWAEQLLNPCA A+  G+ +RV HL  VL EL S +GDANHRLAAHGLRALA  L +    +++++  V P      
Subjt:  VGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAP------

Query:  VTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH--TLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVIAPSIEHDENIE
          FA+ +PR F+ SLI+FHEVSPWFA PN +AN++I    T        R LH++D+GVSHGVQWPTLLE+LTR+ GG  PP +RLTV+ P       + 
Subjt:  VTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH--TLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGG-PPPLIRLTVIAPSIEHDENIE

Query:  TPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREMEPKAVILSE--NNMACSC
         PFS  PPG +    LL +AKS+N++L+I+R       +L+  V G    E L+VC QFR   L H A +ER E L+  R + P+ V+LSE  + +    
Subjt:  TPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREMEPKAVILSE--NNMACSC

Query:  NNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAK--ALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRYDNNWEMRI-
         + G+    F  R+E LWRFL+STS+AFKG++ +ERR++E EA    A  +     E +E W ERM  AGF    F  + +++AR+ +R+YD+ WEM   
Subjt:  NNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAK--ALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRYDNNWEMRI-

Query:  EEKDGCVGLWWKGQPVSFCSFWK
              V L WKGQPVSFCS W+
Subjt:  EEKDGCVGLWWKGQPVSFCSFWK

Q9LRW3 Scarecrow-like protein 298.1e-12949.1Show/hide
Query:  EEPGPNHPSDHILDWLEDSVPFFS-PFLDETYNSSSINCYQ-W-WDENQD--------IGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKR
        E   PN   DH+L WLEDSV     P  D++Y     +  Q W WD+ QD          QDL +    G   T   V T  P+I         DL  + 
Subjt:  EEPGPNHPSDHILDWLEDSVPFFS-PFLDETYNSSSINCYQ-W-WDENQD--------IGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKR

Query:  KAPDDTVHKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQEL
        + P+D   K+S                     +G +E    +KK+  +K+ + KS+  +  +G NKEGRWAE+LLNPCA AI   +++RV H LCVL EL
Subjt:  KAPDDTVHKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQEL

Query:  ASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGV
        AS +GDAN RLAA GLRAL HHL    SSSS SSS     TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL  L+++P   ++LHI+DIGVSHG+
Subjt:  ASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGV

Query:  QWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQL
        QWPTLLEAL+ R  GPPP +R+TVI+     D   + PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I   P E LIVCAQFRLH L
Subjt:  QWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQL

Query:  KHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRN
        KH   DER E L+ +R + PK V+L ENN  CS  +  +F  GF++++EY+W+FLDSTSS FK   S+ER++MEGEA K L N G+MNE KEKWYERMR 
Subjt:  KHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRN

Query:  AGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWK
        AGF  + F ED +D A++ +R+YDNNWE+R+E+ D   GL WKG+ VSFCS WK
Subjt:  AGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWK

Q9SN22 Scarecrow-like protein 324.7e-3630.32Show/hide
Query:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAF
        +  + EQLL  CA AI   DA   H +L VL  +A P GD+  RL +  LRAL   LS   S + + SST++ +  A    RF    L  F +++PW  F
Subjt:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAF

Query:  PNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQIN
            AN++IL  +         +HI+D+ ++H +Q PTL++A+  R   PPPL++LTV++ S      I   +      + + S+L++FA + NI ++  
Subjt:  PNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQIN

Query:  RLDSHSLQSLNS--QVIGKFP---DEILIVCAQFRLHQLKHYAPDE---------RFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLW
         + S      +S  Q +  +P   +E L+V      H +  Y P+E         R  FL+ LR + P+ V L E ++  +  N  N          Y W
Subjt:  RLDSHSLQSLNS--QVIGKFP---DEILIVCAQFRLHQLKHYAPDE---------RFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLW

Query:  RFLDSTSSAFKGRESDERRVMEGE--------AAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKG
           D+T +      S++RR  E E         AK    + E  E K +W ERMR A F      ED +   +A +  +   W M+ E+ D  + L WKG
Subjt:  RFLDSTSSAFKGRESDERRVMEGE--------AAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKG

Query:  QPVSFCSFW
          V F + W
Subjt:  QPVSFCSFW

Arabidopsis top hitse value%identityAlignment
AT3G03450.1 RGA-like 27.3e-2425.3Show/hide
Query:  KNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRF
        ++S +ST +     S + G      L  CA AI + +      L+  +  LA     A  ++A +  +ALA  +  + ++ +           A+ +P F
Subjt:  KNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRF

Query:  FQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIF
         +   + F+E  P+  F +  AN +IL    E    +R +H++D+G++ G+QWP L++AL  R GGPP   RLT I P     EN ++   +G       
Subjt:  FQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIF

Query:  SRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPD-EILIVCAQFRLHQLKHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRV
         +L  FA+++ +  +   L + SL  L  ++    P+ E L+V + F LH+L   +     + L  ++ ++P  V + E     + +N   F   F   +
Subjt:  SRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPD-EILIVCAQFRLHQLKHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRV

Query:  EYLWRFLDSTSSAFKGRESDE--------RRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGL
         Y     DS   ++     D         R+++   AA+  +++ E +E   +W  RM++AGF            A   +  Y      R+EE DGC+ +
Subjt:  EYLWRFLDSTSSAFKGRESDE--------RRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGL

Query:  WWKGQPVSFCSFWKL
         W+ +P+   S WKL
Subjt:  WWKGQPVSFCSFWKL

AT3G13840.1 GRAS family transcription factor5.8e-13049.1Show/hide
Query:  EEPGPNHPSDHILDWLEDSVPFFS-PFLDETYNSSSINCYQ-W-WDENQD--------IGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKR
        E   PN   DH+L WLEDSV     P  D++Y     +  Q W WD+ QD          QDL +    G   T   V T  P+I         DL  + 
Subjt:  EEPGPNHPSDHILDWLEDSVPFFS-PFLDETYNSSSINCYQ-W-WDENQD--------IGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKR

Query:  KAPDDTVHKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQEL
        + P+D   K+S                     +G +E    +KK+  +K+ + KS+  +  +G NKEGRWAE+LLNPCA AI   +++RV H LCVL EL
Subjt:  KAPDDTVHKKSQPHQNQRKNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQEL

Query:  ASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGV
        AS +GDAN RLAA GLRAL HHL    SSSS SSS     TFAS + + FQ++L+KF+EVSPWFA PNN+ANS+IL  L+++P   ++LHI+DIGVSHG+
Subjt:  ASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGV

Query:  QWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQL
        QWPTLLEAL+ R  GPPP +R+TVI+     D   + PFS+GPPG N  S+LL FA+SL INLQI+ LD         Q+I   P E LIVCAQFRLH L
Subjt:  QWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQL

Query:  KHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRN
        KH   DER E L+ +R + PK V+L ENN  CS  +  +F  GF++++EY+W+FLDSTSS FK   S+ER++MEGEA K L N G+MNE KEKWYERMR 
Subjt:  KHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRN

Query:  AGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWK
        AGF  + F ED +D A++ +R+YDNNWE+R+E+ D   GL WKG+ VSFCS WK
Subjt:  AGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWK

AT3G49950.1 GRAS family transcription factor3.4e-3730.32Show/hide
Query:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAF
        +  + EQLL  CA AI   DA   H +L VL  +A P GD+  RL +  LRAL   LS   S + + SST++ +  A    RF    L  F +++PW  F
Subjt:  EGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAF

Query:  PNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQIN
            AN++IL  +         +HI+D+ ++H +Q PTL++A+  R   PPPL++LTV++ S      I   +      + + S+L++FA + NI ++  
Subjt:  PNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQIN

Query:  RLDSHSLQSLNS--QVIGKFP---DEILIVCAQFRLHQLKHYAPDE---------RFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLW
         + S      +S  Q +  +P   +E L+V      H +  Y P+E         R  FL+ LR + P+ V L E ++  +  N  N          Y W
Subjt:  RLDSHSLQSLNS--QVIGKFP---DEILIVCAQFRLHQLKHYAPDE---------RFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLW

Query:  RFLDSTSSAFKGRESDERRVMEGE--------AAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKG
           D+T +      S++RR  E E         AK    + E  E K +W ERMR A F      ED +   +A +  +   W M+ E+ D  + L WKG
Subjt:  RFLDSTSSAFKGRESDERRVMEGE--------AAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKG

Query:  QPVSFCSFW
          V F + W
Subjt:  QPVSFCSFW

AT4G37650.1 GRAS family transcription factor7.7e-3428.4Show/hide
Query:  RWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSS----SSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWF
        +WA+ +L   A A    D  R   +L  L EL+SP GD   +LA++ L+AL + ++ +      +  ++++T    +F ST     +++++KF EVSPW 
Subjt:  RWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSS----SSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWF

Query:  AFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNI--FSRLLSFAKSLNIN
         F +  AN +IL  +  E      +HI+DI  +   QWPTLLEAL  RS   P L   TV+  +   ++   +   +   G+ +  F+RL+      NI 
Subjt:  AFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETPFSIGPPGDNI--FSRLLSFAKSLNIN

Query:  LQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRF----LDST
          +  L    L  L+ +     PDE+L +     +H +       R   + + R + P+ V + E          G FD  F R      R+     +S 
Subjt:  LQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRF----LDST

Query:  SSAFKGRESDERRVMEGEAAKAL--------TNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRY-DNNWEMRIEEKDGCVGLWWKGQPVSF
          +F  R S+ER ++E  A +A+        ++  E  E   KW  RMRN+GF    ++++  D  RA +RRY +  W M        + L W+ QPV +
Subjt:  SSAFKGRESDERRVMEGEAAKAL--------TNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRY-DNNWEMRIEEKDGCVGLWWKGQPVSF

Query:  CSFWK
         S W+
Subjt:  CSFWK

AT5G66770.1 GRAS family transcription factor2.7e-2628.57Show/hide
Query:  CANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH
        CA  I   D       L  ++E  S  GD   R+A +   AL++ LS NS ++SSSSS+   +            S    ++  P+  F +  AN +IL 
Subjt:  CANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSSSSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILH

Query:  TLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVI-APSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSL
           E   +S  +HI+D G+  G+QWP LL+AL  R+ G P  IR++ I APS+      E+P    P      +RL  FAK L++N     + +  +  L
Subjt:  TLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVI-APSIEHDENIETPFSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSL

Query:  NSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFK---GRESDER---
        N       PDE+L V    +L++L    P      L+  + + P+ V L E  ++ +         GF  RV+   +F  +   + +   GR+S+ER   
Subjt:  NSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRVEYLWRFLDSTSSAFK---GRESDER---

Query:  -RVMEGEAAKALTN------QGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRYD-NNWEMRIEEKDGCVGLWWKGQPVSFCSFWK
         R + G     L          E  EEKE+W   M NAGF     +   +  A+  +  Y+ +N    +E K G + L W   P+   S W+
Subjt:  -RVMEGEAAKALTN------QGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRYD-NNWEMRIEEKDGCVGLWWKGQPVSFCSFWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATCGAAGAACCAGGGCCAAACCACCCTTCAGATCATATATTGGACTGGTTGGAGGATTCAGTTCCTTTCTTTTCCCCATTCCTGGATGAGACTTACAACTCTAG
CTCCATAAACTGCTATCAATGGTGGGATGAGAACCAAGACATAGGCCAAGATCTGATTAATGGCTGTCTAAGCGGTTCCCCCACCACTGTCACTACTGTCACTACTAGAC
TACCAAACATTCCCACTTCCCATCACTTGACACCATCTGATTTGACCAAGAAAAGAAAAGCCCCAGACGATACAGTTCATAAAAAATCACAACCCCATCAGAACCAGAGG
AAGAACCAGAATAATCAGAGCAAAAATGGTGCAGATAAAGGCAATGGAGATGTTGAAAGAGTGACTGTGATAAAGAAGGCAGTGGGGAACAAGAAAAATTCATCAAAGTC
CACAGGGAATAACTACAATAACGGAAGTAACAAGGAAGGAAGGTGGGCGGAGCAATTGCTAAATCCCTGTGCTAATGCTATCATTAAAGGGGATGCAACAAGAGTACATC
ACCTTCTTTGTGTTCTTCAAGAGCTCGCCTCGCCCACCGGCGACGCCAACCACCGGCTCGCCGCCCATGGTCTCCGAGCTTTGGCTCATCACCTGTCTTCCAATTCTTCT
TCTTCTTCTTCTTCTTCTTCCACAGTTGCGCCGGTTACTTTCGCTTCCACGGACCCTCGATTCTTCCAGAGGTCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGC
TTTTCCGAACAACATCGCAAATTCTTCAATTCTCCACACTCTCTCTGAAGAACCTAATCGCTCGCGCAATCTTCACATTCTTGACATCGGGGTTTCTCATGGTGTGCAAT
GGCCGACGCTGCTCGAGGCCTTGACTCGCCGTTCCGGTGGACCTCCGCCGCTAATTCGCCTCACAGTCATTGCTCCGTCAATTGAACATGACGAAAATATAGAGACGCCG
TTTTCAATCGGCCCACCGGGAGACAACATCTTCTCTCGGCTTCTTAGTTTCGCCAAATCCTTGAACATCAATTTACAGATCAACCGCCTCGACAGTCACTCACTACAGAG
TTTAAATTCGCAAGTAATCGGCAAGTTTCCAGACGAAATCTTAATCGTTTGCGCACAGTTCAGACTCCACCAGTTGAAACACTACGCTCCAGACGAAAGATTCGAGTTCT
TACAAAACCTAAGAGAAATGGAACCAAAGGCAGTGATTCTGAGCGAAAACAACATGGCATGTAGCTGTAACAACTGCGGAAATTTCGATACCGGATTCACACGACGAGTT
GAATACCTATGGAGATTCCTCGATTCAACAAGCTCCGCATTCAAAGGCCGAGAAAGCGACGAAAGAAGAGTGATGGAAGGCGAAGCCGCAAAGGCGCTGACGAATCAAGG
CGAAATGAACGAGGAAAAGGAAAAATGGTACGAAAGAATGAGAAATGCGGGTTTTGTGAGAAAATTCTTCGCTGAAGACACCATTGATACGGCTCGAGCTTCAATGAGAA
GGTATGATAATAACTGGGAGATGAGAATTGAAGAGAAAGATGGATGCGTAGGGCTATGGTGGAAAGGGCAACCAGTTTCGTTTTGTTCGTTTTGGAAGTTGGGGATGAAA
TCCAATGGCGGTTGA
mRNA sequenceShow/hide mRNA sequence
CCATCTTCTCCCTCCCCCATCAAAAGGGTCTCTCCTCTTTCTCTTTCTCTCTGTCTGAAATATTTTTTCACCAAGCAAAATGACCATCGAAGAACCAGGGCCAAACCACC
CTTCAGATCATATATTGGACTGGTTGGAGGATTCAGTTCCTTTCTTTTCCCCATTCCTGGATGAGACTTACAACTCTAGCTCCATAAACTGCTATCAATGGTGGGATGAG
AACCAAGACATAGGCCAAGATCTGATTAATGGCTGTCTAAGCGGTTCCCCCACCACTGTCACTACTGTCACTACTAGACTACCAAACATTCCCACTTCCCATCACTTGAC
ACCATCTGATTTGACCAAGAAAAGAAAAGCCCCAGACGATACAGTTCATAAAAAATCACAACCCCATCAGAACCAGAGGAAGAACCAGAATAATCAGAGCAAAAATGGTG
CAGATAAAGGCAATGGAGATGTTGAAAGAGTGACTGTGATAAAGAAGGCAGTGGGGAACAAGAAAAATTCATCAAAGTCCACAGGGAATAACTACAATAACGGAAGTAAC
AAGGAAGGAAGGTGGGCGGAGCAATTGCTAAATCCCTGTGCTAATGCTATCATTAAAGGGGATGCAACAAGAGTACATCACCTTCTTTGTGTTCTTCAAGAGCTCGCCTC
GCCCACCGGCGACGCCAACCACCGGCTCGCCGCCCATGGTCTCCGAGCTTTGGCTCATCACCTGTCTTCCAATTCTTCTTCTTCTTCTTCTTCTTCTTCCACAGTTGCGC
CGGTTACTTTCGCTTCCACGGACCCTCGATTCTTCCAGAGGTCGTTGATCAAATTCCACGAGGTGAGTCCATGGTTTGCTTTTCCGAACAACATCGCAAATTCTTCAATT
CTCCACACTCTCTCTGAAGAACCTAATCGCTCGCGCAATCTTCACATTCTTGACATCGGGGTTTCTCATGGTGTGCAATGGCCGACGCTGCTCGAGGCCTTGACTCGCCG
TTCCGGTGGACCTCCGCCGCTAATTCGCCTCACAGTCATTGCTCCGTCAATTGAACATGACGAAAATATAGAGACGCCGTTTTCAATCGGCCCACCGGGAGACAACATCT
TCTCTCGGCTTCTTAGTTTCGCCAAATCCTTGAACATCAATTTACAGATCAACCGCCTCGACAGTCACTCACTACAGAGTTTAAATTCGCAAGTAATCGGCAAGTTTCCA
GACGAAATCTTAATCGTTTGCGCACAGTTCAGACTCCACCAGTTGAAACACTACGCTCCAGACGAAAGATTCGAGTTCTTACAAAACCTAAGAGAAATGGAACCAAAGGC
AGTGATTCTGAGCGAAAACAACATGGCATGTAGCTGTAACAACTGCGGAAATTTCGATACCGGATTCACACGACGAGTTGAATACCTATGGAGATTCCTCGATTCAACAA
GCTCCGCATTCAAAGGCCGAGAAAGCGACGAAAGAAGAGTGATGGAAGGCGAAGCCGCAAAGGCGCTGACGAATCAAGGCGAAATGAACGAGGAAAAGGAAAAATGGTAC
GAAAGAATGAGAAATGCGGGTTTTGTGAGAAAATTCTTCGCTGAAGACACCATTGATACGGCTCGAGCTTCAATGAGAAGGTATGATAATAACTGGGAGATGAGAATTGA
AGAGAAAGATGGATGCGTAGGGCTATGGTGGAAAGGGCAACCAGTTTCGTTTTGTTCGTTTTGGAAGTTGGGGATGAAATCCAATGGCGGTTGAAACTTTTTTTCTTTTC
TTTTTTTTGCTCAGTGATTTTCAATGCAGAGTTCTGAAAAGCTATAGAATCAAAATGCTGGTTTTTGTTTGATTTCTGCCAACTGCTTTGAAAGTTCAAAACCTGTTTGG
CAAGAAATAAAGACTCTGTGGCTTTGTGTTGGAATTTGAGGGTTTGACTTTGAGTCCTTCTGCAACTTCCATTTCAACTCTTTGTTTTGAGTTGAAGAGTTTTGTTTATA
AGGGTTGATTCAAG
Protein sequenceShow/hide protein sequence
MTIEEPGPNHPSDHILDWLEDSVPFFSPFLDETYNSSSINCYQWWDENQDIGQDLINGCLSGSPTTVTTVTTRLPNIPTSHHLTPSDLTKKRKAPDDTVHKKSQPHQNQR
KNQNNQSKNGADKGNGDVERVTVIKKAVGNKKNSSKSTGNNYNNGSNKEGRWAEQLLNPCANAIIKGDATRVHHLLCVLQELASPTGDANHRLAAHGLRALAHHLSSNSS
SSSSSSSTVAPVTFASTDPRFFQRSLIKFHEVSPWFAFPNNIANSSILHTLSEEPNRSRNLHILDIGVSHGVQWPTLLEALTRRSGGPPPLIRLTVIAPSIEHDENIETP
FSIGPPGDNIFSRLLSFAKSLNINLQINRLDSHSLQSLNSQVIGKFPDEILIVCAQFRLHQLKHYAPDERFEFLQNLREMEPKAVILSENNMACSCNNCGNFDTGFTRRV
EYLWRFLDSTSSAFKGRESDERRVMEGEAAKALTNQGEMNEEKEKWYERMRNAGFVRKFFAEDTIDTARASMRRYDNNWEMRIEEKDGCVGLWWKGQPVSFCSFWKLGMK
SNGG