; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021986 (gene) of Snake gourd v1 genome

Gene IDTan0021986
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF630 and DUF632)
Genome locationLG01:13611775..13615538
RNA-Seq ExpressionTan0021986
SyntenyTan0021986
Gene Ontology termsNA
InterPro domainsIPR006867 - Domain of unknown function DUF632
IPR006868 - Domain of unknown function DUF630


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600640.1 Protein ALTERED PHOSPHATE STARVATION RESPONSE 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0091.77Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE
        MGGSNSKIENNEALRLCKERKRFIKQAIDS+YALAAAHVCYV++LKNVGVSLRRYAEAEVLIESS+S+SATELDKTPSHSSYPSPCPSQTAD SESPL+E
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE

Query:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA
        SPISPPIATISYMVAG++ PLTVKVRPSSHSYAYEESVA   SPLP PPPPPHESGSSWDYFDT+DE ESFRF+GT GGMDVNFEDERMWKQFKGEMADA
Subjt:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA

Query:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
        KDEFQE  AK E ELNG E+GGH+SPSG VE++NLEMAR+EDKELNS S+SSKVLLEQSGSRG+I+LEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
Subjt:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF

Query:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM
         RASESGREISRMLEANKIRVGYLEANGN+SGLAFLDPLRRLRLVCCPAKPALFS+EPHKTKVITWKRSTSSRSSSSRNPLA KDD DDS SEFVEEFCM
Subjt:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM

Query:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
        ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
Subjt:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL

Query:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI
        IRMWKAMLECHHAQYITISLAYHSKSA MGTPRAD QRQITVQLQQEIE FGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRR IAPPIFI
Subjt:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI

Query:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK
        LCRDWF GIK LPSNELSDAIR ILGEL CSISQQAELQRKQK LVE   GEEPEGK +ENND VSSNLS IH SLTKVLD LTKFSEASLKMYED+RQK
Subjt:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK

Query:  SEAARTAYLNYRPRSEL
        S+AARTAYLNYRPRSEL
Subjt:  SEAARTAYLNYRPRSEL

KAG7031277.1 hypothetical protein SDJN02_05317, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0092.05Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE
        MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYV++LKNVGVSLRRYAEAEVLIESS+S+SATELDKTPSHSSYPSPCPSQTAD SESPL+E
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE

Query:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA
        SPISPPIATISYMVAG++ PLTVKVRPSSHSYAYEES+A   SPLP PPPPPHESGSSWDYFDT+DEIESFRF+GT GGMDVNFEDERMWKQFKGEMADA
Subjt:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA

Query:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
        KDEFQE  AK E ELNG E+GGH+SPSG VE++NLEMAR+EDKELNS S+SSKVLLEQSGSRG+I+LEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
Subjt:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF

Query:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM
         RASESGREISRMLEANKIRVGYLEANGN+SGLAFLDPLRRLRLVCCPAKPALFS+EPHKTKVITWKRSTSSRSSSSRNPLA KDD DDS SEFVEEFCM
Subjt:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM

Query:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
        ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
Subjt:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL

Query:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI
        IRMWKAMLECHHAQYITISLAYHSKSA MGTPRAD QRQITVQLQQEIE FGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRR IAPPIFI
Subjt:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI

Query:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK
        LCRDWF GIKDLPSNELSDAIR ILGEL CSISQQAELQRKQK LVE   GEEPEGK +ENND VSSNLS IH SLTKVLD LTKFSEASLKMYED+RQK
Subjt:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK

Query:  SEAARTAYLNYRPRSEL
        S+AARTAYLNYRPRSEL
Subjt:  SEAARTAYLNYRPRSEL

XP_022943262.1 nitrate regulatory gene2 protein-like [Cucurbita moschata]0.0e+0091.91Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE
        MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYV++LKNVGVSLRRYAEAEVLIESS+S+SATELDKTPSHSSYPSPCPSQTAD SESPL+E
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE

Query:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA
        SPISPPIATISYMVAG++ PLTVKV+PSSHSYAYEESVA   SPLP PPPPPHESGSSWDYFDT+DEIESFRF+GT GGMDVNFEDERMWKQFKGEMADA
Subjt:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA

Query:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
        KDEFQE  AK E ELNG E+GGH+SPSG VE++NLEMAR+EDKELNS S+SSKVLLEQSGSRG+I+LEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
Subjt:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF

Query:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM
         RASESGREISRMLEANKIRVGYLEANGN+SGLAFLDPLRRLRLVCCPAKPALFS+EPHKTKVITWKRSTSSRSSSSRNPLA KDD DDS SEFVEEFCM
Subjt:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM

Query:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
        ISGSHSSTLDRLYAWERKLYDEVKASESI+KEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
Subjt:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL

Query:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI
        IRMWKAMLECHHAQYITISLAYHSKSA MGTPRAD QRQITVQLQQEIE FGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRR IAPPIFI
Subjt:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI

Query:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK
        LCRDWF GIKDLPSNELSDAIR ILGEL CSISQQAELQRKQK LVE   GEEPEGK +ENND VSSNLS IH SLTKVLD LTKFSEASLKMYED+RQK
Subjt:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK

Query:  SEAARTAYLNYRPRSEL
        S+AARTAYLNYRPRSEL
Subjt:  SEAARTAYLNYRPRSEL

XP_022979261.1 nitrate regulatory gene2 protein-like [Cucurbita maxima]0.0e+0091.77Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE
        MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYV++LKNVGVSLRRYAEAEVLIESS+S+SATELDKTPSHSSYPSPCPSQTAD SESPL+E
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE

Query:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA
        SPISPPIATISYMVAG++ PLTVKVRPSSHSYAYEESVA   SPLP PPPPPHESGSSWDYFDT+DEIESFRF+GT GGMDVNFEDERMWKQFKGEMADA
Subjt:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA

Query:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
        KDEFQE  AK E +LNG E+GGH+SPSG VE++NLEMAR+E KELNS S+SSKVLLEQ+GSRG+I+LEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
Subjt:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF

Query:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM
         RASESGREISRMLEANKIRVGYLEANGN+SGLAFLDPLRRLRLVCCPAKPALFS+EPHKTKVITWKRSTSSRSSSSRNPLA KDD DDS SEFVEEFCM
Subjt:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM

Query:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
        ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
Subjt:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL

Query:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI
        IRMWKAMLECHHAQYITISLAYHSKSA MGTPRAD QRQITVQLQQEIE FGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRR IAPPIFI
Subjt:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI

Query:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK
        LCRDWF GIKDLPSNELSDAIR ILGEL CSISQQAELQRKQK LVE   GEEPEGK +ENND VSSNLS IH SLTKVLD LTKFSEASLKMYED+RQK
Subjt:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK

Query:  SEAARTAYLNYRPRSEL
        S+AARTAYLNYRPRSEL
Subjt:  SEAARTAYLNYRPRSEL

XP_023523136.1 nitrate regulatory gene2 protein-like [Cucurbita pepo subsp. pepo]0.0e+0091.49Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE
        MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYV++LKNVGVSLRRYAEAEVLIESS+S+SATELDKTPSHSSYPSPCPSQTAD SESPL+E
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE

Query:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA
        SPISPPIATISYMVAG++ PLTVKVRPSSHSYAYEESVA   SPLP PPPPPHESGSSWDYFDT+DEIESFRF+GT GGMDVNFEDERMWKQFKGEMADA
Subjt:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA

Query:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
        KDEFQE   K E ELNG E+GGH+SP G VE++NLEMAR+EDKELNS S+SSKVLLEQ GSRG+I+LEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
Subjt:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF

Query:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM
         RASESGREISRMLEANKIRVGYLEANGN+SGLAFLDPLRRLRLVCCPAKPALFS+E HKTKVITWKRSTSSRSSSSRNPLA KDD DDS SEFVEEFCM
Subjt:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM

Query:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
        ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
Subjt:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL

Query:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI
        IRMWKAMLECHHAQYITISLAYHSKSA MGTPRAD QRQITVQLQQEIE FGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPR  IAPPIFI
Subjt:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI

Query:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK
        LCRDWF GIKDLPSNELSDAIR ILGEL CSISQQAELQRKQK LVE   GEEPEGK +ENNDT+SSNLS IH SLTKVLD LTKFSEASLKMYED+RQK
Subjt:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK

Query:  SEAARTAYLNYRPRSEL
        S+AARTAYLNYRPRSEL
Subjt:  SEAARTAYLNYRPRSEL

TrEMBL top hitse value%identityAlignment
A0A0A0KUA2 Uncharacterized protein0.0e+0087.39Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE
        MGG+NSKIENNEALRLCKERKR+IKQAIDSRYALAAAHVCYVQAL+NVGVSLRRYAEAEVLIESSLSTSATE+DKTPSHSSYPSPCPS TADASESPL+E
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE

Query:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA--SPLPPPPPPP-HESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMAD-A
        SPISPPIATISYMVAG   PLTVKVRPS+HS+ YEESVA  SPLPPPPPPP HESG SWDYFDTNDEIESFRFLGTGGMDV+FEDERMWKQFKGEM D  
Subjt:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA--SPLPPPPPPP-HESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMAD-A

Query:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
        KD+  EG +K EA     ++G +LS    VE+RNLEM RREDKE+NS S+S+KV+LEQS SRG ++LEK LCTEQEDPSEFITHRAKDFLSSIKEI++RF
Subjt:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF

Query:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSN-EPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFC
         RASESGREISRMLEANKIRVGYLE NG+ISG AFLDPLRRLRLVCCPAKPAL+S+ E HKTKVITWKRSTS+RSSSSRNPLAAKDD DDSGSEFVEEFC
Subjt:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSN-EPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFC

Query:  MISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQG
        MISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQG
Subjt:  MISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQG

Query:  LIRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCIS-RERSKSRRPFSPRRVIAPPI
        LIRMWKAMLECHH+QYITISLAYHSKS AMGTPRADAQRQI++QLQQEIECFGLSFANWINSLASYVGALNGWLQHCI  ++RSKSRRPFSPRRVIAPPI
Subjt:  LIRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCIS-RERSKSRRPFSPRRVIAPPI

Query:  FILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQ
        F+LCRDW  GI DLPSNELS+AIRA LGEL+CSISQQAELQRKQKLVEANTGEE EGK+DE N T SSN+S IH SLTKVLDRLTKFSEASLKMYEDVRQ
Subjt:  FILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQ

Query:  KSEAARTAYLNYRP
        KSEAA++ YLNY+P
Subjt:  KSEAARTAYLNYRP

A0A5D3D206 DUF632 domain-containing protein/DUF630 domain-containing protein0.0e+0087.66Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE
        MGG+NSKI+NNEALRLCKERKR+IKQAIDSRYALAAAHVCYVQAL+NVGVSLRRYAEAEVLIESSLSTSATE+DKTPSHSSYPSPCPS TADASESPL+E
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE

Query:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA--SPLPPPPPPP-HESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMAD-A
        SPISPPIATISYMVAG   PLTVKVRPS+HS  YEESVA  SPLPPPPPPP HESG SWDYFDTNDEIESFRF GTGGMDVNFEDERMWKQFKGE+ D  
Subjt:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA--SPLPPPPPPP-HESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMAD-A

Query:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
        KD+  EG +K EA     ++  +LS    VE+RNLEM RREDKE+NS S+S+KV+LEQSGSRG ++LEK LCTEQEDPSEFITHRAKDFLSSIKEI++RF
Subjt:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF

Query:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM
         RASESGREISRMLEANKIRVGYLE NG+ISG AFLDPLRRLRLVCCPAKPALFS+EP KTKVITWKRSTS+RSSSSRNPLAAKDD DDSGSEFVEEFCM
Subjt:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM

Query:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
        ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
Subjt:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL

Query:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCIS-RERSKSRRPFSPRRVIAPPIF
        IRMWKAMLECHHAQYITISLAYHSKS AMGTPRADAQRQI+VQL+QEIECFGLSFANWINSLASYVGALNGWLQHCI  +ERSKSRRPFSPRRVIAPPIF
Subjt:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCIS-RERSKSRRPFSPRRVIAPPIF

Query:  ILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK
        +LCRDW  GI DLPSNELS+AIRA L EL+CSISQQAELQRKQKLVEANTGEE EGK +ENN T SSNLS IH SLTKVLDRLTKFSEASLKMYEDVRQK
Subjt:  ILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK

Query:  SEAARTAYLNYRP
        SEAA++ YLNY+P
Subjt:  SEAARTAYLNYRP

A0A6J1EU29 nitrate regulatory gene2 protein-like0.0e+0086.38Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE
        MGGSNSKIEN+EALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSH SYPSPCPSQTADASESPLRE
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE

Query:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVASPLPPPPPP-PHESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMADAKDE
        SPI+PPI TISYMVAG S PLTVKV+PSSHSY YEESVASPLPPPPPP PH+ GSSWDYFDTNDEI+SF FL TGGMDVN E+ERMWKQFKG M DA DE
Subjt:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVASPLPPPPPP-PHESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMADAKDE

Query:  FQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRFLRA
         QEG +K E     CE+G HLS S  VE+R  EMARR+DKELNSTS+S +VLLEQSGSRG +K+EKSLCTEQEDPSEFITHRAKDFLSSIKEI++RF RA
Subjt:  FQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRFLRA

Query:  SESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCMISG
        S+SGRE+SRMLE NKIRV YLE NG++SG AF DPLRRLRLVCCPAKP L S+EPHK KVITWKRST SRSSSS++PLAAKDDVDDSGSEFVEEFCMISG
Subjt:  SESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCMISG

Query:  SHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRM
        SHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKR+EKMRDEEMQPQLIEL+QGLIRM
Subjt:  SHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRM

Query:  WKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCIS--RERSKSRRPFSPRRVIAPPIFIL
        WK+MLECHHAQYITISLAYHSKSAAMGTPRADAQRQI VQLQQEIECFGLSFANWINSLASYVGALNGWLQHCI   +ERSKSRRPFSPRRV+APPIF+L
Subjt:  WKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCIS--RERSKSRRPFSPRRVIAPPIFIL

Query:  CRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEANTGEEPEGK-SDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQKS
        CRDW TGI +LPSNELS+AIR  L EL+CSIS++AELQRKQKLV+ NTGEEPEG  +++NN T SSNLS IH SLTK+LDRLTKFSEASLKMYEDVRQKS
Subjt:  CRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEANTGEEPEGK-SDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQKS

Query:  EAARTAYLNYRP
        EAA T YLNY+P
Subjt:  EAARTAYLNYRP

A0A6J1FSJ9 nitrate regulatory gene2 protein-like0.0e+0091.91Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE
        MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYV++LKNVGVSLRRYAEAEVLIESS+S+SATELDKTPSHSSYPSPCPSQTAD SESPL+E
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE

Query:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA
        SPISPPIATISYMVAG++ PLTVKV+PSSHSYAYEESVA   SPLP PPPPPHESGSSWDYFDT+DEIESFRF+GT GGMDVNFEDERMWKQFKGEMADA
Subjt:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA

Query:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
        KDEFQE  AK E ELNG E+GGH+SPSG VE++NLEMAR+EDKELNS S+SSKVLLEQSGSRG+I+LEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
Subjt:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF

Query:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM
         RASESGREISRMLEANKIRVGYLEANGN+SGLAFLDPLRRLRLVCCPAKPALFS+EPHKTKVITWKRSTSSRSSSSRNPLA KDD DDS SEFVEEFCM
Subjt:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM

Query:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
        ISGSHSSTLDRLYAWERKLYDEVKASESI+KEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
Subjt:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL

Query:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI
        IRMWKAMLECHHAQYITISLAYHSKSA MGTPRAD QRQITVQLQQEIE FGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRR IAPPIFI
Subjt:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI

Query:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK
        LCRDWF GIKDLPSNELSDAIR ILGEL CSISQQAELQRKQK LVE   GEEPEGK +ENND VSSNLS IH SLTKVLD LTKFSEASLKMYED+RQK
Subjt:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK

Query:  SEAARTAYLNYRPRSEL
        S+AARTAYLNYRPRSEL
Subjt:  SEAARTAYLNYRPRSEL

A0A6J1IQA5 nitrate regulatory gene2 protein-like0.0e+0091.77Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE
        MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYV++LKNVGVSLRRYAEAEVLIESS+S+SATELDKTPSHSSYPSPCPSQTAD SESPL+E
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE

Query:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA
        SPISPPIATISYMVAG++ PLTVKVRPSSHSYAYEESVA   SPLP PPPPPHESGSSWDYFDT+DEIESFRF+GT GGMDVNFEDERMWKQFKGEMADA
Subjt:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVA---SPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGT-GGMDVNFEDERMWKQFKGEMADA

Query:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
        KDEFQE  AK E +LNG E+GGH+SPSG VE++NLEMAR+E KELNS S+SSKVLLEQ+GSRG+I+LEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF
Subjt:  KDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRF

Query:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM
         RASESGREISRMLEANKIRVGYLEANGN+SGLAFLDPLRRLRLVCCPAKPALFS+EPHKTKVITWKRSTSSRSSSSRNPLA KDD DDS SEFVEEFCM
Subjt:  LRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCM

Query:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
        ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL
Subjt:  ISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGL

Query:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI
        IRMWKAMLECHHAQYITISLAYHSKSA MGTPRAD QRQITVQLQQEIE FGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRR IAPPIFI
Subjt:  IRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFI

Query:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK
        LCRDWF GIKDLPSNELSDAIR ILGEL CSISQQAELQRKQK LVE   GEEPEGK +ENND VSSNLS IH SLTKVLD LTKFSEASLKMYED+RQK
Subjt:  LCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQK-LVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQK

Query:  SEAARTAYLNYRPRSEL
        S+AARTAYLNYRPRSEL
Subjt:  SEAARTAYLNYRPRSEL

SwissProt top hitse value%identityAlignment
A0A178VBJ0 Protein ALTERED PHOSPHATE STARVATION RESPONSE 11.2e-4725.78Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE
        MG   S+I++ E +  CK RKR++K  + +R  L+ +H  Y+++L+ VG SL  ++  E                TP H  +  P PS        P   
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE

Query:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVASPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMADAKDEF
         P+SP   T ++    TS                  SV  P PPPPPPP    S+WD++D         F+           E  W++            
Subjt:  SPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVASPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMADAKDEF

Query:  QEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRFLRAS
        +   A   A   G +     +P+    Q +  ++      + +T+  S++ +                         ++   KD +  IKE++  FL+A+
Subjt:  QEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRFLRAS

Query:  ESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRS-TSSRSSSSRNPLAAKDDVDDSGSEFVEEFCMISG
        +SG  +S +LE +        +  + SG +    +       C   P  F           W R    S+ S  RN               +   C++ G
Subjt:  ESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRS-TSSRSSSSRNPLAAKDDVDDSGSEFVEEFCMISG

Query:  SHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRM
        SHSST+DRLYAWE+KLY EVK +ESI+ ++++K +Q+R L  K       +K +  V+ L S++ V+  ++ S S  I K+R+ E+ PQL+EL++GL+ M
Subjt:  SHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRM

Query:  WKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFILCR
        W++M E H  Q   +    +  +     P ++  RQ T+QL+ E++ +  SF N + +   Y+ +L GWL+  + +    S+ P   R      I+  C 
Subjt:  WKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFILCR

Query:  DWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEA
        +W   I  +P    S+ I++ L  +H  ++QQA+  +++K  E+
Subjt:  DWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEA

Q93YU8 Nitrate regulatory gene2 protein2.4e-3224.19Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSS------------YPSPCPS
        MG + SK++N +A+R CK+R+R +K+A+ +R+ LAAAH  Y ++L+  G +L  +A  E L  S   T A  L   P   S             PSP PS
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSS------------YPSPCPS

Query:  QTADASESPLRESPISPPIATISY-------------MVAGTSAPLT------VKVRPSSHSYAYEESVASPLP-------------PPPPPPHE--SGS
             S SP   S   P + + S               +   S+P +          P+ +  AY+ S  S  P             PP PP  E  +  
Subjt:  QTADASESPLRESPISPPIATISY-------------MVAGTSAPLT------VKVRPSSHSYAYEESVASPLP-------------PPPPPPHE--SGS

Query:  SWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMADAKDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQ
        + +    +D   +     T   + +F D R  KQ + E    + E +    + E + +  E   H S +      + + A  E+++ +  S+S      +
Subjt:  SWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMADAKDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQ

Query:  SGSRGTIKLEKSLCTEQEDPSEF-----------------------------------ITHRAKDFLSSIKEIEHRFLRASESGREISRMLEANKIRVGY
         GS  T++   S+    + PS                                     +  R +D    I  I+  F +A+ SG ++S+MLE     +G 
Subjt:  SGSRGTIKLEKSLCTEQEDPSEF-----------------------------------ITHRAKDFLSSIKEIEHRFLRASESGREISRMLEANKIRVGY

Query:  LEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEV
         E + + S L                K  + S+        TW         +S+ PLA K  +D +  +        S S  STLDRL AWE+KLY+E+
Subjt:  LEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEV

Query:  KASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRMWKAMLECHHAQ-YITISLAY
        KA E  + E+++K  QL+    K      +DKT+A +  L S I V   +V + S  I ++RD ++ PQL+EL  G + MWK+M + H  Q  I   +  
Subjt:  KASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRMWKAMLECHHAQ-YITISLAY

Query:  HSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGW-----LQHCISRERSKSRRPFSPRRVIAPPIFILCRDWFTGIKDLPSNEL
            +  G   ++  RQ T  L+  +  +  SF++ I     ++ +++ W     L  C     +  + P           +  C +W   +  +P    
Subjt:  HSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGW-----LQHCISRERSKSRRPFSPRRVIAPPIFILCRDWFTGIKDLPSNEL

Query:  SDAIRAILGELHCSISQQAELQRKQKLVEANTGEEPEGKSDENN
        S+AI++ +  +H   ++QA+  + +K  E+ + E  +  S   N
Subjt:  SDAIRAILGELHCSISQQAELQRKQKLVEANTGEEPEGKSDENN

Q9AQW1 Protein ROLLING AND ERECT LEAF 25.0e-3825.81Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESS-------LSTSATELDKTP--------SHSSYPSP
        MG + SK+E  + +R CKER+R +K+A+ SR  LA+AH  Y+++L+    +L R+A+    +  S       L+T+A  L  TP        + SS P P
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESS-------LSTSATELDKTP--------SHSSYPSP

Query:  CP----SQTADASESPLRESPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVASPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGTGGMDVNFE
         P     Q A     P +     PP+A    + A    P  +KV       +      S    P      S S+WD+ +         F      D  F 
Subjt:  CP----SQTADASESPLRESPISPPIATISYMVAGTSAPLTVKVRPSSHSYAYEESVASPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGTGGMDVNFE

Query:  DERMWKQFKGEMADAKDEFQEGPAKSEAELNGCEHGGHLSPSGVV----EQRNLEM-------------------ARREDKELNSTSMSSKVLLEQSGSR
        D R         AD ++  +    + E +  G  H  HL     V    ++R  EM                    R E+ E+ + S        + G  
Subjt:  DERMWKQFKGEMADAKDEFQEGPAKSEAELNGCEHGGHLSPSGVV----EQRNLEM-------------------ARREDKELNSTSMSSKVLLEQSGSR

Query:  GTIKLEKSLC------------TEQEDPSEFITH--------RAKDFLSSIKEIEHRFLRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRL
        GT   E +              +E  D S  +T         R +     +  IE  F++A+E+G  +S +LEA++ +   L+ N            R+L
Subjt:  GTIKLEKSLC------------TEQEDPSEFITH--------RAKDFLSSIKEIEHRFLRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRL

Query:  RLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRY
        +      K    SN    +   TW         +S+ PLA +  +D +  E      M   SH STL+RL AWE+KLY EVKA ES++ E+++K   L+ 
Subjt:  RLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRY

Query:  LFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITV
        L  +   +  +DKT+A +  L S I V   +  + S  I ++RD E+ PQL+EL   L+ MW++M   H  Q   +         +M    +D  R  T 
Subjt:  LFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITV

Query:  QLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQ
         L+  +  +  +F   I     Y+ AL GWL+  + +  S   +  +   +I+  +   C +W   +  LP    S+AI++ +  +H   ++QAE  + +
Subjt:  QLQQEIECFGLSFANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQ

Query:  KLVEANTGE
        K  E  + E
Subjt:  KLVEANTGE

Arabidopsis top hitse value%identityAlignment
AT2G17110.1 Protein of unknown function (DUF630 and DUF632)5.6e-5327.94Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLS--TSATELDKTPSHSSYPSPCPSQTADASESPL
        MG S SK+++  A+ LC++R  F++ AI  RYAL+ AHV Y Q+LK +  SL ++         S S   +  ++D    H  + S   S   D  +S L
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLS--TSATELDKTPSHSSYPSPCPSQTADASESPL

Query:  RESPISPPIAT----------ISYMVAGTSAP-LTVKVRPSSHSYAY----------------EESVASPLPPPPP-PPHESGSSWDYFD----------
          SP+   +            ++YM      P L  + RPSS    +                  +  S LPPPPP PP E    WD+ D          
Subjt:  RESPISPPIAT----------ISYMVAGTSAP-LTVKVRPSSHSYAY----------------EESVASPLPPPPP-PPHESGSSWDYFD----------

Query:  -TNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMA-DAKDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSR
         + D  E    LG   ++   ED+ + K+  G+    A    +E    S A  +G   GG    + + + R      +E+ E +   +  K ++E SG  
Subjt:  -TNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMA-DAKDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSR

Query:  GTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRFLRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTK
           K + ++          +   A       KEIE +FLRA+ESG EI+ MLE  K   G      N+S                     L+   P  + 
Subjt:  GTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRFLRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTK

Query:  VITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKD
        V + + STS ++ +     A+      + ++   E  + S + SSTL +L+ WE+KLYDEVKA E +R  +++K  +L+ +  +    Q +D TR +V+ 
Subjt:  VITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKD

Query:  LHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSL
        L ++IR+AI  VD IS  I K+RDEE+  QL ELIQGL +MWK+MLECH +Q   I  A                 ++T  L  E+  + + F++W+++ 
Subjt:  LHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSL

Query:  ASYVGALNGWLQHCISRERSKSRR---PFSPRRVIAPPIFILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLV----EANTGEEPE-
          +V  LN WL  C+  E  ++     PFSP R+ AP IF++C  W   +  +   E+ +AIR     +   + +Q  L  +++++      N   E + 
Subjt:  ASYVGALNGWLQHCISRERSKSRR---PFSPRRVIAPPIFILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLV----EANTGEEPE-

Query:  ---------------GKSDEN----NDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQKSE
                       G  ++N    +DT + +L     SL ++ + + +F+E SLK Y D+  ++E
Subjt:  ---------------GKSDEN----NDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQKSE

AT2G19090.1 Protein of unknown function (DUF630 and DUF632)3.6e-5227.78Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSP---CPSQTADASESP
        MG S+SK+++ EA+++CK+RKRFIKQAI+ R   A+ H+ Y+ +L+ V  +L      + +++     +  E   T    S+ +P    P +   +S S 
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSP---CPSQTADASESP

Query:  LRE----SPISPPIATI----------SYMVAGTSAPLTVKVR-----------PSSHSYAYEESVA-------------------SPLPPPP-------
          E    SP S P   I          +Y++A  S P+ V+ R           P S S  Y ES                     +PL  P        
Subjt:  LRE----SPISPPIATI----------SYMVAGTSAPLTVKVR-----------PSSHSYAYEESVA-------------------SPLPPPP-------

Query:  -PPPHESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERM--------WKQFKGEMADAKDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNL------
         PPP    S WD+F   +   S  + G    D    D R          ++ + E      E  + P K E  +N      + +P    E R        
Subjt:  -PPPHESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERM--------WKQFKGEMADAKDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNL------

Query:  --------------EMARREDKELNS-----------------TSMSSKVLLEQSGSR-------GTIKLEKSLCTEQEDP-----SEFITHRAKDFLSS
                      E    +D E                    T    K  + + G         G ++  K++   ++D      + ++  R       
Subjt:  --------------EMARREDKELNS-----------------TSMSSKVLLEQSGSR-------GTIKLEKSLCTEQEDP-----SEFITHRAKDFLSS

Query:  IKEIEHRFLRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGS
        IK++E +F    ++ +E+S +LEA   R  Y  +  + S    L+P+            ALF             RS SSRSSSSR  + +     +SGS
Subjt:  IKEIEHRFLRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGS

Query:  E----FVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEE
        E      +E CMISGSH +TLDRL+AWE+KLYDEV++ E +R+ Y++KC QLR    K      +DKTRA ++DL ++I+V+I+S++SISKRIE +RD+E
Subjt:  E----FVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEE

Query:  MQPQLIELIQGLIRMWKAMLECHHAQYITISLA-----------YHSKSAAMGTPRA-DAQR--QITVQLQQEIECFGLSFANWINSLASYVGALNGWLQ
        + PQL+EL++GL RMWK M E H  Q  T+  A            H K      P A ++QR  Q  + L+ ++  +   F  WI S  SY+ AL+GWL 
Subjt:  MQPQLIELIQGLIRMWKAMLECHHAQYITISLA-----------YHSKSAAMGTPRA-DAQR--QITVQLQQEIECFGLSFANWINSLASYVGALNGWLQ

Query:  HCISRERSKSRRPFSPRRVIAPPIFILCRDWFTGIKDLPSNELSDAIRAI---LGELHC-----------SISQQAELQRKQKLVEANTGEE
         C   +    +   S       PI+ +C  W   +  L    + D +      +G ++            S S++       +LV A+ GEE
Subjt:  HCISRERSKSRRPFSPRRVIAPPIFILCRDWFTGIKDLPSNELSDAIRAI---LGELHC-----------SISQQAELQRKQKLVEANTGEE

AT2G27090.1 Protein of unknown function (DUF630 and DUF632)3.1e-10436.35Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSL--STSATE------LDKTPSHSSYPSPCPSQTAD
        MG S S+I+ ++AL+LC+ERK+F++QA+D R  LAAAHV YVQ+LK+ G +LR+++E EV +ESSL  STSAT       ++K+ SH SY  P  S +  
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSL--STSATE------LDKTPSHSSYPSPCPSQTAD

Query:  ASESPLRESPISPPIATISYMVAGTS-------------APLTVKVRPSSHSYAYEESV----ASPLPPPPPPPHESGSSWDYFDTNDEIE---SFRFLG
         + SP    P + P         G S             A +T    P S S    ES     +S +PP  P        WDYF  +  I+   S   +G
Subjt:  ASESPLRESPISPPIATISYMVAGTS-------------APLTVKVRPSSHSYAYEESV----ASPLPPPPPPPHESGSSWDYFDTNDEIE---SFRFLG

Query:  TGGMDVNFEDERMWKQFKGEMADAKDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKEL------NSTSMSSKVLLEQSGSRGTIKLEKS
         G +  + ++E        E  D  ++F     +   + +  E     S + V    N    RR+   L       S S  ++       S     L  +
Subjt:  TGGMDVNFEDERMWKQFKGEMADAKDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKEL------NSTSMSSKVLLEQSGSRGTIKLEKS

Query:  LCTEQEDPSEFITHR--AKDFLSSIKEIEHRFLRASESGREISRMLEANKIR----VGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEP--HKTK
           +  +  +   ++   +DFLSS+KEIE  F++ASE+G+E+ RMLEANK+     V   E+    S L           + C   P     EP  +  K
Subjt:  LCTEQEDPSEFITHR--AKDFLSSIKEIEHRFLRASESGREISRMLEANKIR----VGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEP--HKTK

Query:  VITWKRSTSSRSSSSRNPLAA--KDDVDDSGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVV
         +TW R+ SSRSSSSRNPL     DDV++  S   E  CMI+GSH+STLDRLYAWERKLYDEVK S+++R+EYD KC  LR L ++   +Q IDKTRAVV
Subjt:  VITWKRSTSSRSSSSRNPLAA--KDDVDDSGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVV

Query:  KDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWIN
        KDLHSRIRVAI+ +DSIS+RIE++RD E+QPQL ELI+GL RMW+ MLECH  Q+  I   Y   +  +   +++  RQ+T  L+ E+     SF  WI 
Subjt:  KDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWIN

Query:  SLASYVGALNGWLQHCIS---RERSKSRRPFSPRRVIAPPIFILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEANTGE-EPEGK
           SY+ A+N WL  C++   R + K R P    R   PPI+  C  W   ++ LP+ E+S +I+A+  ++   + +Q + + K+     N  +      
Subjt:  SLASYVGALNGWLQHCIS---RERSKSRRPFSPRRVIAPPIFILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEANTGE-EPEGK

Query:  SDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQKSEAARTAY
         DE  +           SL   + +L +F+E+S+KMYE++++    A+  Y
Subjt:  SDENNDTVSSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQKSEAARTAY

AT4G30130.1 Protein of unknown function (DUF630 and DUF632)1.2e-6830.1Show/hide
Query:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE
        MG S+SK +++EA+++CK+RKRFIKQA++ R   A+ H+ Y+Q+L+ V  +LR Y E +   E  L T  T + +  S   +    P        S + +
Subjt:  MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRE

Query:  SPISPPIATISYMVAGTSAPLTVKVRP-------------SSHSYAYEESVASP------LPPPPPPPHESGSSWDYFDTNDEIESFRFLGTGGMDVNFE
        S     +   SY++A  S P+ V+ +P             +   +    ++ SP      +PPP P   +    W+ F   D+   + +    GMD +  
Subjt:  SPISPPIATISYMVAGTSAPLTVKVRP-------------SSHSYAYEESVASP------LPPPPPPPHESGSSWDYFDTNDEIESFRFLGTGGMDVNFE

Query:  DERMWKQFKGEMADAKDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLE-------MARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDP
          R  ++ +G     +DE+ +       +     +GG +     VE  N E       +    DK  N T     + + + G+ G +    S   + E P
Subjt:  DERMWKQFKGEMADAKDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLE-------MARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDP

Query:  --SEFITHRAKDFLSSIKEIEHRFLRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSS
          + ++  R       IK++E +F     +G+E+S +LEA+  RV Y  +N  +S +  L+P+            ALF            +   SSRSSS
Subjt:  --SEFITHRAKDFLSSIKEIEHRFLRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSS

Query:  SRNPL-----AAKDDVDDSGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAI
        S   L      ++    +S SEF EE CM+SGSH STLDRLYAWE+KLYDEVK+ + IR  Y++KC  LR    K   +  +DKTRA ++DLH++I+V+I
Subjt:  SRNPL-----AAKDDVDDSGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAI

Query:  YSVDSISKRIEKMRDEEMQPQLIELIQGLIRMWKAMLECHHAQYITISLA----------YHSKSAAMGTPRADAQR------QITVQLQQEIECFGLSF
        +S++SIS+RIE +RD+E+ PQL+EL+QGL +MWK M ECH  Q  T+  A           H K      P  ++QR       + VQL+    C    F
Subjt:  YSVDSISKRIEKMRDEEMQPQLIELIQGLIRMWKAMLECHHAQYITISLA----------YHSKSAAMGTPRADAQR------QITVQLQQEIECFGLSF

Query:  ANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFILCRDW---FTGIKDLPSNELSDAIRAILGELHC-----------SISQQAELQRK
          WI S  SY+ +L GWL  C   +    +   +       PI+ +C  W     G+ + P  +  D   + +G ++              S++      
Subjt:  ANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFILCRDW---FTGIKDLPSNELSDAIRAILGELHC-----------SISQQAELQRK

Query:  QKLVEANTGEE
         +LVEA   EE
Subjt:  QKLVEANTGEE

AT4G39790.1 Protein of unknown function (DUF630 and DUF632)1.4e-17351.99Show/hide
Query:  MGGSNSKI---ENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSH-SSYPSPCPSQTADASES
        MG SNSK    + NE L LCKERKRF+KQA+DSR ALAAAHV Y+++L+N+G  LR+YAEAE   ESS S +ATE +K+PSH SSYP        D+ +S
Subjt:  MGGSNSKI---ENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSH-SSYPSPCPSQTADASES

Query:  PL----RESPISPPIATISYMVAGT-SAPLTVKVRP-SSHSYAYEESVASPLPPPPPPPHE-SGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQF
        PL      +P   P+  +SYM   T S+ +T  + P S      E ++ +  PPPPP P     SSWDYFDT D+ +SFRF+G                 
Subjt:  PL----RESPISPPIATISYMVAGT-SAPLTVKVRP-SSHSYAYEESVASPLPPPPPPPHE-SGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQF

Query:  KGEMADAKDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLC---TEQEDPSEFITHRAKDFL
                 E  E  ++ +A + G E         +  Q N  +A+   + L  +S  +             K  K  C    E+EDPSEFITHRAKDF+
Subjt:  KGEMADAKDEFQEGPAKSEAELNGCEHGGHLSPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLC---TEQEDPSEFITHRAKDFL

Query:  SSIKEIEHRFLRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKP-ALFSNEP---HKTKVITWKRSTSSRSSSSRNPL--AAK
        SS+K+IEH+F RASESGRE+SRMLE NKIRVG+ +  G  + +AFL  L+R    CC  K  +  S EP     TKVI WKR++SSRSS+SRNPL   +K
Subjt:  SSIKEIEHRFLRASESGREISRMLEANKIRVGYLEANGNISGLAFLDPLRRLRLVCCPAKP-ALFSNEP---HKTKVITWKRSTSSRSSSSRNPL--AAK

Query:  DDVDD-SGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEK
        +D DD SGS+F+EEFCMISGSHSS+LDRLYAWERKLYDEVKASE IRKEYDRKC+QLR  FAKD+S + +DKTRA  KDLHSRIRVAI SV+SISKRIE+
Subjt:  DDVDD-SGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLFAKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEK

Query:  MRDEEMQPQLIELIQGLIRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCI--SRER
        +RD+E+ PQL+E +QGLIRMWKAMLECHH QYITISLAYH + ++     +  +R+I  +L +E ECFGLSF + ++S+ASYV ALNGWL +C+   +ER
Subjt:  MRDEEMQPQLIELIQGLIRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLSFANWINSLASYVGALNGWLQHCI--SRER

Query:  S-KSRRPFSPRRVIAPPIFILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLD
        S ++RRP+SPRRV+APPIF+LCRDW  GIK LPS+ELS +I+          S   E+           GEE  G        + S+LS +H SL K+L+
Subjt:  S-KSRRPFSPRRVIAPPIFILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEANTGEEPEGKSDENNDTVSSNLSFIHLSLTKVLD

Query:  RLTKFSEASLKMYEDVRQKSEAARTAYLN
        RL KFSEASLKMYEDV+ KSEAAR AY N
Subjt:  RLTKFSEASLKMYEDVRQKSEAARTAYLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTTCGAACTCCAAAATCGAGAACAACGAAGCCCTACGACTTTGTAAGGAACGAAAACGGTTCATTAAACAGGCCATCGATTCGAGGTACGCTCTAGCCGCCGC
CCATGTCTGTTATGTTCAGGCCCTAAAAAACGTCGGCGTTTCTCTCCGGCGATACGCCGAGGCTGAGGTGTTGATCGAGTCGTCTCTGTCAACGTCGGCCACCGAGCTCG
ACAAAACGCCGTCGCATTCCTCTTATCCATCGCCGTGCCCGTCGCAGACTGCGGATGCTTCGGAATCACCATTGCGAGAAAGCCCCATTTCGCCGCCGATAGCCACCATT
AGTTACATGGTTGCCGGAACTAGTGCCCCTCTTACTGTGAAGGTCCGGCCGAGTAGCCATAGTTATGCTTATGAAGAATCAGTTGCTTCGCCATTGCCCCCACCTCCCCC
GCCGCCTCATGAGTCGGGATCTTCTTGGGATTATTTTGATACCAATGATGAAATTGAGAGCTTCAGGTTTTTGGGGACTGGTGGGATGGATGTGAACTTTGAAGATGAGA
GAATGTGGAAGCAATTCAAGGGAGAAATGGCTGATGCCAAGGATGAGTTTCAGGAAGGACCTGCAAAATCAGAAGCAGAGCTAAATGGCTGTGAACATGGTGGCCATTTG
AGTCCCTCTGGGGTTGTTGAACAAAGAAATTTGGAGATGGCGAGACGGGAAGATAAAGAGCTTAATTCGACGAGTATGTCAAGCAAAGTTCTGCTCGAACAATCTGGTTC
GAGAGGAACGATCAAGTTGGAGAAGAGTTTGTGTACTGAACAGGAAGATCCTTCAGAGTTCATAACTCATAGAGCTAAAGATTTTCTTTCCAGCATTAAGGAGATTGAGC
ATCGGTTTCTGAGAGCTTCAGAATCTGGGAGGGAGATCTCTAGAATGCTCGAAGCCAATAAAATCAGAGTCGGATACCTCGAAGCAAATGGAAACATATCTGGTCTTGCT
TTTTTGGACCCTTTACGTCGTCTACGACTTGTTTGCTGTCCTGCAAAGCCTGCCCTCTTTTCTAATGAACCTCATAAGACAAAAGTGATTACATGGAAACGGTCAACGTC
ATCGCGGTCATCTTCATCAAGAAATCCTCTTGCTGCAAAAGATGATGTTGATGACAGTGGAAGCGAATTTGTAGAAGAGTTCTGTATGATCTCTGGAAGTCATTCCTCTA
CCTTGGACCGGCTTTATGCGTGGGAGAGAAAACTCTATGATGAAGTCAAGGCAAGCGAATCCATTCGAAAGGAGTATGATCGGAAGTGTGATCAACTTAGATACCTATTT
GCAAAGGATTACAGTACACAAGTCATCGACAAGACAAGGGCTGTCGTCAAGGATCTGCATTCCCGCATACGAGTTGCAATCTATTCTGTCGATTCGATATCAAAACGCAT
TGAGAAAATGAGGGACGAAGAGATGCAGCCACAACTCATAGAACTCATTCAGGGGTTGATCAGGATGTGGAAGGCGATGCTCGAATGCCATCATGCACAGTACATAACAA
TATCGTTGGCATATCATTCAAAGAGTGCAGCCATGGGTACACCTCGAGCAGATGCACAGAGGCAGATAACAGTCCAACTTCAGCAAGAGATTGAGTGTTTCGGCTTAAGC
TTTGCAAACTGGATCAACAGCCTTGCATCATATGTTGGTGCTTTGAATGGTTGGCTACAACACTGTATTTCACGGGAACGCTCAAAGAGCAGAAGGCCTTTCTCTCCTCG
TCGTGTCATTGCCCCGCCTATCTTCATCCTATGTCGTGATTGGTTTACTGGAATAAAAGACCTTCCATCTAATGAACTCAGCGATGCCATCAGAGCCATCTTGGGAGAAT
TGCACTGTTCAATATCCCAACAAGCAGAACTACAGAGGAAACAGAAGCTTGTTGAAGCAAACACTGGGGAAGAACCAGAGGGGAAGAGCGACGAGAACAACGATACTGTA
TCTTCAAACTTAAGTTTCATACATTTGAGTTTAACGAAGGTTCTCGATCGGCTAACGAAATTTTCTGAGGCCTCGCTCAAAATGTATGAAGATGTTAGACAGAAAAGCGA
GGCTGCACGTACAGCATATTTGAATTATAGGCCTCGCTCAGAATTATGA
mRNA sequenceShow/hide mRNA sequence
TTGCTTTAGTTTTTAATTTCTAAAATTTGATATTTTTTTTTTGTCGCGCCCATGAAAAGGAATGGCCACTATAATTGTTTAATTGAACAGTGAAGTATGTATGTTTTCTA
ATTTTTCCTGTTTTTCTCTTTTTTTTTTGAATTTTTGGTGAAAATCTACACATTTTTCGAGGATTTTTCTTGCAATGGTGATTGCAGCAGATTCCCACTAGGCTACCATT
TTTTTCCCTTCCTTTCTTGCTTTCTTACCTTGTTAGGTTTTTTGATCTCCATTTTGCTTCATCATTACTGCCTTTCCCTTCTTCATCTCTGCTAGAATCTTCGTTTCAGA
GGCTCATTGAAGACTGGTCATCTCAGGCTTGTTTTGTTTTAATGTTTTTGATTTCGTTTTCTTCAAGTTTTCCATTGATGGGTTTGGTTTTATGAGTTCTCTCAATCTCT
GCTCTTTCTGGCTTCCCTCTCGTTCGTACTTGAAAACAAGCTAATCCTCTGTTTTCTCTTACTCTGAATGTAATGTGAAGGGGTTGCTCTTTCTTTGCGAACACGCTTTC
CTCAGGTGGGTTCTTGACTTCGTGGGCGTTTCCGCGTTGTCTGTTGAAAAGGGGCTGCTCTGTTTTCTCGAGGGTTTTCGCTTTTTTTTTTGCTCTGTTTTCTCGAAATC
GGTTCATCTTGATGCTGGTGAGTTTCTTTGGCGAATCAAGGTGGGAAGCCATTGATTCTGAAGTGGGTGTTCTGGTAATCGAAATTTCATGGCTCCGACCTCGATTTCAT
GAACTGGGTCGTTGAAATTTTTGGAGAGAAATGGGTGGTTCGAACTCCAAAATCGAGAACAACGAAGCCCTACGACTTTGTAAGGAACGAAAACGGTTCATTAAACAGGC
CATCGATTCGAGGTACGCTCTAGCCGCCGCCCATGTCTGTTATGTTCAGGCCCTAAAAAACGTCGGCGTTTCTCTCCGGCGATACGCCGAGGCTGAGGTGTTGATCGAGT
CGTCTCTGTCAACGTCGGCCACCGAGCTCGACAAAACGCCGTCGCATTCCTCTTATCCATCGCCGTGCCCGTCGCAGACTGCGGATGCTTCGGAATCACCATTGCGAGAA
AGCCCCATTTCGCCGCCGATAGCCACCATTAGTTACATGGTTGCCGGAACTAGTGCCCCTCTTACTGTGAAGGTCCGGCCGAGTAGCCATAGTTATGCTTATGAAGAATC
AGTTGCTTCGCCATTGCCCCCACCTCCCCCGCCGCCTCATGAGTCGGGATCTTCTTGGGATTATTTTGATACCAATGATGAAATTGAGAGCTTCAGGTTTTTGGGGACTG
GTGGGATGGATGTGAACTTTGAAGATGAGAGAATGTGGAAGCAATTCAAGGGAGAAATGGCTGATGCCAAGGATGAGTTTCAGGAAGGACCTGCAAAATCAGAAGCAGAG
CTAAATGGCTGTGAACATGGTGGCCATTTGAGTCCCTCTGGGGTTGTTGAACAAAGAAATTTGGAGATGGCGAGACGGGAAGATAAAGAGCTTAATTCGACGAGTATGTC
AAGCAAAGTTCTGCTCGAACAATCTGGTTCGAGAGGAACGATCAAGTTGGAGAAGAGTTTGTGTACTGAACAGGAAGATCCTTCAGAGTTCATAACTCATAGAGCTAAAG
ATTTTCTTTCCAGCATTAAGGAGATTGAGCATCGGTTTCTGAGAGCTTCAGAATCTGGGAGGGAGATCTCTAGAATGCTCGAAGCCAATAAAATCAGAGTCGGATACCTC
GAAGCAAATGGAAACATATCTGGTCTTGCTTTTTTGGACCCTTTACGTCGTCTACGACTTGTTTGCTGTCCTGCAAAGCCTGCCCTCTTTTCTAATGAACCTCATAAGAC
AAAAGTGATTACATGGAAACGGTCAACGTCATCGCGGTCATCTTCATCAAGAAATCCTCTTGCTGCAAAAGATGATGTTGATGACAGTGGAAGCGAATTTGTAGAAGAGT
TCTGTATGATCTCTGGAAGTCATTCCTCTACCTTGGACCGGCTTTATGCGTGGGAGAGAAAACTCTATGATGAAGTCAAGGCAAGCGAATCCATTCGAAAGGAGTATGAT
CGGAAGTGTGATCAACTTAGATACCTATTTGCAAAGGATTACAGTACACAAGTCATCGACAAGACAAGGGCTGTCGTCAAGGATCTGCATTCCCGCATACGAGTTGCAAT
CTATTCTGTCGATTCGATATCAAAACGCATTGAGAAAATGAGGGACGAAGAGATGCAGCCACAACTCATAGAACTCATTCAGGGGTTGATCAGGATGTGGAAGGCGATGC
TCGAATGCCATCATGCACAGTACATAACAATATCGTTGGCATATCATTCAAAGAGTGCAGCCATGGGTACACCTCGAGCAGATGCACAGAGGCAGATAACAGTCCAACTT
CAGCAAGAGATTGAGTGTTTCGGCTTAAGCTTTGCAAACTGGATCAACAGCCTTGCATCATATGTTGGTGCTTTGAATGGTTGGCTACAACACTGTATTTCACGGGAACG
CTCAAAGAGCAGAAGGCCTTTCTCTCCTCGTCGTGTCATTGCCCCGCCTATCTTCATCCTATGTCGTGATTGGTTTACTGGAATAAAAGACCTTCCATCTAATGAACTCA
GCGATGCCATCAGAGCCATCTTGGGAGAATTGCACTGTTCAATATCCCAACAAGCAGAACTACAGAGGAAACAGAAGCTTGTTGAAGCAAACACTGGGGAAGAACCAGAG
GGGAAGAGCGACGAGAACAACGATACTGTATCTTCAAACTTAAGTTTCATACATTTGAGTTTAACGAAGGTTCTCGATCGGCTAACGAAATTTTCTGAGGCCTCGCTCAA
AATGTATGAAGATGTTAGACAGAAAAGCGAGGCTGCACGTACAGCATATTTGAATTATAGGCCTCGCTCAGAATTATGACCATGTAAATTAAGGGTGGAGAAGTACAAAT
ATTTCTCAAGTTTGGCTGAGATTGAATTTGTACAGTTTTGAAGAACTTGCAGAGGTTCTGTGATTGCATGATGATGATATGCTCATTTTATTCTGCTTCCAGAAATCCTA
TTGATTTCTTGAAAATCTGTTCTAGCTGTTCTTTTTATTCCTTAACTTTGTGATCTTAGGCTAAATCAATTTATTAG
Protein sequenceShow/hide protein sequence
MGGSNSKIENNEALRLCKERKRFIKQAIDSRYALAAAHVCYVQALKNVGVSLRRYAEAEVLIESSLSTSATELDKTPSHSSYPSPCPSQTADASESPLRESPISPPIATI
SYMVAGTSAPLTVKVRPSSHSYAYEESVASPLPPPPPPPHESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQFKGEMADAKDEFQEGPAKSEAELNGCEHGGHL
SPSGVVEQRNLEMARREDKELNSTSMSSKVLLEQSGSRGTIKLEKSLCTEQEDPSEFITHRAKDFLSSIKEIEHRFLRASESGREISRMLEANKIRVGYLEANGNISGLA
FLDPLRRLRLVCCPAKPALFSNEPHKTKVITWKRSTSSRSSSSRNPLAAKDDVDDSGSEFVEEFCMISGSHSSTLDRLYAWERKLYDEVKASESIRKEYDRKCDQLRYLF
AKDYSTQVIDKTRAVVKDLHSRIRVAIYSVDSISKRIEKMRDEEMQPQLIELIQGLIRMWKAMLECHHAQYITISLAYHSKSAAMGTPRADAQRQITVQLQQEIECFGLS
FANWINSLASYVGALNGWLQHCISRERSKSRRPFSPRRVIAPPIFILCRDWFTGIKDLPSNELSDAIRAILGELHCSISQQAELQRKQKLVEANTGEEPEGKSDENNDTV
SSNLSFIHLSLTKVLDRLTKFSEASLKMYEDVRQKSEAARTAYLNYRPRSEL