; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0001044 (gene) of Chayote v1 genome

Gene IDSed0001044
OrganismSechium edule (Chayote v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationLG08:32403769..32405807
RNA-Seq ExpressionSed0001044
SyntenySed0001044
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598306.1 hypothetical protein SDJN03_08084, partial [Cucurbita argyrosperma subsp. sororia]5.7e-19676.51Show/hide
Query:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME
        MGKRF         +  MLVVTL +I CG VEGGS+S QK       ++NSLRKQAIKSI+SED DIIDCVS+YDQPAFDHPALRNHTIQ+ PTYDPT++
Subjt:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME

Query:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD
        +H+KK   EREG EEK+SMVVKQ WRKSGSCP+GTIPIRR++K VLLKA+SIE YGRKKPM+ +E AQ+   +S + LL N SK  L T G+NYNAAKGD
Subjt:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD

Query:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM
        IKVCNP+VE DDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQT++EIALGSAIYPIST NGL YEI M
Subjt:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM

Query:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE
        FLF+D +T NWWVQY E+INIGYWP ELFSAL++TAETVQWGGEVYSTKI R PHT+T MG+GRFPDF+ G SGWVKR+R RDNSM+L FPGWVEHYSDE
Subjt:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE

Query:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP
        YDCYD+DF+RDYL+DPELYYGGPG+NPRCP
Subjt:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP

KAG7029277.1 hypothetical protein SDJN02_07615, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-19676.74Show/hide
Query:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME
        MGKRF         +  MLVVTL +I CG VEGGS+S QK       ++NSLRKQAIKSI+SED DIIDCVS+YDQPAFDHPALRNHTIQ+ PTYDPT++
Subjt:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME

Query:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD
        +H+KK   EREG EEK+SMVVKQ WRKSGSCP+GTIPIRR+RK VLLKA+SIE YGRKKPM+ +E AQ+   +S + LL N SK  L T G+NYNAAKGD
Subjt:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD

Query:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM
        IKVCNP+VE DDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQT++EIALGSAIYPIST NGL YEI M
Subjt:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM

Query:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE
        FLF+D +T NWWVQY E+INIGYWP ELFSAL++TAETVQWGGEVYSTKI R PHT+T MG+GRFPDF+ G SGWVKR+R RDNSM+L FPGWVEHYSDE
Subjt:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE

Query:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP
        YDCYD+DF+RDYL+DPELYYGGPG+NPRCP
Subjt:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP

XP_022961958.1 uncharacterized protein LOC111462577 isoform X1 [Cucurbita moschata]3.4e-19676.51Show/hide
Query:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME
        MGKRF         +  ML+VTLT+I CG VEGGS+S QK       ++NSLRKQAIKSI+SED DIIDCVS+YDQPAFDHPALRNHTIQ+ PTYDPT++
Subjt:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME

Query:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD
        +H+KK   EREG EEK+SMVVKQ WRKSGSCP+GTIPIRR+RK VLLKA SIE YGRKKPM+ +E AQ+   +S + LL N SK  L T G+NYNAAKGD
Subjt:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD

Query:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM
        IKVCNP+VE DDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQT++EIALGSAIYPIST NGL YEI M
Subjt:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM

Query:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE
        FLF+D +T NWWVQY E+INIGYWP ELFSAL++TAETVQWGGEVYST I R PHT+T MG+GRFPDF+ G SGWVKR+R RDNSM+L FPGWVEHYSDE
Subjt:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE

Query:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP
        YDCYD+DF+RDYL+DPELYYGGPG+NPRCP
Subjt:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP

XP_023546232.1 uncharacterized protein LOC111805388 isoform X2 [Cucurbita pepo subsp. pepo]1.2e-19676.74Show/hide
Query:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME
        MGKRF         +  MLVVTLT+I CG VEGGS+S QK       ++NSLRKQAIKSI+SED DIIDCVS+YDQPAFDHPALRNHTIQ+ PTYDPT++
Subjt:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME

Query:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD
        +H+KK   EREG EEK+SMVVKQ WRKSGSCP+GTIPIRR+RK VLLKA+SIE YGRKKPM+ +E AQ+   +S + LL N SK  L T G+NYNAAKGD
Subjt:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD

Query:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM
        IKVCNP+VE DDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQT++EIALGSAIYPIST NGL YEI M
Subjt:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM

Query:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE
        FLF+D +T NWWVQY E+INIGYWP ELF AL++TAETVQWGGEVYSTKI R PHT+T MG+GRFPDF+ G SGWVKR+R RDNSM+L FPGWVEHYSDE
Subjt:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE

Query:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP
        YDCYD+DF+RDYL+DPELYYGGPG+NPRCP
Subjt:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP

XP_038886336.1 uncharacterized protein LOC120076548 [Benincasa hispida]7.0e-20278.04Show/hide
Query:  KRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTMEKH
        KRFS N      +FKMLVVTLT+I CG+VEGGS+ T++K L V KK+NSLRKQA KSIQS+D DIIDC+++YDQPAFDHPALRNHTIQM PTYDPTM++H
Subjt:  KRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTMEKH

Query:  TKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGDIK
        +KK   EREG+E KDSMVVKQ WRKSGSCPKGTIPIRR++K +L KADS+E YGRK+P   +EIAQ+SN++S + LL NHSK  L   G+NYN AKGDIK
Subjt:  TKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGDIK

Query:  VCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFL
        VCNPKVE DDEYSTSQVALLTGPYYN+EA+ESGWAVNPGVYGDRQTRLFVYWT DASHKTGCFDLTCPGFVQT++EIALGSAIYPISTS GL YEITMFL
Subjt:  VCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFL

Query:  FKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYD
        F+DL+T+NWWVQY E+I+IGYWPSELF+AL YTAETVQWGGEVYSTK+   PHT+TGMGNG+FPD++ G+SGWVKRIR RDNSM+LKFP WVEHYSDEYD
Subjt:  FKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYD

Query:  CYDIDFVRDYLEDPELYYGGPGRNPRCP
        CYDIDF+RDYL+DPELYYGGPG+NP+CP
Subjt:  CYDIDFVRDYLEDPELYYGGPGRNPRCP

TrEMBL top hitse value%identityAlignment
A0A0A0LQZ6 Uncharacterized protein1.0e-19075.58Show/hide
Query:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME
        MGK    NG     +FKMLV  LT+I CGVVE GS+S  K      KK++SLRKQA KSIQSED DIIDCVS+YDQPAFDHPALRNHTIQM PTYDPTM+
Subjt:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME

Query:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD
        KH+KK   E EG+ EK SM VKQ WRKSGSCPK TIPIRR+RK V LKA+S+  YG+K+P  L EIAQ+SN++S + LL NHSK +L   G N+N AKGD
Subjt:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD

Query:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM
        IKVCNP VE DDEYSTSQVALLTGPYYN+EAIESGWAVNPGVYGDRQTRLFVYWT DASHKTGCFDLTCPGFVQT++EIALGSAIYPISTS  L +EITM
Subjt:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM

Query:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE
        FLF+D ET+NWWVQY E+INIGYWPSELF AL YTAETVQWGGEVYSTK+   PHT TGMGNG+FPD++ G+SGWVKRIR RDNSMILKFP +VEHYSDE
Subjt:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE

Query:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP
        YDCYD+DF+R+YL+DPELYYGGPG+N RCP
Subjt:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP

A0A1S3BAI4 uncharacterized protein LOC1034877986.2e-18874.65Show/hide
Query:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME
        MGK    +G     +FKMLVV LT+I CGVVE GS+S  +      KK+NSLRKQA KSIQSED DIIDCVS+YDQPAFDHPALRNHTIQM PTYDPTM+
Subjt:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME

Query:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD
        KH+KK   E+EG+ EK+SM VKQ WR SGSCPK TIPIRR+RK     A+S+  YG+K+P  L EIAQ+SN++S + LL NHSK +L   G+N+N AKGD
Subjt:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD

Query:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM
        IKVCNP VE DDEYSTSQVALLTGPYYN+EAIESGWAVNPGVYGDRQTRLFVYWT DASHKTGCFDLTCPGFVQT++EIALGSAIYPISTS  L +EITM
Subjt:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM

Query:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE
        FLF+D ET+NWWVQY E+INIGYWPSELF AL YTAETVQWGGEVYSTK+   PHT TGMGNG+FPD++ G+SGWVKRIR RDNSMILKFP +VEHYSDE
Subjt:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE

Query:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP
        YDCYD+DF+R+YL+DPELYYGGPG+N RCP
Subjt:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP

A0A5A7V3A1 Uncharacterized protein6.2e-18874.65Show/hide
Query:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME
        MGK    +G     +FKMLVV LT+I CGVVE GS+S  +      KK+NSLRKQA KSIQSED DIIDCVS+YDQPAFDHPALRNHTIQM PTYDPTM+
Subjt:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME

Query:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD
        KH+KK   E+EG+ EK+SM VKQ WR SGSCPK TIPIRR+RK     A+S+  YG+K+P  L EIAQ+SN++S + LL NHSK +L   G+N+N AKGD
Subjt:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD

Query:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM
        IKVCNP VE DDEYSTSQVALLTGPYYN+EAIESGWAVNPGVYGDRQTRLFVYWT DASHKTGCFDLTCPGFVQT++EIALGSAIYPISTS  L +EITM
Subjt:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM

Query:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE
        FLF+D ET+NWWVQY E+INIGYWPSELF AL YTAETVQWGGEVYSTK+   PHT TGMGNG+FPD++ G+SGWVKRIR RDNSMILKFP +VEHYSDE
Subjt:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE

Query:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP
        YDCYD+DF+R+YL+DPELYYGGPG+N RCP
Subjt:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP

A0A6J1HDC5 uncharacterized protein LOC111462577 isoform X11.6e-19676.51Show/hide
Query:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME
        MGKRF         +  ML+VTLT+I CG VEGGS+S QK       ++NSLRKQAIKSI+SED DIIDCVS+YDQPAFDHPALRNHTIQ+ PTYDPT++
Subjt:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME

Query:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD
        +H+KK   EREG EEK+SMVVKQ WRKSGSCP+GTIPIRR+RK VLLKA SIE YGRKKPM+ +E AQ+   +S + LL N SK  L T G+NYNAAKGD
Subjt:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD

Query:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM
        IKVCNP+VE DDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQT++EIALGSAIYPIST NGL YEI M
Subjt:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM

Query:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE
        FLF+D +T NWWVQY E+INIGYWP ELFSAL++TAETVQWGGEVYST I R PHT+T MG+GRFPDF+ G SGWVKR+R RDNSM+L FPGWVEHYSDE
Subjt:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE

Query:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP
        YDCYD+DF+RDYL+DPELYYGGPG+NPRCP
Subjt:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP

A0A6J1K820 uncharacterized protein LOC111492515 isoform X16.2e-19676.28Show/hide
Query:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME
        MGKRF         +  MLVVTLT+I CG VEGGS+S QK       ++NSLRKQAIKSI+SED DIIDCVS+YDQPAFDHPAL NHTIQ+ PTYDPT++
Subjt:  MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTME

Query:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD
        +H+KK   EREG EEK+SMVVKQ WRKSGSCP+GTIPIRR+RK VLLKA+S+E YGRKKPM+ +E AQ+   +S + LL N SK  L T G+NYNAAKGD
Subjt:  KHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGD

Query:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM
        IKVCNP+VE DDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQT++EIALGSAIYPIST NGL YEI M
Subjt:  IKVCNPKVESDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITM

Query:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE
        FLF+D +T NWWVQY E+INIGYWP ELF AL++TAETVQWGGEVYSTKI R PHT+T MG+GRFPDF+ G SGWVKR+R RDNSM+L FPGWVEHYSDE
Subjt:  FLFKDLETSNWWVQYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDE

Query:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP
        YDCYD+DF+RDYL+DPELYYGGPG+NPRCP
Subjt:  YDCYDIDFVRDYLEDPELYYGGPGRNPRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)6.1e-8740.35Show/hide
Query:  KKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDP----TMEKHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGT
        K++ EV K LN L K A+KSIQS D D+IDCV +  QPAFDHP L++H IQM+P Y P       K +     E+EG        + Q W + G C +GT
Subjt:  KKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDP----TMEKHTKKPEGEREGIEEKDSMVVKQRWRKSGSCPKGT

Query:  IPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVN---HSKTVLHTEGHNYNAAKGDIKVCNPKVESDDEYSTSQVALLTGPY-YNFEA
        IP+RR ++D +L+A S++ YG+KK         +   +S  P L+N   H   + + EG  Y  AK  I V  PK++  +E+S SQ+ LL G +  +  +
Subjt:  IPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVN---HSKTVLHTEGHNYNAAKGDIKVCNPKVESDDEYSTSQVALLTGPY-YNFEA

Query:  IESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFLFKDLETSNWWVQYNENINIGYWPSELFSA
        IE+GW V+P +YGD  TRLF YWT+DA   TGC++L C GF+Q N +IA+G++I P+S      Y+I++ ++KD +  +WW+Q+     +GYWPS LFS 
Subjt:  IESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFLFKDLETSNWWVQYNENINIGYWPSELFSA

Query:  LTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYDCYDIDFVRDYLEDPELYYGGPGRNPRCP
        LT +A  ++WGGEV +++     HT T MG+G+FP+  F  + + + I+  D S  LK P  +  ++++ +CYD+    +       YYGGPG+N +CP
Subjt:  LTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYDCYDIDFVRDYLEDPELYYGGPGRNPRCP

AT2G44210.1 Protein of Unknown Function (DUF239)6.1e-8740.05Show/hide
Query:  FKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTMEKHTKKPEGEREGIEE
        F  LV+T+ I+A  VV G +  +  K     K+LN   K A+KSI+S D D+IDCV + DQPAF HP L NHT+QM P+ +P       K   +    + 
Subjt:  FKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTMEKHTKKPEGEREGIEE

Query:  KDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVN-HSKTVLHTEGHNYNAAKGDIKVCNPKVESDDEY
        + S  + Q W  +G CPK TIPIRR R+  L +A S+E YG K     + I +  +++  N L  N H   +++ E   +  AK  I V  P VE  +E+
Subjt:  KDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVN-HSKTVLHTEGHNYNAAKGDIKVCNPKVESDDEY

Query:  STSQVALLTGPY-YNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFLFKDLETSNWWV
        S +Q+ +L G +  +  +IE+GW V+P +YGD +TRLF YWT+DA   TGC++L C GFVQ N EIA+G +I P+S      Y+IT+ ++KD +  +WW+
Subjt:  STSQVALLTGPY-YNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFLFKDLETSNWWV

Query:  QYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYDCYDIDFVRDYL
        Q+ E   IGYWP+ LFS L+ +A  ++WGGEV +++     HT T MG+GRF +  +G + + K ++  D S  L+ P  ++ ++D+ +CY++       
Subjt:  QYNENINIGYWPSELFSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYDCYDIDFVRDYL

Query:  EDPELYYGGPGRNPRCP
             YYGGPGRNP CP
Subjt:  EDPELYYGGPGRNPRCP

AT5G25950.1 Protein of Unknown Function (DUF239)1.4e-9441.97Show/hide
Query:  IIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTMEKHTKKPEGEREGIEEKDSMVVKQR
        +I CG     + +  K  L++  KL +L K A+K+I+SED DIIDC+ +Y Q AFDHPAL+NH IQM+P+     +K T    G  E I+        Q 
Subjt:  IIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTMEKHTKKPEGEREGIEEKDSMVVKQR

Query:  WRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKP---MYLEEIAQISNNQSLNPLLVNHSKTVLHTE------GHNYNAAKGDIKVCNPKVESDDEYS
        W KSG CP GTIP+RRV ++ + +A S   +GRK P    +L+   Q   N ++ P  +N ++  L +E      G N+  A+ DI + NP      +YS
Subjt:  WRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKP---MYLEEIAQISNNQSLNPLLVNHSKTVLHTE------GHNYNAAKGDIKVCNPKVESDDEYS

Query:  TSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFLFKDLETSNWWVQY
        T+Q+ L+ G   NFE++E GW VNP V+GD +TRLF+ WTTD   KTGC +L C GFVQT+ + ALG+ + P+S+S+   Y IT+ +F D  + NWW+  
Subjt:  TSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFLFKDLETSNWWVQY

Query:  NENINIGYWPSELFSALTYTAETVQWGGEVYSTK-IRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYDCYDIDFVR-DYL
          N+ +GYWP  LF+ L ++A  VQWGGEV+S   + + PHT T MG+G++  +++  + +   +R +D SM LK+P ++  Y+DEY+CY     R  Y+
Subjt:  NENINIGYWPSELFSALTYTAETVQWGGEVYSTK-IRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYDCYDIDFVR-DYL

Query:  EDPELYYGGPGRNPRCP
         +P  Y+GGPGRN RCP
Subjt:  EDPELYYGGPGRNPRCP

AT5G56530.1 Protein of Unknown Function (DUF239)1.5e-8840.9Show/hide
Query:  GSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDP-TMEKHTKKPEGEREGIEEKDSMVVKQRWRKSGSCP
        G +S  ++  EV K LN L K A+KSIQS D DIIDCV +  QPAFDHP L++H IQM P+Y P ++   +K  E  +E +       + Q W ++G C 
Subjt:  GSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDP-TMEKHTKKPEGEREGIEEKDSMVVKQRWRKSGSCP

Query:  KGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVN---HSKTVLHTEGHNYNAAKGDIKVCNPKVESDDEYSTSQVALLTGPY-YN
        +GTIP+RR +K+ +L+A S++ YG+KK +       +   +S +P L+N   H   + + EG  +  AK  I V  PKV+S +E+S SQ+ +L G +  +
Subjt:  KGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVN---HSKTVLHTEGHNYNAAKGDIKVCNPKVESDDEYSTSQVALLTGPY-YN

Query:  FEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFLFKDLETSNWWVQYNENINIGYWPSEL
          +IE+GW V+P +YGD  TRLF YWT+DA   TGC++L C GF+Q N +IA+G++I P+S  +   Y+I++ ++KD +  +WW+Q+ +   +GYWPS L
Subjt:  FEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFLFKDLETSNWWVQYNENINIGYWPSEL

Query:  FSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYDCYDIDFVRDYLEDPELYYGGPGRNPR
        FS L  +A  V+WGGEV + +     HT T MG+G+FPD  F  + + + I+  D+S  LK P  +  ++++ +CYD++  ++       YYGGPGRNP 
Subjt:  FSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYDCYDIDFVRDYLEDPELYYGGPGRNPR

Query:  C
        C
Subjt:  C

AT5G56530.2 Protein of Unknown Function (DUF239)1.5e-8840.9Show/hide
Query:  GSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDP-TMEKHTKKPEGEREGIEEKDSMVVKQRWRKSGSCP
        G +S  ++  EV K LN L K A+KSIQS D DIIDCV +  QPAFDHP L++H IQM P+Y P ++   +K  E  +E +       + Q W ++G C 
Subjt:  GSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDP-TMEKHTKKPEGEREGIEEKDSMVVKQRWRKSGSCP

Query:  KGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVN---HSKTVLHTEGHNYNAAKGDIKVCNPKVESDDEYSTSQVALLTGPY-YN
        +GTIP+RR +K+ +L+A S++ YG+KK +       +   +S +P L+N   H   + + EG  +  AK  I V  PKV+S +E+S SQ+ +L G +  +
Subjt:  KGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVN---HSKTVLHTEGHNYNAAKGDIKVCNPKVESDDEYSTSQVALLTGPY-YN

Query:  FEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFLFKDLETSNWWVQYNENINIGYWPSEL
          +IE+GW V+P +YGD  TRLF YWT+DA   TGC++L C GF+Q N +IA+G++I P+S  +   Y+I++ ++KD +  +WW+Q+ +   +GYWPS L
Subjt:  FEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFLFKDLETSNWWVQYNENINIGYWPSEL

Query:  FSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYDCYDIDFVRDYLEDPELYYGGPGRNPR
        FS L  +A  V+WGGEV + +     HT T MG+G+FPD  F  + + + I+  D+S  LK P  +  ++++ +CYD++  ++       YYGGPGRNP 
Subjt:  FSALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYDCYDIDFVRDYLEDPELYYGGPGRNPR

Query:  C
        C
Subjt:  C


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGAGATTTTCGTACAATGGTTTTGGATTCAATGGAATTTTCAAAATGTTGGTTGTGACATTGACAATCATAGCTTGTGGAGTTGTGGAGGGTGGTTCAGTTTC
TACACAGAAGAAGAGATTGGAAGTTTTTAAGAAATTAAACTCTCTTAGAAAGCAAGCAATAAAAAGCATTCAGAGTGAAGATGACGACATCATAGACTGCGTTAGCGTTT
ACGACCAGCCTGCTTTCGATCATCCTGCTCTGAGAAATCACACCATCCAGATGGAACCTACTTATGATCCAACCATGGAAAAGCATACAAAGAAACCTGAAGGAGAAAGG
GAAGGAATAGAGGAGAAGGATTCAATGGTTGTGAAACAAAGATGGAGGAAAAGTGGAAGTTGTCCTAAAGGAACAATACCGATTCGAAGGGTCCGAAAAGACGTCCTACT
CAAAGCCGATTCCATAGAATGCTATGGAAGAAAGAAACCGATGTATTTGGAAGAAATCGCACAGATTTCTAACAACCAAAGTTTAAATCCTCTGCTAGTGAATCATTCGA
AGACAGTTCTCCATACTGAAGGACACAACTACAATGCAGCCAAAGGAGACATTAAAGTATGTAACCCGAAGGTTGAATCCGACGACGAATATAGTACTTCCCAAGTGGCT
CTCTTAACTGGCCCTTACTACAATTTTGAGGCTATCGAATCTGGATGGGCTGTAAATCCAGGCGTTTACGGGGATCGACAGACTCGACTGTTCGTGTATTGGACCACTGA
TGCTTCCCACAAAACAGGCTGCTTTGATTTGACTTGCCCTGGTTTTGTTCAAACCAACCATGAAATTGCTCTTGGTTCTGCTATTTATCCAATCTCAACTTCAAATGGAC
TTACTTATGAAATAACAATGTTCCTTTTCAAGGATTTAGAGACGAGTAATTGGTGGGTGCAATATAATGAAAACATCAACATTGGATATTGGCCAAGTGAATTATTCAGT
GCACTAACTTACACAGCAGAAACAGTACAATGGGGAGGGGAAGTTTACAGCACAAAAATAAGAAGAACTCCTCATACTAAAACAGGCATGGGCAATGGAAGGTTTCCCGA
CTTTGTTTTCGGTAACTCGGGTTGGGTAAAACGGATACGAGCTCGAGATAACTCGATGATCTTGAAGTTTCCAGGTTGGGTTGAGCATTACTCTGATGAATATGATTGTT
ACGATATCGATTTCGTTCGAGATTACTTAGAAGATCCTGAACTATACTATGGAGGTCCTGGTAGAAATCCCAGGTGTCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGAGATTTTCGTACAATGGTTTTGGATTCAATGGAATTTTCAAAATGTTGGTTGTGACATTGACAATCATAGCTTGTGGAGTTGTGGAGGGTGGTTCAGTTTC
TACACAGAAGAAGAGATTGGAAGTTTTTAAGAAATTAAACTCTCTTAGAAAGCAAGCAATAAAAAGCATTCAGAGTGAAGATGACGACATCATAGACTGCGTTAGCGTTT
ACGACCAGCCTGCTTTCGATCATCCTGCTCTGAGAAATCACACCATCCAGATGGAACCTACTTATGATCCAACCATGGAAAAGCATACAAAGAAACCTGAAGGAGAAAGG
GAAGGAATAGAGGAGAAGGATTCAATGGTTGTGAAACAAAGATGGAGGAAAAGTGGAAGTTGTCCTAAAGGAACAATACCGATTCGAAGGGTCCGAAAAGACGTCCTACT
CAAAGCCGATTCCATAGAATGCTATGGAAGAAAGAAACCGATGTATTTGGAAGAAATCGCACAGATTTCTAACAACCAAAGTTTAAATCCTCTGCTAGTGAATCATTCGA
AGACAGTTCTCCATACTGAAGGACACAACTACAATGCAGCCAAAGGAGACATTAAAGTATGTAACCCGAAGGTTGAATCCGACGACGAATATAGTACTTCCCAAGTGGCT
CTCTTAACTGGCCCTTACTACAATTTTGAGGCTATCGAATCTGGATGGGCTGTAAATCCAGGCGTTTACGGGGATCGACAGACTCGACTGTTCGTGTATTGGACCACTGA
TGCTTCCCACAAAACAGGCTGCTTTGATTTGACTTGCCCTGGTTTTGTTCAAACCAACCATGAAATTGCTCTTGGTTCTGCTATTTATCCAATCTCAACTTCAAATGGAC
TTACTTATGAAATAACAATGTTCCTTTTCAAGGATTTAGAGACGAGTAATTGGTGGGTGCAATATAATGAAAACATCAACATTGGATATTGGCCAAGTGAATTATTCAGT
GCACTAACTTACACAGCAGAAACAGTACAATGGGGAGGGGAAGTTTACAGCACAAAAATAAGAAGAACTCCTCATACTAAAACAGGCATGGGCAATGGAAGGTTTCCCGA
CTTTGTTTTCGGTAACTCGGGTTGGGTAAAACGGATACGAGCTCGAGATAACTCGATGATCTTGAAGTTTCCAGGTTGGGTTGAGCATTACTCTGATGAATATGATTGTT
ACGATATCGATTTCGTTCGAGATTACTTAGAAGATCCTGAACTATACTATGGAGGTCCTGGTAGAAATCCCAGGTGTCCCTAG
Protein sequenceShow/hide protein sequence
MGKRFSYNGFGFNGIFKMLVVTLTIIACGVVEGGSVSTQKKRLEVFKKLNSLRKQAIKSIQSEDDDIIDCVSVYDQPAFDHPALRNHTIQMEPTYDPTMEKHTKKPEGER
EGIEEKDSMVVKQRWRKSGSCPKGTIPIRRVRKDVLLKADSIECYGRKKPMYLEEIAQISNNQSLNPLLVNHSKTVLHTEGHNYNAAKGDIKVCNPKVESDDEYSTSQVA
LLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTTDASHKTGCFDLTCPGFVQTNHEIALGSAIYPISTSNGLTYEITMFLFKDLETSNWWVQYNENINIGYWPSELFS
ALTYTAETVQWGGEVYSTKIRRTPHTKTGMGNGRFPDFVFGNSGWVKRIRARDNSMILKFPGWVEHYSDEYDCYDIDFVRDYLEDPELYYGGPGRNPRCP