; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003016 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003016
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold12:37550902..37555319
RNA-Seq ExpressionSpg003016
SyntenySpg003016
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579477.1 hypothetical protein SDJN03_23925, partial [Cucurbita argyrosperma subsp. sororia]9.0e-19577.64Show/hide
Query:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAS DSS WS+++     KP SFMLKDYLL+D SSCSSNGFRSFPRRQCC TTVRFLLEIDLKVKDS++TKRF+PRTASRKIALSTISTLQ+ASDAVVRA
Subjt:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGN
        FK+FPLPS  KPF  RS SRK+ILR FWKK D VD NTRR KSFQEFLDEKEPP S  + + CTA+ V GRNSISSCSNSISWTESEFTSEMIPSSSSGN
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGN

Query:  SESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTP--AAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLV
        SESCSENDAVK DKDSPGNLIGKRDGVTFGKDS+EETTTAP+A   TTP    +REDIVK WPN+EEKEQ SPVSVLDFPFEDEDQD + SSF+CNLHL+
Subjt:  SESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTP--AAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLV

Query:  EGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAG-DFD
        +GKKQKH+ + KRFENGVE EPLDLKKRFAD+   R+ F  IS+KEHQREQKAFELL LVKSTTT        TENLLLDFFHEKLEEN+A AR G DFD
Subjt:  EGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAG-DFD

Query:  QPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        Q QVLK TEDWING+AGE    GWE PEGR LYIKDME+AGKWRS AGE+ ELAAE EAEVW+SL +ELLI++S
Subjt:  QPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

KAG7016946.1 hypothetical protein SDJN02_22057, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-19477.89Show/hide
Query:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAS DSS WS+++     KP SFMLKDYLL+D SSCSSNGFRSFPRRQCC TTVRFLLEIDLKVKDS++TKRF+PRTASRKIALSTISTLQ+ASDAVVRA
Subjt:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNT-RRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSG
        FK+FPLPS RKPF  RS SRK+ILRAFWKK D VD NT RR KSFQEFLDEKEPP S  + + CTA+ V GRNSISSCSNSISWTESEFTSEMIPSSSSG
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNT-RRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSG

Query:  NSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTP--AAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHL
        NSESCSENDAVK DKDSPGNLIGKRDGVTFGKDS+EETTTAP+A   TTP    +REDIVK WPN+EEKEQ SPVSVLDFPFEDEDQD + SSF+CNLHL
Subjt:  NSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTP--AAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHL

Query:  VEGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAG-DF
        V+GKKQKH+ + KRFENGVE EPLDLKKRFAD+    + F  IS+KEHQREQKAFELL LVKSTTT        TENLLLDFFHEKLEEN+A AR G DF
Subjt:  VEGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAG-DF

Query:  DQPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        DQ QVLK TEDWING+AGE    GWE PEGR LYIKDME+AGKWRS AGE+ ELAAE EAEVW+SL +ELLI++S
Subjt:  DQPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

XP_022969906.1 uncharacterized protein LOC111468962 [Cucurbita maxima]1.2e-19478.48Show/hide
Query:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAS DSS WS+++     KP SFMLKDYLL+D SSCSSNGFRSFPRRQCC TTVRFLLEIDLKVKDSS+TKRF+PRTASRKIALSTISTLQ+ASDAVVRA
Subjt:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGN
        FK+FPLPS RKPF  RS SRK+ILRAFWKK D VD NTRR KSFQEFLDEKEPP S  + + CTA+ V GRNSISSCSNSISWTESEFTSE IPSSSSGN
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGN

Query:  SESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTP--AAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLV
        SESCSENDAVK DKDSPGNLIGKRDGVTFGKDS+EETTTAP+A   TTP    +REDIVK+WPN+EEKEQ SPVSVLDFPFEDEDQD + SSF+CNLHLV
Subjt:  SESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTP--AAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLV

Query:  EGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAG-DFD
        +GKK  H+QK+KRFENGVE EPLDLKKRFAD+    + FSLIS+KEHQREQKAFELL LVKSTTT        TENLLLDFFHEKLEEN+A AR G D D
Subjt:  EGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAG-DFD

Query:  QPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        Q QVLK TEDWING+AGE  V GWE PEGR LYIKDME+AGKWRS  GE+ ELAAE EAEVWISL +ELLI++S
Subjt:  QPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

XP_023551213.1 uncharacterized protein LOC111809098 [Cucurbita pepo subsp. pepo]2.8e-19678.48Show/hide
Query:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAS DSS WS+++     KP SFMLKDYLL+D SSCSSNGFRSFPRRQCC TTVRFLLEIDLKVKDSS+TKRF+PRTASRKIALSTISTLQ+ASDAVVRA
Subjt:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGN
        FK+FPLPS RKPF  RS SRK+ILRAFWKK D VD NTRR KSFQEFLDEKEPP S  + + CTA+ V GRNSISSCSNSISWTESEFTSEMIPSSSSGN
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGN

Query:  SESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTP--AAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLV
        SESCSENDAVK DKDSPGNLIGKRDGVTFGKDS+EETTTAP+A A TTP    +REDIVK+WPN+EEKEQ SPVSVLDFPFEDEDQD + SSF+CNLHLV
Subjt:  SESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTP--AAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLV

Query:  EGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAG-DFD
        +GKKQKH+Q+ KRFENGVE EPLDL KRFAD+   R+ FS IS+KEHQREQKAFELL LVKST T         ENLLLDFFHEKLEEN+ATAR G DFD
Subjt:  EGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAG-DFD

Query:  QPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        Q QVLK TEDWING+ GE    GWE PEGR LYIKDME+AGKWRS AGE+ ELAAE EAEVW+SL +ELLI++S
Subjt:  QPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

XP_038906459.1 uncharacterized protein LOC120092443 [Benincasa hispida]4.8e-19677.38Show/hide
Query:  ASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAF
        ASTDSSNW+L++     KP+SFMLKDYLL+D+SSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKD+S+TKRF+PRT SRKIALSTISTLQ+ASDAV+RAF
Subjt:  ASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAF

Query:  KQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSE---CTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSS
        KQFPLPSSRK FLPRSISRKLI +AFWKKS+IVD NTRRWKSF+EFLDEKEPPSSD N S+   CTAIAVAGRNS SSCSNSISWTESEFTSEMIPSSSS
Subjt:  KQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSE---CTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSS

Query:  GNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLV
        GNSESCSENDAVKDDKDSPGNLIGKRDGV+FGKDS+E+TTTAPAA A  T   YR+DIVK+W NDEEKEQ SPVSVLDFPFEDEDQDI SSSF+CN++LV
Subjt:  GNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLV

Query:  EGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAGDFDQ
        EGKKQKH +K+KR E G+ELEP+DLKKRF D+    + F+LI+KKEHQ E+KAFE L L+KSTT         TENLLLDFFH+KLEE+EATA   DFDQ
Subjt:  EGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAGDFDQ

Query:  PQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
          +LK T++WI+G+AG+M VMGWE PE R  YIKDME+AGKW SFAGE+ EL AE EAEVWISL N+LLI++S
Subjt:  PQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

TrEMBL top hitse value%identityAlignment
A0A0A0KP06 Uncharacterized protein6.3e-18674.53Show/hide
Query:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MASTDSSNW+L++     KP+S +LKDYLL+D SSCSSNGFRSFPRRQCC+TTVRFLLEIDLKVKDSSVTKRF+PRT SRKIALSTISTLQ+ASDAV+RA
Subjt:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSN-TRRWKSFQEFLDEKEPPSS---DQNPSE---CTAIAVAGRNSISSCSNSISWTESEFTSEMI
        FKQFPLPSSRK F PRSISRKLI +AF KKSDIVD N  +RWKSF+EFLDEKEPPSS   ++N S+   CTAIAVAGRNSISSCSNSISWTESEFTSE+I
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSN-TRRWKSFQEFLDEKEPPSS---DQNPSE---CTAIAVAGRNSISSCSNSISWTESEFTSEMI

Query:  PSSSSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPA-AYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFS
        PSS SGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDS+EETTTAP + A  T A  YRED VK+W N+EEKEQFSPVSVLDFPFEDEDQDI SSSF+
Subjt:  PSSSSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPA-AYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFS

Query:  CNLHLVEGKKQK-HSQKTKRFENGVELEPLDLKKRFADLG--HRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEA
        CN+HL+EGKKQK   QKTKR E G ELEP+DLKKRF ++     +  F+LI+KKEHQ E+KA E L L+KSTT         TENLLLDFFH+KL+E+EA
Subjt:  CNLHLVEGKKQK-HSQKTKRFENGVELEPLDLKKRFADLG--HRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEA

Query:  TARAGDFDQPQVLKLTEDWINGEAGEMKVMG-WELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        T+   DFDQPQ+LK  +DWI+G AGE+ VMG WELPE RN YIKDME+  KWRSF G++ EL AE E EVWISLLN+LLI++S
Subjt:  TARAGDFDQPQVLKLTEDWINGEAGEMKVMG-WELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

A0A1S3ATL0 uncharacterized protein LOC1034827065.7e-17972.73Show/hide
Query:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MASTDSSNW+L++     KP+S +LKDYLL+D SSCSSNGFRSFPRRQCC+TTVRFLLEIDLKVKDSS TK+F+PRT+SRKIALSTISTLQ+ASDAV+RA
Subjt:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSS---DQNPSE---CTAIAVAGRNSISSCSNSISWTESEFTSEMIP
        FKQFPLPSSRK F PRSISRKLI +AF KKSDIVD N RRWKSF+EFLDEKEPPSS   +QN S+   CTAIAVAGRNSISSCSNSISWTESEFTSE+IP
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSS---DQNPSE---CTAIAVAGRNSISSCSNSISWTESEFTSEMIP

Query:  SSSSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWP-NDEEKEQFSPVSVLDFPFEDEDQDISSSSFSC
        SS SGNSESCSEN AVKDDKDSP NLIGKRDGVTFGKDS+EET T  A  A      YRED VK+W  N+EEKEQFSPVSVLDFPFEDEDQDI SSS +C
Subjt:  SSSSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWP-NDEEKEQFSPVSVLDFPFEDEDQDISSSSFSC

Query:  NLHLVEGKKQK-HSQKTKRFENGVELEPLDLKKRFAD---LGHRRRDFSLISK-KEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENE
        N+HL+EGKKQK   QKTKR E G ELEP+DLKKRF +   +   +  F+LI+K KEHQ E+KA E L L+KSTT         TENLLLDFFH+KL+E+E
Subjt:  NLHLVEGKKQK-HSQKTKRFENGVELEPLDLKKRFAD---LGHRRRDFSLISK-KEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENE

Query:  ATARAGDFDQPQVLKLTEDWINGEAGEMKVMG-WELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        AT+   DFDQPQ+L+  +DW++G AGE+ VMG WELPE RN YIKDME+A KWRSF G++ EL AE EAEVWISLL++LLI++S
Subjt:  ATARAGDFDQPQVLKLTEDWINGEAGEMKVMG-WELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

A0A5A7TN51 Uncharacterized protein1.2e-17972.93Show/hide
Query:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MASTDSSNW+L++     KP+S +LKDYLL+D SSCSSNGFRSFPRRQCC+TTVRFLLEIDLKVKDSS TK+F+PRT+SRKIALSTISTLQ+ASDAV+RA
Subjt:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSS---DQNPSE---CTAIAVAGRNSISSCSNSISWTESEFTSEMIP
        FKQFPLPSSRK F PRSISRKLI +AF KKSDIVD N RRWKSF+EFLDEKEPPSS   +QN S+   CTAIAVAGRNSISSCSNSISWTESEFTSE+IP
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSS---DQNPSE---CTAIAVAGRNSISSCSNSISWTESEFTSEMIP

Query:  SSSSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWP-NDEEKEQFSPVSVLDFPFEDEDQDISSSSFSC
        SS SGNSESCSEN AVKDDKDSP NLIGKRDGVTFGKDS+EET T  A  A      YRED VK+W  N+EEKEQFSPVSVLDFPFEDEDQDI SSSF+C
Subjt:  SSSSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWP-NDEEKEQFSPVSVLDFPFEDEDQDISSSSFSC

Query:  NLHLVEGKKQK-HSQKTKRFENGVELEPLDLKKRFAD---LGHRRRDFSLISK-KEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENE
        N+HL+EGKKQK   QKTKR E G ELEP+DLKKRF +   +   +  F+LI+K KEHQ E+KA E L L+KSTT         TENLLLDFFH+KL+E+E
Subjt:  NLHLVEGKKQK-HSQKTKRFENGVELEPLDLKKRFAD---LGHRRRDFSLISK-KEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENE

Query:  ATARAGDFDQPQVLKLTEDWINGEAGEMKVMG-WELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        AT+   DFDQPQ+L+  +DW++G AGE+ VMG WELPE RN YIKDME+A KWRSF G++ EL AE EAEVWISLL++LLI++S
Subjt:  ATARAGDFDQPQVLKLTEDWINGEAGEMKVMG-WELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

A0A6J1E8H2 uncharacterized protein LOC1114303531.7e-19477.85Show/hide
Query:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAS DSS WS+++     KP SFMLKDYLL+D SSCSSNGFRSFPRRQCC TTVRFLLEIDLKVKDS++TKRF+PRTASRKIALSTISTLQ+ASDAVVRA
Subjt:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGN
        FK+FPLPS  KPF  RS SRK+ILR FWKK D VD NTRR KSFQEFLDEKEPP S  + + CTA+ V GRNSISSCSNSISWTESEFTSEMIPSSSSGN
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGN

Query:  SESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAA--YREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLV
        SESCSENDAVK DKDSPGNLIGKRDGVTFGKDS+EETTTAP+A   TTP    +REDIVK WPN+EEKEQ SPVSVLDFPFEDEDQD + SSF+CNLHLV
Subjt:  SESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAA--YREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLV

Query:  EGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAG-DFD
        +GKKQKH+ K KRFENGVE EPLDLKKRFAD+   R+ F  IS+KE+QREQKAFELL LVKSTTT        TENLLLDFFHEKLEEN+A AR G DFD
Subjt:  EGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAG-DFD

Query:  QPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        Q QVLK TEDWING+AGE    GWE PEGR LYIKDME AGKWRS AGE+ ELAAE EAEVW+SL +ELLI++S
Subjt:  QPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

A0A6J1HZ34 uncharacterized protein LOC1114689625.7e-19578.48Show/hide
Query:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAS DSS WS+++     KP SFMLKDYLL+D SSCSSNGFRSFPRRQCC TTVRFLLEIDLKVKDSS+TKRF+PRTASRKIALSTISTLQ+ASDAVVRA
Subjt:  MASTDSSNWSLMA----PKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGN
        FK+FPLPS RKPF  RS SRK+ILRAFWKK D VD NTRR KSFQEFLDEKEPP S  + + CTA+ V GRNSISSCSNSISWTESEFTSE IPSSSSGN
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGN

Query:  SESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTP--AAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLV
        SESCSENDAVK DKDSPGNLIGKRDGVTFGKDS+EETTTAP+A   TTP    +REDIVK+WPN+EEKEQ SPVSVLDFPFEDEDQD + SSF+CNLHLV
Subjt:  SESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTP--AAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLV

Query:  EGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAG-DFD
        +GKK  H+QK+KRFENGVE EPLDLKKRFAD+    + FSLIS+KEHQREQKAFELL LVKSTTT        TENLLLDFFHEKLEEN+A AR G D D
Subjt:  EGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAG-DFD

Query:  QPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        Q QVLK TEDWING+AGE  V GWE PEGR LYIKDME+AGKWRS  GE+ ELAAE EAEVWISL +ELLI++S
Subjt:  QPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00770.1 unknown protein8.0e-1626.54Show/hide
Query:  RSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFKQF---PLPSSRKPFLPRS
        RS MLKD LLED +SCSSNGF+S PRR                           P    RK           A  AV+ A K      + S+    LPRS
Subjt:  RSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFKQF---PLPSSRKPFLPRS

Query:  ISRKLILR---------AFWKKSDIVDSNTRRW---KSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGNSESCS
        +SR+L  +            +  DIV     RW   K   E +   EP          T    +   S +SCS   SW++ +FTSE +PSS   N E C 
Subjt:  ISRKLILR---------AFWKKSDIVDSNTRRW---KSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGNSESCS

Query:  ENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLVEGKKQKH
        E  +VK++    G                E++ TA     T        ++  E     EKE  SPVSV +   E+ D + S SSFS  L  VE  KQK 
Subjt:  ENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLVEGKKQKH

Query:  SQKTKRFENGVELEPLDL-------------------KKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEE
         Q  +RFE+   + P +L                    K   D      D     +   + E+KA +L N VK    +        E+L++D+F ++L +
Subjt:  SQKTKRFENGVELEPLDL-------------------KKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEE

Query:  NEATARAGDFDQPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGE-LAAEVEAEVWISLLNELLIEIS
           +       + Q++   + W+ G+  E ++      + R    +++E          EE E +  ++E E++  L++E L  +S
Subjt:  NEATARAGDFDQPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGE-LAAEVEAEVWISLLNELLIEIS

AT4G11780.1 unknown protein2.3e-3128.34Show/hide
Query:  MASTDSSNWSLMAPKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQ--CCTTTVRFLLEIDLK----------VKDSSVTKRFIPRTASRKIALSTISTLQK
        M S+     S    + +  +L+DYLL+D+SSCSSNGF+SFPRRQ    ++TVR LL+ ++K           K   +T+R    T    I+      + K
Subjt:  MASTDSSNWSLMAPKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQ--CCTTTVRFLLEIDLK----------VKDSSVTKRFIPRTASRKIALSTISTLQK

Query:  ASDAVVRAFKQFPLPSS---RKPFLPRSISRKLILRAFWKKSDI----------VDSNTRRWKS--FQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSC
        AS A +   K  P PSS   ++    RS S++L+  +FW+K  +           D   + W+S  ++E LD++    S  + ++          +I+  
Subjt:  ASDAVVRAFKQFPLPSS---RKPFLPRSISRKLILRAFWKKSDI----------VDSNTRRWKS--FQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSC

Query:  SNSISWTESEFTSEMIPSSSSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGK---DSIEETTTAPAATATTTPAAYREDIVKEWPNDEEKEQFSPVSV
           IS   S + SE   +SSS   +S S + +     +S   +  + D V  GK   DS++      ++         R++ V     +EEKEQ SPVS+
Subjt:  SNSISWTESEFTSEMIPSSSSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGK---DSIEETTTAPAATATTTPAAYREDIVKEWPNDEEKEQFSPVSV

Query:  LDFPFEDEDQDISSSSFSCNLHLVEGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFS--LISKKEHQREQKAFELLNLVKSTTTKSQCFIFK-
        L+ PF+D+D+D   +             +K ++K++R    V LEPLDL KR      R+ ++S   +  +E + E +A  L  LVK    ++   +   
Subjt:  LDFPFEDEDQDISSSSFSCNLHLVEGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFS--LISKKEHQREQKAFELLNLVKSTTTKSQCFIFK-

Query:  -TENLLLDFFHEKLEENEATARAGDFDQPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGE-LAAEVEAEVWISLLNELLIE
          +NLLLD+  E           G  ++  ++K  EDW+ G   EM  M WE+   R +Y+K+M    KW    G+E E +  E+    + S ++E + +
Subjt:  -TENLLLDFFHEKLEENEATARAGDFDQPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGE-LAAEVEAEVWISLLNELLIE

Query:  I
        +
Subjt:  I

AT4G23020.1 unknown protein1.8e-2829.46Show/hide
Query:  MASTDSSNWSLMAPKPR--SFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFK
        M S  SS+  L   K R    +L+D+LL+D+SSCSSNGF+SFPR          LL  +++        R I         L+    + KAS A++ A K
Subjt:  MASTDSSNWSLMAPKPR--SFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFK

Query:  QFPLPSS-RKPFLPRSISRKLILRAFWKK----------------SDIVDSNTRRWKSFQEFLDEKEPPSSDQ----NPSEC-TAIAVAGRNSISSCSNS
          P PSS +     R   + L  R+FWKK                +D  +   +R +SF EFL E +   SDQ    +P++  +  A   ++++   S+S
Subjt:  QFPLPSS-RKPFLPRSISRKLILRAFWKK----------------SDIVDSNTRRWKSFQEFLDEKEPPSSDQ----NPSEC-TAIAVAGRNSISSCSNS

Query:  ISWTESEFTSEMIPSSSSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWPNDEEKEQFSPVSVLDFPFE
         S  +SE T      SSSG        D V        +L           D+ EE                          +EEKEQ SP+S+LD PF+
Subjt:  ISWTESEFTSEMIPSSSSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWPNDEEKEQFSPVSVLDFPFE

Query:  DEDQDISSSSFSCNLHLVEGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFS--LISKKEHQREQKAFELLNLVKSTTTKSQCFIFKT---ENL
        D+       + S   H  E  ++K  +K +R E+ V LEP+DL+KR  +    R+D+   +I  +E Q E +A  L  LVKS   + Q  +  +   +N+
Subjt:  DEDQDISSSSFSCNLHLVEGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFS--LISKKEHQREQKAFELLNLVKSTTTKSQCFIFKT---ENL

Query:  LLDFFHEKLEENEATARAGDFDQPQVLKLTEDWINGEAGE--MKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWI-SLLNELLIEIS
        LLDFF E    NE        D+ +++++ E+W+     +     M W++ E R +Y+K+M    KW    G+E E   E     ++ SL++EL+ +IS
Subjt:  LLDFFHEKLEENEATARAGDFDQPQVLKLTEDWINGEAGE--MKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWI-SLLNELLIEIS

AT4G23020.2 unknown protein5.3e-2829.28Show/hide
Query:  MASTDSSNWSLMAPKPR--SFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFK
        M S  SS+  L   K R    +L+D+LL+D+SSCSSNGF+SFPR          LL  +++        R I         L+    + KAS A++ A K
Subjt:  MASTDSSNWSLMAPKPR--SFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFK

Query:  QFPLPSS-RKPFLPRSISRKLILRAFWKK----------------SDIVDSNTRRWKSFQEFLDEKEPPSSDQ----NPSEC-TAIAVAGRNSISSCSNS
          P PSS +     R   + L  R+FWKK                +D  +   +R +SF EFL E +   SDQ    +P++  +  A   ++++   S+S
Subjt:  QFPLPSS-RKPFLPRSISRKLILRAFWKK----------------SDIVDSNTRRWKSFQEFLDEKEPPSSDQ----NPSEC-TAIAVAGRNSISSCSNS

Query:  ISWTESEFT---SEMIPSSSSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWPNDEEKEQFSPVSVLDF
         S  +SE T   S +I    SG+      +D    + ++   L G   G      S+                      +K    +EEKEQ SP+S+LD 
Subjt:  ISWTESEFT---SEMIPSSSSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWPNDEEKEQFSPVSVLDF

Query:  PFEDEDQDISSSSFSCNLHLVEGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFS--LISKKEHQREQKAFELLNLVKSTTTKSQCFIFKT---
        PF+D+       + S   H  E  ++K  +K +R E+ V LEP+DL+KR  +    R+D+   +I  +E Q E +A  L  LVKS   + Q  +  +   
Subjt:  PFEDEDQDISSSSFSCNLHLVEGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFS--LISKKEHQREQKAFELLNLVKSTTTKSQCFIFKT---

Query:  ENLLLDFFHEKLEENEATARAGDFDQPQVLKLTEDWINGEAGE--MKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWI-SLLNELLIE
        +N+LLDFF E    NE        D+ +++++ E+W+     +     M W++ E R +Y+K+M    KW    G+E E   E     ++ SL++EL+ +
Subjt:  ENLLLDFFHEKLEENEATARAGDFDQPQVLKLTEDWINGEAGE--MKVMGWELPEGRNLYIKDMEMAGKWRSFAGEEGELAAEVEAEVWI-SLLNELLIE

Query:  IS
        IS
Subjt:  IS

AT5G03670.1 unknown protein3.1e-0728.89Show/hide
Query:  DEEKEQFSPVSVLDFPFEDEDQDI--SSSSFSCNLHLVEGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKS
        +EEKEQ SPVSVLD PF+D+D+DI    ++   +   V+  K    QK  RFE    L+P++L+KR +D            + E + E++  E+ +L   
Subjt:  DEEKEQFSPVSVLDFPFEDEDQDI--SSSSFSCNLHLVEGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRRRDFSLISKKEHQREQKAFELLNLVKS

Query:  TTTKSQCFIFKTENLLLDFFHEKLEENEAT-ARAGDFDQPQVLKLTEDWINGEAGEM--------KVMGWELPEGRNLYIK-----DMEMAGKWRS-FAG
              C I  T+ +L  +F E +E  E   A   D    +   L  D I+GEA           ++  W   E   + +        E  G WRS    
Subjt:  TTTKSQCFIFKTENLLLDFFHEKLEENEAT-ARAGDFDQPQVLKLTEDWINGEAGEM--------KVMGWELPEGRNLYIK-----DMEMAGKWRS-FAG

Query:  EEGELAAEVEAEVWISLLNELLIEI
        +  E   ++E E++  L+ EL  +I
Subjt:  EEGELAAEVEAEVWISLLNELLIEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTACTGATTCTTCTAATTGGTCTCTCATGGCGCCAAAACCTAGGTCGTTCATGCTCAAGGATTATCTTCTTGAAGATATGAGCTCTTGCTCCTCCAATGGCTT
CCGGTCCTTTCCGCGCCGCCAATGCTGCACCACAACCGTCCGATTTCTCCTCGAAATCGATCTCAAAGTGAAAGATTCTTCCGTAACTAAAAGATTCATTCCTCGAACGG
CCTCCAGGAAAATCGCTCTCTCCACGATCTCCACTTTGCAAAAGGCGTCCGACGCCGTCGTCAGAGCCTTCAAGCAATTTCCCCTGCCTTCTTCGCGGAAGCCGTTTCTG
CCGAGGAGTATTTCGCGGAAACTGATTCTCAGAGCGTTCTGGAAGAAATCGGATATCGTTGATTCCAACACCAGACGGTGGAAATCGTTTCAGGAATTTCTCGATGAGAA
AGAACCGCCGTCGTCTGACCAGAATCCCTCCGAGTGCACCGCCATTGCCGTTGCTGGAAGAAACTCGATCAGTAGCTGTAGTAACAGTATCAGTTGGACGGAGAGCGAAT
TTACATCGGAGATGATTCCGTCGTCTTCGAGCGGTAATTCCGAGAGTTGCAGCGAAAACGACGCCGTTAAGGACGATAAGGATTCGCCTGGTAATCTCATAGGCAAAAGA
GATGGCGTAACGTTCGGAAAAGATTCCATTGAAGAAACAACCACCGCCCCCGCCGCCACCGCCACCACAACCCCCGCCGCTTACCGGGAGGATATCGTTAAGGAATGGCC
AAATGATGAAGAAAAAGAACAATTCAGTCCTGTTTCAGTGTTGGATTTTCCATTCGAAGATGAAGATCAAGACATCTCCTCCTCATCTTTCAGTTGCAACCTTCACCTCG
TCGAAGGAAAGAAGCAGAAACATTCTCAGAAGACGAAGCGATTCGAGAACGGAGTCGAATTGGAGCCTCTAGACTTGAAAAAGCGATTCGCCGATTTAGGCCATCGCCGT
CGCGATTTCAGCTTAATATCCAAAAAAGAACACCAAAGGGAACAGAAGGCATTCGAGCTTCTAAACCTCGTGAAATCCACGACAACCAAATCGCAGTGCTTCATATTCAA
AACAGAGAATCTTCTTCTCGATTTCTTCCACGAGAAGCTCGAAGAAAACGAAGCAACTGCAAGAGCAGGCGATTTCGATCAGCCACAGGTTTTGAAATTGACCGAGGATT
GGATCAATGGGGAGGCCGGAGAAATGAAGGTAATGGGTTGGGAGTTGCCGGAGGGACGGAACTTGTACATTAAGGATATGGAGATGGCCGGAAAATGGAGAAGTTTCGCC
GGAGAAGAAGGAGAATTGGCGGCGGAGGTTGAAGCTGAGGTTTGGATTTCTTTGCTTAATGAGCTATTGATTGAAATCTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTACTGATTCTTCTAATTGGTCTCTCATGGCGCCAAAACCTAGGTCGTTCATGCTCAAGGATTATCTTCTTGAAGATATGAGCTCTTGCTCCTCCAATGGCTT
CCGGTCCTTTCCGCGCCGCCAATGCTGCACCACAACCGTCCGATTTCTCCTCGAAATCGATCTCAAAGTGAAAGATTCTTCCGTAACTAAAAGATTCATTCCTCGAACGG
CCTCCAGGAAAATCGCTCTCTCCACGATCTCCACTTTGCAAAAGGCGTCCGACGCCGTCGTCAGAGCCTTCAAGCAATTTCCCCTGCCTTCTTCGCGGAAGCCGTTTCTG
CCGAGGAGTATTTCGCGGAAACTGATTCTCAGAGCGTTCTGGAAGAAATCGGATATCGTTGATTCCAACACCAGACGGTGGAAATCGTTTCAGGAATTTCTCGATGAGAA
AGAACCGCCGTCGTCTGACCAGAATCCCTCCGAGTGCACCGCCATTGCCGTTGCTGGAAGAAACTCGATCAGTAGCTGTAGTAACAGTATCAGTTGGACGGAGAGCGAAT
TTACATCGGAGATGATTCCGTCGTCTTCGAGCGGTAATTCCGAGAGTTGCAGCGAAAACGACGCCGTTAAGGACGATAAGGATTCGCCTGGTAATCTCATAGGCAAAAGA
GATGGCGTAACGTTCGGAAAAGATTCCATTGAAGAAACAACCACCGCCCCCGCCGCCACCGCCACCACAACCCCCGCCGCTTACCGGGAGGATATCGTTAAGGAATGGCC
AAATGATGAAGAAAAAGAACAATTCAGTCCTGTTTCAGTGTTGGATTTTCCATTCGAAGATGAAGATCAAGACATCTCCTCCTCATCTTTCAGTTGCAACCTTCACCTCG
TCGAAGGAAAGAAGCAGAAACATTCTCAGAAGACGAAGCGATTCGAGAACGGAGTCGAATTGGAGCCTCTAGACTTGAAAAAGCGATTCGCCGATTTAGGCCATCGCCGT
CGCGATTTCAGCTTAATATCCAAAAAAGAACACCAAAGGGAACAGAAGGCATTCGAGCTTCTAAACCTCGTGAAATCCACGACAACCAAATCGCAGTGCTTCATATTCAA
AACAGAGAATCTTCTTCTCGATTTCTTCCACGAGAAGCTCGAAGAAAACGAAGCAACTGCAAGAGCAGGCGATTTCGATCAGCCACAGGTTTTGAAATTGACCGAGGATT
GGATCAATGGGGAGGCCGGAGAAATGAAGGTAATGGGTTGGGAGTTGCCGGAGGGACGGAACTTGTACATTAAGGATATGGAGATGGCCGGAAAATGGAGAAGTTTCGCC
GGAGAAGAAGGAGAATTGGCGGCGGAGGTTGAAGCTGAGGTTTGGATTTCTTTGCTTAATGAGCTATTGATTGAAATCTCCTAG
Protein sequenceShow/hide protein sequence
MASTDSSNWSLMAPKPRSFMLKDYLLEDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFKQFPLPSSRKPFL
PRSISRKLILRAFWKKSDIVDSNTRRWKSFQEFLDEKEPPSSDQNPSECTAIAVAGRNSISSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAVKDDKDSPGNLIGKR
DGVTFGKDSIEETTTAPAATATTTPAAYREDIVKEWPNDEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLVEGKKQKHSQKTKRFENGVELEPLDLKKRFADLGHRR
RDFSLISKKEHQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEENEATARAGDFDQPQVLKLTEDWINGEAGEMKVMGWELPEGRNLYIKDMEMAGKWRSFA
GEEGELAAEVEAEVWISLLNELLIEIS