; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032778 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032778
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr11:37450344..37453849
RNA-Seq ExpressionLag0032778
SyntenyLag0032778
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579477.1 hypothetical protein SDJN03_23925, partial [Cucurbita argyrosperma subsp. sororia]1.8e-18775.68Show/hide
Query:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAS  SS W++++     KP SFMLKDYLLDD SSCSSNGFRSFPRRQCC TTVRFLLEIDLKVKDS++TKRF+PRTASRKIALSTISTLQ+ASDAVVRA
Subjt:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSS
        FK+FPLPS  KPF  RS SRK+ILR FWKK + VD NTRR KSFQEFLDEKEPP S    +DSA+CTA+ V GRNSIS CSNSISWTESEFTSEMIPSSS
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSS

Query:  SGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA----------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL
        SGNSESCS NDAVK DKDSP NLIGKRDG TFGKDS+EETTTAP+A          +REDIVK WPNEEEKEQ SPVSVLDFPFEDEDQD + SSF+CNL
Subjt:  SGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA----------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL

Query:  HLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAG-
        HLI+GKKQKH  + KRFE GVE EPLDLKKRFADI   R+ F  IS+KE+QREQKAFELL LVKSTTT        TENLLLDFFHEKLEE++A AR G 
Subjt:  HLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAG-

Query:  DFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        DFDQ QVLK TEDWING+AGE    GWE PEGR LYI+DME+AGKWRS AGE+ ELAAE EAEVW+SL +ELLI++S
Subjt:  DFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

XP_022922340.1 uncharacterized protein LOC111430353 [Cucurbita moschata]5.2e-18775.68Show/hide
Query:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAS  SS W++++     KP SFMLKDYLLDD SSCSSNGFRSFPRRQCC TTVRFLLEIDLKVKDS++TKRF+PRTASRKIALSTISTLQ+ASDAVVRA
Subjt:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSS
        FK+FPLPS  KPF  RS SRK+ILR FWKK + VD NTRR KSFQEFLDEKEPP S    +DSA+CTA+ V GRNSIS CSNSISWTESEFTSEMIPSSS
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSS

Query:  SGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA----------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL
        SGNSESCS NDAVK DKDSP NLIGKRDG TFGKDS+EETTTAP+A          +REDIVK WPNEEEKEQ SPVSVLDFPFEDEDQD + SSF+CNL
Subjt:  SGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA----------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL

Query:  HLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAG-
        HL++GKKQKH  K KRFE GVE EPLDLKKRFADI   R+ F  IS+KE QREQKAFELL LVKSTTT        TENLLLDFFHEKLEE++A AR G 
Subjt:  HLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAG-

Query:  DFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        DFDQ QVLK TEDWING+AGE    GWE PEGR LYI+DME AGKWRS AGE+ ELAAE EAEVW+SL +ELLI++S
Subjt:  DFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

XP_022969906.1 uncharacterized protein LOC111468962 [Cucurbita maxima]4.0e-18776.1Show/hide
Query:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAS  SS W++++     KP SFMLKDYLLDD SSCSSNGFRSFPRRQCC TTVRFLLEIDLKVKDSS+TKRF+PRTASRKIALSTISTLQ+ASDAVVRA
Subjt:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSS
        FK+FPLPS RKPF  RS SRK+ILRAFWKK + VD NTRR KSFQEFLDEKEPP S    +DSA+CTA+ V GRNSIS CSNSISWTESEFTSE IPSSS
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSS

Query:  SGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA----------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL
        SGNSESCS NDAVK DKDSP NLIGKRDG TFGKDS+EETTTAP+A          +REDIVK+WPNEEEKEQ SPVSVLDFPFEDEDQD + SSF+CNL
Subjt:  SGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA----------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL

Query:  HLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAG-
        HL++GKK  H QK+KRFE GVE EPLDLKKRFADI    + FSLIS+KE+QREQKAFELL LVKSTTT        TENLLLDFFHEKLEE++A AR G 
Subjt:  HLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAG-

Query:  DFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        D DQ QVLK TEDWING+AGE  V GWE PEGR LYI+DME+AGKWRS  GE+ ELAAE EAEVWISL +ELLI++S
Subjt:  DFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

XP_023551213.1 uncharacterized protein LOC111809098 [Cucurbita pepo subsp. pepo]2.8e-18875.89Show/hide
Query:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAS  SS W++++     KP SFMLKDYLLDD SSCSSNGFRSFPRRQCC TTVRFLLEIDLKVKDSS+TKRF+PRTASRKIALSTISTLQ+ASDAVVRA
Subjt:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSS
        FK+FPLPS RKPF  RS SRK+ILRAFWKK + VD NTRR KSFQEFLDEKEPP S    +DSA+CTA+ V GRNSIS CSNSISWTESEFTSEMIPSSS
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSS

Query:  SGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA----------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL
        SGNSESCS NDAVK DKDSP NLIGKRDG TFGKDS+EETTTAP+A          +REDIVK+WPNEEEKEQ SPVSVLDFPFEDEDQD + SSF+CNL
Subjt:  SGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA----------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL

Query:  HLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAG-
        HL++GKKQKH Q+ KRFE GVE EPLDL KRFADI   R+ FS IS+KE+QREQKAFELL LVKST T         ENLLLDFFHEKLEE++ATAR G 
Subjt:  HLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAG-

Query:  DFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        DFDQ QVLK TEDWING+ GE    GWE PEGR LYI+DME+AGKWRS AGE+ ELAAE EAEVW+SL +ELLI++S
Subjt:  DFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

XP_038906459.1 uncharacterized protein LOC120092443 [Benincasa hispida]8.9e-19576.69Show/hide
Query:  ASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAF
        AST SS+WTL++     KP+SFMLKDYLLDD+SSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKD+S+TKRF+PRT SRKIALSTISTLQ+ASDAV+RAF
Subjt:  ASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAF

Query:  KQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSSS
        KQFPLPSSRK FLPRSISRKLI +AFWKKSEIVD NTRRWKSF+EFLDEKEPPSSD N +DSA+CTAIAVAGRNS S CSNSISWTESEFTSEMIPSSSS
Subjt:  KQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSSS

Query:  GNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA-------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLIE
        GNSESCS NDAVKDDKDSP NLIGKRDG +FGKDS+E+TTTAPAA       YR+DIVK+W N+EEKEQ SPVSVLDFPFEDEDQDI SSSF+CN++L+E
Subjt:  GNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA-------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLIE

Query:  GKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAGDFDQP
        GKKQKH +K+KR EKG+ELEP+DLKKRF DI    + F+LI+KKE+Q E+KAFE L L+KSTT         TENLLLDFFH+KLEEHEATA   DFDQ 
Subjt:  GKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAGDFDQP

Query:  QVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
         +LK T++WI+G+AG+M VMGWE PE R  YI+DME+AGKW SFAGE+ EL AE EAEVWISL N+LLI++S
Subjt:  QVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

TrEMBL top hitse value%identityAlignment
A0A0A0KP06 Uncharacterized protein6.9e-18573.91Show/hide
Query:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAST SS+WTL++     KP+S +LKDYLLDD SSCSSNGFRSFPRRQCC+TTVRFLLEIDLKVKDSSVTKRF+PRT SRKIALSTISTLQ+ASDAV+RA
Subjt:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSN-TRRWKSFQEFLDEKEPPSS---DQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMI
        FKQFPLPSSRK F PRSISRKLI +AF KKS+IVD N  +RWKSF+EFLDEKEPPSS   ++N +DSA+CTAIAVAGRNSIS CSNSISWTESEFTSE+I
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSN-TRRWKSFQEFLDEKEPPSS---DQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMI

Query:  PSSSSGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA---------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFS
        PSS SGNSESCS NDAVKDDKDSP NLIGKRDG TFGKDS+EETTTAP +         YRED VK+W NEEEKEQFSPVSVLDFPFEDEDQDI SSSF+
Subjt:  PSSSSGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA---------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFS

Query:  CNLHLIEGKKQK-HCQKTKRFEKGVELEPLDLKKRFADIG--HRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEA
        CN+HL+EGKKQK   QKTKR EKG ELEP+DLKKRF +I     +  F+LI+KKE+Q E+KA E L L+KSTT         TENLLLDFFH+KL+EHEA
Subjt:  CNLHLIEGKKQK-HCQKTKRFEKGVELEPLDLKKRFADIG--HRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEA

Query:  TARAGDFDQPQVLKLTEDWINGEAGEMTVMG-WELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        T+   DFDQPQ+LK  +DWI+G AGE+TVMG WELPE RN YI+DME+  KWRSF G++ EL AE E EVWISLLN+LLI++S
Subjt:  TARAGDFDQPQVLKLTEDWINGEAGEMTVMG-WELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

A0A1S3ATL0 uncharacterized protein LOC1034827063.9e-18073.03Show/hide
Query:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAST SS+WTL++     KP+S +LKDYLLDD SSCSSNGFRSFPRRQCC+TTVRFLLEIDLKVKDSS TK+F+PRT+SRKIALSTISTLQ+ASDAV+RA
Subjt:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSS---DQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIP
        FKQFPLPSSRK F PRSISRKLI +AF KKS+IVD N RRWKSF+EFLDEKEPPSS   +QN +DSA+CTAIAVAGRNSIS CSNSISWTESEFTSE+IP
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSS---DQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIP

Query:  SSSSGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTT------APAAYREDIVKEWP-NEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL
        SS SGNSESCS N AVKDDKDSP NLIGKRDG TFGKDS+EET T      A   YRED VK+W  NEEEKEQFSPVSVLDFPFEDEDQDI SSS +CN+
Subjt:  SSSSGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTT------APAAYREDIVKEWP-NEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL

Query:  HLIEGKKQK-HCQKTKRFEKGVELEPLDLKKRFAD---IGHRRRDFSLISK-KEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEAT
        HL+EGKKQK   QKTKR EKG ELEP+DLKKRF +   I   +  F+LI+K KE+Q E+KA E L L+KSTT         TENLLLDFFH+KL+EHEAT
Subjt:  HLIEGKKQK-HCQKTKRFEKGVELEPLDLKKRFAD---IGHRRRDFSLISK-KEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEAT

Query:  ARAGDFDQPQVLKLTEDWINGEAGEMTVMG-WELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        +   DFDQPQ+L+  +DW++G AGE+TVMG WELPE RN YI+DME+A KWRSF G++ EL AE EAEVWISLL++LLI++S
Subjt:  ARAGDFDQPQVLKLTEDWINGEAGEMTVMG-WELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

A0A5A7TN51 Uncharacterized protein7.9e-18173.24Show/hide
Query:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAST SS+WTL++     KP+S +LKDYLLDD SSCSSNGFRSFPRRQCC+TTVRFLLEIDLKVKDSS TK+F+PRT+SRKIALSTISTLQ+ASDAV+RA
Subjt:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSS---DQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIP
        FKQFPLPSSRK F PRSISRKLI +AF KKS+IVD N RRWKSF+EFLDEKEPPSS   +QN +DSA+CTAIAVAGRNSIS CSNSISWTESEFTSE+IP
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSS---DQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIP

Query:  SSSSGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTT------APAAYREDIVKEWP-NEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL
        SS SGNSESCS N AVKDDKDSP NLIGKRDG TFGKDS+EET T      A   YRED VK+W  NEEEKEQFSPVSVLDFPFEDEDQDI SSSF+CN+
Subjt:  SSSSGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTT------APAAYREDIVKEWP-NEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL

Query:  HLIEGKKQK-HCQKTKRFEKGVELEPLDLKKRFAD---IGHRRRDFSLISK-KEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEAT
        HL+EGKKQK   QKTKR EKG ELEP+DLKKRF +   I   +  F+LI+K KE+Q E+KA E L L+KSTT         TENLLLDFFH+KL+EHEAT
Subjt:  HLIEGKKQK-HCQKTKRFEKGVELEPLDLKKRFAD---IGHRRRDFSLISK-KEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEAT

Query:  ARAGDFDQPQVLKLTEDWINGEAGEMTVMG-WELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        +   DFDQPQ+L+  +DW++G AGE+TVMG WELPE RN YI+DME+A KWRSF G++ EL AE EAEVWISLL++LLI++S
Subjt:  ARAGDFDQPQVLKLTEDWINGEAGEMTVMG-WELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

A0A6J1E8H2 uncharacterized protein LOC1114303532.5e-18775.68Show/hide
Query:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAS  SS W++++     KP SFMLKDYLLDD SSCSSNGFRSFPRRQCC TTVRFLLEIDLKVKDS++TKRF+PRTASRKIALSTISTLQ+ASDAVVRA
Subjt:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSS
        FK+FPLPS  KPF  RS SRK+ILR FWKK + VD NTRR KSFQEFLDEKEPP S    +DSA+CTA+ V GRNSIS CSNSISWTESEFTSEMIPSSS
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSS

Query:  SGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA----------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL
        SGNSESCS NDAVK DKDSP NLIGKRDG TFGKDS+EETTTAP+A          +REDIVK WPNEEEKEQ SPVSVLDFPFEDEDQD + SSF+CNL
Subjt:  SGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA----------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL

Query:  HLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAG-
        HL++GKKQKH  K KRFE GVE EPLDLKKRFADI   R+ F  IS+KE QREQKAFELL LVKSTTT        TENLLLDFFHEKLEE++A AR G 
Subjt:  HLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAG-

Query:  DFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        DFDQ QVLK TEDWING+AGE    GWE PEGR LYI+DME AGKWRS AGE+ ELAAE EAEVW+SL +ELLI++S
Subjt:  DFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

A0A6J1HZ34 uncharacterized protein LOC1114689621.9e-18776.1Show/hide
Query:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA
        MAS  SS W++++     KP SFMLKDYLLDD SSCSSNGFRSFPRRQCC TTVRFLLEIDLKVKDSS+TKRF+PRTASRKIALSTISTLQ+ASDAVVRA
Subjt:  MASTVSSDWTLMA----AKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSS
        FK+FPLPS RKPF  RS SRK+ILRAFWKK + VD NTRR KSFQEFLDEKEPP S    +DSA+CTA+ V GRNSIS CSNSISWTESEFTSE IPSSS
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSS

Query:  SGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA----------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL
        SGNSESCS NDAVK DKDSP NLIGKRDG TFGKDS+EETTTAP+A          +REDIVK+WPNEEEKEQ SPVSVLDFPFEDEDQD + SSF+CNL
Subjt:  SGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAA----------YREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNL

Query:  HLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAG-
        HL++GKK  H QK+KRFE GVE EPLDLKKRFADI    + FSLIS+KE+QREQKAFELL LVKSTTT        TENLLLDFFHEKLEE++A AR G 
Subjt:  HLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAG-

Query:  DFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS
        D DQ QVLK TEDWING+AGE  V GWE PEGR LYI+DME+AGKWRS  GE+ ELAAE EAEVWISL +ELLI++S
Subjt:  DFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWISLLNELLIEIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00770.1 unknown protein6.1e-1626.26Show/hide
Query:  RSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFKQF---PLPSSRKPFLPRS
        RS MLKD LL+D +SCSSNGF+S PRR                           P    RK           A  AV+ A K      + S+    LPRS
Subjt:  RSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFKQF---PLPSSRKPFLPRS

Query:  ISRKLILRAFWKKSEIVDSNT-------RRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSSSGNSESCSVN
        +SR+L  +    K+E   S T        RW S ++  ++     S   P            G ++ S  S S SW++ +FTSE +PSS   N E C   
Subjt:  ISRKLILRAFWKKSEIVDSNT-------RRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSSSGNSESCSVN

Query:  DAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAAYREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLIEGKKQKHCQKTKRFEKG
         +VK++     + +G            E++ TA      ++  E   + EKE  SPVSV +   E+ D + S SSFS  L  +E  KQK  Q  +RFE  
Subjt:  DAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAAYREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLIEGKKQKHCQKTKRFEKG

Query:  VELEPLDLKKRFA-----------------DIGHRRRDFSLISKKEY--QREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAGDF
          + P +L +  +                 D           S+ EY  + E+KA +L N VK    +        E+L++D+F ++L +   +      
Subjt:  VELEPLDLKKRFA-----------------DIGHRRRDFSLISKKEY--QREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAGDF

Query:  DQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGE-LAAEVEAEVWISLLNELLIEIS
         + Q++   + W+ G+       G    + R    +++E          EE E +  ++E E++  L++E L  +S
Subjt:  DQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGE-LAAEVEAEVWISLLNELLIEIS

AT4G11780.1 unknown protein1.9e-3330.27Show/hide
Query:  MASTV-SSDWTLMAAKPR--SFMLKDYLLDDMSSCSSNGFRSFPRRQ--CCTTTVRFLLEIDLK----------VKDSSVTKRFIPRTASRKIALSTIST
        MAS + SSD  L  +K R    +L+DYLLDD+SSCSSNGF+SFPRRQ    ++TVR LL+ ++K           K   +T+R    T    I+      
Subjt:  MASTV-SSDWTLMAAKPR--SFMLKDYLLDDMSSCSSNGFRSFPRRQ--CCTTTVRFLLEIDLK----------VKDSSVTKRFIPRTASRKIALSTIST

Query:  LQKASDAVVRAFKQFPLPSS---RKPFLPRSISRKLILRAFWKKSEI----------VDSNTRRWKS--FQEFLDEKEPPSSDQNPADSALCTAIAVAGR
        + KAS A +   K  P PSS   ++    RS S++L+  +FW+K  +           D   + W+S  ++E LD++    S  +  D  +  + + A  
Subjt:  LQKASDAVVRAFKQFPLPSS---RKPFLPRSISRKLILRAFWKKSEI----------VDSNTRRWKS--FQEFLDEKEPPSSDQNPADSALCTAIAVAGR

Query:  NSISCCSNSISWTESEFTSEM---------------IPSSSSGNSESCSVN-DAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAAYREDIVKEWPNE
         +I+     IS   S + SE                  SSSSG SE  S   DAV+D K+S  + +   DG     D            R++ V      
Subjt:  NSISCCSNSISWTESEFTSEM---------------IPSSSSGNSESCSVN-DAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAAYREDIVKEWPNE

Query:  EEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFS--LISKKEYQREQKAFELLNLVKST
        EEKEQ SPVS+L+ PF+D+D+D   +  +          +K  +K++R    V LEPLDL KR      R+ ++S   +  +E + E +A  L  LVK  
Subjt:  EEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFS--LISKKEYQREQKAFELLNLVKST

Query:  TTKSQCFIFK--TENLLLDFFHEKLEEHEATARAGDFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGE-LAAEVEAEV
          ++   +     +NLLLD+  E           G  ++  ++K  EDW+ G   EM  M WE+   R +Y+++M    KW    G+E E +  E+    
Subjt:  TTKSQCFIFK--TENLLLDFFHEKLEEHEATARAGDFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGE-LAAEVEAEV

Query:  WISLLNELLIEI
        + S ++E + ++
Subjt:  WISLLNELLIEI

AT4G23020.1 unknown protein4.8e-2929.82Show/hide
Query:  MASTVSSDWTLMAAKPR--SFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFK
        M S  SSD  L  +K R    +L+D+LLDD+SSCSSNGF+SFPR          LL  +++        R I         L+    + KAS A++ A K
Subjt:  MASTVSSDWTLMAAKPR--SFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFK

Query:  QFPLPSS-RKPFLPRSISRKLILRAFWKK----------------SEIVDSNTRRWKSFQEFLDEKEPPSSDQ----NPADSALCTAIAVAGRNSISCCS
          P PSS +     R   + L  R+FWKK                ++  +   +R +SF EFL E +   SDQ    +P D  L +  A   ++++   S
Subjt:  QFPLPSS-RKPFLPRSISRKLILRAFWKK----------------SEIVDSNTRRWKSFQEFLDEKEPPSSDQ----NPADSALCTAIAVAGRNSISCCS

Query:  NSISWTESEFTSEMIPSSSSGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAAYREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDI
        +S S  +SE T      SSSG        D V       ++L           D+ EE                   EEKEQ SP+S+LD PF+D+    
Subjt:  NSISWTESEFTSEMIPSSSSGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAAYREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDI

Query:  SSSSFSCNLHLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFS--LISKKEYQREQKAFELLNLVKSTTTKSQCFIFKT---ENLLLDFFH
           + S   H  E  ++K  +K +R E  V LEP+DL+KR       R+D+   +I  +E Q E +A  L  LVKS   + Q  +  +   +N+LLDFF 
Subjt:  SSSSFSCNLHLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFS--LISKKEYQREQKAFELLNLVKSTTTKSQCFIFKT---ENLLLDFFH

Query:  EKLEEHEATARAGDFDQPQVLKLTEDWINGEAGE--MTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWI-SLLNELLIEIS
        E              D+ +++++ E+W+     +     M W++ E R +Y+++M    KW    G+E E   E     ++ SL++EL+ +IS
Subjt:  EKLEEHEATARAGDFDQPQVLKLTEDWINGEAGE--MTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWI-SLLNELLIEIS

AT4G23020.2 unknown protein5.7e-3029.64Show/hide
Query:  MASTVSSDWTLMAAKPR--SFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFK
        M S  SSD  L  +K R    +L+D+LLDD+SSCSSNGF+SFPR          LL  +++        R I         L+    + KAS A++ A K
Subjt:  MASTVSSDWTLMAAKPR--SFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFK

Query:  QFPLPSS-RKPFLPRSISRKLILRAFWKK----------------SEIVDSNTRRWKSFQEFLDEKEPPSSDQ----NPADSALCTAIAVAGRNSISCCS
          P PSS +     R   + L  R+FWKK                ++  +   +R +SF EFL E +   SDQ    +P D  L +  A   ++++   S
Subjt:  QFPLPSS-RKPFLPRSISRKLILRAFWKK----------------SEIVDSNTRRWKSFQEFLDEKEPPSSDQ----NPADSALCTAIAVAGRNSISCCS

Query:  NSISWTESEFT---SEMIPSSSSGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAAYREDIVKEWPNEEEKEQFSPVSVLDFPFEDED
        +S S  +SE T   S +I    SG+     V+D    + ++   L G   G      S+         +  +        EEKEQ SP+S+LD PF+D+ 
Subjt:  NSISWTESEFT---SEMIPSSSSGNSESCSVNDAVKDDKDSPANLIGKRDGETFGKDSIEETTTAPAAYREDIVKEWPNEEEKEQFSPVSVLDFPFEDED

Query:  QDISSSSFSCNLHLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFS--LISKKEYQREQKAFELLNLVKSTTTKSQCFIFKT---ENLLLD
              + S   H  E  ++K  +K +R E  V LEP+DL+KR       R+D+   +I  +E Q E +A  L  LVKS   + Q  +  +   +N+LLD
Subjt:  QDISSSSFSCNLHLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFS--LISKKEYQREQKAFELLNLVKSTTTKSQCFIFKT---ENLLLD

Query:  FFHEKLEEHEATARAGDFDQPQVLKLTEDWINGEAGE--MTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWI-SLLNELLIEIS
        FF E              D+ +++++ E+W+     +     M W++ E R +Y+++M    KW    G+E E   E     ++ SL++EL+ +IS
Subjt:  FFHEKLEEHEATARAGDFDQPQVLKLTEDWINGEAGE--MTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGELAAEVEAEVWI-SLLNELLIEIS

AT5G03670.1 unknown protein7.9e-0830.22Show/hide
Query:  EEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLIEGKKQKH--CQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKS
        EEEKEQ SPVSVLD PF+D+D+DI     +        +K KH   QK  RFE+   L+P++L+KR +D            + E + E++  E+ +L   
Subjt:  EEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLIEGKKQKH--CQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSLISKKEYQREQKAFELLNLVKS

Query:  TTTKSQCFIFKTENLLLDFFHEKLEEHEAT-ARAGDFDQPQVLKLTEDWINGEAGEMTV--------MGWELPEGRNLYIQ-----DMEMAGKWRS-FAG
              C I  T+ +L  +F E +E  E   A   D    +   L  D I+GEA    V          W   E   + +        E  G WRS    
Subjt:  TTTKSQCFIFKTENLLLDFFHEKLEEHEAT-ARAGDFDQPQVLKLTEDWINGEAGEMTV--------MGWELPEGRNLYIQ-----DMEMAGKWRS-FAG

Query:  EEGELAAEVEAEVWISLLNELLIEI
        +  E   ++E E++  L+ EL  +I
Subjt:  EEGELAAEVEAEVWISLLNELLIEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCACGGTTTCTTCTGATTGGACTCTCATGGCGGCAAAACCTAGGTCGTTCATGCTCAAGGATTATCTTCTTGACGATATGAGCTCTTGCTCCTCCAATGGCTT
CCGGTCCTTTCCGCGCCGCCAATGCTGCACCACAACCGTCCGATTTCTCCTCGAAATCGATCTCAAAGTGAAAGATTCTTCCGTAACTAAAAGATTCATTCCTCGAACGG
CCTCCAGGAAAATCGCTCTCTCCACGATCTCCACTTTGCAAAAGGCGTCCGACGCCGTCGTCAGAGCCTTCAAGCAATTTCCCCTGCCTTCTTCGCGGAAGCCGTTTCTG
CCGAGGAGTATTTCGCGGAAACTGATTCTCAGAGCGTTCTGGAAGAAATCGGAAATCGTTGATTCCAACACCAGACGGTGGAAATCGTTTCAGGAATTTCTCGATGAGAA
AGAACCGCCGTCGTCTGACCAGAATCCCGCCGATTCCGCCCTGTGCACCGCCATTGCCGTTGCTGGAAGAAACTCGATCAGTTGCTGTAGTAACAGTATCAGTTGGACGG
AGAGCGAATTTACATCGGAGATGATTCCGTCGTCTTCGAGCGGTAATTCCGAGAGTTGCAGCGTAAACGACGCCGTTAAGGACGATAAGGATTCGCCTGCTAATCTCATC
GGCAAAAGAGATGGCGAAACGTTCGGAAAAGATTCCATCGAAGAAACAACCACCGCCCCCGCCGCTTACCGGGAGGATATCGTTAAGGAATGGCCAAATGAAGAAGAAAA
GGAGCAGTTCAGTCCTGTTTCAGTGTTGGATTTTCCATTCGAAGATGAAGATCAAGACATCTCCTCCTCATCTTTCAGTTGCAACCTTCATCTCATCGAAGGAAAGAAGC
AGAAACATTGTCAGAAGACGAAGCGGTTCGAGAAAGGAGTCGAATTGGAGCCTCTAGACTTGAAAAAGCGATTCGCAGACATAGGCCATCGCCGTCGCGATTTCAGCTTA
ATATCCAAAAAAGAATACCAAAGAGAACAGAAGGCATTCGAGCTTCTAAACCTCGTGAAATCCACGACAACCAAATCGCAATGCTTCATATTCAAAACAGAGAATCTTCT
TCTCGATTTCTTCCACGAGAAGCTCGAAGAACACGAAGCAACTGCAAGAGCAGGCGATTTCGATCAGCCACAGGTTTTGAAATTGACCGAAGATTGGATCAATGGGGAGG
CCGGAGAAATGACGGTAATGGGTTGGGAGTTGCCGGAGGGACGGAATTTGTACATTCAGGATATGGAGATGGCCGGAAAATGGAGAAGTTTCGCCGGAGAAGAAGGAGAA
TTGGCGGCGGAGGTTGAAGCTGAGGTTTGGATTTCTTTGCTTAATGAGCTATTGATTGAAATCTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCACGGTTTCTTCTGATTGGACTCTCATGGCGGCAAAACCTAGGTCGTTCATGCTCAAGGATTATCTTCTTGACGATATGAGCTCTTGCTCCTCCAATGGCTT
CCGGTCCTTTCCGCGCCGCCAATGCTGCACCACAACCGTCCGATTTCTCCTCGAAATCGATCTCAAAGTGAAAGATTCTTCCGTAACTAAAAGATTCATTCCTCGAACGG
CCTCCAGGAAAATCGCTCTCTCCACGATCTCCACTTTGCAAAAGGCGTCCGACGCCGTCGTCAGAGCCTTCAAGCAATTTCCCCTGCCTTCTTCGCGGAAGCCGTTTCTG
CCGAGGAGTATTTCGCGGAAACTGATTCTCAGAGCGTTCTGGAAGAAATCGGAAATCGTTGATTCCAACACCAGACGGTGGAAATCGTTTCAGGAATTTCTCGATGAGAA
AGAACCGCCGTCGTCTGACCAGAATCCCGCCGATTCCGCCCTGTGCACCGCCATTGCCGTTGCTGGAAGAAACTCGATCAGTTGCTGTAGTAACAGTATCAGTTGGACGG
AGAGCGAATTTACATCGGAGATGATTCCGTCGTCTTCGAGCGGTAATTCCGAGAGTTGCAGCGTAAACGACGCCGTTAAGGACGATAAGGATTCGCCTGCTAATCTCATC
GGCAAAAGAGATGGCGAAACGTTCGGAAAAGATTCCATCGAAGAAACAACCACCGCCCCCGCCGCTTACCGGGAGGATATCGTTAAGGAATGGCCAAATGAAGAAGAAAA
GGAGCAGTTCAGTCCTGTTTCAGTGTTGGATTTTCCATTCGAAGATGAAGATCAAGACATCTCCTCCTCATCTTTCAGTTGCAACCTTCATCTCATCGAAGGAAAGAAGC
AGAAACATTGTCAGAAGACGAAGCGGTTCGAGAAAGGAGTCGAATTGGAGCCTCTAGACTTGAAAAAGCGATTCGCAGACATAGGCCATCGCCGTCGCGATTTCAGCTTA
ATATCCAAAAAAGAATACCAAAGAGAACAGAAGGCATTCGAGCTTCTAAACCTCGTGAAATCCACGACAACCAAATCGCAATGCTTCATATTCAAAACAGAGAATCTTCT
TCTCGATTTCTTCCACGAGAAGCTCGAAGAACACGAAGCAACTGCAAGAGCAGGCGATTTCGATCAGCCACAGGTTTTGAAATTGACCGAAGATTGGATCAATGGGGAGG
CCGGAGAAATGACGGTAATGGGTTGGGAGTTGCCGGAGGGACGGAATTTGTACATTCAGGATATGGAGATGGCCGGAAAATGGAGAAGTTTCGCCGGAGAAGAAGGAGAA
TTGGCGGCGGAGGTTGAAGCTGAGGTTTGGATTTCTTTGCTTAATGAGCTATTGATTGAAATCTCCTAG
Protein sequenceShow/hide protein sequence
MASTVSSDWTLMAAKPRSFMLKDYLLDDMSSCSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSVTKRFIPRTASRKIALSTISTLQKASDAVVRAFKQFPLPSSRKPFL
PRSISRKLILRAFWKKSEIVDSNTRRWKSFQEFLDEKEPPSSDQNPADSALCTAIAVAGRNSISCCSNSISWTESEFTSEMIPSSSSGNSESCSVNDAVKDDKDSPANLI
GKRDGETFGKDSIEETTTAPAAYREDIVKEWPNEEEKEQFSPVSVLDFPFEDEDQDISSSSFSCNLHLIEGKKQKHCQKTKRFEKGVELEPLDLKKRFADIGHRRRDFSL
ISKKEYQREQKAFELLNLVKSTTTKSQCFIFKTENLLLDFFHEKLEEHEATARAGDFDQPQVLKLTEDWINGEAGEMTVMGWELPEGRNLYIQDMEMAGKWRSFAGEEGE
LAAEVEAEVWISLLNELLIEIS