; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000336 (gene) of Snake gourd v1 genome

Gene IDTan0000336
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein piccolo isoform X2
Genome locationLG08:665345..670908
RNA-Seq ExpressionTan0000336
SyntenyTan0000336
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008459318.1 PREDICTED: uncharacterized protein LOC103498484 isoform X2 [Cucumis melo]1.9e-20971.11Show/hide
Query:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA
        MPG I+LSVLEF+DLPELLPS ISIKV MGKR YETS+KG+FSFPLTTLRDDVILI+QD GG+EISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQF+
Subjt:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA

Query:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG
        LSEDDRSRIR MRETALR+KQVE QDR+L+SSGSN ASSFYLNPELS                      DSQ CLLQ+GDLSAK  A +S S STE+IP 
Subjt:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG

Query:  EKSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKT
        +K  TEK N+++LDQNDADR+ D+  TIP LQ +D NKPKVNNT LVER++ +SPH NK S TI S+ENLF SQ SELSNS +K EEKTD+ E PSRR+ 
Subjt:  EKSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKT

Query:  PSKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIH---DIKQKEQKRKFIEASDGTKLSEDL
        P  VK ++SAFE+SLTQDTKP IKPT+R+ QHSVVEKQ SLKVNQSK           +   I  PLA E+ H   DIKQKEQKRKFIEA  G KL E  
Subjt:  PSKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIH---DIKQKEQKRKFIEASDGTKLSEDL

Query:  RQPLKLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMV-GG
         + LKLKGKKNQVGGENL EKDKMHKERD IDAKNDESYQK +PEK D DRNS+TGES+SR KDEQFPSKRSGGWIFP+ERRRLCVTT  +Q+ D+  GG
Subjt:  RQPLKLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMV-GG

Query:  RTSYTFSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK
        R SYTF  KGEM+ISTEE+RG SETKAN  K +HQ++IKP+SSDD KP EG LANA K+AIM+GFGTLVL TRQRKKK
Subjt:  RTSYTFSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK

XP_022937772.1 uncharacterized protein LOC111444070 isoform X2 [Cucurbita moschata]3.7e-20872.47Show/hide
Query:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA
        MPG IR+SVLEFMDLPELLP PI IK+ MGKRQYETSEKG+FSFPLTTLRDDVILIIQDAGG+E+SRAGVQAKSIVEKGYWDDLFPLEGGGRVHL+FQFA
Subjt:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA

Query:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG
        LSEDDRSRIRTMRETA+R+KQVE QDR+L SSGS+LASSFYLNPELS                      DSQKCLLQVGDLSAKEAAH+SSSASTED+P 
Subjt:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG

Query:  EKSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPHNKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKTP
        +K +TEKTND +LDQNDAD N D+PS IP +                         NKPS TI SE+ +F SQ SELS SPAKDE+KT S +    RKTP
Subjt:  EKSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPHNKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKTP

Query:  SKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSKTGTTIDRPLAGELIHDIKQKEQKRKFIEASDGTKLSEDLRQPLKLKGKKNQVGG
        S VK VISAFE++LTQDT P IKPT+RSTQ  VVEKQ SLKV QSK GTTIDRPLA EL HD KQ EQKRKFIEASDGTKLSE+  Q LKLKGKKNQVGG
Subjt:  SKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSKTGTTIDRPLAGELIHDIKQKEQKRKFIEASDGTKLSEDLRQPLKLKGKKNQVGG

Query:  ENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQM-PDMVGGRTSYTFSCKGEMRIS
        ENL EKD+M+K+ D IDAKND +YQKS+PEKPDY             KDEQF SKRSGGWIFP+E+RR+CVTTGG+QM  DMVGGRTSYTFS K EM+IS
Subjt:  ENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQM-PDMVGGRTSYTFSCKGEMRIS

Query:  TEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK
        TEE+R  SET+AN  KHDHQ++IKPESSDDVKPSEGPLANAFK+A+MIGFGTLVLLTRQRKKK
Subjt:  TEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK

XP_038890373.1 uncharacterized protein LOC120079962 isoform X1 [Benincasa hispida]5.4e-22074.31Show/hide
Query:  LSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFALSEDDR
        LSVLEF+DLPELLPSPISIKV MGKR YETSEKGEFSFPLTTLRDDV+LI+QDAGG+EISRAGVQAKSIVEKGYWDDLFPLEGGG VHLQFQFALSEDDR
Subjt:  LSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFALSEDDR

Query:  SRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPGEKSITE
        SRIR MRETALR+K VEHQD++LKSSGSNLASSFYLN ELS                      DSQKCLLQ+GDLSAKEAA +S SASTE+IP E  +TE
Subjt:  SRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPGEKSITE

Query:  KTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKTPSKVKT
        KTN+++LDQNDADRN  SPSTI  LQEVD NKPKVNNT LVERM  +SPH NK S  I  EENLF SQ SELS+SP+K EEKTD+ E PSRR+ P  VK 
Subjt:  KTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKTPSKVKT

Query:  VISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIHD-----IKQKEQKRKFIEASDGTKLSEDLRQPL
        VISAFE+SLTQDTKP IKPT+R+  HSV EKQ SLKVNQSK           + T I  P AG+L HD     IKQKEQKRKFI+ SDGTK+SED RQ L
Subjt:  VISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIHD-----IKQKEQKRKFIEASDGTKLSEDLRQPL

Query:  KLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMVGGRTSYT
        KLKGKKNQVGGE LS+KDKMHKERD I++KNDESYQK +PEKPD DRNSVTGES+SR KDEQFPS+RSGGWIFP+ERRRLCVTTGG+ +    GGRTSYT
Subjt:  KLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMVGGRTSYT

Query:  FSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKKNQE
        F+ K EM+IS EENRG SETKAN  K DHQ++IKPESSDDVKPSEGPLANA K+AIM+GFGTLVL TRQRKKK  E
Subjt:  FSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKKNQE

XP_038890374.1 uncharacterized protein LOC120079962 isoform X2 [Benincasa hispida]1.5e-22274.23Show/hide
Query:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA
        MPG I+LSVLEF+DLPELLPSPISIKV MGKR YETSEKGEFSFPLTTLRDDV+LI+QDAGG+EISRAGVQAKSIVEKGYWDDLFPLEGGG VHLQFQFA
Subjt:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA

Query:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG
        LSEDDRSRIR MRETALR+K VEHQD++LKSSGSNLASSFYLN ELS                      DSQKCLLQ+GDLSAKEAA +S SASTE+IP 
Subjt:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG

Query:  EKSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKT
        E  +TEKTN+++LDQNDADRN  SPSTI  LQEVD NKPKVNNT LVERM  +SPH NK S  I  EENLF SQ SELS+SP+K EEKTD+ E PSRR+ 
Subjt:  EKSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKT

Query:  PSKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIHD-----IKQKEQKRKFIEASDGTKLSE
        P  VK VISAFE+SLTQDTKP IKPT+R+  HSV EKQ SLKVNQSK           + T I  P AG+L HD     IKQKEQKRKFI+ SDGTK+SE
Subjt:  PSKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIHD-----IKQKEQKRKFIEASDGTKLSE

Query:  DLRQPLKLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMVG
        D RQ LKLKGKKNQVGGE LS+KDKMHKERD I++KNDESYQK +PEKPD DRNSVTGES+SR KDEQFPS+RSGGWIFP+ERRRLCVTTGG+ +    G
Subjt:  DLRQPLKLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMVG

Query:  GRTSYTFSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKKNQE
        GRTSYTF+ K EM+IS EENRG SETKAN  K DHQ++IKPESSDDVKPSEGPLANA K+AIM+GFGTLVL TRQRKKK  E
Subjt:  GRTSYTFSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKKNQE

XP_038890375.1 uncharacterized protein LOC120079962 isoform X3 [Benincasa hispida]3.5e-21974.52Show/hide
Query:  LSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFALSEDDR
        LSVLEF+DLPELLPSPISIKV MGKR YETSEKGEFSFPLTTLRDDV+LI+QDAGG+EISRAGVQAKSIVEKGYWDDLFPLEGGG VHLQFQFALSEDDR
Subjt:  LSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFALSEDDR

Query:  SRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPGEKSITE
        SRIR MRETALR+K VEHQD++LKSSGSNLASSFYLN ELS                      DSQKCLLQ+GDLSAKEAA +S SASTE+IP E  +TE
Subjt:  SRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPGEKSITE

Query:  KTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKTPSKVKT
        KTN+++LDQNDADRN  SPSTI  LQEVD NKPKVNNT LVERM  +SPH NK S  I  EENLF SQ SELS+SP+K EEKTD+ E PSRR+ P  VK 
Subjt:  KTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKTPSKVKT

Query:  VISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIHD-----IKQKEQKRKFIEASDGTKLSEDLRQPL
        VISAFE+SLTQDTKP IKPT+R+  HSV EKQ SLKVNQSK           + T I  P AG+L HD     IKQKEQKRKFI+ SDGTK+SED RQ L
Subjt:  VISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIHD-----IKQKEQKRKFIEASDGTKLSEDLRQPL

Query:  KLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMVGGRTSYT
        KLKGKKNQVGGE LS+KDKMHKERD I++KNDESYQK +PEKPD DRNSVTGES+SR KDEQFPS+RSGGWIFP+ERRRLCVTTGG+ +    GGRTSYT
Subjt:  KLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMVGGRTSYT

Query:  FSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK
        F+ K EM+IS EENRG SETKAN  K DHQ++IKPESSDDVKPSEGPLANA K+AIM+GFGTLVL TRQRKKK
Subjt:  FSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK

TrEMBL top hitse value%identityAlignment
A0A0A0LHS4 Uncharacterized protein3.0e-20069.03Show/hide
Query:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA
        MPG IRLSVLEF+DLPELLPS ISIKV MGKR YETS+KG+FSFPLTTLRDDVILI+QDAGG+EISRAGVQAKSIVEKGYWDDLFPLEGGG VHLQFQFA
Subjt:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA

Query:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG
        LSEDDRSRIR MRETALR+KQVE QDR+L+SSGSN+ SSFY NPELS                      DSQKCLLQ+GDLSAK    +S+S STE+I  
Subjt:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG

Query:  EKSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKT
         K ITE+TN+++LDQNDA+R   +PST P LQEVD NKPKVNNT LVER++++SPH NK S TI SEENLF SQ +ELSNS +K EEKTD+   PSRR+ 
Subjt:  EKSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKT

Query:  PSKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIH---DIKQKEQKRKFIEASDGTKLSEDL
        P  VK ++SAFE+SLTQDTKP IKPT+R+ QHSVVEKQ SL+VNQSK           + T I  PLAGEL H   DIKQKEQKRKFIEA DGTK+ ED 
Subjt:  PSKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIH---DIKQKEQKRKFIEASDGTKLSEDL

Query:  RQPLKLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMV-GG
         Q LKLKGKKNQVGGENL EKDKMHKERD ID KNDESYQ                   SR +D+QF SKRSGGWIFP+ERRRLCVTT  +Q+ D+  GG
Subjt:  RQPLKLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMV-GG

Query:  RTSYTFSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK
        R SYTF  KGEM+ISTEE+RG SETKAN  K +HQ++IKP+SSDDVKP EG LA A K+AIM+GFGTLVL TRQRKKK
Subjt:  RTSYTFSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK

A0A1S3C9Z4 uncharacterized protein LOC103498484 isoform X29.4e-21071.11Show/hide
Query:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA
        MPG I+LSVLEF+DLPELLPS ISIKV MGKR YETS+KG+FSFPLTTLRDDVILI+QD GG+EISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQF+
Subjt:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA

Query:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG
        LSEDDRSRIR MRETALR+KQVE QDR+L+SSGSN ASSFYLNPELS                      DSQ CLLQ+GDLSAK  A +S S STE+IP 
Subjt:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG

Query:  EKSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKT
        +K  TEK N+++LDQNDADR+ D+  TIP LQ +D NKPKVNNT LVER++ +SPH NK S TI S+ENLF SQ SELSNS +K EEKTD+ E PSRR+ 
Subjt:  EKSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKT

Query:  PSKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIH---DIKQKEQKRKFIEASDGTKLSEDL
        P  VK ++SAFE+SLTQDTKP IKPT+R+ QHSVVEKQ SLKVNQSK           +   I  PLA E+ H   DIKQKEQKRKFIEA  G KL E  
Subjt:  PSKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIH---DIKQKEQKRKFIEASDGTKLSEDL

Query:  RQPLKLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMV-GG
         + LKLKGKKNQVGGENL EKDKMHKERD IDAKNDESYQK +PEK D DRNS+TGES+SR KDEQFPSKRSGGWIFP+ERRRLCVTT  +Q+ D+  GG
Subjt:  RQPLKLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMV-GG

Query:  RTSYTFSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK
        R SYTF  KGEM+ISTEE+RG SETKAN  K +HQ++IKP+SSDD KP EG LANA K+AIM+GFGTLVL TRQRKKK
Subjt:  RTSYTFSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK

A0A1S4DZH1 uncharacterized protein LOC103498484 isoform X13.3e-20770.71Show/hide
Query:  PGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFAL
        P    LSVLEF+DLPELLPS ISIKV MGKR YETS+KG+FSFPLTTLRDDVILI+QD GG+EISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQF+L
Subjt:  PGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFAL

Query:  SEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPGE
        SEDDRSRIR MRETALR+KQVE QDR+L+SSGSN ASSFYLNPELS                      DSQ CLLQ+GDLSAK  A +S S STE+IP +
Subjt:  SEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPGE

Query:  KSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKTP
        K  TEK N+++LDQNDADR+ D+  TIP LQ +D NKPKVNNT LVER++ +SPH NK S TI S+ENLF SQ SELSNS +K EEKTD+ E PSRR+ P
Subjt:  KSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPH-NKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKTP

Query:  SKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIH---DIKQKEQKRKFIEASDGTKLSEDLR
          VK ++SAFE+SLTQDTKP IKPT+R+ QHSVVEKQ SLKVNQSK           +   I  PLA E+ H   DIKQKEQKRKFIEA  G KL E   
Subjt:  SKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSK-----------TGTTIDRPLAGELIH---DIKQKEQKRKFIEASDGTKLSEDLR

Query:  QPLKLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMV-GGR
        + LKLKGKKNQVGGENL EKDKMHKERD IDAKNDESYQK +PEK D DRNS+TGES+SR KDEQFPSKRSGGWIFP+ERRRLCVTT  +Q+ D+  GGR
Subjt:  QPLKLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMV-GGR

Query:  TSYTFSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK
         SYTF  KGEM+ISTEE+RG SETKAN  K +HQ++IKP+SSDD KP EG LANA K+AIM+GFGTLVL TRQRKKK
Subjt:  TSYTFSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK

A0A6J1FBB0 uncharacterized protein LOC111444070 isoform X12.2e-20671.58Show/hide
Query:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA
        MPG IR+SVLEFMDLPELLP PI IK+ MGKRQYETSEKG+FSFPLTTLRDDVILIIQDAGG+E+SRAGVQAKSIVEKGYWDDLFPLEGGGRVHL+FQFA
Subjt:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA

Query:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG
        LSEDDRSRIRTMRETA+R+KQVE QDR+L SSGS+LASSFYLNPELS                      DSQKCLLQVGDLSAKEAAH+SSSASTED+P 
Subjt:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG

Query:  EKSITEKTNDIRLDQ-------NDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPHNKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEA
        +K +TEKTND +LDQ       NDAD N D+PS IP +                         NKPS TI SE+ +F SQ SELS SPAKDE+KT S + 
Subjt:  EKSITEKTNDIRLDQ-------NDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPHNKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEA

Query:  PSRRKTPSKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSKTGTTIDRPLAGELIHDIKQKEQKRKFIEASDGTKLSEDLRQPLKLKG
           RKTPS VK VISAFE++LTQDT P IKPT+RSTQ  VVEKQ SLKV QSK GTTIDRPLA EL HD KQ EQKRKFIEASDGTKLSE+  Q LKLKG
Subjt:  PSRRKTPSKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSKTGTTIDRPLAGELIHDIKQKEQKRKFIEASDGTKLSEDLRQPLKLKG

Query:  KKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQM-PDMVGGRTSYTFSC
        KKNQVGGENL EKD+M+K+ D IDAKND +YQKS+PEKPDY             KDEQF SKRSGGWIFP+E+RR+CVTTGG+QM  DMVGGRTSYTFS 
Subjt:  KKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQM-PDMVGGRTSYTFSC

Query:  KGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK
        K EM+ISTEE+R  SET+AN  KHDHQ++IKPESSDDVKPSEGPLANAFK+A+MIGFGTLVLLTRQRKKK
Subjt:  KGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK

A0A6J1FC61 uncharacterized protein LOC111444070 isoform X21.8e-20872.47Show/hide
Query:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA
        MPG IR+SVLEFMDLPELLP PI IK+ MGKRQYETSEKG+FSFPLTTLRDDVILIIQDAGG+E+SRAGVQAKSIVEKGYWDDLFPLEGGGRVHL+FQFA
Subjt:  MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFA

Query:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG
        LSEDDRSRIRTMRETA+R+KQVE QDR+L SSGS+LASSFYLNPELS                      DSQKCLLQVGDLSAKEAAH+SSSASTED+P 
Subjt:  LSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPG

Query:  EKSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPHNKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKTP
        +K +TEKTND +LDQNDAD N D+PS IP +                         NKPS TI SE+ +F SQ SELS SPAKDE+KT S +    RKTP
Subjt:  EKSITEKTNDIRLDQNDADRNVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPHNKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKTP

Query:  SKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSKTGTTIDRPLAGELIHDIKQKEQKRKFIEASDGTKLSEDLRQPLKLKGKKNQVGG
        S VK VISAFE++LTQDT P IKPT+RSTQ  VVEKQ SLKV QSK GTTIDRPLA EL HD KQ EQKRKFIEASDGTKLSE+  Q LKLKGKKNQVGG
Subjt:  SKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSKTGTTIDRPLAGELIHDIKQKEQKRKFIEASDGTKLSEDLRQPLKLKGKKNQVGG

Query:  ENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQM-PDMVGGRTSYTFSCKGEMRIS
        ENL EKD+M+K+ D IDAKND +YQKS+PEKPDY             KDEQF SKRSGGWIFP+E+RR+CVTTGG+QM  DMVGGRTSYTFS K EM+IS
Subjt:  ENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQM-PDMVGGRTSYTFSCKGEMRIS

Query:  TEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK
        TEE+R  SET+AN  KHDHQ++IKPESSDDVKPSEGPLANAFK+A+MIGFGTLVLLTRQRKKK
Subjt:  TEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLTRQRKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G56850.1 unknown protein5.7e-2626.74Show/hide
Query:  MPGIIRLSVLEFMDLPELLP--SPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQ
        MPG I++SVL  +D+    P  S  SIKV MGK +Y+TS+ G++ FP+T LR+++I+ + D  G++I +  ++ + I+E G+ ++     G G V L+ Q
Subjt:  MPGIIRLSVLEFMDLPELLP--SPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQ

Query:  FALSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDI
        F LSE+DR+RIR +R++ALRKK  E        +GS+   S  +  +LS   ++   + V      A +S  +   L Q   L         SS      
Subjt:  FALSEDDRSRIRTMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDI

Query:  PGEKSITEKTNDIRLDQNDADRNVDSP-----STIPLLQEVD--VNKP-----KVNNTRLVERMKR--KSPHNK-PSLTIPSEENLFRSQASELSNSPAK
        P  K   EKTN I+L  +D   +   P      ++ L+++ D  +NKP     +  +  LV++  +    P  K      P   +L + + + LS    K
Subjt:  PGEKSITEKTNDIRLDQNDADRNVDSP-----STIPLLQEVD--VNKP-----KVNNTRLVERMKR--KSPHNK-PSLTIPSEENLFRSQASELSNSPAK

Query:  DEEKTDSAEAPSRRKTPSKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSKTGTTIDRPLAGELIHDIKQKEQKRKFIE--------A
         + K  S          + V+ +IS FE  +TQDTK  I+     T         S K  + KT     +P A   ++   +K ++RK I          
Subjt:  DEEKTDSAEAPSRRKTPSKVKTVISAFETSLTQDTKPCIKPTVRSTQHSVVEKQISLKVNQSKTGTTIDRPLAGELIHDIKQKEQKRKFIE--------A

Query:  SDGTKLSEDL------RQPLKLKGKKNQVGGENLSEKDKMHKERD--VIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERR
        SD T   +DL       + + ++ K  +    +    D +  +R   V++ ++DE  Q       D    +  G  +               WIFP+E +
Subjt:  SDGTKLSEDL------RQPLKLKGKKNQVGGENLSEKDKMHKERD--VIDAKNDESYQKSLPEKPDYDRNSVTGESISRGKDEQFPSKRSGGWIFPEERR

Query:  RLCVTTG-GDQMPDMVGGRTSYTFSCKGEMRISTEENRGASETKA-NMRK---------HDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLT
         LC  T  G +  D+     +     +  +  ST EN G   ++  N+ K          +  K  K E+S + + S  P+    +  I++GF  LV LT
Subjt:  RLCVTTG-GDQMPDMVGGRTSYTFSCKGEMRISTEENRGASETKA-NMRK---------HDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFGTLVLLT

Query:  RQ
        RQ
Subjt:  RQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGGAATCATCCGACTTTCAGTTCTGGAGTTCATGGACCTTCCTGAGCTATTGCCTTCCCCAATTTCGATTAAGGTTTTCATGGGTAAAAGACAGTATGAGACCTC
GGAGAAAGGAGAGTTTTCATTTCCATTGACGACGCTTCGAGATGATGTGATTCTTATAATTCAGGATGCTGGAGGGAGTGAGATATCACGTGCAGGTGTTCAGGCGAAAT
CAATAGTTGAGAAGGGTTACTGGGATGATCTTTTTCCTCTTGAAGGAGGCGGCCGTGTGCATTTGCAGTTCCAGTTTGCTCTTAGTGAAGATGATCGCAGTCGAATTCGA
ACAATGAGAGAAACTGCTTTGAGAAAAAAACAAGTTGAGCATCAAGATAGAGATCTCAAAAGCTCTGGAAGCAACCTTGCTTCATCTTTTTACCTCAACCCCGAGCTCTC
AGTTTTCTTTACATTGGTGTTTTCTGAACGGGTTTTTCTTGATTTTCTGAGTGCTCCGTCTTCTCAAGATTCGCAAAAATGTCTTCTGCAAGTTGGAGATCTAAGTGCTA
AAGAAGCAGCTCACCGATCTTCATCAGCATCAACTGAAGATATCCCTGGTGAGAAATCGATTACTGAAAAAACAAACGACATTCGGCTTGATCAGAATGATGCAGATAGA
AATGTGGATAGTCCATCAACAATTCCTCTGTTGCAAGAAGTTGATGTTAATAAACCAAAAGTAAATAATACTCGATTAGTTGAGAGGATGAAGAGAAAGTCACCTCATAA
CAAACCTTCACTGACCATTCCTTCAGAAGAAAATTTATTCCGTTCTCAAGCCTCAGAGTTATCCAATTCTCCAGCAAAAGATGAAGAGAAGACCGATTCTGCGGAGGCAC
CATCACGTAGGAAAACTCCCAGCAAGGTTAAAACTGTAATAAGTGCCTTTGAAACTAGCCTGACTCAGGATACAAAACCTTGCATTAAACCTACCGTAAGAAGTACTCAA
CACAGTGTGGTAGAAAAGCAAATATCTCTAAAAGTTAATCAGTCTAAAACAGGAACAACAATAGATCGTCCTCTTGCAGGGGAACTGATCCATGATATCAAGCAAAAGGA
ACAGAAACGGAAATTCATCGAGGCTTCAGATGGGACTAAATTATCTGAGGACCTTAGGCAACCACTAAAGTTGAAGGGGAAAAAGAATCAAGTTGGAGGAGAAAATTTGA
GTGAAAAGGATAAAATGCATAAGGAGCGGGATGTTATAGATGCAAAGAATGATGAATCGTATCAAAAGTCGTTGCCTGAGAAGCCGGACTATGATAGAAATTCAGTAACA
GGTGAATCAATTTCACGTGGCAAAGATGAGCAATTTCCTTCCAAAAGGTCTGGTGGCTGGATATTTCCAGAAGAAAGAAGACGGCTGTGTGTCACAACAGGTGGTGATCA
GATGCCAGATATGGTGGGAGGTCGCACTAGCTATACATTTAGCTGTAAAGGGGAAATGAGGATTTCCACGGAAGAGAATAGAGGTGCCTCAGAAACGAAGGCAAATATGC
GCAAGCATGATCATCAAAAAGTTATCAAACCAGAGAGTTCAGATGATGTAAAACCTTCTGAAGGACCACTTGCCAATGCCTTCAAAGTTGCAATAATGATTGGGTTTGGG
ACTCTTGTTCTCCTTACGAGACAAAGGAAAAAGAAGAATCAAGAATATATCAGACTACTGTCTGATACACATGGCTTTATTTATGGACTTGCAGAAATAACCAGTTGGGT
GAAGGATCCACACCAATTGTATTCAAGGGATAGAAGTGCGGTTGCTCCGCCGTGGCGGTGGCGCCGCCTTGCCCTCATTAGTTTAAGAATGAGGCTTTGCTCTCGGAGAG
GGAGGTTCCCTCCGCTTCGAATGCTTGTAGAGCCAATGATTTCTACGCCTCTACCATCTCATCACTATCATCATCATCATCATCATCTTCTTCTTCTTCTGCTTCTGCTG
CATGGCGACTCGGGGACGAGTCTTTGTGGATTTACATTCCACAGAGACTGTTTGCTAATGTTCGCAGCCACGAATCGTCGCAGCTCTCTCTCTGTTTCTGAGAAAAGGCG
AATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGGGAATCATCCGACTTTCAGTTCTGGAGTTCATGGACCTTCCTGAGCTATTGCCTTCCCCAATTTCGATTAAGGTTTTCATGGGTAAAAGACAGTATGAGACCTC
GGAGAAAGGAGAGTTTTCATTTCCATTGACGACGCTTCGAGATGATGTGATTCTTATAATTCAGGATGCTGGAGGGAGTGAGATATCACGTGCAGGTGTTCAGGCGAAAT
CAATAGTTGAGAAGGGTTACTGGGATGATCTTTTTCCTCTTGAAGGAGGCGGCCGTGTGCATTTGCAGTTCCAGTTTGCTCTTAGTGAAGATGATCGCAGTCGAATTCGA
ACAATGAGAGAAACTGCTTTGAGAAAAAAACAAGTTGAGCATCAAGATAGAGATCTCAAAAGCTCTGGAAGCAACCTTGCTTCATCTTTTTACCTCAACCCCGAGCTCTC
AGTTTTCTTTACATTGGTGTTTTCTGAACGGGTTTTTCTTGATTTTCTGAGTGCTCCGTCTTCTCAAGATTCGCAAAAATGTCTTCTGCAAGTTGGAGATCTAAGTGCTA
AAGAAGCAGCTCACCGATCTTCATCAGCATCAACTGAAGATATCCCTGGTGAGAAATCGATTACTGAAAAAACAAACGACATTCGGCTTGATCAGAATGATGCAGATAGA
AATGTGGATAGTCCATCAACAATTCCTCTGTTGCAAGAAGTTGATGTTAATAAACCAAAAGTAAATAATACTCGATTAGTTGAGAGGATGAAGAGAAAGTCACCTCATAA
CAAACCTTCACTGACCATTCCTTCAGAAGAAAATTTATTCCGTTCTCAAGCCTCAGAGTTATCCAATTCTCCAGCAAAAGATGAAGAGAAGACCGATTCTGCGGAGGCAC
CATCACGTAGGAAAACTCCCAGCAAGGTTAAAACTGTAATAAGTGCCTTTGAAACTAGCCTGACTCAGGATACAAAACCTTGCATTAAACCTACCGTAAGAAGTACTCAA
CACAGTGTGGTAGAAAAGCAAATATCTCTAAAAGTTAATCAGTCTAAAACAGGAACAACAATAGATCGTCCTCTTGCAGGGGAACTGATCCATGATATCAAGCAAAAGGA
ACAGAAACGGAAATTCATCGAGGCTTCAGATGGGACTAAATTATCTGAGGACCTTAGGCAACCACTAAAGTTGAAGGGGAAAAAGAATCAAGTTGGAGGAGAAAATTTGA
GTGAAAAGGATAAAATGCATAAGGAGCGGGATGTTATAGATGCAAAGAATGATGAATCGTATCAAAAGTCGTTGCCTGAGAAGCCGGACTATGATAGAAATTCAGTAACA
GGTGAATCAATTTCACGTGGCAAAGATGAGCAATTTCCTTCCAAAAGGTCTGGTGGCTGGATATTTCCAGAAGAAAGAAGACGGCTGTGTGTCACAACAGGTGGTGATCA
GATGCCAGATATGGTGGGAGGTCGCACTAGCTATACATTTAGCTGTAAAGGGGAAATGAGGATTTCCACGGAAGAGAATAGAGGTGCCTCAGAAACGAAGGCAAATATGC
GCAAGCATGATCATCAAAAAGTTATCAAACCAGAGAGTTCAGATGATGTAAAACCTTCTGAAGGACCACTTGCCAATGCCTTCAAAGTTGCAATAATGATTGGGTTTGGG
ACTCTTGTTCTCCTTACGAGACAAAGGAAAAAGAAGAATCAAGAATATATCAGACTACTGTCTGATACACATGGCTTTATTTATGGACTTGCAGAAATAACCAGTTGGGT
GAAGGATCCACACCAATTGTATTCAAGGGATAGAAGTGCGGTTGCTCCGCCGTGGCGGTGGCGCCGCCTTGCCCTCATTAGTTTAAGAATGAGGCTTTGCTCTCGGAGAG
GGAGGTTCCCTCCGCTTCGAATGCTTGTAGAGCCAATGATTTCTACGCCTCTACCATCTCATCACTATCATCATCATCATCATCATCTTCTTCTTCTTCTGCTTCTGCTG
CATGGCGACTCGGGGACGAGTCTTTGTGGATTTACATTCCACAGAGACTGTTTGCTAATGTTCGCAGCCACGAATCGTCGCAGCTCTCTCTCTGTTTCTGAGAAAAGGCG
AATCTGA
Protein sequenceShow/hide protein sequence
MPGIIRLSVLEFMDLPELLPSPISIKVFMGKRQYETSEKGEFSFPLTTLRDDVILIIQDAGGSEISRAGVQAKSIVEKGYWDDLFPLEGGGRVHLQFQFALSEDDRSRIR
TMRETALRKKQVEHQDRDLKSSGSNLASSFYLNPELSVFFTLVFSERVFLDFLSAPSSQDSQKCLLQVGDLSAKEAAHRSSSASTEDIPGEKSITEKTNDIRLDQNDADR
NVDSPSTIPLLQEVDVNKPKVNNTRLVERMKRKSPHNKPSLTIPSEENLFRSQASELSNSPAKDEEKTDSAEAPSRRKTPSKVKTVISAFETSLTQDTKPCIKPTVRSTQ
HSVVEKQISLKVNQSKTGTTIDRPLAGELIHDIKQKEQKRKFIEASDGTKLSEDLRQPLKLKGKKNQVGGENLSEKDKMHKERDVIDAKNDESYQKSLPEKPDYDRNSVT
GESISRGKDEQFPSKRSGGWIFPEERRRLCVTTGGDQMPDMVGGRTSYTFSCKGEMRISTEENRGASETKANMRKHDHQKVIKPESSDDVKPSEGPLANAFKVAIMIGFG
TLVLLTRQRKKKNQEYIRLLSDTHGFIYGLAEITSWVKDPHQLYSRDRSAVAPPWRWRRLALISLRMRLCSRRGRFPPLRMLVEPMISTPLPSHHYHHHHHHLLLLLLLL
HGDSGTSLCGFTFHRDCLLMFAATNRRSSLSVSEKRRI