; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001188 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001188
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr4:26247031..26257135
RNA-Seq ExpressionLag0001188
SyntenyLag0001188
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022926214.1 uncharacterized protein LOC111433394 [Cucurbita moschata]1.8e-12554.34Show/hide
Query:  LEQNGQQNNQAKNPILVAN-------------DRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDS
        +E   Q N + +NP ++AN             DR+RAIRAY  P V+ELNP I+   ++AT+FE+KPVMF ML+ +GQFHGLPSEDPHLHLKSFLGVSDS
Subjt:  LEQNGQQNNQAKNPILVAN-------------DRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDS

Query:  FVIEGVSRDALRLILFPYSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIE
        F  + V +D +RL LFPYSLRDGAK+WLN+ A  +I +WN   EKFL KYFPPTR+A+ R+EIV F+Q E++T SEAWERFKE+LRKCPHH  P+CIQ+E
Subjt:  FVIEGVSRDALRLILFPYSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIE

Query:  IFYNGLNIATQCMVDASVGGALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT------VVSHVQQPPVVN
         FYNGLNIAT+ +VDAS  GA+L+K++NEAYEILERI++ +CQW+DVR +  +K + VLEVD +S+I A +  + N L+N+       + + V    V+N
Subjt:  IFYNGLNIATQCMVDASVGGALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT------VVSHVQQPPVVN

Query:  QVTEESCVYCGEEHNYLFCPNNPTSVFFAGNQ------RNNPYSKFYNSGWRNHPNFAWGGQGS-NLQVPQAQQ-----KVVNQSGFAKSQELPQQNKQA
        Q   ESCVYCGEEH +  CP+NP S+F+ GNQ      +NNP+S  YN GWRNHPNF+W GQGS N Q+P          + NQ  ++  Q   Q     
Subjt:  QVTEESCVYCGEEHNYLFCPNNPTSVFFAGNQ------RNNPYSKFYNSGWRNHPNFAWGGQGS-NLQVPQAQQ-----KVVNQSGFAKSQELPQQNKQA

Query:  LPQQNLGNSLESMMKDYMARNDVIILSQQASLRALELQ
          Q  LG S+ES++K+YMA+NDV+I +QQASLR LE+Q
Subjt:  LPQQNLGNSLESMMKDYMARNDVIILSQQASLRALELQ

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]1.2e-14042.86Show/hide
Query:  KLEEESCDAANLPLEQNGQQNNQA---------KNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHL
        K+ E++     L  + N +  N A          NPI +A+DR+RAIRAY  P V+ELNP I+   I+ T+FE+KPVMF ML+ +GQFHGLP EDPHLHL
Subjt:  KLEEESCDAANLPLEQNGQQNNQA---------KNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHL

Query:  KSFLGV-------SDSFVIEGVSRDALRLILFPYSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKEL
        KSFLGV       SDSF  +GV +D +RL LFPY LRDGAK+WLN+ AP +I +WN  AE FL KYFPPTR+A+ ++EIV F+Q E+ET SEA ERFKE+
Subjt:  KSFLGV-------SDSFVIEGVSRDALRLILFPYSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKEL

Query:  LRKCPHHCFPYCIQIEIFYNGLNIATQCMVDASVGGALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT--
        LRKCPHH  P+CIQ+E FYNGLNI T+ +VDAS  GA+L+K++NEAYEILERI++ +CQW+DVR +  +K + VLEVD +S+I A +  + N L+N+   
Subjt:  LRKCPHHCFPYCIQIEIFYNGLNIATQCMVDASVGGALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT--

Query:  ----VVSHVQQPPVVNQVTEESCVYCGEEHNYLFCPNNPTSVFFAGNQ------RNNPYSKFYNSGWRNHPNFAWGGQG-SNLQVPQAQQ-----KVVNQ
            + + V     +NQ   ESCVYCGEEH +  CP+NP S+F+ GNQ      +NNP+S  YN GWRNHPNF+W GQ   N Q+P         ++ NQ
Subjt:  ----VVSHVQQPPVVNQVTEESCVYCGEEHNYLFCPNNPTSVFFAGNQ------RNNPYSKFYNSGWRNHPNFAWGGQG-SNLQVPQAQQ-----KVVNQ

Query:  SGFAKSQELPQQNKQALPQQNLGNSLESMMKDYMARNDVIILSQQASLRALELQVGQLANELKARPQGNIPSHIEHPIREGKKQGAGGSNNVS-------
          ++  Q   Q       Q     S+ES++K+YMA+ND +I SQQASLR LE+Q+G   N  +        +  +    E   Q     +          
Subjt:  SGFAKSQELPQQNKQALPQQNLGNSLESMMKDYMARNDVIILSQQASLRALELQVGQLANELKARPQGNIPSHIEHPIREGKKQGAGGSNNVS-------

Query:  GASSSVPDVEPPYVPPPPY-------------------------NPPLPFPKRQRP--------------------------------------KNQD--
          ++S       Y P PP+                         N PL    +Q P                                      K +D  
Subjt:  GASSSVPDVEPPYVPPPPY-------------------------NPPLPFPKRQRP--------------------------------------KNQD--

Query:  --------GGKELGRAHCDLGASVNLMPLSVYRKLGIGEARPTTVTFRLADRFITYPVGKIEDVLVQKSK--------------------NQNGQFLTTS
                GGKELGRA CDLGA++NLMPLS+Y+KLGIGEARPTTVT +LADR ITYP GKIED+L+Q  K                         FL   
Subjt:  --------GGKELGRAHCDLGASVNLMPLSVYRKLGIGEARPTTVTFRLADRFITYPVGKIEDVLVQKSK--------------------NQNGQFLTTS

Query:  RSLMDVQQGELTMKVLDEIVE
        R+L+DV +G +T+++  + VE
Subjt:  RSLMDVQQGELTMKVLDEIVE

XP_022947838.1 uncharacterized protein LOC111451598 [Cucurbita moschata]1.3e-12655.46Show/hide
Query:  QNNQAKNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDSFVIEGVSRDALRLILFPYS
        Q     N I VA+DR+RAIRAY  P V+ELNP I+   ++AT+FE+KPVMF ML+ +GQFHGL S+DPHLHLKSFLGVSDSF  +GV +D +RL  F YS
Subjt:  QNNQAKNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDSFVIEGVSRDALRLILFPYS

Query:  LRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIEIFYNGLNIATQCMVDASVG
        LRDGAK+WLN  A   I +WN  AEKFL KYFPPTRSA+ R+EIV F++ E ET SEAWERFKE LRKCPHH  P+CIQIE FYNGLN AT+ +VDAS  
Subjt:  LRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIEIFYNGLNIATQCMVDASVG

Query:  GALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT------VVSHVQQPPVVNQVTEESCVYCGEEHNYLFC
        G +L+K++NEAYEILERI++ +CQW DVR +  KK + VLEVD +S+I A +  + N L+N+       + +      V+ Q   ESCVYCGE+H +  C
Subjt:  GALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT------VVSHVQQPPVVNQVTEESCVYCGEEHNYLFC

Query:  PNNPTSVFFAGNQ------RNNPYSKFYNSGWRNHPNFAWGGQGS-NLQVPQAQQKVVNQSGFAKSQELPQQNKQALPQ-------QNL-GNSLESMMKD
        P+NP S+F+ GNQ      + NP S  YN GWRNHPNF   GQGS N Q+P    K     GF    +L   ++QA  Q       Q++ G  LES++K+
Subjt:  PNNPTSVFFAGNQ------RNNPYSKFYNSGWRNHPNFAWGGQGS-NLQVPQAQQKVVNQSGFAKSQELPQQNKQALPQ-------QNL-GNSLESMMKD

Query:  YMARNDVIILSQQASLRALELQVGQLANELKARPQGNIPSHIEHPIREG
        YMARND +I SQQ SLR LE+QVGQLANEL+ RP G +P+  E P REG
Subjt:  YMARNDVIILSQQASLRALELQVGQLANELKARPQGNIPSHIEHPIREG

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]4.1e-12550.78Show/hide
Query:  GQQNNQAKNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDSFVIEGVSRDALRLILFP
        G  +N+A NPI +A+DR RAIR Y  P+ +ELNP IV   I+A  FE+KPVMF ML+ VGQF G P+EDPHLH++SFL VSDSF ++GVS +ALRL LFP
Subjt:  GQQNNQAKNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDSFVIEGVSRDALRLILFP

Query:  YSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIEIFYNGLNIATQCMVDAS
        +SLRD A+AWLN+  P S++ WN  AEKFL KYFPPTR+AK RSEI+ F+Q+E+ET S+AWERFKELLRKCPHH  P+CIQ+E FYNGLN A++ ++DAS
Subjt:  YSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIEIFYNGLNIATQCMVDAS

Query:  VGGALLAKSFNEAYEILERISTYSCQWSDVRG-SNKKVKSVLEVDGVSTIRADIVELANALKNVTVVSHVQQPPVVNQVTEESCVYCGEEHNYLFCPNNP
          GA+L+KS+NEA+EILERI++ + QWS  R  +++KV  VLEVD ++ + A +  + N LKN+ +   V QP    Q  + SCVYCG+ H +  CP+N 
Subjt:  VGGALLAKSFNEAYEILERISTYSCQWSDVRG-SNKKVKSVLEVDGVSTIRADIVELANALKNVTVVSHVQQPPVVNQVTEESCVYCGEEHNYLFCPNNP

Query:  TSVFFAGNQ----RNNPYSKFYNSGWRNHPNFAWGGQGSNLQVPQAQQKVVNQSGFAKSQELPQQNKQALPQQNLGNSLESMMKDYMARNDVIILSQQAS
         SV + GNQ     NNPYS  YN  W++HPNF+WGGQG     P          GF++ Q  PQQ  Q  PQ +  +SLES+M+DYMA+ND +I SQ AS
Subjt:  TSVFFAGNQ----RNNPYSKFYNSGWRNHPNFAWGGQGSNLQVPQAQQKVVNQSGFAKSQELPQQNKQALPQQNLGNSLESMMKDYMARNDVIILSQQAS

Query:  LRALELQVGQLANELKARPQGNIPSHIEHPIREGK--------KQGAGGSNNVSGASSSVPD----------------VE-PPYVPPPPYN---------
        LR LE+Q+GQLAN+LK RPQG +PS  E+P R+GK        + G    +NV+   S  P                 VE PP V     +         
Subjt:  LRALELQVGQLANELKARPQGNIPSHIEHPIREGK--------KQGAGGSNNVSGASSSVPD----------------VE-PPYVPPPPYN---------

Query:  PPLPFPKRQRPKNQDG
        PP PFP+R + +  DG
Subjt:  PPLPFPKRQRPKNQDG

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]2.1e-12942.31Show/hide
Query:  QAKNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDSFVIEGVSRDALRLILFPYSLRD
        Q  +PI++ +DR RAIR Y  P+ +ELNP IV   I+A  FE+KPVMF ML+ VGQF  +P+EDPHLHL+SFL +SDSF I+GVS +  RL LFP+SLRD
Subjt:  QAKNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDSFVIEGVSRDALRLILFPYSLRD

Query:  GAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIEIFYNGLNIATQCMVDASVGGAL
         A++WLN+ +P S++ WN FAEKFL KYFPPTR+AK RSEI+ F Q E+E+ S+AWERFKELLRKCPHH  P+CIQ+E FYNGLN  +Q ++DAS  GA+
Subjt:  GAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIEIFYNGLNIATQCMVDASVGGAL

Query:  LAKSFNEAYEILERISTYSCQWSDVRG-SNKKVKSVLEVDGVSTIRADIVELANALKNVTVVSHVQ-QPPVVNQVTEESCVYCGEEHNYLFCPNNPTSVF
        L+KS+NEA+EILE I++ + QWS+ R   ++KV  VLEVD ++ +   +  + N LKN+++ +    QP    Q  + SCV+C E H +  CP+NP SV 
Subjt:  LAKSFNEAYEILERISTYSCQWSDVRG-SNKKVKSVLEVDGVSTIRADIVELANALKNVTVVSHVQ-QPPVVNQVTEESCVYCGEEHNYLFCPNNPTSVF

Query:  FAGNQ----RNNPYSKFYNSGWRNHPNFAWGGQGSNLQVPQAQQKVVNQSGFAKSQELPQQNKQALPQQNLGNSLESMMKDYMARNDVIILSQQASLRAL
        + GNQ     N  +S  YN  W+NHPN +WG +       Q +Q      GF++    PQ  + + P     +SLES+M+DYMA+ND +I SQ A LR L
Subjt:  FAGNQ----RNNPYSKFYNSGWRNHPNFAWGGQGSNLQVPQAQQKVVNQSGFAKSQELPQQNKQALPQQNLGNSLESMMKDYMARNDVIILSQQASLRAL

Query:  ELQVGQLANELKARPQGNIPSHIEHPIREGKKQ--------------------GAG--------------------GSNNVSGASSSVPDVEPPYVPPPP
        ELQ+G LANELKARPQG++PS  E+P R+GK+Q                    G+G                     +  V  A+    + +        
Subjt:  ELQVGQLANELKARPQGNIPSHIEHPIREGKKQ--------------------GAG--------------------GSNNVSGASSSVPDVEPPYVPPPP

Query:  YNPPLPFPKRQRPKNQDG--------------------------------------GKELGRAHCDLGASVNLM-----------------PLSV-YRKL
          PPLPFP+R + + QDG                                       + LG     L      M                 P+S+  R L
Subjt:  YNPPLPFPKRQRPKNQDG--------------------------------------GKELGRAHCDLGASVNLM-----------------PLSV-YRKL

Query:  GIGEARPTTVTFRLADRFITYPVGKIEDVLVQKSK--------------------NQNGQFLTTSRSLMDVQQGELTMKVLDE
        GIGEARPTTVT +LADR + +P GKIEDVLVQ  K                         FL T R+L+DV++GELTM+  DE
Subjt:  GIGEARPTTVTFRLADRFITYPVGKIEDVLVQKSK--------------------NQNGQFLTTSRSLMDVQQGELTMKVLDE

TrEMBL top hitse value%identityAlignment
A0A6J1DY39 uncharacterized protein LOC1110256533.5e-10636.22Show/hide
Query:  NQAKNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDSFVIEGVSRDALRLILFPYSLR
        N   NPI VA+ RDRA+R Y   I+++LN  ++        FE KP+M  ML  +GQF GL  EDP  HLKSF+ V+++F + G+S DALRL LFP+S+ 
Subjt:  NQAKNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDSFVIEGVSRDALRLILFPYSLR

Query:  DGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIEIFYNGLNIATQCMVDASVGGA
          A AWLN+F   +I+TW+   +KFL KYFPPTR+A +R EI+ FRQ E E  + AWERFK+L+  CP+   P C+QIE F+ G +I T+ M++ +  G 
Subjt:  DGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIEIFYNGLNIATQCMVDASVGGA

Query:  LLAKSFNEAYEILERISTYSCQWSDVRGSNKKVKS----VLEVDGVSTIRADIVELANALKNV-------TVVSHVQQPPVVNQVTEESCVYCGEEHNYL
          +KSFNE  EIL+++S ++ QW   +   +  ++    VL +D +++++  I  +   LKN+               P  V Q+ E +C YCG+ H   
Subjt:  LLAKSFNEAYEILERISTYSCQWSDVRGSNKKVKS----VLEVDGVSTIRADIVELANALKNV-------TVVSHVQQPPVVNQVTEESCVYCGEEHNYL

Query:  FCPNNPTSVFFAG---NQRNNPYSKFYNSGWRNHPNFAWGGQGSNLQVPQAQQ--KVVNQSGFAKSQELP--------QQNKQALPQQNLGN--------
         CP+NP+S+++ G    Q+ NPYS  YN GW+ HPNF+W GQGS+      QQ  +     GF  S   P        Q+N     QQNL N        
Subjt:  FCPNNPTSVFFAG---NQRNNPYSKFYNSGWRNHPNFAWGGQGSNLQVPQAQQ--KVVNQSGFAKSQELP--------QQNKQALPQQNLGN--------

Query:  ------SLESMM-------------KDYMARNDVIILSQQASLRALELQVGQLANELKARPQGNIPSHIEHPIREGKK-------------QGAGGSNNV
              +++ +M             KDYM RNDV       ++R LE+Q+GQL NE++ RPQG++PS  E P R GK+             +G    +  
Subjt:  ------SLESMM-------------KDYMARNDVIILSQQASLRALELQVGQLANELKARPQGNIPSHIEHPIREGKK-------------QGAGGSNNV

Query:  SGASS------SVPD--VEPPY-VPPPPY----NPPLPFPKRQRPKNQD---------------------------------------------------
        S + S      +VPD  VEP   VP  P      PP PFP+R   KNQD                                                   
Subjt:  SGASS------SVPD--VEPPY-VPPPPY----NPPLPFPKRQRPKNQD---------------------------------------------------

Query:  ----------------------------GGKELGRAHCDLGASVNLMPLSVYRKLGIGEARPTTVTFRLADRFITYPVGKIEDVLVQKSK----------
                                    GGK++GRA CDLGAS+NLMPLS+++K  IG+A PTTVT +LADR IT P GKIEDVLV+  K          
Subjt:  ----------------------------GGKELGRAHCDLGASVNLMPLSVYRKLGIGEARPTTVTFRLADRFITYPVGKIEDVLVQKSK----------

Query:  ----------NQNGQFLTTSRSLMDVQQGELTMKVLDEIV
                       FL T  +L+DV++GELTM+V D+ V
Subjt:  ----------NQNGQFLTTSRSLMDVQQGELTMKVLDEIV

A0A6J1EEI2 uncharacterized protein LOC1114333949.0e-12654.34Show/hide
Query:  LEQNGQQNNQAKNPILVAN-------------DRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDS
        +E   Q N + +NP ++AN             DR+RAIRAY  P V+ELNP I+   ++AT+FE+KPVMF ML+ +GQFHGLPSEDPHLHLKSFLGVSDS
Subjt:  LEQNGQQNNQAKNPILVAN-------------DRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDS

Query:  FVIEGVSRDALRLILFPYSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIE
        F  + V +D +RL LFPYSLRDGAK+WLN+ A  +I +WN   EKFL KYFPPTR+A+ R+EIV F+Q E++T SEAWERFKE+LRKCPHH  P+CIQ+E
Subjt:  FVIEGVSRDALRLILFPYSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIE

Query:  IFYNGLNIATQCMVDASVGGALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT------VVSHVQQPPVVN
         FYNGLNIAT+ +VDAS  GA+L+K++NEAYEILERI++ +CQW+DVR +  +K + VLEVD +S+I A +  + N L+N+       + + V    V+N
Subjt:  IFYNGLNIATQCMVDASVGGALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT------VVSHVQQPPVVN

Query:  QVTEESCVYCGEEHNYLFCPNNPTSVFFAGNQ------RNNPYSKFYNSGWRNHPNFAWGGQGS-NLQVPQAQQ-----KVVNQSGFAKSQELPQQNKQA
        Q   ESCVYCGEEH +  CP+NP S+F+ GNQ      +NNP+S  YN GWRNHPNF+W GQGS N Q+P          + NQ  ++  Q   Q     
Subjt:  QVTEESCVYCGEEHNYLFCPNNPTSVFFAGNQ------RNNPYSKFYNSGWRNHPNFAWGGQGS-NLQVPQAQQ-----KVVNQSGFAKSQELPQQNKQA

Query:  LPQQNLGNSLESMMKDYMARNDVIILSQQASLRALELQ
          Q  LG S+ES++K+YMA+NDV+I +QQASLR LE+Q
Subjt:  LPQQNLGNSLESMMKDYMARNDVIILSQQASLRALELQ

A0A6J1EQ90 uncharacterized protein LOC1114364115.8e-14142.86Show/hide
Query:  KLEEESCDAANLPLEQNGQQNNQA---------KNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHL
        K+ E++     L  + N +  N A          NPI +A+DR+RAIRAY  P V+ELNP I+   I+ T+FE+KPVMF ML+ +GQFHGLP EDPHLHL
Subjt:  KLEEESCDAANLPLEQNGQQNNQA---------KNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHL

Query:  KSFLGV-------SDSFVIEGVSRDALRLILFPYSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKEL
        KSFLGV       SDSF  +GV +D +RL LFPY LRDGAK+WLN+ AP +I +WN  AE FL KYFPPTR+A+ ++EIV F+Q E+ET SEA ERFKE+
Subjt:  KSFLGV-------SDSFVIEGVSRDALRLILFPYSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKEL

Query:  LRKCPHHCFPYCIQIEIFYNGLNIATQCMVDASVGGALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT--
        LRKCPHH  P+CIQ+E FYNGLNI T+ +VDAS  GA+L+K++NEAYEILERI++ +CQW+DVR +  +K + VLEVD +S+I A +  + N L+N+   
Subjt:  LRKCPHHCFPYCIQIEIFYNGLNIATQCMVDASVGGALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT--

Query:  ----VVSHVQQPPVVNQVTEESCVYCGEEHNYLFCPNNPTSVFFAGNQ------RNNPYSKFYNSGWRNHPNFAWGGQG-SNLQVPQAQQ-----KVVNQ
            + + V     +NQ   ESCVYCGEEH +  CP+NP S+F+ GNQ      +NNP+S  YN GWRNHPNF+W GQ   N Q+P         ++ NQ
Subjt:  ----VVSHVQQPPVVNQVTEESCVYCGEEHNYLFCPNNPTSVFFAGNQ------RNNPYSKFYNSGWRNHPNFAWGGQG-SNLQVPQAQQ-----KVVNQ

Query:  SGFAKSQELPQQNKQALPQQNLGNSLESMMKDYMARNDVIILSQQASLRALELQVGQLANELKARPQGNIPSHIEHPIREGKKQGAGGSNNVS-------
          ++  Q   Q       Q     S+ES++K+YMA+ND +I SQQASLR LE+Q+G   N  +        +  +    E   Q     +          
Subjt:  SGFAKSQELPQQNKQALPQQNLGNSLESMMKDYMARNDVIILSQQASLRALELQVGQLANELKARPQGNIPSHIEHPIREGKKQGAGGSNNVS-------

Query:  GASSSVPDVEPPYVPPPPY-------------------------NPPLPFPKRQRP--------------------------------------KNQD--
          ++S       Y P PP+                         N PL    +Q P                                      K +D  
Subjt:  GASSSVPDVEPPYVPPPPY-------------------------NPPLPFPKRQRP--------------------------------------KNQD--

Query:  --------GGKELGRAHCDLGASVNLMPLSVYRKLGIGEARPTTVTFRLADRFITYPVGKIEDVLVQKSK--------------------NQNGQFLTTS
                GGKELGRA CDLGA++NLMPLS+Y+KLGIGEARPTTVT +LADR ITYP GKIED+L+Q  K                         FL   
Subjt:  --------GGKELGRAHCDLGASVNLMPLSVYRKLGIGEARPTTVTFRLADRFITYPVGKIEDVLVQKSK--------------------NQNGQFLTTS

Query:  RSLMDVQQGELTMKVLDEIVE
        R+L+DV +G +T+++  + VE
Subjt:  RSLMDVQQGELTMKVLDEIVE

A0A6J1G7Q6 uncharacterized protein LOC1114515986.2e-12755.46Show/hide
Query:  QNNQAKNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDSFVIEGVSRDALRLILFPYS
        Q     N I VA+DR+RAIRAY  P V+ELNP I+   ++AT+FE+KPVMF ML+ +GQFHGL S+DPHLHLKSFLGVSDSF  +GV +D +RL  F YS
Subjt:  QNNQAKNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDSFVIEGVSRDALRLILFPYS

Query:  LRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIEIFYNGLNIATQCMVDASVG
        LRDGAK+WLN  A   I +WN  AEKFL KYFPPTRSA+ R+EIV F++ E ET SEAWERFKE LRKCPHH  P+CIQIE FYNGLN AT+ +VDAS  
Subjt:  LRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIEIFYNGLNIATQCMVDASVG

Query:  GALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT------VVSHVQQPPVVNQVTEESCVYCGEEHNYLFC
        G +L+K++NEAYEILERI++ +CQW DVR +  KK + VLEVD +S+I A +  + N L+N+       + +      V+ Q   ESCVYCGE+H +  C
Subjt:  GALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT------VVSHVQQPPVVNQVTEESCVYCGEEHNYLFC

Query:  PNNPTSVFFAGNQ------RNNPYSKFYNSGWRNHPNFAWGGQGS-NLQVPQAQQKVVNQSGFAKSQELPQQNKQALPQ-------QNL-GNSLESMMKD
        P+NP S+F+ GNQ      + NP S  YN GWRNHPNF   GQGS N Q+P    K     GF    +L   ++QA  Q       Q++ G  LES++K+
Subjt:  PNNPTSVFFAGNQ------RNNPYSKFYNSGWRNHPNFAWGGQGS-NLQVPQAQQKVVNQSGFAKSQELPQQNKQALPQ-------QNL-GNSLESMMKD

Query:  YMARNDVIILSQQASLRALELQVGQLANELKARPQGNIPSHIEHPIREG
        YMARND +I SQQ SLR LE+QVGQLANEL+ RP G +P+  E P REG
Subjt:  YMARNDVIILSQQASLRALELQVGQLANELKARPQGNIPSHIEHPIREG

A0A6J1H7E4 uncharacterized protein LOC1114611682.6e-11757.97Show/hide
Query:  LEQNGQQNNQAKNPILVAN-------------DRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDS
        ++Q  Q N + +NP+++AN             DR+RAIRAY  P VDELNP I+   ++AT+FE+KPVMF ML+ +GQFHGLPSEDPHLHLKSFLGVSDS
Subjt:  LEQNGQQNNQAKNPILVAN-------------DRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGVSDS

Query:  FVIEGVSRDALRLILFPYSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIE
        F  +GV +D +RL LFPYSLRDGAK+WLN+ AP +I +WN  AEKFL KYFPPTR+A+ R+EIV F+Q E+ET SEAWERFKE+LRKCPHH  P+CIQ+E
Subjt:  FVIEGVSRDALRLILFPYSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIE

Query:  IFYNGLNIATQCMVDASVGGALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT------VVSHVQQPPVVN
         FYNGLNIAT+ +VDAS  GA+L+K++NEAYEILERI++ +CQW+DVR +  KK + VLEVD +S+I A +  + N L+N+       + +      V+ 
Subjt:  IFYNGLNIATQCMVDASVGGALLAKSFNEAYEILERISTYSCQWSDVRGS-NKKVKSVLEVDGVSTIRADIVELANALKNVT------VVSHVQQPPVVN

Query:  QVTEESCVYCGEEHNYLFCPNNPTSVFFA------GNQRNNPYSKFYNSGWRNHPNFAWGGQGS
        Q   ESCVYCGEEH +  CP NP S+ +       GNQ+NNP S  YN GWRNHPNF+W GQGS
Subjt:  QVTEESCVYCGEEHNYLFCPNNPTSVFFA------GNQRNNPYSKFYNSGWRNHPNFAWGGQGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGATTGTCCTTTGATTTGTACGGGTGAGAGTGGCCAGTTCGCCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAATGGGGAGCTGGGAACGTAGTCTTAC
AAGATGGAATTCACTCCTTCCTGATATGAGGGTAAGTAGAGGTCCATTAGGTCCCACCGGTAGCTCATTTGGGGCGTTGAGCGAAAAAGGAATTGCAAGTTGCTGTATTT
TTGAGGAAAAATTCAGAGAATTCGGCGAAAAGTTCAAGGATTTCTTCAAAGAAAATACTGTTGAGCGACGTGAGGGAGCAAAATCTGTGCTGCAGCAAAGCTGGGAGAAA
AACTGCCACGTCACAACTCATTTTCAAGAAGTTCAAGTGAAACTCGAAGAGGAATCATGCGATGCAGCAAATCTGCCGCTAGAGCAAAATGGACAGCAAAATAATCAGGC
TAAGAATCCTATCCTTGTAGCAAATGATAGGGACAGAGCCATTAGAGCGTATGTTTTCCCAATAGTTGACGAGTTAAATCCAGTGATAGTATGTCTAGTTATTGAGGCAA
CAAGCTTTGAAATGAAACCAGTCATGTTTTACATGTTAGAGATGGTTGGCCAATTCCATGGTTTACCATCCGAGGACCCTCATCTTCATCTTAAGTCTTTTCTAGGAGTT
AGTGATTCATTTGTCATCGAGGGAGTGTCGAGAGATGCCCTTAGATTAATCCTATTTCCTTATTCCCTTAGAGATGGAGCAAAGGCGTGGTTGAATTCTTTTGCTCCAAC
ATCGATAAGTACGTGGAACAAGTTCGCAGAGAAATTTCTTAGTAAGTATTTTCCACCAACTAGGAGTGCCAAGTTGAGGAGTGAGATAGTGGGATTTAGGCAAAATGAGG
AAGAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTCTGCGAAAGTGTCCCCACCATTGTTTTCCATATTGTATTCAAATAGAAATATTTTACAATGGGTTAAAC
ATAGCAACCCAATGTATGGTTGATGCTTCTGTGGGAGGGGCTCTTTTGGCAAAATCTTTTAATGAAGCTTATGAGATTTTAGAGAGAATATCAACCTACAGTTGTCAATG
GTCAGATGTTAGAGGCTCTAATAAAAAAGTTAAGAGTGTGTTAGAAGTTGATGGTGTGTCTACCATTAGGGCCGATATTGTAGAATTAGCTAACGCTCTTAAAAATGTAA
CAGTGGTTAGTCATGTCCAGCAGCCGCCAGTGGTAAACCAAGTTACAGAAGAATCATGTGTTTATTGTGGTGAAGAGCATAATTATTTGTTTTGTCCTAACAATCCAACT
TCTGTGTTTTTTGCAGGTAATCAAAGGAATAACCCTTATTCTAAGTTTTATAATTCAGGTTGGCGCAACCACCCCAACTTTGCGTGGGGAGGGCAAGGAAGCAATTTGCA
AGTTCCCCAAGCACAGCAAAAGGTAGTGAATCAGTCCGGATTTGCTAAATCACAGGAATTGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTTGGGTAATTCTC
TGGAGTCAATGATGAAGGATTATATGGCTCGTAATGATGTCATAATCCTAAGTCAGCAAGCTTCATTGAGAGCCCTAGAGTTGCAAGTGGGTCAGCTAGCTAATGAGTTG
AAGGCACGACCTCAAGGGAACATTCCTTCACATATTGAACACCCTATAAGGGAAGGTAAGAAGCAGGGTGCTGGAGGCAGCAATAATGTTTCTGGAGCATCTAGTTCTGT
TCCAGATGTGGAACCACCTTATGTACCACCCCCACCCTATAATCCACCCTTACCTTTTCCAAAAAGGCAAAGACCTAAGAATCAAGATGGTGGAAAGGAGTTGGGGAGAG
CACATTGTGATTTAGGTGCAAGTGTAAACCTTATGCCTCTTTCGGTTTATAGAAAGTTAGGTATAGGTGAAGCTAGGCCTACAACAGTCACATTCCGATTAGCAGATAGG
TTTATCACGTATCCTGTAGGTAAAATTGAGGATGTTCTAGTCCAGAAGTCTAAGAACCAGAATGGTCAATTTTTGACAACTAGTAGATCATTGATGGATGTCCAACAAGG
GGAGCTTACAATGAAGGTGTTGGATGAGATTGTTGAGGACCGCTTTGAGAAGGAATTGATGGAGTACCATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAA
TAGGTAGTTTGGAGTATGAACATAAAGATGTAGACATCACTAATCACGGTTGGCAATGGTTGTGCGCAAAGCCAGAGCCCACTGTTGATCAACTAGAACGGGAGTTCTAT
GCCAACGTTGATGAAAATGAAGGATTCTTGGTTATTGTTCACGGAGTTGCTCTCAACTGGAGCCCTAGAGTGATTAATTCTCTATTTAATTTGCAAGACTCCCTCCACGC
TGTTTTCAACGAAATATTAGTTGTTTCCTCATATGAGCAACTAAATGCCCAGTGGAGGTTGTCTAGGACGAAGGCAAGAGCATTCCAGTCAGCATACCTGAAGTGTGAGG
CTAACACTTGGCTCAGCTTTGTCAAGCAGAGATTATTGCCTGCAACATACGCCTCCACTATCCCTCGTGATCGAGTACACTTGAAGAGAGGTGAGGTGCACCAGGGTGGT
TTGTTGTGCGGTGTCCATAGAGTGCTGAAGCAACTTGAGATGGTTCTCAAAACCTTTTGCTTGAGCATTCCTTCTAGCCTCGTCATGACTGCAGCAAAAGTTCTAAGCAG
TGCTTGGTTTTGTAGGATGCTCAGGACGAGCCGTCAAAGACAAAGGCATTTACGATTTGGAGACCGAATCAATCGGCCGGAGGCCGAATTGACGGCAGAATGGAGATCTA
AGAAGACCATTTGCGATTCCACTCAAGGAGACGAAGTTCTTCCATGTTCTTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGGATTGTCCTTTGATTTGTACGGGTGAGAGTGGCCAGTTCGCCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAATGGGGAGCTGGGAACGTAGTCTTAC
AAGATGGAATTCACTCCTTCCTGATATGAGGGTAAGTAGAGGTCCATTAGGTCCCACCGGTAGCTCATTTGGGGCGTTGAGCGAAAAAGGAATTGCAAGTTGCTGTATTT
TTGAGGAAAAATTCAGAGAATTCGGCGAAAAGTTCAAGGATTTCTTCAAAGAAAATACTGTTGAGCGACGTGAGGGAGCAAAATCTGTGCTGCAGCAAAGCTGGGAGAAA
AACTGCCACGTCACAACTCATTTTCAAGAAGTTCAAGTGAAACTCGAAGAGGAATCATGCGATGCAGCAAATCTGCCGCTAGAGCAAAATGGACAGCAAAATAATCAGGC
TAAGAATCCTATCCTTGTAGCAAATGATAGGGACAGAGCCATTAGAGCGTATGTTTTCCCAATAGTTGACGAGTTAAATCCAGTGATAGTATGTCTAGTTATTGAGGCAA
CAAGCTTTGAAATGAAACCAGTCATGTTTTACATGTTAGAGATGGTTGGCCAATTCCATGGTTTACCATCCGAGGACCCTCATCTTCATCTTAAGTCTTTTCTAGGAGTT
AGTGATTCATTTGTCATCGAGGGAGTGTCGAGAGATGCCCTTAGATTAATCCTATTTCCTTATTCCCTTAGAGATGGAGCAAAGGCGTGGTTGAATTCTTTTGCTCCAAC
ATCGATAAGTACGTGGAACAAGTTCGCAGAGAAATTTCTTAGTAAGTATTTTCCACCAACTAGGAGTGCCAAGTTGAGGAGTGAGATAGTGGGATTTAGGCAAAATGAGG
AAGAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTCTGCGAAAGTGTCCCCACCATTGTTTTCCATATTGTATTCAAATAGAAATATTTTACAATGGGTTAAAC
ATAGCAACCCAATGTATGGTTGATGCTTCTGTGGGAGGGGCTCTTTTGGCAAAATCTTTTAATGAAGCTTATGAGATTTTAGAGAGAATATCAACCTACAGTTGTCAATG
GTCAGATGTTAGAGGCTCTAATAAAAAAGTTAAGAGTGTGTTAGAAGTTGATGGTGTGTCTACCATTAGGGCCGATATTGTAGAATTAGCTAACGCTCTTAAAAATGTAA
CAGTGGTTAGTCATGTCCAGCAGCCGCCAGTGGTAAACCAAGTTACAGAAGAATCATGTGTTTATTGTGGTGAAGAGCATAATTATTTGTTTTGTCCTAACAATCCAACT
TCTGTGTTTTTTGCAGGTAATCAAAGGAATAACCCTTATTCTAAGTTTTATAATTCAGGTTGGCGCAACCACCCCAACTTTGCGTGGGGAGGGCAAGGAAGCAATTTGCA
AGTTCCCCAAGCACAGCAAAAGGTAGTGAATCAGTCCGGATTTGCTAAATCACAGGAATTGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTTGGGTAATTCTC
TGGAGTCAATGATGAAGGATTATATGGCTCGTAATGATGTCATAATCCTAAGTCAGCAAGCTTCATTGAGAGCCCTAGAGTTGCAAGTGGGTCAGCTAGCTAATGAGTTG
AAGGCACGACCTCAAGGGAACATTCCTTCACATATTGAACACCCTATAAGGGAAGGTAAGAAGCAGGGTGCTGGAGGCAGCAATAATGTTTCTGGAGCATCTAGTTCTGT
TCCAGATGTGGAACCACCTTATGTACCACCCCCACCCTATAATCCACCCTTACCTTTTCCAAAAAGGCAAAGACCTAAGAATCAAGATGGTGGAAAGGAGTTGGGGAGAG
CACATTGTGATTTAGGTGCAAGTGTAAACCTTATGCCTCTTTCGGTTTATAGAAAGTTAGGTATAGGTGAAGCTAGGCCTACAACAGTCACATTCCGATTAGCAGATAGG
TTTATCACGTATCCTGTAGGTAAAATTGAGGATGTTCTAGTCCAGAAGTCTAAGAACCAGAATGGTCAATTTTTGACAACTAGTAGATCATTGATGGATGTCCAACAAGG
GGAGCTTACAATGAAGGTGTTGGATGAGATTGTTGAGGACCGCTTTGAGAAGGAATTGATGGAGTACCATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAA
TAGGTAGTTTGGAGTATGAACATAAAGATGTAGACATCACTAATCACGGTTGGCAATGGTTGTGCGCAAAGCCAGAGCCCACTGTTGATCAACTAGAACGGGAGTTCTAT
GCCAACGTTGATGAAAATGAAGGATTCTTGGTTATTGTTCACGGAGTTGCTCTCAACTGGAGCCCTAGAGTGATTAATTCTCTATTTAATTTGCAAGACTCCCTCCACGC
TGTTTTCAACGAAATATTAGTTGTTTCCTCATATGAGCAACTAAATGCCCAGTGGAGGTTGTCTAGGACGAAGGCAAGAGCATTCCAGTCAGCATACCTGAAGTGTGAGG
CTAACACTTGGCTCAGCTTTGTCAAGCAGAGATTATTGCCTGCAACATACGCCTCCACTATCCCTCGTGATCGAGTACACTTGAAGAGAGGTGAGGTGCACCAGGGTGGT
TTGTTGTGCGGTGTCCATAGAGTGCTGAAGCAACTTGAGATGGTTCTCAAAACCTTTTGCTTGAGCATTCCTTCTAGCCTCGTCATGACTGCAGCAAAAGTTCTAAGCAG
TGCTTGGTTTTGTAGGATGCTCAGGACGAGCCGTCAAAGACAAAGGCATTTACGATTTGGAGACCGAATCAATCGGCCGGAGGCCGAATTGACGGCAGAATGGAGATCTA
AGAAGACCATTTGCGATTCCACTCAAGGAGACGAAGTTCTTCCATGTTCTTGGTAG
Protein sequenceShow/hide protein sequence
MRDCPLICTGESGQFADSISLPFWGQDRMGSWERSLTRWNSLLPDMRVSRGPLGPTGSSFGALSEKGIASCCIFEEKFREFGEKFKDFFKENTVERREGAKSVLQQSWEK
NCHVTTHFQEVQVKLEEESCDAANLPLEQNGQQNNQAKNPILVANDRDRAIRAYVFPIVDELNPVIVCLVIEATSFEMKPVMFYMLEMVGQFHGLPSEDPHLHLKSFLGV
SDSFVIEGVSRDALRLILFPYSLRDGAKAWLNSFAPTSISTWNKFAEKFLSKYFPPTRSAKLRSEIVGFRQNEEETFSEAWERFKELLRKCPHHCFPYCIQIEIFYNGLN
IATQCMVDASVGGALLAKSFNEAYEILERISTYSCQWSDVRGSNKKVKSVLEVDGVSTIRADIVELANALKNVTVVSHVQQPPVVNQVTEESCVYCGEEHNYLFCPNNPT
SVFFAGNQRNNPYSKFYNSGWRNHPNFAWGGQGSNLQVPQAQQKVVNQSGFAKSQELPQQNKQALPQQNLGNSLESMMKDYMARNDVIILSQQASLRALELQVGQLANEL
KARPQGNIPSHIEHPIREGKKQGAGGSNNVSGASSSVPDVEPPYVPPPPYNPPLPFPKRQRPKNQDGGKELGRAHCDLGASVNLMPLSVYRKLGIGEARPTTVTFRLADR
FITYPVGKIEDVLVQKSKNQNGQFLTTSRSLMDVQQGELTMKVLDEIVEDRFEKELMEYHTQKFGEIQIEDLEIGSLEYEHKDVDITNHGWQWLCAKPEPTVDQLEREFY
ANVDENEGFLVIVHGVALNWSPRVINSLFNLQDSLHAVFNEILVVSSYEQLNAQWRLSRTKARAFQSAYLKCEANTWLSFVKQRLLPATYASTIPRDRVHLKRGEVHQGG
LLCGVHRVLKQLEMVLKTFCLSIPSSLVMTAAKVLSSAWFCRMLRTSRQRQRHLRFGDRINRPEAELTAEWRSKKTICDSTQGDEVLPCSW