; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034011 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034011
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr3:3695952..3696629
RNA-Seq ExpressionLag0034011
SyntenyLag0034011
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587646.1 hypothetical protein SDJN03_16211, partial [Cucurbita argyrosperma subsp. sororia]1.3e-8576.11Show/hide
Query:  MEGCIDSRKRLRDESNDSLFNFIGSKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESA
        ME CIDSRKR+RDESN+SLFNF+GSKI+R DSAE +FISPD+DDAP+ SVSSD +SI SKQ G +H +DSGLDS Q   IQEDLLKILDEAD SIDRE A
Subjt:  MEGCIDSRKRLRDESNDSLFNFIGSKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESA

Query:  IQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNRGEEEVVA
        I DLDSVI SFEKEI   VPVP VQPELGYLLEASDDELGLPPA  K E+E VNF  ++SG  GMKGFLGFEDE VPNYCWLENLSSE E NR EEEV A
Subjt:  IQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNRGEEEVVA

Query:  LGGLFDH-TETTAELPPYRSETMYCL
        LGGL DH T+   ELPPYRSETM+CL
Subjt:  LGGLFDH-TETTAELPPYRSETMYCL

KAG6589552.1 hypothetical protein SDJN03_14975, partial [Cucurbita argyrosperma subsp. sororia]2.5e-7875Show/hide
Query:  MEGCIDSRKRLRDESNDSLFNFIG--SKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRE
        ME C+DSRKRLRDESNDSLFNFIG  SK +RLDSA L     DVDDAPI SVSSD KSIDS          SGLDS QA+ IQ+DLLKILD+ DA IDRE
Subjt:  MEGCIDSRKRLRDESNDSLFNFIG--SKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRE

Query:  SAIQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNR-GEEE
          IQDLDSVIRSFEKEIQ  VPVP VQPELG+LLEASDDELGLPPAGEK E EAVNFAA+F G GGMKG LG EDE VPNYCWLENL SENE NR  EEE
Subjt:  SAIQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNR-GEEE

Query:  VVALGGLFDHTETTAELPPYRSETMYCL
        VV LGGLFDHT+   EL  YRSETM CL
Subjt:  VVALGGLFDHTETTAELPPYRSETMYCL

KAG7021606.1 hypothetical protein SDJN02_15332, partial [Cucurbita argyrosperma subsp. argyrosperma]3.7e-8575.66Show/hide
Query:  MEGCIDSRKRLRDESNDSLFNFIGSKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESA
        ME CIDSRKR+RDESN+SLFNF+GSKI+R DSAE +FISPD+DDAP+ SVSSD +SI SKQ G +H +DSGLDS Q   IQEDLLKIL+EAD SIDRE A
Subjt:  MEGCIDSRKRLRDESNDSLFNFIGSKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESA

Query:  IQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNRGEEEVVA
        I DLDSVI SFEKEI   VPVP VQPELGYLLEASDDELGLPPA  K E+E VNF  ++SG  GMKGFLGFEDE VPNYCWLENLSSE E NR EEEV A
Subjt:  IQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNRGEEEVVA

Query:  LGGLFDH-TETTAELPPYRSETMYCL
        LGGL DH T+   ELPPYRSETM+CL
Subjt:  LGGLFDH-TETTAELPPYRSETMYCL

XP_023516749.1 uncharacterized protein LOC111780554 [Cucurbita pepo subsp. pepo]1.7e-7774.56Show/hide
Query:  MEGCIDSRKRLRDESNDSLFNFIG--SKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRE
        ME C+DSRKRLRDESNDSLFNFIG  SK +RLDSA L     DVDDAP  SVSSD KSIDS          SGLDS QA+ IQ+DLLKILD+ DA IDRE
Subjt:  MEGCIDSRKRLRDESNDSLFNFIG--SKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRE

Query:  SAIQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENE-SNRGEEE
        S IQDLDSVIRSFEKEIQ  VPVP VQPELG+LLEASDDELGLPPAGEK E EAVNFAA+F G GGMKG LG EDE VPNYCWLENL SENE S   EEE
Subjt:  SAIQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENE-SNRGEEE

Query:  VVALGGLFDHTETTAELPPYRSETMYCL
        VV LGGLFDHT+   EL  YRSETM CL
Subjt:  VVALGGLFDHTETTAELPPYRSETMYCL

XP_023531746.1 uncharacterized protein LOC111793909 [Cucurbita pepo subsp. pepo]1.4e-8474.78Show/hide
Query:  MEGCIDSRKRLRDESNDSLFNFIGSKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESA
        ME CIDSRKR+RDESN+SLFNF+GSKI+R DSAE +FISPD DDAP+ SVSSD +SI SKQ G +H +DSGLDS Q   I+EDLLKILDEAD SIDRE A
Subjt:  MEGCIDSRKRLRDESNDSLFNFIGSKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESA

Query:  IQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNRGEEEVVA
        I DLDSVI SFEKEI   VPVP VQPELGYLLEASDDELGLPPA  K E+E VNF  ++SG  GMKGFLGFEDE VPNYCWLENLSSE E NR E+EV A
Subjt:  IQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNRGEEEVVA

Query:  LGGLFDH-TETTAELPPYRSETMYCL
        LGGL DH T+   E+PPYRSETM+CL
Subjt:  LGGLFDH-TETTAELPPYRSETMYCL

TrEMBL top hitse value%identityAlignment
A0A0A0LS21 Uncharacterized protein2.7e-5762.23Show/hide
Query:  MEGCIDSRKRLRDESNDSLFNFIG--SKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRE
        ME  +DSRKRLRD+SNDSLFN IG  SK +RL++   S       DAP   VS  T S  S                  H IQEDLLKILD+ DASIDRE
Subjt:  MEGCIDSRKRLRDESNDSLFNFIG--SKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRE

Query:  SAIQDLDSVIRSFEKEIQVP--VPVPVVQPELGYLLEASDDELGLPP-AGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENE--SNR
        + IQDLDSVIRSFEKEI+VP  V VPVVQPELG+LLEASDDELGLPP AGEK EIE     A+FSG GG+KG LGFEDE+V NYCW +NL  E +  S  
Subjt:  SAIQDLDSVIRSFEKEIQVP--VPVPVVQPELGYLLEASDDELGLPP-AGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENE--SNR

Query:  GEEEVVALGGLFDHTETTAELP-PYRSETMYCL
         EEEVVALGGLFDHT+  AELP  YRSE M CL
Subjt:  GEEEVVALGGLFDHTETTAELP-PYRSETMYCL

A0A1S3BWV1 uncharacterized protein LOC1034943333.2e-5862.5Show/hide
Query:  MEGCIDSRKRLRDESN-DSLFNFIG--SKIIRLDS-AELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASID
        ME C+D+RKRLRD+SN DSLFN IG  SK +RL++ A+ +F      DAP   VS  T S  S                  H IQEDLLKILD+ DASID
Subjt:  MEGCIDSRKRLRDESN-DSLFNFIG--SKIIRLDS-AELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASID

Query:  RESAIQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENES--NRG
        RE+AIQDLDSVIRSFEKEI+  VPVPVVQPELG+LLEASDDELGLPPAGEK EIE     A+FSG GG+KG LGFEDE+V NYCW +NL  E +      
Subjt:  RESAIQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENES--NRG

Query:  EEEVVALGGLFDHTETTAELP-PYRSETMYCL
        EEEVVALGGLFDHT+  AELP  YRSE M CL
Subjt:  EEEVVALGGLFDHTETTAELP-PYRSETMYCL

A0A5A7UZW3 Uncharacterized protein3.2e-5862.5Show/hide
Query:  MEGCIDSRKRLRDESN-DSLFNFIG--SKIIRLDS-AELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASID
        ME C+D+RKRLRD+SN DSLFN IG  SK +RL++ A+ +F      DAP   VS  T S  S                  H IQEDLLKILD+ DASID
Subjt:  MEGCIDSRKRLRDESN-DSLFNFIG--SKIIRLDS-AELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASID

Query:  RESAIQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENES--NRG
        RE+AIQDLDSVIRSFEKEI+  VPVPVVQPELG+LLEASDDELGLPPAGEK EIE     A+FSG GG+KG LGFEDE+V NYCW +NL  E +      
Subjt:  RESAIQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENES--NRG

Query:  EEEVVALGGLFDHTETTAELP-PYRSETMYCL
        EEEVVALGGLFDHT+  AELP  YRSE M CL
Subjt:  EEEVVALGGLFDHTETTAELP-PYRSETMYCL

A0A5D3DXI3 Uncharacterized protein1.1e-5862.93Show/hide
Query:  MEGCIDSRKRLRDESN-DSLFNFIG--SKIIRLDS-AELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASID
        ME C+D+RKRLRD+SN DSLFN IG  SK +RL++ A+ +F      DAP   VS  T S  S                  H IQEDLLKILD+ DASID
Subjt:  MEGCIDSRKRLRDESN-DSLFNFIG--SKIIRLDS-AELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASID

Query:  RESAIQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENE--SNRG
        RE+AIQDLDSVIRSFEKEI+  VPVPVVQPELG+LLEASDDELGLPPAGEK EIE     A+FSG GG+KG LGFEDE+V NYCW +NL  E +  S   
Subjt:  RESAIQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENE--SNRG

Query:  EEEVVALGGLFDHTETTAELP-PYRSETMYCL
        EEEVVALGGLFDHT+  AELP  YRSE M CL
Subjt:  EEEVVALGGLFDHTETTAELP-PYRSETMYCL

A0A6J1E5H5 uncharacterized protein LOC1114296741.4e-7472.37Show/hide
Query:  MEGCIDSRKRLRDESNDSLFNFIG--SKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRE
        ME C+DSRKRLRDESNDSLFNFIG  SK +RLDSA L     DVDDAPI SVSSD KSI               DS QA+ IQ+DLLKILD+ DA IDRE
Subjt:  MEGCIDSRKRLRDESNDSLFNFIG--SKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRE

Query:  SAIQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNR-GEEE
        S IQDLDSVIRSFEKEIQVPVP    QPELG+LLEASDDELGLPPAGEK E EAVNFAA+F G G MKG LG EDE VPNYCWLENL SENE NR  EEE
Subjt:  SAIQDLDSVIRSFEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNR-GEEE

Query:  VVALGGLFDHTETTAELPPYRSETMYCL
        VV LGGLFDHT+   EL  YRSETM CL
Subjt:  VVALGGLFDHTETTAELPPYRSETMYCL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13360.1 unknown protein1.9e-1534.21Show/hide
Query:  TKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESAIQDLDSVIRSFEKEIQVPVPVPV--------VQPELGYLLEASDDELGLPP--
        T S + K++     D++ LDS +   +++DL  +LD++D     E   QDLDSV++SFE E+                 QP+LGYLLEASDDELGLPP  
Subjt:  TKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESAIQDLDSVIRSFEKEIQVPVPVPV--------VQPELGYLLEASDDELGLPP--

Query:  -------AGEKAEIEAV-NFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNRGEEEVVALGGLFDHTE---TTAELPPYRSETM
               A E+   E V +     S   G+    GFED  V NY  L+  S   +      + VA+ GLF+ ++    + +L  +RSE++
Subjt:  -------AGEKAEIEAV-NFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNRGEEEVVALGGLFDHTE---TTAELPPYRSETM

AT1G13360.2 unknown protein4.7e-1435.26Show/hide
Query:  TKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESAIQDLDSVIRSFEKEIQVPVPVPV--------VQPELGYLLEASDDELGLPP--
        T S + K++     D++ LDS +   +++DL  +LD++D     E   QDLDSV++SFE E+                 QP+LGYLLEASDDELGLPP  
Subjt:  TKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESAIQDLDSVIRSFEKEIQVPVPVPV--------VQPELGYLLEASDDELGLPP--

Query:  -------AGEKAEIEAV-NFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNRGEEEVVALGGLFDHT
               A E+   E V +     S   G+    GFED  V NY  L+  S   +      + VA+ G F +T
Subjt:  -------AGEKAEIEAV-NFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNRGEEEVVALGGLFDHT

AT1G13360.3 unknown protein3.1e-1337.16Show/hide
Query:  TKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESAIQDLDSVIRSFEKEIQVPVPVPV--------VQPELGYLLEASDDELGLPP--
        T S + K++     D++ LDS +   +++DL  +LD++D     E   QDLDSV++SFE E+                 QP+LGYLLEASDDELGLPP  
Subjt:  TKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESAIQDLDSVIRSFEKEIQVPVPVPV--------VQPELGYLLEASDDELGLPP--

Query:  -------AGEKAEIEAV-NFAAQFSGCGGMKGFLGFEDEVVPNYCWLE
               A E+   E V +     S   G+    GFED  V NY  L+
Subjt:  -------AGEKAEIEAV-NFAAQFSGCGGMKGFLGFEDEVVPNYCWLE

AT3G25870.1 unknown protein1.7e-0836.03Show/hide
Query:  TKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESAIQDLDSVIRSFEKEIQVPVPV---PVVQPELGYLLEASDDELG----------
        T S+ +K++     D   LDS     +++DL       D+ +D  S  QDLDSV++SFE E+            QP+LGYL EASDDELG          
Subjt:  TKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESAIQDLDSVIRSFEKEIQVPVPV---PVVQPELGYLLEASDDELG----------

Query:  -LPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEV
         LPP+ E+   E V  ++  S  G +    GFED V
Subjt:  -LPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGCTGCATTGACAGCAGGAAGCGACTACGCGACGAATCCAATGATTCTTTATTCAATTTCATCGGATCCAAGATCATCCGACTTGATTCCGCTGAATTGAGCTT
CATCTCGCCCGATGTGGACGATGCACCGATTGGTTCCGTTTCATCGGATACAAAATCGATCGATTCCAAACAGATTGGAATCGTTCACGGTGATGATTCAGGCCTGGACT
CGCTTCAGGCACACGCGATTCAGGAAGACCTGCTGAAGATTCTTGACGAAGCCGACGCTTCCATAGATCGCGAGTCGGCGATTCAAGATCTCGACTCGGTGATCAGAAGC
TTCGAGAAGGAAATTCAGGTGCCGGTGCCGGTGCCTGTGGTTCAGCCTGAACTTGGATACCTTCTAGAAGCCTCGGACGATGAATTAGGGCTTCCGCCGGCCGGCGAGAA
AGCGGAGATTGAGGCAGTTAATTTTGCCGCGCAATTTTCAGGTTGTGGCGGTATGAAAGGGTTTTTAGGGTTTGAGGATGAAGTTGTTCCGAATTACTGTTGGCTTGAAA
ATTTGAGCAGTGAGAACGAATCGAATCGGGGAGAAGAAGAGGTGGTGGCGTTGGGTGGATTGTTCGATCATACGGAGACGACGGCGGAGTTGCCGCCGTATCGATCGGAG
ACGATGTACTGTTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGCTGCATTGACAGCAGGAAGCGACTACGCGACGAATCCAATGATTCTTTATTCAATTTCATCGGATCCAAGATCATCCGACTTGATTCCGCTGAATTGAGCTT
CATCTCGCCCGATGTGGACGATGCACCGATTGGTTCCGTTTCATCGGATACAAAATCGATCGATTCCAAACAGATTGGAATCGTTCACGGTGATGATTCAGGCCTGGACT
CGCTTCAGGCACACGCGATTCAGGAAGACCTGCTGAAGATTCTTGACGAAGCCGACGCTTCCATAGATCGCGAGTCGGCGATTCAAGATCTCGACTCGGTGATCAGAAGC
TTCGAGAAGGAAATTCAGGTGCCGGTGCCGGTGCCTGTGGTTCAGCCTGAACTTGGATACCTTCTAGAAGCCTCGGACGATGAATTAGGGCTTCCGCCGGCCGGCGAGAA
AGCGGAGATTGAGGCAGTTAATTTTGCCGCGCAATTTTCAGGTTGTGGCGGTATGAAAGGGTTTTTAGGGTTTGAGGATGAAGTTGTTCCGAATTACTGTTGGCTTGAAA
ATTTGAGCAGTGAGAACGAATCGAATCGGGGAGAAGAAGAGGTGGTGGCGTTGGGTGGATTGTTCGATCATACGGAGACGACGGCGGAGTTGCCGCCGTATCGATCGGAG
ACGATGTACTGTTTATAA
Protein sequenceShow/hide protein sequence
MEGCIDSRKRLRDESNDSLFNFIGSKIIRLDSAELSFISPDVDDAPIGSVSSDTKSIDSKQIGIVHGDDSGLDSLQAHAIQEDLLKILDEADASIDRESAIQDLDSVIRS
FEKEIQVPVPVPVVQPELGYLLEASDDELGLPPAGEKAEIEAVNFAAQFSGCGGMKGFLGFEDEVVPNYCWLENLSSENESNRGEEEVVALGGLFDHTETTAELPPYRSE
TMYCL