; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042170 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042170
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4283 domain-containing protein
Genome locationchr13:37755568..37759438
RNA-Seq ExpressionLag0042170
SyntenyLag0042170
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051433.1 hypothetical protein E6C27_scaffold55G001850 [Cucumis melo var. makuwa]7.8e-1841.32Show/hide
Query:  KNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVN
        + YGGW+ I+NLPL+ W R + E I  + GGL   +S TLNL + +EA+I+V+KN CGF+P  I +    +    L FGD   L  +     S    + +
Subjt:  KNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVN

Query:  DFSNSLDVIRVKQVILDEEVD
        DF+NS+D +RV+ V+LDE++D
Subjt:  DFSNSLDVIRVKQVILDEEVD

KAA0062494.1 uncharacterized protein E6C27_scaffold130G00900 [Cucumis melo var. makuwa]3.5e-1839.55Show/hide
Query:  IYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLI
        + K Y GW+ I+NLPL+ W R +FE IG +L GL+ I+  TLNL +CS+A I+V+KN CGF+     +    +    L FGD   L   N    S   ++
Subjt:  IYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLI

Query:  VNDFSNSLDVIRVKQVILDEEVDLVNEGVRLNES
        ++DF NS+D ++++ V+LDE++D+ +    LN S
Subjt:  VNDFSNSLDVIRVKQVILDEEVDLVNEGVRLNES

KAA0063414.1 uncharacterized protein E6C27_scaffold508G00510 [Cucumis melo var. makuwa]1.0e-2230.47Show/hide
Query:  WFFECAVWPSTGRRRTIQVPTGLNKKANDSEGQVFPNSYAEVGKRGGSMKNSFSLEDSGRNVKFVNEEAYWVCKNWDVLKIDLESSLVV-----SRLMAH
        W   C VWP +G R  + +P G N+     +G +   S+A +      +K+  S+  S  +++          K  D+L++   S  ++         + 
Subjt:  WFFECAVWPSTGRRRTIQVPTGLNKKANDSEGQVFPNSYAEVGKRGGSMKNSFSLEDSGRNVKFVNEEAYWVCKNWDVLKIDLESSLVV-----SRLMAH

Query:  YSWKNVKIALEDFFKSSVFIDPFMDDKAVIQVADAKIYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPV
        +  KN ++  EDF             K+++ V      K YGGWI+I+NLPL+ W  D ++AIG   GG  SIS  T+NL++CSEA I+V +N CGF+P 
Subjt:  YSWKNVKIALEDFFKSSVFIDPFMDDKAVIQVADAKIYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPV

Query:  DINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVNDFSNSLDVIRVKQVILDE
         + ++   +    L FGDI  LE   +       L ++  +N +D++R+ QV+LDE
Subjt:  DINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVNDFSNSLDVIRVKQVILDE

XP_031738083.1 uncharacterized protein LOC116402658 [Cucumis sativus]1.5e-1629.74Show/hide
Query:  IYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLI
        + K YGGW+ ++NLPL+LW R  FEAIG + GG + I++ TLNL +CSEA I+V++N CGF+     +    +    L FG+   L     +   S   +
Subjt:  IYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLI

Query:  VNDFSNSLDVIRVKQVILDEEVDL------------VNEGVRLNESPFISCYHEEFNAAMGSPKVASMHDEHINFTGC---IVSPSKMINDS----NCNV
         + + NSLD +R+++ + DE  DL            V   +  + SPF    H  F A   +P+  +   + +           PS +IND+    +C  
Subjt:  VNDFSNSLDVIRVKQVILDEEVDL------------VNEGVRLNESPFISCYHEEFNAAMGSPKVASMHDEHINFTGC---IVSPSKMINDS----NCNV

Query:  KDDIQQLLGSSNISSENGLIQSKDFKELSDQI
        K      +  + I+S  G++  +  KE +  I
Subjt:  KDDIQQLLGSSNISSENGLIQSKDFKELSDQI

XP_038904899.1 uncharacterized protein LOC120091119 isoform X2 [Benincasa hispida]8.6e-1743.86Show/hide
Query:  KNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVN
        + YGGWI+I+NLPL+ W + +FEAIGK  GGL SI+   LNL+   +A I+V++N CGF+P  I V    +    L FGDI+    SN        L  +
Subjt:  KNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVN

Query:  DFSNSLDVIRVKQV
        DF+N +D+IR+ +V
Subjt:  DFSNSLDVIRVKQV

TrEMBL top hitse value%identityAlignment
A0A5A7TQZ7 Copper-transporting ATPase PAA13.9e-1533.86Show/hide
Query:  KIDLESSLVVSRLMAHYSWKNVKIALEDFFKSSVFIDPFMDDKAVIQVAD-----AKIYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLN
        KI   SS + S  +   S +++K  LED F  ++ I+P   DKAVI+  D     A     +GG ++++NLP NL  +D F+AIG+   GL  I  + LN
Subjt:  KIDLESSLVVSRLMAHYSWKNVKIALEDFFKSSVFIDPFMDDKAVIQVAD-----AKIYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLN

Query:  LLDCSEAFIE--------------VEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVNDFSNSLDVIRVKQVIL
        L+DCS+A I+              V KN CGF+P    +K   +  F L FGDI  +E   L      + I+ D S+    + ++QV++
Subjt:  LLDCSEAFIE--------------VEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVNDFSNSLDVIRVKQVIL

A0A5A7U6A9 Uncharacterized protein3.8e-1841.32Show/hide
Query:  KNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVN
        + YGGW+ I+NLPL+ W R + E I  + GGL   +S TLNL + +EA+I+V+KN CGF+P  I +    +    L FGD   L  +     S    + +
Subjt:  KNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVN

Query:  DFSNSLDVIRVKQVILDEEVD
        DF+NS+D +RV+ V+LDE++D
Subjt:  DFSNSLDVIRVKQVILDEEVD

A0A5A7V878 DUF4283 domain-containing protein5.1e-2330.47Show/hide
Query:  WFFECAVWPSTGRRRTIQVPTGLNKKANDSEGQVFPNSYAEVGKRGGSMKNSFSLEDSGRNVKFVNEEAYWVCKNWDVLKIDLESSLVV-----SRLMAH
        W   C VWP +G R  + +P G N+     +G +   S+A +      +K+  S+  S  +++          K  D+L++   S  ++         + 
Subjt:  WFFECAVWPSTGRRRTIQVPTGLNKKANDSEGQVFPNSYAEVGKRGGSMKNSFSLEDSGRNVKFVNEEAYWVCKNWDVLKIDLESSLVV-----SRLMAH

Query:  YSWKNVKIALEDFFKSSVFIDPFMDDKAVIQVADAKIYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPV
        +  KN ++  EDF             K+++ V      K YGGWI+I+NLPL+ W  D ++AIG   GG  SIS  T+NL++CSEA I+V +N CGF+P 
Subjt:  YSWKNVKIALEDFFKSSVFIDPFMDDKAVIQVADAKIYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPV

Query:  DINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVNDFSNSLDVIRVKQVILDE
         + ++   +    L FGDI  LE   +       L ++  +N +D++R+ QV+LDE
Subjt:  DINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVNDFSNSLDVIRVKQVILDE

A0A5D3DVS9 Uncharacterized protein1.7e-1839.55Show/hide
Query:  IYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLI
        + K Y GW+ I+NLPL+ W R +FE IG +L GL+ I+  TLNL +CS+A I+V+KN CGF+     +    +    L FGD   L   N    S   ++
Subjt:  IYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLI

Query:  VNDFSNSLDVIRVKQVILDEEVDLVNEGVRLNES
        ++DF NS+D ++++ V+LDE++D+ +    LN S
Subjt:  VNDFSNSLDVIRVKQVILDEEVDLVNEGVRLNES

A0A6J1D6X4 uncharacterized protein LOC1110181869.6e-1428.03Show/hide
Query:  NWDVLKIDLESSLVVSRLMAHYSWKNVKIALEDFFKSSVFIDPFMDDKAVIQVAD---------------------------------AKIYKNYGGWIA
        N +V +++ E ++V++R   H  W  +   +++  +SS  I+PF  DKA+++                                    A ++ +YG W+ 
Subjt:  NWDVLKIDLESSLVVSRLMAHYSWKNVKIALEDFFKSSVFIDPFMDDKAVIQVAD---------------------------------AKIYKNYGGWIA

Query:  IRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDIN
        IRN+PL+LW   +F+AIG  LGG +    N    ++C +  I+V+ N+CGFIP +I+
Subjt:  IRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPVDIN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAAAGATGAAATAGATGGGGAACTCCAAGTAGATGGGTCTATCCCCCACCCACTTATCCTCCCAAAAGTACGTGTCTTTCTCGTCCCCCACCACACAATGAACAA
GGCTAGAGAAAGAAGGGAGCTCCATGACAATATCTTTCCAAAGATTTCTGGAAGTGCCCCTAACCTCACCCAACAACCACTCGAAAGGATGAACGTCGTATTTACTCACA
ATTATCTATACCATAAGGATAAAGACTCATGGGGAAAATACCAAAGCCATTTAGCTAACAGAGTTTTGTCACGGATCCTAAGGGTCCGTTTGTGGTTTGAAAATGTTCTA
GTTGAGTTATTGCAGAATCCTGTTTCTTCTTTCTTTCATCAGAAAATTAAGGAAGACTTTGGAGTCATTAGGTTGATTAAGTTCTTTTCGGATAATGAATGGTTTTTTGA
ATGTGCTGTTTGGCCTTCCACGGGTAGGAGGAGGACTATTCAAGTTCCAACGGGTTTGAATAAGAAAGCTAATGACTCAGAAGGTCAAGTCTTTCCTAACTCCTATGCTG
AAGTGGGTAAGCGAGGTGGCTCTATGAAGAATTCATTCTCCCTGGAAGATTCAGGTAGAAATGTTAAGTTTGTTAATGAAGAAGCTTACTGGGTTTGTAAGAATTGGGAT
GTGCTGAAAATAGATTTGGAAAGTTCACTCGTTGTTTCTAGATTGATGGCCCATTATTCTTGGAAGAATGTTAAGATTGCCCTTGAGGATTTCTTTAAATCTTCAGTCTT
TATCGACCCTTTTATGGATGATAAAGCTGTGATTCAGGTGGCAGATGCCAAAATTTATAAAAACTACGGAGGTTGGATTGCAATCAGAAATTTACCTTTGAATTTGTGGC
ATCGTGATTCCTTTGAAGCTATTGGAAAGAACCTTGGAGGGTTGGTTAGTATTTCATCTAATACGCTTAATTTATTAGATTGTTCTGAAGCCTTCATTGAAGTAGAAAAG
AATTTTTGTGGCTTTATCCCTGTTGATATTAATGTTAAGATTGGTAATAAGTTTGAATTCTCTTTAAGATTTGGTGATATTAATGCATTAGAGGACAGTAATTTGAAGTT
TGATTCAAGTAGAAAGTTAATAGTCAATGACTTTTCAAATTCCCTGGATGTAATTAGGGTTAAGCAAGTGATATTGGATGAAGAAGTGGACCTTGTAAATGAAGGGGTAA
GGTTGAATGAATCGCCATTTATTTCTTGTTATCATGAGGAATTTAATGCGGCAATGGGTTCTCCAAAAGTTGCTTCAATGCATGATGAGCATATTAATTTCACAGGCTGT
ATTGTATCTCCCTCCAAGATGATTAATGACAGCAATTGCAATGTGAAAGATGATATACAGCAGCTACTGGGATCGTCAAATATCAGTTCTGAAAATGGGTTAATTCAGTC
TAAGGATTTTAAGGAATTGTCTGACCAAATTCCAAGGGAGAGAGACCATTTTAATGAGGTATTGGGTTCTCCAAAAGGTGCTTCATTACATGATAAGTGTATTAATATTG
CTGGCTGTAAAGATATTAATGTTAGCATTAATGAGACGGTTTCTTTTCTCTCTCCTTCTAAAGATGTTAATGCTATTAACAGTCTCCCCTCCTTCAAAGATGTTGATGCT
ATTAATGAGGCGTTGGGTCCTCAAGTTCAGCCGCCCCAGATTTATGAAACTCCTTCCAAGAATTTTAATGTCGTTACTTGCAATTTAAATAAGGATGTCCAGCACGTAAC
ATTAAAGACCTATTCTCGAAAAAAGGCTTCTCATTCATTGGCTGTTAAGTCCAACGTTAATGCTAATCAATTGGAAGCTGAATGCACTCATCTAATTGTTGCAAATAAGG
CTGTGGGATCTGCAAATGTCAGTGCTGGAAATGGGTTGTTTCAGGCCAAGGAATTTAAAGAATCTTCTGTTCGAATTCCAGGGGGAGTAATATTTTCGTCAGAGGTATTG
GCAATTCTTATAAACATAGTACTCATTCTCCAGTGGAATCAGACTTTCGTCAGAGGTATTGGCAATTCTTATAAACATAGTACTCATTCTCCAGTGGAATCAGACGATGA
GTCTACTGTTAGTGTAAGCAGTGAGGACTCTGATCACTTGTTGGATAAAGAGGAATGTGAGGAGCTCTTTTTAGAAGACCAAATTGGTTTGTCGAAGCTTCTAGTCTCAA
GTGCAATGGGATGCATGCGGATGGATATGGTATCATGGGATCAGTTTTTGGAGGAAAATGTTTGGGATTCTTCAGATGCCAGTTATTATCAGAGTCATTCAGCATTTGGA
TAG
mRNA sequenceShow/hide mRNA sequence
ATGGACAAAGATGAAATAGATGGGGAACTCCAAGTAGATGGGTCTATCCCCCACCCACTTATCCTCCCAAAAGTACGTGTCTTTCTCGTCCCCCACCACACAATGAACAA
GGCTAGAGAAAGAAGGGAGCTCCATGACAATATCTTTCCAAAGATTTCTGGAAGTGCCCCTAACCTCACCCAACAACCACTCGAAAGGATGAACGTCGTATTTACTCACA
ATTATCTATACCATAAGGATAAAGACTCATGGGGAAAATACCAAAGCCATTTAGCTAACAGAGTTTTGTCACGGATCCTAAGGGTCCGTTTGTGGTTTGAAAATGTTCTA
GTTGAGTTATTGCAGAATCCTGTTTCTTCTTTCTTTCATCAGAAAATTAAGGAAGACTTTGGAGTCATTAGGTTGATTAAGTTCTTTTCGGATAATGAATGGTTTTTTGA
ATGTGCTGTTTGGCCTTCCACGGGTAGGAGGAGGACTATTCAAGTTCCAACGGGTTTGAATAAGAAAGCTAATGACTCAGAAGGTCAAGTCTTTCCTAACTCCTATGCTG
AAGTGGGTAAGCGAGGTGGCTCTATGAAGAATTCATTCTCCCTGGAAGATTCAGGTAGAAATGTTAAGTTTGTTAATGAAGAAGCTTACTGGGTTTGTAAGAATTGGGAT
GTGCTGAAAATAGATTTGGAAAGTTCACTCGTTGTTTCTAGATTGATGGCCCATTATTCTTGGAAGAATGTTAAGATTGCCCTTGAGGATTTCTTTAAATCTTCAGTCTT
TATCGACCCTTTTATGGATGATAAAGCTGTGATTCAGGTGGCAGATGCCAAAATTTATAAAAACTACGGAGGTTGGATTGCAATCAGAAATTTACCTTTGAATTTGTGGC
ATCGTGATTCCTTTGAAGCTATTGGAAAGAACCTTGGAGGGTTGGTTAGTATTTCATCTAATACGCTTAATTTATTAGATTGTTCTGAAGCCTTCATTGAAGTAGAAAAG
AATTTTTGTGGCTTTATCCCTGTTGATATTAATGTTAAGATTGGTAATAAGTTTGAATTCTCTTTAAGATTTGGTGATATTAATGCATTAGAGGACAGTAATTTGAAGTT
TGATTCAAGTAGAAAGTTAATAGTCAATGACTTTTCAAATTCCCTGGATGTAATTAGGGTTAAGCAAGTGATATTGGATGAAGAAGTGGACCTTGTAAATGAAGGGGTAA
GGTTGAATGAATCGCCATTTATTTCTTGTTATCATGAGGAATTTAATGCGGCAATGGGTTCTCCAAAAGTTGCTTCAATGCATGATGAGCATATTAATTTCACAGGCTGT
ATTGTATCTCCCTCCAAGATGATTAATGACAGCAATTGCAATGTGAAAGATGATATACAGCAGCTACTGGGATCGTCAAATATCAGTTCTGAAAATGGGTTAATTCAGTC
TAAGGATTTTAAGGAATTGTCTGACCAAATTCCAAGGGAGAGAGACCATTTTAATGAGGTATTGGGTTCTCCAAAAGGTGCTTCATTACATGATAAGTGTATTAATATTG
CTGGCTGTAAAGATATTAATGTTAGCATTAATGAGACGGTTTCTTTTCTCTCTCCTTCTAAAGATGTTAATGCTATTAACAGTCTCCCCTCCTTCAAAGATGTTGATGCT
ATTAATGAGGCGTTGGGTCCTCAAGTTCAGCCGCCCCAGATTTATGAAACTCCTTCCAAGAATTTTAATGTCGTTACTTGCAATTTAAATAAGGATGTCCAGCACGTAAC
ATTAAAGACCTATTCTCGAAAAAAGGCTTCTCATTCATTGGCTGTTAAGTCCAACGTTAATGCTAATCAATTGGAAGCTGAATGCACTCATCTAATTGTTGCAAATAAGG
CTGTGGGATCTGCAAATGTCAGTGCTGGAAATGGGTTGTTTCAGGCCAAGGAATTTAAAGAATCTTCTGTTCGAATTCCAGGGGGAGTAATATTTTCGTCAGAGGTATTG
GCAATTCTTATAAACATAGTACTCATTCTCCAGTGGAATCAGACTTTCGTCAGAGGTATTGGCAATTCTTATAAACATAGTACTCATTCTCCAGTGGAATCAGACGATGA
GTCTACTGTTAGTGTAAGCAGTGAGGACTCTGATCACTTGTTGGATAAAGAGGAATGTGAGGAGCTCTTTTTAGAAGACCAAATTGGTTTGTCGAAGCTTCTAGTCTCAA
GTGCAATGGGATGCATGCGGATGGATATGGTATCATGGGATCAGTTTTTGGAGGAAAATGTTTGGGATTCTTCAGATGCCAGTTATTATCAGAGTCATTCAGCATTTGGA
TAG
Protein sequenceShow/hide protein sequence
MDKDEIDGELQVDGSIPHPLILPKVRVFLVPHHTMNKARERRELHDNIFPKISGSAPNLTQQPLERMNVVFTHNYLYHKDKDSWGKYQSHLANRVLSRILRVRLWFENVL
VELLQNPVSSFFHQKIKEDFGVIRLIKFFSDNEWFFECAVWPSTGRRRTIQVPTGLNKKANDSEGQVFPNSYAEVGKRGGSMKNSFSLEDSGRNVKFVNEEAYWVCKNWD
VLKIDLESSLVVSRLMAHYSWKNVKIALEDFFKSSVFIDPFMDDKAVIQVADAKIYKNYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEK
NFCGFIPVDINVKIGNKFEFSLRFGDINALEDSNLKFDSSRKLIVNDFSNSLDVIRVKQVILDEEVDLVNEGVRLNESPFISCYHEEFNAAMGSPKVASMHDEHINFTGC
IVSPSKMINDSNCNVKDDIQQLLGSSNISSENGLIQSKDFKELSDQIPRERDHFNEVLGSPKGASLHDKCINIAGCKDINVSINETVSFLSPSKDVNAINSLPSFKDVDA
INEALGPQVQPPQIYETPSKNFNVVTCNLNKDVQHVTLKTYSRKKASHSLAVKSNVNANQLEAECTHLIVANKAVGSANVSAGNGLFQAKEFKESSVRIPGGVIFSSEVL
AILINIVLILQWNQTFVRGIGNSYKHSTHSPVESDDESTVSVSSEDSDHLLDKEECEELFLEDQIGLSKLLVSSAMGCMRMDMVSWDQFLEENVWDSSDASYYQSHSAFG