; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035164 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035164
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4283 domain-containing protein
Genome locationchr3:15994728..15998899
RNA-Seq ExpressionLag0035164
SyntenyLag0035164
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039967.1 hypothetical protein E6C27_scaffold122G002490 [Cucumis melo var. makuwa]3.5e-1833.51Show/hide
Query:  SVLLVESVENGPSNSNSYANLVKLGVS-SMKSIPLDGSAKNPRLRAHYSWRDVKMVLEEYFHSSISINPFMDDKALIQVAVGVSEFSLVGK---WNKYGD
        S  +++   +   +S+   N+ K G S ++ S   + +    R   H  W  +   L E   +++   PF  DKALI       +  L+ K   W   G 
Subjt:  SVLLVESVENGPSNSNSYANLVKLGVS-SMKSIPLDGSAKNPRLRAHYSWRDVKMVLEEYFHSSISINPFMDDKALIQVAVGVSEFSLVGK---WNKYGD

Query:  LHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFS
         ++K E WS + ++  K + SYGGWI +R +PL+ WN  SF  IG   GG V ++ +T  L D  EA I+I+ N  GFIPAYI  K+ +K E S
Subjt:  LHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFS

KAA0040039.1 hypothetical protein E6C27_scaffold366G00060 [Cucumis melo var. makuwa]5.4e-1932.99Show/hide
Query:  KGYSVLLVESVENGPSNSNSYANLVKLGVSSMK-SIPLDGSAKNPRLRAHYSWRDVKMVLEEYFHSSISINPFMDDKALIQVAVGVSEFSLVGK---WNK
        + Y+  +++   +   +++   N+ K G S+   S   + +A   R   H  W  +   L E   +++   PF  DKALI       +  L+ K   W  
Subjt:  KGYSVLLVESVENGPSNSNSYANLVKLGVSSMK-SIPLDGSAKNPRLRAHYSWRDVKMVLEEYFHSSISINPFMDDKALIQVAVGVSEFSLVGK---WNK

Query:  YGDLHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFS
         G  ++K E WS + ++  K + SYGGWI +R +PL+ WN  SF  IG   GG V ++ +T  L D  EA I+I+ N  GFIPAYI  K+ +K E S
Subjt:  YGDLHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFS

KAA0063414.1 uncharacterized protein E6C27_scaffold508G00510 [Cucumis melo var. makuwa]7.0e-1940.15Show/hide
Query:  LHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFSLRYGDI
        +H   E+   +  S    +K YGGWI+I+NLPL+ W+   ++AIG   GG  SIS KT+NL++CSEA I++ +N CGF+PA + ++   +    L +GDI
Subjt:  LHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFSLRYGDI

Query:  NTLEARNSKFDLSKDLSANAFLNSLDILRVKQVVLDE
          LEA      + + L  +   N +D+LR+ QV+LDE
Subjt:  NTLEARNSKFDLSKDLSANAFLNSLDILRVKQVVLDE

TYK30603.1 Ulp1-like peptidase [Cucumis melo var. makuwa]2.7e-1843.59Show/hide
Query:  LEEYFHSSISINPFMDDKALIQVAVGVSEFSLV--GKWNKYGDLHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSK
        LE+ F   I+INPF D++A  ++  G  E  +V   KW  YG  HL  E W   N+S+   +K + GW++I+NLPL+LW  A FE IG + GGL S +  
Subjt:  LEEYFHSSISINPFMDDKALIQVAVGVSEFSLV--GKWNKYGDLHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSK

Query:  TLNLLDCSEAFIEIEKN
        TLNL+ C++A I++ KN
Subjt:  TLNLLDCSEAFIEIEKN

XP_038904899.1 uncharacterized protein LOC120091119 isoform X2 [Benincasa hispida]6.6e-2540.97Show/hide
Query:  SLVGKWNKYGDLHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGN
        ++ GKW K+G  HLK E W++  + +  Y++ YGGWI+I+NLPL+ W   +FEAIGK  GGL SI+ + LNL+   +A I++++N CGF+PA I V    
Subjt:  SLVGKWNKYGDLHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGN

Query:  KLEFSLRYGDINTLEARNSKFDLSKDLSANAFLNSLDILRVKQV
        +    L +GDI+T    N    +  DL  + F N +D++R+ +V
Subjt:  KLEFSLRYGDINTLEARNSKFDLSKDLSANAFLNSLDILRVKQV

TrEMBL top hitse value%identityAlignment
A0A5A7TEK8 DUF4283 domain-containing protein1.7e-1833.51Show/hide
Query:  SVLLVESVENGPSNSNSYANLVKLGVS-SMKSIPLDGSAKNPRLRAHYSWRDVKMVLEEYFHSSISINPFMDDKALIQVAVGVSEFSLVGK---WNKYGD
        S  +++   +   +S+   N+ K G S ++ S   + +    R   H  W  +   L E   +++   PF  DKALI       +  L+ K   W   G 
Subjt:  SVLLVESVENGPSNSNSYANLVKLGVS-SMKSIPLDGSAKNPRLRAHYSWRDVKMVLEEYFHSSISINPFMDDKALIQVAVGVSEFSLVGK---WNKYGD

Query:  LHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFS
         ++K E WS + ++  K + SYGGWI +R +PL+ WN  SF  IG   GG V ++ +T  L D  EA I+I+ N  GFIPAYI  K+ +K E S
Subjt:  LHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFS

A0A5A7TFK7 DUF4283 domain-containing protein2.6e-1932.99Show/hide
Query:  KGYSVLLVESVENGPSNSNSYANLVKLGVSSMK-SIPLDGSAKNPRLRAHYSWRDVKMVLEEYFHSSISINPFMDDKALIQVAVGVSEFSLVGK---WNK
        + Y+  +++   +   +++   N+ K G S+   S   + +A   R   H  W  +   L E   +++   PF  DKALI       +  L+ K   W  
Subjt:  KGYSVLLVESVENGPSNSNSYANLVKLGVSSMK-SIPLDGSAKNPRLRAHYSWRDVKMVLEEYFHSSISINPFMDDKALIQVAVGVSEFSLVGK---WNK

Query:  YGDLHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFS
         G  ++K E WS + ++  K + SYGGWI +R +PL+ WN  SF  IG   GG V ++ +T  L D  EA I+I+ N  GFIPAYI  K+ +K E S
Subjt:  YGDLHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFS

A0A5A7V878 DUF4283 domain-containing protein3.4e-1940.15Show/hide
Query:  LHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFSLRYGDI
        +H   E+   +  S    +K YGGWI+I+NLPL+ W+   ++AIG   GG  SIS KT+NL++CSEA I++ +N CGF+PA + ++   +    L +GDI
Subjt:  LHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFSLRYGDI

Query:  NTLEARNSKFDLSKDLSANAFLNSLDILRVKQVVLDE
          LEA      + + L  +   N +D+LR+ QV+LDE
Subjt:  NTLEARNSKFDLSKDLSANAFLNSLDILRVKQVVLDE

A0A5D3DLP0 DUF4283 domain-containing protein2.2e-1833.51Show/hide
Query:  SVLLVESVENGPSNSNSYANLVKLGVS-SMKSIPLDGSAKNPRLRAHYSWRDVKMVLEEYFHSSISINPFMDDKALIQVAVGVSEFSLVGK---WNKYGD
        S  +++   +   +S+   N+ K G S ++ S   + +    R   H  W  +   L E   +++   PF  DKALI       +  L+ K   W   G 
Subjt:  SVLLVESVENGPSNSNSYANLVKLGVS-SMKSIPLDGSAKNPRLRAHYSWRDVKMVLEEYFHSSISINPFMDDKALIQVAVGVSEFSLVGK---WNKYGD

Query:  LHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFS
         ++K E WS + ++  K + SYGGWI +R +PL+ WN  SF  IG   GG V ++ +T  L D  EA I+I+ N  GFIPAYI  K+ +K E S
Subjt:  LHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFS

A0A5D3E3A5 Ulp1-like peptidase1.3e-1843.59Show/hide
Query:  LEEYFHSSISINPFMDDKALIQVAVGVSEFSLV--GKWNKYGDLHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSK
        LE+ F   I+INPF D++A  ++  G  E  +V   KW  YG  HL  E W   N+S+   +K + GW++I+NLPL+LW  A FE IG + GGL S +  
Subjt:  LEEYFHSSISINPFMDDKALIQVAVGVSEFSLV--GKWNKYGDLHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSK

Query:  TLNLLDCSEAFIEIEKN
        TLNL+ C++A I++ KN
Subjt:  TLNLLDCSEAFIEIEKN

SwissProt top hitse value%identityAlignment
P92555 Uncharacterized mitochondrial protein AtMg012501.7e-0445.31Show/hide
Query:  PKGKIYASRGLRQLDLLSLFLFLLVVDVLSRRVFRGVEGGGIMGGFKVGKECISLSHLQFADDT
        P+G +  SRGLRQ D LS +LF+L  +VLS    R  E G  + G +V      ++HL FADDT
Subjt:  PKGKIYASRGLRQLDLLSLFLFLLVVDVLSRRVFRGVEGGGIMGGFKVGKECISLSHLQFADDT

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.2e-0545.31Show/hide
Query:  PKGKIYASRGLRQLDLLSLFLFLLVVDVLSRRVFRGVEGGGIMGGFKVGKECISLSHLQFADDT
        P+G +  SRGLRQ D LS +LF+L  +VLS    R  E G  + G +V      ++HL FADDT
Subjt:  PKGKIYASRGLRQLDLLSLFLFLLVVDVLSRRVFRGVEGGGIMGGFKVGKECISLSHLQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTAGTGATTTTATTCTCAAAATTCATGCTTCTGAAATTTGGCAAAATCCGCCTTTGTCAAGAAATTTGAAGGGTTATTCGGTTTTACTTGTTGAATCAGTAGAAAA
TGGCCCTTCAAATTCAAATTCATACGCTAATCTGGTCAAGCTAGGTGTCTCTAGTATGAAATCCATTCCTTTGGATGGCTCAGCCAAAAATCCAAGATTGAGGGCCCATT
ACTCTTGGAGAGATGTTAAGATGGTCCTTGAGGAGTATTTTCATTCTTCAATCTCGATCAACCCTTTTATGGACGATAAAGCCTTAATTCAGGTGGCTGTTGGCGTCTCT
GAATTTTCTTTGGTTGGTAAATGGAACAAATATGGGGACCTTCATTTGAAACTTGAACTTTGGTCTTCTGAGAATTACTCCCAATCGAAATATATGAAAAGCTATGGAGG
ATGGATTGCAATTAGGAATTTACCTTTAAATTTGTGGAATTGTGCATCCTTTGAAGCTATTGGAAAGAACCTTGGAGGTTTGGTTAGTATTTCATCTAAAACTCTTAATT
TATTAGATTGCTCCGAAGCTTTCATTGAAATAGAAAAAAATTGTTGTGGCTTTATTCCTGCTTATATCAATGTTAAGATCGGTAATAAGCTTGAATTCTCTCTTCGTTAT
GGTGATATTAACACGCTAGAGGCTAGGAATTCTAAGTTTGATTTAAGTAAAGATTTATCAGCTAATGCCTTTTTGAATTCCCTGGATATATTAAGGGTCAAGCAAGTTGT
GTTGGATGAAGAATTGACTATTTTTAATGAAGGAGAGAGGGAGACCGAATTGCCATTAATTTCTTATTATCAGGAGGAATTTAATGAAGTGTTGGGTTCTCCTAAAGGTG
CATCGTTGCATGATGAGTGTATTAGTAACACTGGTTGTAAAGATATTAATGTTAGCATTCATGAGCCGGTTTCCTTTCTCTCTCCTTCTAAAGATGTTACTGCTATTATC
GTTGCCAGCCCTCATGAAGTCCAACAGCCTCAGTTCTTTGAACCTCCTTCCAAGAAGATTAATGACGTTATTTGCAATTTAAAAGATGATATCCAGCAGTCCATGGAATT
TGAGGTAGCTTCTGTTCAGATTCAAGGGGCGAGAGACCATTTGATCAGAGGTATTGTCAATTCTCCCAGTCATAAAATTCATTCTCCGGTGGAATCAGATGATGAGTCTG
TACTTAGTGTAAGCAGTGAGGATTCTGATCAGTTATATGATAAAGAGGATTGTGTGGAGCACTTTTCAGAAGATCAAATTGGTGAGTCTTTAGTTTCTATTTTTCTTGAG
AATGTTGATGGTCTAGGTTCTCAAGGTTCAATTATTCACGAGTCTTTACTATCTCCTTCTCAAATTCCTAACCAATTCTCTTCGTTAGTTGAAATTTGTGGACTTCAATT
GTGCAAGATTTCACCCCAGTCATCTAAAGCGGGTATTGGATGGTCTTTTGTGGAAGCTTATGGAAAATCAGGAGGTCTTCTTATTATGTGGGATGAAAGTAAACTGTCAG
TGTTGGAATTCTTAAAGGGCGGCTATTCTCTTTGGTCAAATGCCTTACTCTTTGTAAAACAGTTTGTTGGGTATCTAATGTGTATGGTCCAAATGATTACAAAGAAAGGA
GATTTCTATGGCCCGAATTACGCTCTCTTTCTTATTATTGAACGGATCTTTGGTGTATTGGTGGTGACTTTAATATTACTCGATGGGTTCATGAAAGATTTCCTGTGGGA
AGGCAAACAAAAGAATCGTTATAATTGGCACACTTCTGGTAAGGTAGGCCCGAGTCTTCGAAGTCCTTGCATTAATATTTCTAAGGTTCTTCAAAGGAGGTTCCCGTCTC
ATGTTTTATCTCCTTCTATCTGCCCTTTATGCTTGAATGCCAGTGAATCTTCGCAACATTTGTTCTTCGATTGTGTTTATTCTTATCAGTGTTGGGGGAAGTTATTGTCT
ATCTTCAAGCTTCAATGGGTTTTGGATCGGTCATTCAAAGGAAATGTGCTCTTGGTCCGGTTCGTTGATTTAGTTAAGTTTTTGCCCACCCCTAGCTGTCCCAAAGGGAA
GATTTATGCTTCTCGAGGCCTTAGGCAATTAGACCTGCTATCTCTCTTCCTCTTTCTTCTCGTGGTAGATGTGCTGAGTAGAAGGGTCTTTAGAGGGGTGGAGGGAGGAG
GTATCATGGGAGGTTTTAAGGTGGGTAAGGAGTGCATTTCTTTATCTCATCTTCAGTTTGCCGATGACACTATATTCTCTTCTCAGGGAATGAAGATTCCTTTCTTAACT
TAA
mRNA sequenceShow/hide mRNA sequence
ATGATTAGTGATTTTATTCTCAAAATTCATGCTTCTGAAATTTGGCAAAATCCGCCTTTGTCAAGAAATTTGAAGGGTTATTCGGTTTTACTTGTTGAATCAGTAGAAAA
TGGCCCTTCAAATTCAAATTCATACGCTAATCTGGTCAAGCTAGGTGTCTCTAGTATGAAATCCATTCCTTTGGATGGCTCAGCCAAAAATCCAAGATTGAGGGCCCATT
ACTCTTGGAGAGATGTTAAGATGGTCCTTGAGGAGTATTTTCATTCTTCAATCTCGATCAACCCTTTTATGGACGATAAAGCCTTAATTCAGGTGGCTGTTGGCGTCTCT
GAATTTTCTTTGGTTGGTAAATGGAACAAATATGGGGACCTTCATTTGAAACTTGAACTTTGGTCTTCTGAGAATTACTCCCAATCGAAATATATGAAAAGCTATGGAGG
ATGGATTGCAATTAGGAATTTACCTTTAAATTTGTGGAATTGTGCATCCTTTGAAGCTATTGGAAAGAACCTTGGAGGTTTGGTTAGTATTTCATCTAAAACTCTTAATT
TATTAGATTGCTCCGAAGCTTTCATTGAAATAGAAAAAAATTGTTGTGGCTTTATTCCTGCTTATATCAATGTTAAGATCGGTAATAAGCTTGAATTCTCTCTTCGTTAT
GGTGATATTAACACGCTAGAGGCTAGGAATTCTAAGTTTGATTTAAGTAAAGATTTATCAGCTAATGCCTTTTTGAATTCCCTGGATATATTAAGGGTCAAGCAAGTTGT
GTTGGATGAAGAATTGACTATTTTTAATGAAGGAGAGAGGGAGACCGAATTGCCATTAATTTCTTATTATCAGGAGGAATTTAATGAAGTGTTGGGTTCTCCTAAAGGTG
CATCGTTGCATGATGAGTGTATTAGTAACACTGGTTGTAAAGATATTAATGTTAGCATTCATGAGCCGGTTTCCTTTCTCTCTCCTTCTAAAGATGTTACTGCTATTATC
GTTGCCAGCCCTCATGAAGTCCAACAGCCTCAGTTCTTTGAACCTCCTTCCAAGAAGATTAATGACGTTATTTGCAATTTAAAAGATGATATCCAGCAGTCCATGGAATT
TGAGGTAGCTTCTGTTCAGATTCAAGGGGCGAGAGACCATTTGATCAGAGGTATTGTCAATTCTCCCAGTCATAAAATTCATTCTCCGGTGGAATCAGATGATGAGTCTG
TACTTAGTGTAAGCAGTGAGGATTCTGATCAGTTATATGATAAAGAGGATTGTGTGGAGCACTTTTCAGAAGATCAAATTGGTGAGTCTTTAGTTTCTATTTTTCTTGAG
AATGTTGATGGTCTAGGTTCTCAAGGTTCAATTATTCACGAGTCTTTACTATCTCCTTCTCAAATTCCTAACCAATTCTCTTCGTTAGTTGAAATTTGTGGACTTCAATT
GTGCAAGATTTCACCCCAGTCATCTAAAGCGGGTATTGGATGGTCTTTTGTGGAAGCTTATGGAAAATCAGGAGGTCTTCTTATTATGTGGGATGAAAGTAAACTGTCAG
TGTTGGAATTCTTAAAGGGCGGCTATTCTCTTTGGTCAAATGCCTTACTCTTTGTAAAACAGTTTGTTGGGTATCTAATGTGTATGGTCCAAATGATTACAAAGAAAGGA
GATTTCTATGGCCCGAATTACGCTCTCTTTCTTATTATTGAACGGATCTTTGGTGTATTGGTGGTGACTTTAATATTACTCGATGGGTTCATGAAAGATTTCCTGTGGGA
AGGCAAACAAAAGAATCGTTATAATTGGCACACTTCTGGTAAGGTAGGCCCGAGTCTTCGAAGTCCTTGCATTAATATTTCTAAGGTTCTTCAAAGGAGGTTCCCGTCTC
ATGTTTTATCTCCTTCTATCTGCCCTTTATGCTTGAATGCCAGTGAATCTTCGCAACATTTGTTCTTCGATTGTGTTTATTCTTATCAGTGTTGGGGGAAGTTATTGTCT
ATCTTCAAGCTTCAATGGGTTTTGGATCGGTCATTCAAAGGAAATGTGCTCTTGGTCCGGTTCGTTGATTTAGTTAAGTTTTTGCCCACCCCTAGCTGTCCCAAAGGGAA
GATTTATGCTTCTCGAGGCCTTAGGCAATTAGACCTGCTATCTCTCTTCCTCTTTCTTCTCGTGGTAGATGTGCTGAGTAGAAGGGTCTTTAGAGGGGTGGAGGGAGGAG
GTATCATGGGAGGTTTTAAGGTGGGTAAGGAGTGCATTTCTTTATCTCATCTTCAGTTTGCCGATGACACTATATTCTCTTCTCAGGGAATGAAGATTCCTTTCTTAACT
TAA
Protein sequenceShow/hide protein sequence
MISDFILKIHASEIWQNPPLSRNLKGYSVLLVESVENGPSNSNSYANLVKLGVSSMKSIPLDGSAKNPRLRAHYSWRDVKMVLEEYFHSSISINPFMDDKALIQVAVGVS
EFSLVGKWNKYGDLHLKLELWSSENYSQSKYMKSYGGWIAIRNLPLNLWNCASFEAIGKNLGGLVSISSKTLNLLDCSEAFIEIEKNCCGFIPAYINVKIGNKLEFSLRY
GDINTLEARNSKFDLSKDLSANAFLNSLDILRVKQVVLDEELTIFNEGERETELPLISYYQEEFNEVLGSPKGASLHDECISNTGCKDINVSIHEPVSFLSPSKDVTAII
VASPHEVQQPQFFEPPSKKINDVICNLKDDIQQSMEFEVASVQIQGARDHLIRGIVNSPSHKIHSPVESDDESVLSVSSEDSDQLYDKEDCVEHFSEDQIGESLVSIFLE
NVDGLGSQGSIIHESLLSPSQIPNQFSSLVEICGLQLCKISPQSSKAGIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLWSNALLFVKQFVGYLMCMVQMITKKG
DFYGPNYALFLIIERIFGVLVVTLILLDGFMKDFLWEGKQKNRYNWHTSGKVGPSLRSPCINISKVLQRRFPSHVLSPSICPLCLNASESSQHLFFDCVYSYQCWGKLLS
IFKLQWVLDRSFKGNVLLVRFVDLVKFLPTPSCPKGKIYASRGLRQLDLLSLFLFLLVVDVLSRRVFRGVEGGGIMGGFKVGKECISLSHLQFADDTIFSSQGMKIPFLT