; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg026221 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg026221
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold13:14485360..14488706
RNA-Seq ExpressionSpg026221
SyntenySpg026221
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048674.1 Ankyrin repeat protein [Cucumis melo var. makuwa]3.2e-2743.88Show/hide
Query:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC
        ++  CCA+V ATF D TGVV+ GLNHQFSFR FG    G P V G++  +V  Q   EAP GIAPVGL+T I+K+ KA S+ L+I           I + 
Subjt:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC

Query:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGG
        KG   +S+ HE +  +G NI  S+GGKVS+KS GSV  S+GG I+      + K L SFG++ KK    S G + H+ K  + VS+   I++   G
Subjt:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGG

KAA0048677.1 Ankyrin repeat protein [Cucumis melo var. makuwa]1.9e-2441.4Show/hide
Query:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC
        ++  CCA+V ATF+D TGVV+ GLNHQFSFR FG    G P V GS+  +V  Q   EAP GIAPVGLETPI+K +K  S   +I           I + 
Subjt:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC

Query:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSL-FSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGGINA
        KG   VS+ HE    SG NI  S  GKVS+KS  SV  S+GG I+      + K L   FG+E KK    S G + HK           G+    G +N 
Subjt:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSL-FSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGGINA

Query:  KSKGSHRLSLNNKAS
               +S+N + S
Subjt:  KSKGSHRLSLNNKAS

KAE8646351.1 hypothetical protein Csa_023818, partial [Cucumis sativus]1.8e-2545.41Show/hide
Query:  ATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKCKGDIVVSHGH
        ATFNDVTGVV+L LNH FSFR FG    G P   GS+A +V+  +  EAP GIAPVGL+TPI+K +K  +FGLEI           I + +  +VVS  H
Subjt:  ATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKCKGDIVVSHGH

Query:  EISKKSGVNIDISNGGKVSMKSSGSV----------LSNGGNIDVSKG-----GKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGG
        EI+ KSG NI +S+GGK+S+K  G V          L + GN+ VS G      K+ K L SFG+E KK    S G +  K K    VSH   I++K  G
Subjt:  EISKKSGVNIDISNGGKVSMKSSGSV----------LSNGGNIDVSKG-----GKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGG

Query:  INAKSKG
         +  SKG
Subjt:  INAKSKG

KAE8646352.1 hypothetical protein Csa_015951, partial [Cucumis sativus]3.4e-2948.6Show/hide
Query:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC
        ++  CCA+V ATF+D TGVV+ GLNHQFSFR FG    G P V GS+  +V+ Q   EAP GIAPVGLET I+K++K  S+ L+I K +           
Subjt:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC

Query:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSK
         G  VVSH HE +  SG NI  S+GGKVS+KS GSV  S+GG I+  K   H     SFG+E KK    + G + HK K
Subjt:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSK

XP_023004881.1 uncharacterized protein LOC111498059 [Cucurbita maxima]1.4e-1733.86Show/hide
Query:  CCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAPGIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKCKGDIV
        CCA++ ATFNDVTGV+  G NH+FSFR FG H  GI K   +   S                  + +S D+KA SF  EIKKG    +G E+PK  G   
Subjt:  CCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAPGIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKCKGDIV

Query:  VSHGHEISKKSGVNIDIS--------------NGGKVSMKSSGSVLSNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINL
         S GHEI +K G NI  S              NGG+ S K  G+ + +G NI +    K   ++     +   +  L  G+   KS G  V SH  GI+ 
Subjt:  VSHGHEISKKSGVNIDIS--------------NGGKVSMKSSGSVLSNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINL

Query:  KNGGINAKSKGSHRLSLNNKASRANSVPRIKKAILYLRNPDYSHLLCKNKG
         N G  A S G H +S N+  + A+         L     D  H +  + G
Subjt:  KNGGINAKSKGSHRLSLNNKASRANSVPRIKKAILYLRNPDYSHLLCKNKG

TrEMBL top hitse value%identityAlignment
A0A0A0K5J0 Uncharacterized protein9.1e-2845.75Show/hide
Query:  CAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKCKGDIV
        C KV ATFNDVTGVV+L LNH FSFR FG    G P   GS+A +V+  +  EAP GIAPVGL+TPI+K +K  +FGLEI           I + +  +V
Subjt:  CAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKCKGDIV

Query:  VSHGHEISKKSGVNIDISNGGKVSMKSSGSV----------LSNGGNIDVSKG-----GKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGIN
        VS  HEI+ KSG NI +S+GGK+S+K  G V          L + GN+ VS G      K+ K L SFG+E KK    S G +  K K    VSH   I+
Subjt:  VSHGHEISKKSGVNIDISNGGKVSMKSSGSV----------LSNGGNIDVSKG-----GKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGIN

Query:  LKNGGINAKSKG
        +K  G +  SKG
Subjt:  LKNGGINAKSKG

A0A0A0K7W7 Uncharacterized protein8.8e-3146.94Show/hide
Query:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC
        ++  CCA+V ATF+D TGVV+ GLNHQFSFR FG    G P V GS+  +V  Q   EAP GIAPVGLET I+K++K  S+ L+I K +           
Subjt:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC

Query:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGG
         G  VVSH HE +  SG NI  S+GGKVS+KS GSV  S+GG I+  K   H     SFG+E KK    + G + HK K  + VSH   I++   G
Subjt:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGG

A0A0A0KAZ8 Uncharacterized protein3.7e-2947.89Show/hide
Query:  TATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKCKGDIVVSHG
        +ATF+D TGVV+ GLNHQFSFR FG    G P V GS+  +V+ Q   EAP GIAPVGLET I+K++K  S+ L+I K +            G  VVSH 
Subjt:  TATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKCKGDIVVSHG

Query:  HEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINL-KNGGIN
        HE +  SG NI  S+GGKVS+KS GSV  S+GG I+  K   H     SFG+E KK    + G + HK K  + VSH   I++ K G IN
Subjt:  HEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINL-KNGGIN

A0A5A7U057 Ankyrin repeat protein1.6e-2743.88Show/hide
Query:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC
        ++  CCA+V ATF D TGVV+ GLNHQFSFR FG    G P V G++  +V  Q   EAP GIAPVGL+T I+K+ KA S+ L+I           I + 
Subjt:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC

Query:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGG
        KG   +S+ HE +  +G NI  S+GGKVS+KS GSV  S+GG I+      + K L SFG++ KK    S G + H+ K  + VS+   I++   G
Subjt:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGG

A0A5A7U318 Ankyrin repeat protein9.4e-2541.4Show/hide
Query:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC
        ++  CCA+V ATF+D TGVV+ GLNHQFSFR FG    G P V GS+  +V  Q   EAP GIAPVGLETPI+K +K  S   +I           I + 
Subjt:  SFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAP-GIAPVGLETPISKDNKALSFGLEIKKGLTTSLGFEIPKC

Query:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSL-FSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGGINA
        KG   VS+ HE    SG NI  S  GKVS+KS  SV  S+GG I+      + K L   FG+E KK    S G + HK           G+    G +N 
Subjt:  KGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVL-SNGGNIDVSKGGKHQKSL-FSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGGINA

Query:  KSKGSHRLSLNNKAS
               +S+N + S
Subjt:  KSKGSHRLSLNNKAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.9e-0830.39Show/hide
Query:  PNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHW--KPNSQRHI--LIQSFFTATLWTLWNERNSRIFKGISRSSAQIR
        P  C LC    +S  +LF  C  +  +W FF  +T    P  L      M+  +W   P+ +++I  +I+  F + ++ +W ERN R+  G+SRS+  I 
Subjt:  PNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHW--KPNSQRHI--LIQSFFTATLWTLWNERNSRIFKGISRSSAQIR

Query:  ED
        +D
Subjt:  ED

AT4G05095.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT4G04650.1)5.7e-0628.28Show/hide
Query:  PNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHWKPNSQR----HILIQSFFTATLWTLWNERNSRIFKGISRSSAQI
        P+ C LC  + E  ++LF  C +++ +W        F+    L     +M+   W  +  R     ++I+  F A+++ LW ERN R+    SRSS  I
Subjt:  PNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHWKPNSQR----HILIQSFFTATLWTLWNERNSRIFKGISRSSAQI

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.1e-0428.12Show/hide
Query:  PNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHWKPNSQRHILIQSFFTATLWTLWNERNSRIFKGISRSSAQIR
        P+   LC    E+  +LF  C  +  IW FF  A+ F    P  +      I      S    +++    + ++ +W ERN+RIF  IS S++ +R
Subjt:  PNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHWKPNSQRHILIQSFFTATLWTLWNERNSRIFKGISRSSAQIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTAGCCCGAACGGGTGTTTCCTTTGCTACAAAAGTTTGGAAAGTATGGACTATCTCTTTATTCGTTGTGACCTTGCTTATCCCATTTGGGTGTTCTTCCATCAGGC
CACTGGTTTCCTTTTGCCGATTCCTCTCCATGTGGATCAATTTTATATGGAGATCTTTCATTGGAAGCCAAATTCTCAGAGGCATATTCTTATCCAATCATTCTTTACTG
CCACTTTGTGGACTTTGTGGAATGAGCGCAACAGTCGAATCTTCAAAGGGATCTCTCGCAGCTCTGCTCAAATTAGGGAGGATAGCTTTGCTCTTTCTGGATTTTGGCCG
TGTACCTCAAGCCTTTTTTGTAATTATGAAGCCTCTTCTATTTTCCTTATAATTGGGAGGCTTTCATGTAGCTTTTCCTTGTGTTGTGCAAAGGTAACTGCCACTTTCAA
CGATGTCACTGGTGTTGTCGATCTCGGCCTCAATCATCAATTTTCCTTCCGTGTATTTGGATCACATGGTTTTGGGATCCCAAAAGTTGGTGGTAGTGTCGCACCAAGTG
TAGACATGCAGAGTCCTGTAGAAGCACCAGGGATTGCCCCTGTTGGGCTTGAAACTCCAATAAGCAAAGACAATAAAGCTTTGTCTTTTGGTCTTGAAATTAAGAAGGGT
CTCACTACATCCCTTGGTTTTGAAATTCCCAAGTGTAAGGGCGACATTGTTGTTTCTCATGGTCACGAAATAAGTAAGAAGAGTGGCGTCAACATTGATATTTCCAATGG
TGGAAAAGTTAGTATGAAGAGCAGTGGTAGTGTTCTTTCAAATGGTGGCAACATTGATGTTTCCAAAGGTGGAAAACACCAAAAAAGCCTTTTCTCTTTTGGTCTTGAAA
CAAAGAAGAGCATGACCTTATCTCATGGTCAAAGAACTCACAAGAGTAAAGGTGGCATTGTTGTTTCTCATTGTCGTGGAATTAATCTTAAGAATGGAGGTATTAATGCA
AAGAGCAAAGGATCACATCGTCTGTCTTTAAATAACAAAGCGAGTCGCGCCAATAGTGTCCCCAGGATTAAGAAAGCAATCCTCTACCTCCGAAACCCTGATTATTCTCA
TCTTCTTTGCAAAAATAAGGGATGGGTCACAGTGGGAAATTTTTATGTTAAGTTCGAACGATGGGACTCTGAAATGCACGTTGCTGCTAAACTTGTTCCCAAATTACAGA
TGATCAAGGGTAATGCTTCACGGTTCGTACGGTGTTGCCGAACAAAGGAAAATGGATGGTTTGCTGAAACCCTAAGATCCATGGAACGTTTACTCGAGAAGCGTCTTTGG
AATATGATGAATTTGACGTCAAATCTGAATCATTTCTGTTCAGTGGAAATGAGGCATGCAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTAGCCCGAACGGGTGTTTCCTTTGCTACAAAAGTTTGGAAAGTATGGACTATCTCTTTATTCGTTGTGACCTTGCTTATCCCATTTGGGTGTTCTTCCATCAGGC
CACTGGTTTCCTTTTGCCGATTCCTCTCCATGTGGATCAATTTTATATGGAGATCTTTCATTGGAAGCCAAATTCTCAGAGGCATATTCTTATCCAATCATTCTTTACTG
CCACTTTGTGGACTTTGTGGAATGAGCGCAACAGTCGAATCTTCAAAGGGATCTCTCGCAGCTCTGCTCAAATTAGGGAGGATAGCTTTGCTCTTTCTGGATTTTGGCCG
TGTACCTCAAGCCTTTTTTGTAATTATGAAGCCTCTTCTATTTTCCTTATAATTGGGAGGCTTTCATGTAGCTTTTCCTTGTGTTGTGCAAAGGTAACTGCCACTTTCAA
CGATGTCACTGGTGTTGTCGATCTCGGCCTCAATCATCAATTTTCCTTCCGTGTATTTGGATCACATGGTTTTGGGATCCCAAAAGTTGGTGGTAGTGTCGCACCAAGTG
TAGACATGCAGAGTCCTGTAGAAGCACCAGGGATTGCCCCTGTTGGGCTTGAAACTCCAATAAGCAAAGACAATAAAGCTTTGTCTTTTGGTCTTGAAATTAAGAAGGGT
CTCACTACATCCCTTGGTTTTGAAATTCCCAAGTGTAAGGGCGACATTGTTGTTTCTCATGGTCACGAAATAAGTAAGAAGAGTGGCGTCAACATTGATATTTCCAATGG
TGGAAAAGTTAGTATGAAGAGCAGTGGTAGTGTTCTTTCAAATGGTGGCAACATTGATGTTTCCAAAGGTGGAAAACACCAAAAAAGCCTTTTCTCTTTTGGTCTTGAAA
CAAAGAAGAGCATGACCTTATCTCATGGTCAAAGAACTCACAAGAGTAAAGGTGGCATTGTTGTTTCTCATTGTCGTGGAATTAATCTTAAGAATGGAGGTATTAATGCA
AAGAGCAAAGGATCACATCGTCTGTCTTTAAATAACAAAGCGAGTCGCGCCAATAGTGTCCCCAGGATTAAGAAAGCAATCCTCTACCTCCGAAACCCTGATTATTCTCA
TCTTCTTTGCAAAAATAAGGGATGGGTCACAGTGGGAAATTTTTATGTTAAGTTCGAACGATGGGACTCTGAAATGCACGTTGCTGCTAAACTTGTTCCCAAATTACAGA
TGATCAAGGGTAATGCTTCACGGTTCGTACGGTGTTGCCGAACAAAGGAAAATGGATGGTTTGCTGAAACCCTAAGATCCATGGAACGTTTACTCGAGAAGCGTCTTTGG
AATATGATGAATTTGACGTCAAATCTGAATCATTTCTGTTCAGTGGAAATGAGGCATGCAGAGTGA
Protein sequenceShow/hide protein sequence
MISPNGCFLCYKSLESMDYLFIRCDLAYPIWVFFHQATGFLLPIPLHVDQFYMEIFHWKPNSQRHILIQSFFTATLWTLWNERNSRIFKGISRSSAQIREDSFALSGFWP
CTSSLFCNYEASSIFLIIGRLSCSFSLCCAKVTATFNDVTGVVDLGLNHQFSFRVFGSHGFGIPKVGGSVAPSVDMQSPVEAPGIAPVGLETPISKDNKALSFGLEIKKG
LTTSLGFEIPKCKGDIVVSHGHEISKKSGVNIDISNGGKVSMKSSGSVLSNGGNIDVSKGGKHQKSLFSFGLETKKSMTLSHGQRTHKSKGGIVVSHCRGINLKNGGINA
KSKGSHRLSLNNKASRANSVPRIKKAILYLRNPDYSHLLCKNKGWVTVGNFYVKFERWDSEMHVAAKLVPKLQMIKGNASRFVRCCRTKENGWFAETLRSMERLLEKRLW
NMMNLTSNLNHFCSVEMRHAE