; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037058 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037058
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionEnzymatic polyprotein
Genome locationscaffold6:28261976..28267262
RNA-Seq ExpressionSpg037058
SyntenySpg037058
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037024.1 polyprotein [Cucumis melo var. makuwa]5.4e-3952.06Show/hide
Query:  MDIDEETKQSLLRILNSNEVSSEEESF-FEK--DLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREEL
        M+IDE+ KQSL +++ + E+SSEEE F FEK  DL+N ++EESS   S S NE+D +EAI   GCINVLT+ QK L D+I+E+  +  RKK+LL+LRE+L
Subjt:  MDIDEETKQSLLRILNSNEVSSEEESF-FEK--DLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREEL

Query:  EAPEQTYKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINK
        E P+Q +KDPMNFS+Q V+N L +EHAAP+K+  L HEI+ LK+E+ +NKQ++S L+ A VAIQE    +II     S E++ + S D N+I++
Subjt:  EAPEQTYKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINK

KAA0050194.1 polyprotein [Cucumis melo var. makuwa]2.3e-3749.47Show/hide
Query:  TKQSLLRILNSNEVSSEEESFF---EKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQT
        T  S  +++ + E+SSEEE F    E+D++N ++EESS+  S S NE+D  EAIPC GCINVLT+ Q+ L D+I+E+  +  RK +LL+LRE+LE P+Q 
Subjt:  TKQSLLRILNSNEVSSEEESFF---EKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQT

Query:  YKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINK
        +KDPMNFS+Q V+N L +EHAAP+K++ LQHEIK LK+E+ +N+Q++S LE A V IQE    +II     S E++ + S D N+I++
Subjt:  YKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINK

KAA0050625.1 Enzymatic polyprotein [Cucumis melo var. makuwa]5.8e-4151.61Show/hide
Query:  MDIDEETKQSLLRILNSNEVSSEEESFF---EKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREEL
        M+IDE+ KQSLL+++ + E+SSEE+ F    E+DL+N ++EESS   S S NE+D +E I C GCI+VLT+ QK L D+I+E+  +  RK +LL+LRE+L
Subjt:  MDIDEETKQSLLRILNSNEVSSEEESFF---EKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREEL

Query:  EAPEQTYKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSS
        E P+Q +KDPMNFS+QNV+N L +EHAAP+K++ LQHEIK LK EV +NKQ++S LE A VAIQE   ++ +++       L + S
Subjt:  EAPEQTYKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSS

TYJ98215.1 polyprotein [Cucumis melo var. makuwa]2.3e-3749.47Show/hide
Query:  TKQSLLRILNSNEVSSEEESFF---EKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQT
        T  S  +++ + E+SSEEE F    E+D++N ++EESS+  S S NE+D  EAIPC GCINVLT+ Q+ L D+I+E+  +  RK +LL+LRE+LE P+Q 
Subjt:  TKQSLLRILNSNEVSSEEESFF---EKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQT

Query:  YKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINK
        +KDPMNFS+Q V+N L +EHAAP+K++ LQHEIK LK+E+ +N+Q++S LE A V IQE    +II     S E++ + S D N+I++
Subjt:  YKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINK

TYK23160.1 hypothetical protein E5676_scaffold142G001850 [Cucumis melo var. makuwa]1.5e-4437.65Show/hide
Query:  SNEVSSEEESF-FEK--DLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQTYKDPMNFSYQ
        + E+SSEEE   FEK  DL+N ++EESS   S S NE+D +EAIPC  CINVLT+ QK L D+I+E+  +  RKK+LL+L E+LE P Q +KDPMNFS+Q
Subjt:  SNEVSSEEESF-FEK--DLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQTYKDPMNFSYQ

Query:  NVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINKVFNQNCDTIFITNSRLASRMRL
         V N L +          L H ++  K                               +P + E                          +SR      +
Subjt:  NVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINKVFNQNCDTIFITNSRLASRMRL

Query:  QHAKSLTVYARRKRVPEGKLMIFSHLGINGSINQSMRYKIQDYKLETIALIDSGADQNVIQEGIIPSRPAIRPLPPPQASQNVASSSGASSSRGKRPITQ
        Q  KSL         P+G                          +  ++L+D  A                + +    A+  +ASSS  SSS GK PI+Q
Subjt:  QHAKSLTVYARRKRVPEGKLMIFSHLGINGSINQSMRYKIQDYKLETIALIDSGADQNVIQEGIIPSRPAIRPLPPPQASQNVASSSGASSSRGKRPITQ

Query:  TSAPSPMSADNYEMDLGFELLSRRRPGSSSRSLSIRPDVSLPPHPSTTLLRPSGVSNPNVRPPAPSNLVPRRNTQSYARTVRPAVFMPRPPVIGYQNKTT
        TSA +PM A+NY MDL F+++SRRR GSS R+++I         PST LLRP  V+         SN    R  QSYARTV+PAVFMPRPPV GYQ KTT
Subjt:  TSAPSPMSADNYEMDLGFELLSRRRPGSSSRSLSIRPDVSLPPHPSTTLLRPSGVSNPNVRPPAPSNLVPRRNTQSYARTVRPAVFMPRPPVIGYQNKTT

Query:  LEDVIIEPEFDGPSVKE
        LEDV+IEPEFDGP + E
Subjt:  LEDVIIEPEFDGPSVKE

TrEMBL top hitse value%identityAlignment
A0A5A7U4M8 Polyprotein1.1e-3749.47Show/hide
Query:  TKQSLLRILNSNEVSSEEESFF---EKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQT
        T  S  +++ + E+SSEEE F    E+D++N ++EESS+  S S NE+D  EAIPC GCINVLT+ Q+ L D+I+E+  +  RK +LL+LRE+LE P+Q 
Subjt:  TKQSLLRILNSNEVSSEEESFF---EKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQT

Query:  YKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINK
        +KDPMNFS+Q V+N L +EHAAP+K++ LQHEIK LK+E+ +N+Q++S LE A V IQE    +II     S E++ + S D N+I++
Subjt:  YKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINK

A0A5A7UAJ7 Enzymatic polyprotein2.8e-4151.61Show/hide
Query:  MDIDEETKQSLLRILNSNEVSSEEESFF---EKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREEL
        M+IDE+ KQSLL+++ + E+SSEE+ F    E+DL+N ++EESS   S S NE+D +E I C GCI+VLT+ QK L D+I+E+  +  RK +LL+LRE+L
Subjt:  MDIDEETKQSLLRILNSNEVSSEEESFF---EKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREEL

Query:  EAPEQTYKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSS
        E P+Q +KDPMNFS+QNV+N L +EHAAP+K++ LQHEIK LK EV +NKQ++S LE A VAIQE   ++ +++       L + S
Subjt:  EAPEQTYKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSS

A0A5D3BI97 Polyprotein1.1e-3749.47Show/hide
Query:  TKQSLLRILNSNEVSSEEESFF---EKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQT
        T  S  +++ + E+SSEEE F    E+D++N ++EESS+  S S NE+D  EAIPC GCINVLT+ Q+ L D+I+E+  +  RK +LL+LRE+LE P+Q 
Subjt:  TKQSLLRILNSNEVSSEEESFF---EKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQT

Query:  YKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINK
        +KDPMNFS+Q V+N L +EHAAP+K++ LQHEIK LK+E+ +N+Q++S LE A V IQE    +II     S E++ + S D N+I++
Subjt:  YKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINK

A0A5D3C457 Polyprotein2.6e-3952.06Show/hide
Query:  MDIDEETKQSLLRILNSNEVSSEEESF-FEK--DLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREEL
        M+IDE+ KQSL +++ + E+SSEEE F FEK  DL+N ++EESS   S S NE+D +EAI   GCINVLT+ QK L D+I+E+  +  RKK+LL+LRE+L
Subjt:  MDIDEETKQSLLRILNSNEVSSEEESF-FEK--DLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREEL

Query:  EAPEQTYKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINK
        E P+Q +KDPMNFS+Q V+N L +EHAAP+K+  L HEI+ LK+E+ +NKQ++S L+ A VAIQE    +II     S E++ + S D N+I++
Subjt:  EAPEQTYKDPMNFSYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINK

A0A5D3DIF5 Uncharacterized protein7.1e-4537.65Show/hide
Query:  SNEVSSEEESF-FEK--DLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQTYKDPMNFSYQ
        + E+SSEEE   FEK  DL+N ++EESS   S S NE+D +EAIPC  CINVLT+ QK L D+I+E+  +  RKK+LL+L E+LE P Q +KDPMNFS+Q
Subjt:  SNEVSSEEESF-FEK--DLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQTYKDPMNFSYQ

Query:  NVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINKVFNQNCDTIFITNSRLASRMRL
         V N L +          L H ++  K                               +P + E                          +SR      +
Subjt:  NVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINKVFNQNCDTIFITNSRLASRMRL

Query:  QHAKSLTVYARRKRVPEGKLMIFSHLGINGSINQSMRYKIQDYKLETIALIDSGADQNVIQEGIIPSRPAIRPLPPPQASQNVASSSGASSSRGKRPITQ
        Q  KSL         P+G                          +  ++L+D  A                + +    A+  +ASSS  SSS GK PI+Q
Subjt:  QHAKSLTVYARRKRVPEGKLMIFSHLGINGSINQSMRYKIQDYKLETIALIDSGADQNVIQEGIIPSRPAIRPLPPPQASQNVASSSGASSSRGKRPITQ

Query:  TSAPSPMSADNYEMDLGFELLSRRRPGSSSRSLSIRPDVSLPPHPSTTLLRPSGVSNPNVRPPAPSNLVPRRNTQSYARTVRPAVFMPRPPVIGYQNKTT
        TSA +PM A+NY MDL F+++SRRR GSS R+++I         PST LLRP  V+         SN    R  QSYARTV+PAVFMPRPPV GYQ KTT
Subjt:  TSAPSPMSADNYEMDLGFELLSRRRPGSSSRSLSIRPDVSLPPHPSTTLLRPSGVSNPNVRPPAPSNLVPRRNTQSYARTVRPAVFMPRPPVIGYQNKTT

Query:  LEDVIIEPEFDGPSVKE
        LEDV+IEPEFDGP + E
Subjt:  LEDVIIEPEFDGPSVKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATCGACGAAGAAACAAAGCAATCTCTTCTTCGAATCCTTAATTCCAATGAGGTATCCTCTGAAGAAGAATCCTTTTTTGAAAAGGATTTGATCAATGAGATAAT
TGAAGAATCTTCCAACAATTCTTCATTATCCAGCAACGAATCAGACGAAGATGAAGCCATCCCCTGCGGAGGATGCATCAACGTCCTAACAACGTATCAAAAGAATCTCT
TTGATCTTATCGATGAGATTCCATACCAAGGAACCAGGAAGAAAATGCTTTTAAAGCTCCGGGAAGAGCTTGAAGCACCTGAGCAAACTTACAAAGATCCTATGAACTTC
AGCTACCAGAATGTGCTGAATCGTTTAAGGGAAGAACATGCTGCACCTATCAAGATTTCTTATCTACAGCATGAAATCAAAACCCTAAAAAGGGAAGTTGCTGATAATAA
GCAACGTCTATCTGATCTTGAATTTGCCTTCGTAGCAATTCAAGAGTCCACAGCTGCAAGGATCATTGTTGAAGAACCAAAATCTGCTGAGGATTTAGGTTCCTCCTCTA
AAGATATCAACATCATCAATAAAGTCTTTAATCAAAATTGCGATACTATTTTCATTACTAACTCCCGCTTGGCCAGCCGGATGCGGTTGCAGCATGCTAAATCCCTGACA
GTTTATGCCAGACGCAAAAGAGTACCGGAAGGGAAGCTTATGATATTTTCTCACTTAGGCATTAATGGATCAATTAATCAAAGTATGCGCTACAAGATTCAAGATTACAA
GCTCGAGACTATAGCTCTTATAGATTCTGGAGCAGATCAAAATGTTATTCAAGAAGGAATAATTCCATCAAGGCCTGCAATAAGGCCTCTGCCTCCTCCCCAGGCAAGCC
AAAATGTAGCTAGCTCTTCTGGGGCTAGTTCTTCAAGGGGCAAACGCCCTATTACTCAAACCTCTGCCCCATCTCCGATGAGTGCAGATAATTACGAAATGGATCTCGGT
TTTGAACTATTATCCAGACGTCGCCCAGGTTCCTCTTCAAGGAGCCTATCAATAAGACCGGATGTGAGTCTCCCTCCACATCCTTCTACAACTTTGTTACGCCCTTCCGG
TGTATCAAATCCAAACGTGCGACCTCCCGCACCTTCAAATTTAGTTCCTCGAAGGAACACTCAATCTTATGCCCGTACCGTCAGGCCAGCGGTATTTATGCCACGACCTC
CGGTAATCGGCTATCAAAACAAAACAACTCTGGAAGATGTTATCATTGAACCAGAGTTTGACGGACCTTCAGTCAAGGAGTGGTTCAAATCCAAAGCGCATCTTTCTCGA
GAAGAAGAAGACCAATTTCTTCTTGCCAAGAATTCAATCATGACGTCACTAGCAGGTGCAACTTCTGAGTCTGACCTCCAAGCCATCATTCAAACAGTTGTCCAAACTCT
CTCAGACAACGACGACCTTCAAGAAGAAGAAAACCCTGAAGCATTAGAAGCAGCTTCAGAATCTTCTGTCAATGATGTTGAAGATGAATACGACTCGTATCTCAATTTCA
ACATTCTTGATCCTTACCATGATTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATATCGACGAAGAAACAAAGCAATCTCTTCTTCGAATCCTTAATTCCAATGAGGTATCCTCTGAAGAAGAATCCTTTTTTGAAAAGGATTTGATCAATGAGATAAT
TGAAGAATCTTCCAACAATTCTTCATTATCCAGCAACGAATCAGACGAAGATGAAGCCATCCCCTGCGGAGGATGCATCAACGTCCTAACAACGTATCAAAAGAATCTCT
TTGATCTTATCGATGAGATTCCATACCAAGGAACCAGGAAGAAAATGCTTTTAAAGCTCCGGGAAGAGCTTGAAGCACCTGAGCAAACTTACAAAGATCCTATGAACTTC
AGCTACCAGAATGTGCTGAATCGTTTAAGGGAAGAACATGCTGCACCTATCAAGATTTCTTATCTACAGCATGAAATCAAAACCCTAAAAAGGGAAGTTGCTGATAATAA
GCAACGTCTATCTGATCTTGAATTTGCCTTCGTAGCAATTCAAGAGTCCACAGCTGCAAGGATCATTGTTGAAGAACCAAAATCTGCTGAGGATTTAGGTTCCTCCTCTA
AAGATATCAACATCATCAATAAAGTCTTTAATCAAAATTGCGATACTATTTTCATTACTAACTCCCGCTTGGCCAGCCGGATGCGGTTGCAGCATGCTAAATCCCTGACA
GTTTATGCCAGACGCAAAAGAGTACCGGAAGGGAAGCTTATGATATTTTCTCACTTAGGCATTAATGGATCAATTAATCAAAGTATGCGCTACAAGATTCAAGATTACAA
GCTCGAGACTATAGCTCTTATAGATTCTGGAGCAGATCAAAATGTTATTCAAGAAGGAATAATTCCATCAAGGCCTGCAATAAGGCCTCTGCCTCCTCCCCAGGCAAGCC
AAAATGTAGCTAGCTCTTCTGGGGCTAGTTCTTCAAGGGGCAAACGCCCTATTACTCAAACCTCTGCCCCATCTCCGATGAGTGCAGATAATTACGAAATGGATCTCGGT
TTTGAACTATTATCCAGACGTCGCCCAGGTTCCTCTTCAAGGAGCCTATCAATAAGACCGGATGTGAGTCTCCCTCCACATCCTTCTACAACTTTGTTACGCCCTTCCGG
TGTATCAAATCCAAACGTGCGACCTCCCGCACCTTCAAATTTAGTTCCTCGAAGGAACACTCAATCTTATGCCCGTACCGTCAGGCCAGCGGTATTTATGCCACGACCTC
CGGTAATCGGCTATCAAAACAAAACAACTCTGGAAGATGTTATCATTGAACCAGAGTTTGACGGACCTTCAGTCAAGGAGTGGTTCAAATCCAAAGCGCATCTTTCTCGA
GAAGAAGAAGACCAATTTCTTCTTGCCAAGAATTCAATCATGACGTCACTAGCAGGTGCAACTTCTGAGTCTGACCTCCAAGCCATCATTCAAACAGTTGTCCAAACTCT
CTCAGACAACGACGACCTTCAAGAAGAAGAAAACCCTGAAGCATTAGAAGCAGCTTCAGAATCTTCTGTCAATGATGTTGAAGATGAATACGACTCGTATCTCAATTTCA
ACATTCTTGATCCTTACCATGATTCCTAG
Protein sequenceShow/hide protein sequence
MDIDEETKQSLLRILNSNEVSSEEESFFEKDLINEIIEESSNNSSLSSNESDEDEAIPCGGCINVLTTYQKNLFDLIDEIPYQGTRKKMLLKLREELEAPEQTYKDPMNF
SYQNVLNRLREEHAAPIKISYLQHEIKTLKREVADNKQRLSDLEFAFVAIQESTAARIIVEEPKSAEDLGSSSKDINIINKVFNQNCDTIFITNSRLASRMRLQHAKSLT
VYARRKRVPEGKLMIFSHLGINGSINQSMRYKIQDYKLETIALIDSGADQNVIQEGIIPSRPAIRPLPPPQASQNVASSSGASSSRGKRPITQTSAPSPMSADNYEMDLG
FELLSRRRPGSSSRSLSIRPDVSLPPHPSTTLLRPSGVSNPNVRPPAPSNLVPRRNTQSYARTVRPAVFMPRPPVIGYQNKTTLEDVIIEPEFDGPSVKEWFKSKAHLSR
EEEDQFLLAKNSIMTSLAGATSESDLQAIIQTVVQTLSDNDDLQEEENPEALEAASESSVNDVEDEYDSYLNFNILDPYHDS