; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005766 (gene) of Snake gourd v1 genome

Gene IDTan0005766
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG09:47361669..47362448
RNA-Seq ExpressionTan0005766
SyntenyTan0005766
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]6.0e-9070.2Show/hide
Query:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------------------------------------
        MQNSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMY MLCTRPDICYA GI                                       
Subjt:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------------------------------------

Query:  ----------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK
                  T+KDSRKSTSGSVFTLNGGAVVWRSIK GCI D  MEAEYVA+CEAAKEAVWLRKF+  LEVV +MNLP+TL+CDNSGAVANSKE RSHK
Subjt:  ----------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK

Query:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR
        RGKHIERKYHLIREIV RGDV VT+IA EHN+ADPFTK LT KVFEGHLESLGLR
Subjt:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-9070.59Show/hide
Query:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------------------------------------
        MQNSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMYVMLCTRPDICYA GI                                       
Subjt:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------------------------------------

Query:  ----------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK
                  T+KDSRKSTSGSVFTLNGGAVVWRSIK GCI D  MEAEYVA+CEAAKEAVWLRKF+  LEVV +MNLP+TL+CDNSGAVANSKE RSHK
Subjt:  ----------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK

Query:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR
        RGKHIERKYHLIREIV RGDV VT+IA EHN+ADPFTK LT KVFEGHLESLGLR
Subjt:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]6.0e-9070.2Show/hide
Query:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------------------------------------
        MQNSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMY MLCTRPDICYA GI                                       
Subjt:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------------------------------------

Query:  ----------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK
                  T+KDSRKSTSGSVFTLNGGAVVWRSIK GCI D  MEAEYVA+CEAAKEAVWLRKF+  LEVV +MNLP+TL+CDNSGAVANSKE RSHK
Subjt:  ----------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK

Query:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR
        RGKHIERKYHLIREIV RGDV VT+IA EHN+ADPFTK LT KVFEGHLESLGLR
Subjt:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR

TYK11909.1 gag/pol protein [Cucumis melo var. makuwa]6.0e-9078.9Show/hide
Query:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGITEK------------DSRKSTSGSVFTLNGGAVVWRSIK
        MQNSKK LLPFRHGVHLSK+QCPKTPQEVEDMRRIPYAS VGSLMYVML TRPDICYA GI  +            DS+KSTSGSVFTLNGGAVVWRSIK
Subjt:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGITEK------------DSRKSTSGSVFTLNGGAVVWRSIK

Query:  MGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFT
         GCIVD  MEAEYVA+CEAAKEA+WLRKF+  LEVV +MNL +TL+CDNSGAV +SKE RSHK+GKHIERKYHLIREIV RGDV VT+IA EHN+ADPFT
Subjt:  MGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFT

Query:  KVLTDKVFEGHLESLGLR
        K LT KVF GHLESLGLR
Subjt:  KVLTDKVFEGHLESLGLR

TYK11959.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-9175.21Show/hide
Query:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI----------------------------TEKDSRKSTSG
        MQNSKK LLPFRHGVHLSK+QCPKTPQEVEDMRRIPYAS VGSLMY MLCTRPDICYA GI                            T+KDSRKS SG
Subjt:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI----------------------------TEKDSRKSTSG

Query:  SVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLIREIVHRGDV
        S+FTLNGGAVVWRSIK GCI D  MEAEYV +CEAAKEAVWLRKF+  LEVV +MNLP+TL+CDNSGAVANSKE RSHK+GKHIERKYHLIREIV R DV
Subjt:  SVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLIREIVHRGDV

Query:  TVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR
         VT+IALEHN+ADPFTK LT KVFEGHLESLGLR
Subjt:  TVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR

TrEMBL top hitse value%identityAlignment
A0A5A7TKM4 Gag/pol protein1.0e-9070.59Show/hide
Query:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------------------------------------
        MQNSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMYVMLCTRPDICYA GI                                       
Subjt:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------------------------------------

Query:  ----------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK
                  T+KDSRKSTSGSVFTLNGGAVVWRSIK GCI D  MEAEYVA+CEAAKEAVWLRKF+  LEVV +MNLP+TL+CDNSGAVANSKE RSHK
Subjt:  ----------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK

Query:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR
        RGKHIERKYHLIREIV RGDV VT+IA EHN+ADPFTK LT KVFEGHLESLGLR
Subjt:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR

A0A5A7TZD0 Gag/pol protein2.9e-9070.2Show/hide
Query:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------------------------------------
        MQNSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMY MLCTRPDICYA GI                                       
Subjt:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------------------------------------

Query:  ----------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK
                  T+KDSRKSTSGSVFTLNGGAVVWRSIK GCI D  MEAEYVA+CEAAKEAVWLRKF+  LEVV +MNLP+TL+CDNSGAVANSKE RSHK
Subjt:  ----------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK

Query:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR
        RGKHIERKYHLIREIV RGDV VT+IA EHN+ADPFTK LT KVFEGHLESLGLR
Subjt:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR

A0A5A7UYE8 Gag/pol protein2.9e-9070.2Show/hide
Query:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------------------------------------
        MQNSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMY MLCTRPDICYA GI                                       
Subjt:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------------------------------------

Query:  ----------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK
                  T+KDSRKSTSGSVFTLNGGAVVWRSIK GCI D  MEAEYVA+CEAAKEAVWLRKF+  LEVV +MNLP+TL+CDNSGAVANSKE RSHK
Subjt:  ----------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK

Query:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR
        RGKHIERKYHLIREIV RGDV VT+IA EHN+ADPFTK LT KVFEGHLESLGLR
Subjt:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR

A0A5D3BZ66 Gag/pol protein2.9e-9079.07Show/hide
Query:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGC
        MQNSKK LLPFRHGVHLSK+QCPKT QE+EDMRRIPYAS VGSLMY MLCTRP ICYA  I         T+KDSRKSTSGSVFTLNGG VVWRSIK GC
Subjt:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI---------TEKDSRKSTSGSVFTLNGGAVVWRSIKMGC

Query:  IVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVL
        I + +MEAEYVA+CEAAK+A+WLRKF+  LEVV +MNLP TL+CDNSGAVANSKE  SHKRGKHIERKYHLIREIV RGD  VT+IA EHN+ADPFTK L
Subjt:  IVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVL

Query:  TDKVFEGHLESLGLR
        T KVFEGHLESLGLR
Subjt:  TDKVFEGHLESLGLR

A0A5D3CNV3 Gag/pol protein7.0e-9275.21Show/hide
Query:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI----------------------------TEKDSRKSTSG
        MQNSKK LLPFRHGVHLSK+QCPKTPQEVEDMRRIPYAS VGSLMY MLCTRPDICYA GI                            T+KDSRKS SG
Subjt:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGI----------------------------TEKDSRKSTSG

Query:  SVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLIREIVHRGDV
        S+FTLNGGAVVWRSIK GCI D  MEAEYV +CEAAKEAVWLRKF+  LEVV +MNLP+TL+CDNSGAVANSKE RSHK+GKHIERKYHLIREIV R DV
Subjt:  SVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLIREIVHRGDV

Query:  TVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR
         VT+IALEHN+ADPFTK LT KVFEGHLESLGLR
Subjt:  TVTQIALEHNVADPFTKVLTDKVFEGHLESLGLR

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.4e-1727.8Show/hide
Query:  PYASVVGSLMYVMLCTRPDICYANGITEKDS----------------------------------------------------RKSTSGSVFTL-NGGAV
        P  S++G LMY+MLCTRPD+  A  I  + S                                                    RKST+G +F + +   +
Subjt:  PYASVVGSLMYVMLCTRPDICYANGITEKDS----------------------------------------------------RKSTSGSVFTL-NGGAV

Query:  VWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLIREIVHRGDVTVTQIALEHN
         W + +   +     EAEY+A  EA +EA+WL+  + ++ +   +  P+ ++ DN G ++ +     HKR KHI+ KYH  RE V    + +  I  E+ 
Subjt:  VWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLIREIVHRGDVTVTQIALEHN

Query:  VADPFTKVLTDKVFEGHLESLGL
        +AD FTK L    F    + LGL
Subjt:  VADPFTKVLTDKVFEGHLESLGL

P0CV72 Secreted RxLR effector protein 1616.4e-1031.58Show/hide
Query:  MRRIPYASVVGSLMYVMLCTRPDICYANGITEK--------------------------------------------------DSRKSTSGSVFTLNGGA
        M+ +PY S VG++MY+M+ TRPD+  A G+  +                                                  +SR+STSG +F LNGG 
Subjt:  MRRIPYASVVGSLMYVMLCTRPDICYANGITEK--------------------------------------------------DSRKSTSGSVFTLNGGA

Query:  VVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWL
        V WRS K   +     E EY+A  EA +EAVWL
Subjt:  VVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-3232.68Show/hide
Query:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGITEK------------------------------------
        M+N+K    P    + LSK  CP T +E  +M ++PY+S VGSLMY M+CTRPDI +A G+  +                                    
Subjt:  MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGITEK------------------------------------

Query:  -------------DSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK
                     D+RKS++G +FT +GGA+ W+S    C+     EAEY+A+ E  KE +WL++F+  L + +       ++CD+  A+  SK    H 
Subjt:  -------------DSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHK

Query:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGL
        R KHI+ +YH IRE+V    + V +I+   N AD  TKV+    FE   E +G+
Subjt:  RGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.0e-1136.27Show/hide
Query:  KDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLI
        KD+R+ST+G    L    + W+S K   +     EAEY A   A  E +WL +F   L++   ++ P  LFCDN+ A+  +     H+R KHIE   H +
Subjt:  KDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKEAVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLI

Query:  RE
        RE
Subjt:  RE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAACTCCAAGAAGTGTTTACTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAGTGTCCTAAGACGCCTCAAGAGGTTGAGGATATGAGACGGATTCCTTA
TGCTTCAGTTGTAGGGAGCCTGATGTACGTCATGTTGTGTACTAGGCCCGACATCTGTTATGCAAATGGGATTACCGAGAAGGATTCTAGGAAGTCCACATCGGGGTCAG
TGTTCACTCTTAACGGAGGAGCTGTAGTGTGGCGAAGCATCAAGATGGGATGCATCGTTGATTTCATGATGGAAGCCGAGTATGTTGCTTCTTGTGAAGCTGCAAAGGAA
GCTGTTTGGCTTAGGAAGTTCATGTTAGCTTTGGAAGTTGTTCGAGATATGAACTTGCCAGTGACGTTGTTTTGTGACAACAGCGGTGCAGTAGCCAACTCGAAAGAACG
TCGAAGTCATAAGAGGGGCAAGCATATTGAACGCAAGTATCACTTGATACGGGAGATTGTGCACCGTGGAGACGTGACCGTCACGCAGATAGCTTTGGAGCACAACGTTG
CTGATCCATTTACAAAGGTCCTTACAGATAAGGTGTTTGAGGGTCACCTAGAGAGTCTAGGTCTTCGAGTTCTTCCTGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGAACTCCAAGAAGTGTTTACTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAGTGTCCTAAGACGCCTCAAGAGGTTGAGGATATGAGACGGATTCCTTA
TGCTTCAGTTGTAGGGAGCCTGATGTACGTCATGTTGTGTACTAGGCCCGACATCTGTTATGCAAATGGGATTACCGAGAAGGATTCTAGGAAGTCCACATCGGGGTCAG
TGTTCACTCTTAACGGAGGAGCTGTAGTGTGGCGAAGCATCAAGATGGGATGCATCGTTGATTTCATGATGGAAGCCGAGTATGTTGCTTCTTGTGAAGCTGCAAAGGAA
GCTGTTTGGCTTAGGAAGTTCATGTTAGCTTTGGAAGTTGTTCGAGATATGAACTTGCCAGTGACGTTGTTTTGTGACAACAGCGGTGCAGTAGCCAACTCGAAAGAACG
TCGAAGTCATAAGAGGGGCAAGCATATTGAACGCAAGTATCACTTGATACGGGAGATTGTGCACCGTGGAGACGTGACCGTCACGCAGATAGCTTTGGAGCACAACGTTG
CTGATCCATTTACAAAGGTCCTTACAGATAAGGTGTTTGAGGGTCACCTAGAGAGTCTAGGTCTTCGAGTTCTTCCTGACTAG
Protein sequenceShow/hide protein sequence
MQNSKKCLLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYVMLCTRPDICYANGITEKDSRKSTSGSVFTLNGGAVVWRSIKMGCIVDFMMEAEYVASCEAAKE
AVWLRKFMLALEVVRDMNLPVTLFCDNSGAVANSKERRSHKRGKHIERKYHLIREIVHRGDVTVTQIALEHNVADPFTKVLTDKVFEGHLESLGLRVLPD