; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g012910 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g012910
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:17836408..17837311
RNA-Seq ExpressionLcy06g012910
SyntenyLcy06g012910
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4274760.1 unnamed protein product [Prunus armeniaca]5.4e-1723.76Show/hide
Query:  ETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGREDLAPMDYLHGIWKISNSGVREKDKMCKSLILCWQIWSYRNQIHHRNQHPD----IKFLQSQVDKY--
        ET +H+F+ C   +  W       D+    G + ++   ++       NS     + +   +   W+IW  RN         D    +  L  QV ++  
Subjt:  ETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGREDLAPMDYLHGIWKISNSGVREKDKMCKSLILCWQIWSYRNQIHHRNQHPD----IKFLQSQVDKY--

Query:  ----MEELLERTV----------WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGL-KALSIESPPVQIE
            +++L  + +          W +P  G+ K+N DAAWS +   GG+GW +  S G LL AG +        + +E LA+   L +  +     + +E
Subjt:  ----MEELLERTV----------WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGL-KALSIESPPVQIE

Query:  TDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLARKASLSGCSNNWTHSFPDWLLNLNVGDL
        +D+ + + +LNGR   +++++  + + + L+            ++ NQ AH++A  AS  G ++ W H  PDWL N    D+
Subjt:  TDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLARKASLSGCSNNWTHSFPDWLLNLNVGDL

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]2.0e-2431.19Show/hide
Query:  KSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEELLERTV------------------------WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQ
        +S+I+ WQIW  RN+   +  HP+ + +Q  +D+Y+     R                          W  P S  WKLN++AAW  + N GGIGW L  
Subjt:  KSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEELLERTV------------------------WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQ

Query:  SDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIE-SPPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLAR
          G ++ A  R I     I  LE +A+CEGL+A+  E   P+ +E+D+L+ +HLL+ + +D+TE+   ++E   ++ + ++  +  +++  N++AH LAR
Subjt:  SDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIE-SPPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLAR

Query:  KA
        +A
Subjt:  KA

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]7.8e-2430.26Show/hide
Query:  ERRETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGREDLAPMDYLHGIWKISNSGVREKDKMCKSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEE
        ++ ETT H+ W CKV K +W+         ++I R +    +Y    W +  +G  E+    +S+I+  QIW  RN+   +  H + + +Q  +D+Y+  
Subjt:  ERRETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGREDLAPMDYLHGIWKISNSGVREKDKMCKSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEE

Query:  LL----------------------ERTVWVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIE-SP
                                 R  W  P S  WKLN+DAAW  + N  GIGW L    G ++  G R I     I  LE +A+CEGL+A+  E   
Subjt:  LL----------------------ERTVWVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIE-SP

Query:  PVQIETDALQVVHLLNGREEDETEVKLF
        P+ +E+D+L+ +HLL+     + ++KLF
Subjt:  PVQIETDALQVVHLLNGREEDETEVKLF

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]5.0e-2333.49Show/hide
Query:  LILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEE----------LLERTV-----WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFR
        LI  W IW++RN +  R +H     +  Q+ K++ E          +L +T+     W  PP  +W LN+DA+WS+  +RGGIGW +   DG ++ AG R
Subjt:  LILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEE----------LLERTV-----WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFR

Query:  AISRRWKILRLESLAVCEGLKALSIES--PPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLARKASLSGCSNN
         +     +  LE+ A+ EGL+ L+      P+ IETD+ +V  LLN + ED T+    ++E  +L    ++     V +  N  AH+LA++AS+   S  
Subjt:  AISRRWKILRLESLAVCEGLKALSIES--PPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLARKASLSGCSNN

Query:  WTHSFPDWL
        W   FP+WL
Subjt:  WTHSFPDWL

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]2.3e-2330.48Show/hide
Query:  KSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEELLERTV------------------------WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQ
        +S+I+ WQIW  RN+   +  H + + +Q  +D+Y+     R                          W  P S  WKLN+DAAW  + N GGIGW L  
Subjt:  KSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEELLERTV------------------------WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQ

Query:  SDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIE---------SPPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLN
          G ++ A  R I     I  LE +A+CEGL+A+  E           P+ +E+D+L+ +HLL+ + +D+TE+   ++E   ++ + K+  +  +++  N
Subjt:  SDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIE---------SPPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLN

Query:  QLAHNLARKA
        ++AH+LAR+A
Subjt:  QLAHNLARKA

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134129.9e-2531.19Show/hide
Query:  KSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEELLERTV------------------------WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQ
        +S+I+ WQIW  RN+   +  HP+ + +Q  +D+Y+     R                          W  P S  WKLN++AAW  + N GGIGW L  
Subjt:  KSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEELLERTV------------------------WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQ

Query:  SDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIE-SPPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLAR
          G ++ A  R I     I  LE +A+CEGL+A+  E   P+ +E+D+L+ +HLL+ + +D+TE+   ++E   ++ + ++  +  +++  N++AH LAR
Subjt:  SDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIE-SPPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLAR

Query:  KA
        +A
Subjt:  KA

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X13.8e-2430.26Show/hide
Query:  ERRETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGREDLAPMDYLHGIWKISNSGVREKDKMCKSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEE
        ++ ETT H+ W CKV K +W+         ++I R +    +Y    W +  +G  E+    +S+I+  QIW  RN+   +  H + + +Q  +D+Y+  
Subjt:  ERRETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGREDLAPMDYLHGIWKISNSGVREKDKMCKSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEE

Query:  LL----------------------ERTVWVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIE-SP
                                 R  W  P S  WKLN+DAAW  + N  GIGW L    G ++  G R I     I  LE +A+CEGL+A+  E   
Subjt:  LL----------------------ERTVWVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIE-SP

Query:  PVQIETDALQVVHLLNGREEDETEVKLF
        P+ +E+D+L+ +HLL+     + ++KLF
Subjt:  PVQIETDALQVVHLLNGREEDETEVKLF

A0A6J1DNV9 uncharacterized protein LOC1110224032.4e-2333.49Show/hide
Query:  LILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEE----------LLERTV-----WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFR
        LI  W IW++RN +  R +H     +  Q+ K++ E          +L +T+     W  PP  +W LN+DA+WS+  +RGGIGW +   DG ++ AG R
Subjt:  LILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEE----------LLERTV-----WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFR

Query:  AISRRWKILRLESLAVCEGLKALSIES--PPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLARKASLSGCSNN
         +     +  LE+ A+ EGL+ L+      P+ IETD+ +V  LLN + ED T+    ++E  +L    ++     V +  N  AH+LA++AS+   S  
Subjt:  AISRRWKILRLESLAVCEGLKALSIES--PPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLARKASLSGCSNN

Query:  WTHSFPDWL
        W   FP+WL
Subjt:  WTHSFPDWL

A0A6J1DSV1 uncharacterized protein LOC1110236081.1e-2330.48Show/hide
Query:  KSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEELLERTV------------------------WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQ
        +S+I+ WQIW  RN+   +  H + + +Q  +D+Y+     R                          W  P S  WKLN+DAAW  + N GGIGW L  
Subjt:  KSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEELLERTV------------------------WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQ

Query:  SDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIE---------SPPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLN
          G ++ A  R I     I  LE +A+CEGL+A+  E           P+ +E+D+L+ +HLL+ + +D+TE+   ++E   ++ + K+  +  +++  N
Subjt:  SDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIE---------SPPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLN

Query:  QLAHNLARKA
        ++AH+LAR+A
Subjt:  QLAHNLARKA

A0A803PWX1 Uncharacterized protein4.0e-1826.89Show/hide
Query:  ETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGR-EDLAPMDYLHGIWKISNSGVREKDKMCKSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEELL
        E   H  WGC+V + +W       +   F GR +     D L  + ++S+  +  KD+    LIL W +W  RN ++H    P    +     K++ E  
Subjt:  ETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGR-EDLAPMDYLHGIWKISNSGVREKDKMCKSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEELL

Query:  ERTV------------WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLK-ALSIESPPVQIETDALQVV
        E  V            W  P  G +K+N DA      N  G+   +   +G ++ A  R + ++   L+ E  A+  G++  +    P   +E+D LQ V
Subjt:  ERTV------------WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLK-ALSIESPPVQIETDALQVV

Query:  HLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLARKASLSGCSNNWTHSFP
        +L+   EE   +V   I + K L+   +V GV+ V +  NQ+AH LA  A ++  S  W    P
Subjt:  HLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLARKASLSGCSNNWTHSFP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein9.5e-0426.92Show/hide
Query:  PSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLKALS-IESPPVQIETDALQVVHLLNGREEDETEVKLFIDET
        PS   K N DA+  E     G+GW +  S G++L  G      R      E  A+   ++A S      V  E D   V  L+N  + D   +K ++D  
Subjt:  PSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLKALS-IESPPVQIETDALQVVHLLNGREEDETEVKLFIDET

Query:  KSLIFERKVEGVYLVNKNLNQLAHNLARKASLSGCSNNWTHSFPDWLLNLNVGDLG
        KS I           ++  NQ A  L +KA  S    +  +  P +L  + +  LG
Subjt:  KSLIFERKVEGVYLVNKNLNQLAHNLARKASLSGCSNNWTHSFPDWLLNLNVGDLG

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.8e-1023.47Show/hide
Query:  ERRETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGREDLAPMDYLHGIWKISNSGVREKDKMCKSLI--LCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYM
        + RET +HL + C   + +W     P          +     Y +  W ++      K     +L+  L W++W  RN++  + +  D   +  +  +  
Subjt:  ERRETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGREDLAPMDYLHGIWKISNSGVREKDKMCKSLI--LCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYM

Query:  EE--------------LLERTV---WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLKALS-IESPPVQ
        EE               +ER +   W  PP    K N+DA W  E  R GIGW L    G +L  G RA+ R   +L  E  A+   +  +S      + 
Subjt:  EE--------------LLERTV---WVRPPSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLKALS-IESPPVQ

Query:  IETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLARKA-SLSGCSNNWTHSFPDWL
         E+DA  +V+LLN  +   T ++  +++ + L+   +        +  N++A  +AR++ S S          P WL
Subjt:  IETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLARKA-SLSGCSNNWTHSFPDWL

AT4G29090.1 Ribonuclease H-like superfamily protein2.1e-1123.64Show/hide
Query:  RETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGREDLAPMDYLHGIWKISNSGVREKDKMCKSLI--LCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEE
        +ET +HL + C   +  W     P  L       + A   Y++  W  +      + +    L+  L W++W  RN++  R +  + + +  + +  +EE
Subjt:  RETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGREDLAPMDYLHGIWKISNSGVREKDKMCKSLI--LCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEE

Query:  LLERTV-----------------WVRPPSGMW-KLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLKALS-IESPPVQI
           RT                  W RPP   W K N+DA W+ +  R GIGW L    G +   G RA+ +   +L  E  A+   + +LS  +   V  
Subjt:  LLERTV-----------------WVRPPSGMW-KLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLKALS-IESPPVQI

Query:  ETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLARKA
        E+D+  ++ +LN  +E    +K  I + + L+ +        + +  N LA  +AR++
Subjt:  ETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEGVYLVNKNLNQLAHNLARKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGAAAGGAGGGAGACGACATCTCACCTGTTTTGGGGTTGCAAAGTTACAAAAGGTCTGTGGCTGAAATATTTTCACCCTACTGATTTAGTCTATTTCATTGGTAGGGAAG
ATCTGGCGCCTATGGATTATCTACATGGTATTTGGAAGATCTCAAACTCGGGAGTGAGGGAGAAGGACAAGATGTGCAAGAGCCTAATTCTATGTTGGCAGATATGGTCT
TACAGAAATCAAATCCATCACAGAAATCAACATCCAGATATCAAATTTCTTCAATCTCAAGTTGACAAATACATGGAAGAATTGTTAGAGAGGACAGTTTGGGTCAGACC
TCCAAGTGGGATGTGGAAGCTCAATAGTGACGCGGCCTGGTCTGAAGAGTTTAATCGCGGTGGGATTGGGTGGGCTCTTCTGCAATCGGATGGGTCTCTTCTGTCTGCCG
GATTTCGAGCGATATCAAGGCGATGGAAGATTTTGCGTCTAGAGTCATTGGCGGTTTGCGAAGGTCTCAAAGCCTTGTCGATCGAGTCACCTCCTGTCCAAATCGAAACA
GATGCTCTTCAGGTGGTCCATCTTCTAAACGGACGTGAGGAAGACGAGACAGAAGTGAAGCTCTTCATAGATGAAACTAAATCCCTAATTTTCGAGAGGAAGGTTGAAGG
TGTGTACCTTGTGAACAAAAATCTTAACCAACTGGCTCATAATTTGGCCAGAAAAGCGAGTTTAAGTGGGTGTTCAAATAACTGGACCCACTCCTTTCCTGATTGGCTAC
TTAATCTAAATGTAGGAGATTTAGGTGGTTTTGATACCAATAGTGGGGGTTCTGTCCCACCACCAATGGTTTCCATCCTTTAA
mRNA sequenceShow/hide mRNA sequence
GGAAAGGAGGGAGACGACATCTCACCTGTTTTGGGGTTGCAAAGTTACAAAAGGTCTGTGGCTGAAATATTTTCACCCTACTGATTTAGTCTATTTCATTGGTAGGGAAG
ATCTGGCGCCTATGGATTATCTACATGGTATTTGGAAGATCTCAAACTCGGGAGTGAGGGAGAAGGACAAGATGTGCAAGAGCCTAATTCTATGTTGGCAGATATGGTCT
TACAGAAATCAAATCCATCACAGAAATCAACATCCAGATATCAAATTTCTTCAATCTCAAGTTGACAAATACATGGAAGAATTGTTAGAGAGGACAGTTTGGGTCAGACC
TCCAAGTGGGATGTGGAAGCTCAATAGTGACGCGGCCTGGTCTGAAGAGTTTAATCGCGGTGGGATTGGGTGGGCTCTTCTGCAATCGGATGGGTCTCTTCTGTCTGCCG
GATTTCGAGCGATATCAAGGCGATGGAAGATTTTGCGTCTAGAGTCATTGGCGGTTTGCGAAGGTCTCAAAGCCTTGTCGATCGAGTCACCTCCTGTCCAAATCGAAACA
GATGCTCTTCAGGTGGTCCATCTTCTAAACGGACGTGAGGAAGACGAGACAGAAGTGAAGCTCTTCATAGATGAAACTAAATCCCTAATTTTCGAGAGGAAGGTTGAAGG
TGTGTACCTTGTGAACAAAAATCTTAACCAACTGGCTCATAATTTGGCCAGAAAAGCGAGTTTAAGTGGGTGTTCAAATAACTGGACCCACTCCTTTCCTGATTGGCTAC
TTAATCTAAATGTAGGAGATTTAGGTGGTTTTGATACCAATAGTGGGGGTTCTGTCCCACCACCAATGGTTTCCATCCTTTAA
Protein sequenceShow/hide protein sequence
ERRETTSHLFWGCKVTKGLWLKYFHPTDLVYFIGREDLAPMDYLHGIWKISNSGVREKDKMCKSLILCWQIWSYRNQIHHRNQHPDIKFLQSQVDKYMEELLERTVWVRP
PSGMWKLNSDAAWSEEFNRGGIGWALLQSDGSLLSAGFRAISRRWKILRLESLAVCEGLKALSIESPPVQIETDALQVVHLLNGREEDETEVKLFIDETKSLIFERKVEG
VYLVNKNLNQLAHNLARKASLSGCSNNWTHSFPDWLLNLNVGDLGGFDTNSGGSVPPPMVSIL