; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035786 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035786
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:30179136..30179984
RNA-Seq ExpressionLag0035786
SyntenyLag0035786
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023888364.1 uncharacterized protein LOC112000452 [Quercus suber]3.8e-4736.11Show/hide
Query:  VPSKGKSGGLMLLW-KDSWLKIESYLEGHIDGMVKSDL-GWWRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQI
        VPS+G SGGL L W K+  + I++Y   HID  V   + GWW  T FYGNP T +R ESW L+K LS    LPW++  DFNE++ ++EK GG +RP +Q+
Subjt:  VPSKGKSGGLMLLW-KDSWLKIESYLEGHIDGMVKSDL-GWWRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQI

Query:  NAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFED
          F++V+D CGLKDLGF+G ++TW   + + ++I+ERLDR LA S++ S F   +  HL   +SDH P+   F    K    R   +P+RFE  W   E 
Subjt:  NAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFED

Query:  CKDIIGNHWRYTRRATIDSFQRKVTDCIEQ----LKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRGAV---VKKVELELDSLLDEE
        C +++   W       + +    +  C+E     L +W+K   +  +   + + ++ ++ LE      ++   ++K  +EL+  LD+E
Subjt:  CKDIIGNHWRYTRRATIDSFQRKVTDCIEQ----LKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRGAV---VKKVELELDSLLDEE

XP_023889179.1 uncharacterized protein LOC112001232 isoform X1 [Quercus suber]3.3e-5142.11Show/hide
Query:  VPSKGKSGGLMLLW-KDSWLKIESYLEGHIDGMV-KSDLGW-WRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQ
        VPS+G  GGL LLW +D+ L+I+S+ + HID ++ +S  G+ WRFTGFYG+P TH R ESWKL+  L+  FN+ W    DFNE+L +NEK GGV RPQ+Q
Subjt:  VPSKGKSGGLMLLW-KDSWLKIESYLEGHIDGMV-KSDLGW-WRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQ

Query:  INAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFE
        +++FR+VV+ CG KDLG+ G  YTW   +   N I  RLDR LA SE+   F+D  + HL   +SDH  +L T   S  F      RR   FE  W   +
Subjt:  INAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFE

Query:  DCKDIIGNHWRY-TRRATIDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQ--TLERSSDRGAVVKKVELELDSLLDEE
        DC+++I   W   T  AT +     +  C   L NW++N + G+I+  + +K++ +   T+E S   GA + ++  E++ LLD E
Subjt:  DCKDIIGNHWRY-TRRATIDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQ--TLERSSDRGAVVKKVELELDSLLDEE

XP_030922765.1 uncharacterized protein LOC115949628 [Quercus lobata]1.4e-4939.37Show/hide
Query:  MFCVPSKGKSGGLMLLWK-DSWLKIESYLEGHIDGMV-KSDLGWWRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQ
        M  V    + GG+ + WK +    +++Y   HID +V K     WRFTGFYG P T+ R ESW  ++ L   +++PW+   DFNE+   +EK GG  RP 
Subjt:  MFCVPSKGKSGGLMLLWK-DSWLKIESYLEGHIDGMV-KSDLGWWRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQ

Query:  AQINAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAA
         Q+ AFREV+DECG KDLGFVGSKYTW+R  G +N I ERLDR +A +++  LF  T++ HL   SSDHKPI+         G  +  ++P RFE+ W  
Subjt:  AQINAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAA

Query:  FEDCKDIIGNHW-RYTRRATIDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRGAV--VKKVELELDSLLDEE
           CK+I+ + W R+     +D  + K+ +C ++L  W+++      KS   KKEQ  +  E +   G +  V K++ E++ LL +E
Subjt:  FEDCKDIIGNHW-RYTRRATIDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRGAV--VKKVELELDSLLDEE

XP_030922943.1 uncharacterized protein LOC115949807 [Quercus lobata]2.2e-4739.79Show/hide
Query:  VPSKGKSGGLMLLW-KDSWLKIESYLEGHIDGMV-KSDLGW-WRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQ
        VPS+G+SGGL LLW  D+ L+I+SY   HID ++ ++D G  WRFT FYG   TH R+ESWKL+  L+  FNLPW    DFNE+L++ EK GG  R Q+Q
Subjt:  VPSKGKSGGLMLLW-KDSWLKIESYLEGHIDGMV-KSDLGW-WRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQ

Query:  INAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIR---FEEGWA
        ++ FR +V++CG KDLG+ G  YTW   +   N I  RLDR  AN+E+   F+   + HL   +SDH  +  T +L         TR+ IR   F+  W 
Subjt:  INAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIR---FEEGWA

Query:  AFEDCKDIIGNHWRY-TRRATIDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQ---EIQTLERSSDRGAVVKKVELELDSLLDEE
          EDC ++I   W+  +  AT +     +  C   L +W++  + GSI   + +K +    I T ++  DRGA + ++  E++ LLD E
Subjt:  AFEDCKDIIGNHWRY-TRRATIDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQ---EIQTLERSSDRGAVVKKVELELDSLLDEE

XP_030967653.1 uncharacterized protein LOC115988147 [Quercus lobata]4.1e-4941.61Show/hide
Query:  VPSKGKSGGLMLLW-KDSWLKIESYLEGHIDGMV-KSDLGW-WRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQ
        VPS+G+ GGL LLW  D+ L+I+SY + HID ++ +S  G+ WRFTGFYG+P TH R++SWKL+  L+  FN PW    DFNE+L++ EK GG  R Q Q
Subjt:  VPSKGKSGGLMLLW-KDSWLKIESYLEGHIDGMV-KSDLGW-WRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQ

Query:  INAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFE
        ++ FR+VV+ CG KDLGF G  +TW   +   + I  RLDR  ANSE+ + F+D  + HL + +SDH  IL T + SR    N+  RR   FE  W   +
Subjt:  INAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFE

Query:  DCKDIIGNHWRY-TRRATIDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRG---AVVKKVELELDSLLDEE
        DC+++I   W   T   T D     +  C   L  W++N + G+I   + +K++ + +L    DRG   A V ++  E++ LLD E
Subjt:  DCKDIIGNHWRY-TRRATIDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRG---AVVKKVELELDSLLDEE

TrEMBL top hitse value%identityAlignment
A0A2N9H6V4 RNase H domain-containing protein8.3e-4838.89Show/hide
Query:  FCVPSKGKSGGLMLLWK-DSWLKIESYLEGHIDGMVKSDLGW-WRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQA
        F VPS+G+SGGL + W+ +  + I SY   HID ++  D    WRFTGFYG+P    +  +W L++ L     LPW+ G DFNE+L   EK G VAR  +
Subjt:  FCVPSKGKSGGLMLLWK-DSWLKIESYLEGHIDGMVKSDLGW-WRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQA

Query:  QINAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAF
        Q+ AFR VVDECG  DLGFVGS YTW+  +     + ERLDR LA +++   F  + + HL    SDH+P+    ++S     +R +R+  RFEE W   
Subjt:  QINAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAF

Query:  EDCKDIIGNHWRYTRRATIDSFQ--RKVTDCIEQLKNWSKNRLKGSIKSAMVKKE---QEIQTLERSSDRGAVVKKVELELDSLLDEE
        + C+D I   W    R T   FQ   K+  C E LK WS  +  GSI++A+  K    Q+ + L   +    +++++  EL  L  +E
Subjt:  EDCKDIIGNHWRYTRRATIDSFQ--RKVTDCIEQLKNWSKNRLKGSIKSAMVKKE---QEIQTLERSSDRGAVVKKVELELDSLLDEE

A0A2N9HKV4 Uncharacterized protein4.5e-4637.06Show/hide
Query:  VPSKGKSGGLMLLWK-DSWLKIESYLEGHIDGMVK--SDLGWWRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQ
        VP + + GGL+L WK D  + I+S+   HID ++   ++L  WRFTGFYG P T  R  SW +++ L   F+LPW    DFNE+++  EK+GG  RP AQ
Subjt:  VPSKGKSGGLMLLWK-DSWLKIESYLEGHIDGMVK--SDLGWWRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQ

Query:  INAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFE
        + AFR V+D+CG +DLGF G ++TW  +      I  RLDR++ N+E+   F+D+ + H+P   SDH P+     LS    G+   R+  RFE  W   E
Subjt:  INAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFE

Query:  DCKDIIGNHWRYTRRAT-IDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRG---AVVKKVELELDSLLDEE
         C+  + + WR     + +     +V DC  +L+ WS+N   GS++ A+ +K ++++  E  S  G   + V  +  EL  LL+ E
Subjt:  DCKDIIGNHWRYTRRAT-IDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRG---AVVKKVELELDSLLDEE

A0A2N9I921 Reverse transcriptase domain-containing protein9.8e-4937.19Show/hide
Query:  VPSKGKSGGLMLLWKDSW-LKIESYLEGHIDGMVKSDLGW-WRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQI
        VP + K GGL L WK +  L+I SY   HID +V +  G  WRFT FYG P +H+R+ SW L++ L   F+LPW  G DFNE++ + EK+G +++P++Q+
Subjt:  VPSKGKSGGLMLLWKDSW-LKIESYLEGHIDGMVKSDLGW-WRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQI

Query:  NAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFED
         +FRE +D+CG  DLG++G+ +TW  +      + ERLDR +A++ + S F    + HL    SDHKP+     LS     NR   +P RFEE W     
Subjt:  NAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFED

Query:  CKDIIGNHWRYTRRA-TIDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRGA---VVKKVELELDSLLDEE
        C + I   W+      ++     K+  C  QLKNWSK+   GS++  + +K +E++  E  S +G    ++  +  E+  LL +E
Subjt:  CKDIIGNHWRYTRRA-TIDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRGA---VVKKVELELDSLLDEE

A0A2N9J109 Uncharacterized protein3.5e-4636.84Show/hide
Query:  VPSKGKSGGLMLLWK-DSWLKIESYLEGHIDGMV-KSDLGWWRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQI
        VP + K GGL L WK D  L+I SY   HID +V  +    W FTGFYG P THKR+ESW L++ L   ++LPW  G DFNE++ + EK+G +++P +Q+
Subjt:  VPSKGKSGGLMLLWK-DSWLKIESYLEGHIDGMV-KSDLGWWRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQI

Query:  NAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFED
          FR+ +D CG  DLG++G+ +TW  +      + ERLD+ +A SE+ ++F    + HL    SDHKP+     LS K   N    +P  FEE W +   
Subjt:  NAFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFED

Query:  CKDIIGNHWRYTRRAT-IDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRG---AVVKKVELELDSLLDEE
        C + I N W+ +     +     K+  C + LK WSK+   GSI+  +  K +E++  E ++ +G     +  +  E+  LL +E
Subjt:  CKDIIGNHWRYTRRAT-IDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRG---AVVKKVELELDSLLDEE

A0A6J1DUG8 uncharacterized protein LOC1110241354.1e-4740.31Show/hide
Query:  VPSKGKSGGLMLLW-KDSWLKIESYLEGHIDGMVKSDLGWWRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQIN
        V S GKSGGLMLLW  DS ++I+S   GHID ++    G WRFTGFYGNP T+KR  SWKL++ L+   +LPWIIG DFNE++++ EK GGV R ++Q+ 
Subjt:  VPSKGKSGGLMLLW-KDSWLKIESYLEGHIDGMVKSDLGWWRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQIN

Query:  AFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFEDC
                C                       I ERLDR+L N    +   + ++ HL   SSDH+PILA+++           +R IRFEE W   + C
Subjt:  AFREVVDECGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFEDC

Query:  KDIIGNHWRYTRRATIDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLER
        +DII   W       I++FQ K+  C+ +L  W+K RL  S+K A+  KE+E++ L +
Subjt:  KDIIGNHWRYTRRATIDSFQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTGCGTTCCTAGCAAGGGCAAGAGTGGAGGGCTCATGCTTCTCTGGAAGGATTCATGGTTGAAAATAGAATCCTATTTAGAGGGGCATATCGATGGGATGGTTAA
AAGTGATTTGGGATGGTGGAGGTTTACTGGTTTTTATGGGAACCCGATTACTCACAAACGCAAGGAATCGTGGAAGCTTATCAAATGTCTGTCGATGTGCTTCAATCTCC
CGTGGATTATTGGGAGGGATTTTAATGAAGTGTTGGCTGTCAATGAGAAGAGGGGAGGGGTAGCCAGACCTCAAGCTCAAATTAACGCGTTCAGGGAGGTGGTGGATGAA
TGCGGCTTGAAAGATCTGGGGTTTGTGGGCAGTAAATACACATGGTTCAGATCAGAGGGGGAAGATAATGAGATTAAAGAAAGGCTCGATCGTTACCTAGCAAACTCTGA
GTTCACAAGTTTGTTTAGAGATACTGAAATCAAACACCTCCCCAAACATAGCTCAGATCATAAACCAATCCTTGCCACGTTCAATCTGTCCAGGAAATTTGGGGGCAATC
GGACTACTAGGAGACCAATCCGATTTGAAGAAGGGTGGGCTGCTTTTGAAGATTGCAAAGACATTATCGGCAACCATTGGAGGTACACTAGAAGGGCTACTATTGACTCG
TTTCAAAGGAAAGTCACTGATTGTATTGAGCAATTGAAGAATTGGAGTAAAAACAGATTGAAAGGATCGATCAAGTCGGCCATGGTGAAGAAAGAGCAAGAGATTCAAAC
CCTTGAGAGGAGTTCAGATAGGGGGGCAGTAGTTAAAAAAGTTGAGTTGGAGCTCGATAGTCTTCTTGATGAAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCTGCGTTCCTAGCAAGGGCAAGAGTGGAGGGCTCATGCTTCTCTGGAAGGATTCATGGTTGAAAATAGAATCCTATTTAGAGGGGCATATCGATGGGATGGTTAA
AAGTGATTTGGGATGGTGGAGGTTTACTGGTTTTTATGGGAACCCGATTACTCACAAACGCAAGGAATCGTGGAAGCTTATCAAATGTCTGTCGATGTGCTTCAATCTCC
CGTGGATTATTGGGAGGGATTTTAATGAAGTGTTGGCTGTCAATGAGAAGAGGGGAGGGGTAGCCAGACCTCAAGCTCAAATTAACGCGTTCAGGGAGGTGGTGGATGAA
TGCGGCTTGAAAGATCTGGGGTTTGTGGGCAGTAAATACACATGGTTCAGATCAGAGGGGGAAGATAATGAGATTAAAGAAAGGCTCGATCGTTACCTAGCAAACTCTGA
GTTCACAAGTTTGTTTAGAGATACTGAAATCAAACACCTCCCCAAACATAGCTCAGATCATAAACCAATCCTTGCCACGTTCAATCTGTCCAGGAAATTTGGGGGCAATC
GGACTACTAGGAGACCAATCCGATTTGAAGAAGGGTGGGCTGCTTTTGAAGATTGCAAAGACATTATCGGCAACCATTGGAGGTACACTAGAAGGGCTACTATTGACTCG
TTTCAAAGGAAAGTCACTGATTGTATTGAGCAATTGAAGAATTGGAGTAAAAACAGATTGAAAGGATCGATCAAGTCGGCCATGGTGAAGAAAGAGCAAGAGATTCAAAC
CCTTGAGAGGAGTTCAGATAGGGGGGCAGTAGTTAAAAAAGTTGAGTTGGAGCTCGATAGTCTTCTTGATGAAGAATAA
Protein sequenceShow/hide protein sequence
MFCVPSKGKSGGLMLLWKDSWLKIESYLEGHIDGMVKSDLGWWRFTGFYGNPITHKRKESWKLIKCLSMCFNLPWIIGRDFNEVLAVNEKRGGVARPQAQINAFREVVDE
CGLKDLGFVGSKYTWFRSEGEDNEIKERLDRYLANSEFTSLFRDTEIKHLPKHSSDHKPILATFNLSRKFGGNRTTRRPIRFEEGWAAFEDCKDIIGNHWRYTRRATIDS
FQRKVTDCIEQLKNWSKNRLKGSIKSAMVKKEQEIQTLERSSDRGAVVKKVELELDSLLDEE