; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033986 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033986
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationchr3:3497965..3500976
RNA-Seq ExpressionLag0033986
SyntenyLag0033986
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589615.1 hypothetical protein SDJN03_15038, partial [Cucurbita argyrosperma subsp. sororia]4.5e-20387.03Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD
        MLGTALQFGGIKGEDRFYIPVRARK+YNQQKPSRRP KTDETE+PSS+    VVASTT PSKPL PQ+KSNLERFLDAT+PSV AQ FSKTTMRGW TCD
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE AA+RSDSKSRLA+EDSDLDSS+DTSS+GSI+YEFGKSCNLSREQW+H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH

Query:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT
         HLACE+ + +RKTSL DEHS  +EGFSSDDGDA  PRS LLFQFLEQDLPYQRVPLADKIFDLAYQ+PGLKTLRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRG GHGQAP MIYPND DGI KVSLPVF LASYKLKGSIWAQN V EH+MANSLMQAA+KWLR  QVNQPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW

Query:  K
        +
Subjt:  K

XP_022134722.1 uncharacterized protein LOC111006925 [Momordica charantia]1.4e-20788.53Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD
        M GTALQFGGIKGEDRFYIPV+ARKNYNQQKPSRRP K+DETESPSSD+KTKVVASTTKPSKPL PQ KSNLERFLDAT PSVPAQ FSKTTMRGW TCD
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH
        IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE +AMRSDSK RLA EDSDLDSSRDTSSDGSIEYE GK+  +SREQW+H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH

Query:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT
        D L CEN +K+R  S+ DEH M++EGFSSDDGDAGNPRSVLLFQF EQDLPYQRVPLADKIFDLAYQYPGLK+LRSCDIQPASWVSVAWYPIYRIPTGPT
Subjt:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRG GHGQAPVMIYPND+DG+ KVSLPVF LASYKLKGSIWAQNGV EH+MANSLMQAAD WLR  QV+QPDFQFFASHGTYW
Subjt:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW

Query:  K
        +
Subjt:  K

XP_022921943.1 uncharacterized protein LOC111430050 [Cucurbita moschata]5.3e-20487.28Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD
        MLGTALQFGGIKGEDRFYIPVRARK+YNQQKPSRRP KTDETE+PSS+    VVASTT PSKPL PQ+KSNLERFLDAT+PSVPAQ FSKTTMRGW TCD
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE AA+RSDSKSRLA+EDSDLDSS+DTSS+GSI+YEFGKSCNLSREQW+H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH

Query:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT
         HLACE+ + +RKTSL DEHS  +EGFSSDDGDA  PRS LLFQFLEQDLPYQRVPLADKIFDLAYQ+PGLKTLRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRG GHGQAP MIYPND DGI KVSLPVF LASYKLKGSIWAQN V EH+MANSLMQAA+KWLR  QVNQPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW

Query:  K
        +
Subjt:  K

XP_022987436.1 uncharacterized protein LOC111484983 [Cucurbita maxima]2.9e-20287.03Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD
        MLGTALQFGGIKGEDRFYIPVRARK YNQQKPSRRP KTDETE+PSS    KVVASTT PSKPL PQ+KSNLERFLDAT+PSVPAQ FSKTTMRGW TCD
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE AA+RSDSKSRLA+EDSDLDSSRDTSS+GSI+YEFGKSCNLSREQW+H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH

Query:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT
         HLAC++ + +RKTSL DEHS  +EGFSSDDGDA  PRS LLFQFLEQDLPYQRVPLADKIFDLAYQ+PGLKTLRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRG GHGQAP MIYPND DGI KVSLPVF LASYKLKGSIWAQN V E++M NSLMQAA+KWLR  QVNQPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW

Query:  K
        +
Subjt:  K

XP_023516127.1 uncharacterized protein LOC111780081 [Cucurbita pepo subsp. pepo]7.0e-20487.53Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD
        MLGTALQFGGIKGEDRFYIPVRARK+YNQQKPSRRP KTDETE+PSS    KVVASTT PSKPL PQ+KSNLERFLDAT+PSVPAQ FSKTTMRGW TCD
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE AA+RSDSKSRLA+EDSDLDSSRDTSS+GSI+YEFGKSCNLSREQW+H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH

Query:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT
         HLACE+ + +RKTSL DEHS  +EGFSSDDGDA  PRS LLFQFLEQDLPYQRVPLADKIFDLAYQ+PGLKTLRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRG GHGQAP MIYPND DGI KVSLPVF LASYKLKGSIWAQN V EH+MANSLMQAA+KWLR  QVNQPDFQFFAS+ TYW
Subjt:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW

Query:  K
        +
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A1S3BX10 uncharacterized protein LOC103494138 isoform X12.1e-19383.04Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD
        MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRP KTDETES SS    KVV  TTKP + L PQ+KSNLERFL+ATRPSVPAQ FSKTTMR W TCD
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH
        IEFQPYFILNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQIYGE AA+RSDS  RLA EDSDLDSSRDTSSDGSI+Y+ GKS NLSREQW H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH

Query:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT
         HLACEN+ K+RKTSL+DE  M++EGF SDDGDAG PRS LLFQFLEQDLPYQRVPLADKIF+LAYQ+PGLKTLRSCDI PASWVSVAWYPIYRIPTGPT
Subjt:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTP +G  H   PVM+YP DID I K+SLPVF +ASYKLKGSIW QNG+++H+ ANSLMQAADKWLR  QV+QPDFQFF+SHGTYW
Subjt:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW

Query:  K
        +
Subjt:  K

A0A5A7USF1 Uncharacterized protein2.1e-19383.04Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD
        MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRP KTDETES SS    KVV  TTKP + L PQ+KSNLERFL+ATRPSVPAQ FSKTTMR W TCD
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH
        IEFQPYFILNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQIYGE AA+RSDS  RLA EDSDLDSSRDTSSDGSI+Y+ GKS NLSREQW H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH

Query:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT
         HLACEN+ K+RKTSL+DE  M++EGF SDDGDAG PRS LLFQFLEQDLPYQRVPLADKIF+LAYQ+PGLKTLRSCDI PASWVSVAWYPIYRIPTGPT
Subjt:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTP +G  H   PVM+YP DID I K+SLPVF +ASYKLKGSIW QNG+++H+ ANSLMQAADKWLR  QV+QPDFQFF+SHGTYW
Subjt:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW

Query:  K
        +
Subjt:  K

A0A6J1C0E1 uncharacterized protein LOC1110069256.6e-20888.53Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD
        M GTALQFGGIKGEDRFYIPV+ARKNYNQQKPSRRP K+DETESPSSD+KTKVVASTTKPSKPL PQ KSNLERFLDAT PSVPAQ FSKTTMRGW TCD
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH
        IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE +AMRSDSK RLA EDSDLDSSRDTSSDGSIEYE GK+  +SREQW+H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH

Query:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT
        D L CEN +K+R  S+ DEH M++EGFSSDDGDAGNPRSVLLFQF EQDLPYQRVPLADKIFDLAYQYPGLK+LRSCDIQPASWVSVAWYPIYRIPTGPT
Subjt:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRG GHGQAPVMIYPND+DG+ KVSLPVF LASYKLKGSIWAQNGV EH+MANSLMQAAD WLR  QV+QPDFQFFASHGTYW
Subjt:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW

Query:  K
        +
Subjt:  K

A0A6J1E577 uncharacterized protein LOC1114300502.6e-20487.28Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD
        MLGTALQFGGIKGEDRFYIPVRARK+YNQQKPSRRP KTDETE+PSS+    VVASTT PSKPL PQ+KSNLERFLDAT+PSVPAQ FSKTTMRGW TCD
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE AA+RSDSKSRLA+EDSDLDSS+DTSS+GSI+YEFGKSCNLSREQW+H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH

Query:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT
         HLACE+ + +RKTSL DEHS  +EGFSSDDGDA  PRS LLFQFLEQDLPYQRVPLADKIFDLAYQ+PGLKTLRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRG GHGQAP MIYPND DGI KVSLPVF LASYKLKGSIWAQN V EH+MANSLMQAA+KWLR  QVNQPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW

Query:  K
        +
Subjt:  K

A0A6J1JE68 uncharacterized protein LOC1114849831.4e-20287.03Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD
        MLGTALQFGGIKGEDRFYIPVRARK YNQQKPSRRP KTDETE+PSS    KVVASTT PSKPL PQ+KSNLERFLDAT+PSVPAQ FSKTTMRGW TCD
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE AA+RSDSKSRLA+EDSDLDSSRDTSS+GSI+YEFGKSCNLSREQW+H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLH

Query:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT
         HLAC++ + +RKTSL DEHS  +EGFSSDDGDA  PRS LLFQFLEQDLPYQRVPLADKIFDLAYQ+PGLKTLRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  DHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRG GHGQAP MIYPND DGI KVSLPVF LASYKLKGSIWAQN V E++M NSLMQAA+KWLR  QVNQPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYW

Query:  K
        +
Subjt:  K

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)2.5e-9054.1Show/hide
Query:  TKSNLERFLDATRPSVPAQNFSKTTMRGWGTCDIEFQ-PYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLA
        + SN+ERFLD+  PSVPA   SKT +R  G  D+E Q PYF+L D+WESF EWSAYG GVPL LN   D V QYYVP LSGIQ+Y +  A+ S  ++R  
Subjt:  TKSNLERFLDATRPSVPAQNFSKTTMRGWGTCDIEFQ-PYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLA

Query:  SEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLHDHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLA
         E+S+ D  RD+SS+GS   E  +    S+EQ            ++ K SL  EH   +E  SSDDG+  + +  L+F++LE+DLPY R P ADK+ DLA
Subjt:  SEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLHDHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLA

Query:  YQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGTGHGQAPV-MIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVD
         ++P LKTLRSCD+ P+SW SVAWYPIY+IPTGPTLKDLDACFLTYHSL TP +G G     + ++ P   + + K+ LPVF LASYKL+GS+W   G  
Subjt:  YQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGTGHGQAPV-MIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVD

Query:  EHRMANSLMQAADKWLRCRQVNQPDFQFF
         H++ANSL QAAD WLR RQVN PDF FF
Subjt:  EHRMANSLMQAADKWLRCRQVNQPDFQFF

AT2G01260.1 Protein of unknown function (DUF789)1.3e-8647.86Show/hide
Query:  MLGTALQF-GGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTC
        MLG   Q   G  G+D FY   + R+  NQ+    R  ++D +  PSS        S  K     +  + SNL+RFL++  PSVPAQ  SKT +R     
Subjt:  MLGTALQF-GGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTC

Query:  D--IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSRE
        D   +  PYF+L D+W+SF EWSAYG GVPLVLN   D V+QYYVP LS IQIY    A+ S  KSR   + SD D  RD+SSD S +         S  
Subjt:  D--IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSRE

Query:  QWLHDHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIP
        + +   + C         SL D+H   +E  SSDDG+    +  L+F++LE+DLPY R P ADK+ DLA Q+P L TLRSCD+  +SW SVAWYPIYRIP
Subjt:  QWLHDHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIP

Query:  TGPTLKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFF
        TGPTLKDLDACFLTYHSL T   G G  Q+  +  P + +   K+SLPVF LASYK +GS+W   G  EH++ NSL QAADKWL    V+ PDF FF
Subjt:  TGPTLKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFF

AT2G01260.2 Protein of unknown function (DUF789)2.1e-6547.53Show/hide
Query:  MLGTALQF-GGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTC
        MLG   Q   G  G+D FY   + R+  NQ+    R  ++D +  PSS        S  K     +  + SNL+RFL++  PSVPAQ  SKT +R     
Subjt:  MLGTALQF-GGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTC

Query:  D--IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSRE
        D   +  PYF+L D+W+SF EWSAYG GVPLVLN   D V+QYYVP LS IQIY    A+ S  KSR   + SD D  RD+SSD S +         S  
Subjt:  D--IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSRE

Query:  QWLHDHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIP
        + +   + C         SL D+H   +E  SSDDG+    +  L+F++LE+DLPY R P ADK+ DLA Q+P L TLRSCD+  +SW SVAWYPIYRIP
Subjt:  QWLHDHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIP

Query:  TGPTLKDLDACFLTYHSLSTPIRG
        TGPTLKDLDACFLTYHSL T   G
Subjt:  TGPTLKDLDACFLTYHSLSTPIRG

AT4G16100.1 Protein of unknown function (DUF789)6.5e-8344.36Show/hide
Query:  IKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSS----DIKTKV-----------------VASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFS
        I+GE+RFY P   RK   +++  R   +  E E   +    D K KV                 V S    +      T SNL RFLD T P V  Q+  
Subjt:  IKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSS----DIKTKV-----------------VASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFS

Query:  KTTMRGWGTCDIEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGK
         T+ +GW T + E++PYF+LNDLW+SF+EWSAYG GVPL+LNG DSVVQYYVPYLSGIQ+Y +P+  R+ +  R   E+SD DS RD SSDGS       
Subjt:  KTTMRGWGTCDIEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGK

Query:  SCNLSREQWLHDHLACENIVKLRKTSLSDEHSMMREGFSSDDGDA-GNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVA
         C              E    L + SL ++  +   G SSD+ +A  N    L+F++LE  +P+ R PL DKI +L+ Q+P L+T RSCD+ P+SWVSVA
Subjt:  SCNLSREQWLHDHLACENIVKLRKTSLSDEHSMMREGFSSDDGDA-GNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVA

Query:  WYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGTGH--GQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWA-QNGVDEHRMANSLMQAADKWLRCRQV
        WYPIYRIP G +L++LDACFLT+HSLSTP RGT +  GQ+      +     AK+ LP F LASYK K S W+ ++ VDE++   +L++ A++WLR  +V
Subjt:  WYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGTGH--GQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWA-QNGVDEHRMANSLMQAADKWLRCRQV

Query:  NQPDFQFFASH-GTYWK
          PDF+ F SH G+ W+
Subjt:  NQPDFQFFASH-GTYWK

AT5G49220.1 Protein of unknown function (DUF789)4.4e-7139.26Show/hide
Query:  GTALQFGGIKGEDRFYIPVRARKNYN----QQKPSRRPNKTDETE-------------SPS--------SDIKTKVVASTTKPSKPLAPQTK--------
        G ++    I+GE+RFY P   R+       QQ+   +  + DE E             +P         S+ K++VV S ++     +  +         
Subjt:  GTALQFGGIKGEDRFYIPVRARKNYN----QQKPSRRPNKTDETE-------------SPS--------SDIKTKVVASTTKPSKPLAPQTK--------

Query:  -SNLERFLDATRPSVPAQNFSKTTMRGWGTCDIEFQPYFILNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSR
         SNL+RFL+ T P VPA+ F   +     T + +   YF+L DLWESF EWSAYGAGV     PL ++G DS VQYYVPYLSGIQ+Y +P       K R
Subjt:  -SNLERFLDATRPSVPAQNFSKTTMRGWGTCDIEFQPYFILNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSR

Query:  LASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLHDHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFD
            D++  SS  +S+  ++  +                    ++ +L + SL D+   +    SS + +  NP+  LLF++LE + P+ R PLA+KI D
Subjt:  LASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLHDHLACENIVKLRKTSLSDEHSMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFD

Query:  LAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGV
        LA + P L T RSCD+ P+SWVSV+WYPIYRIP GPTL++LDACFLT+HSLST    +  G        +D     K+ LP F LASYKLK S+W QN +
Subjt:  LAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGTGHGQAPVMIYPNDIDGIAKVSLPVFALASYKLKGSIWAQNGV

Query:  DEHRMANSLMQAADKWLRCRQVNQPDFQFFASH
         E +   SL+QAADKWL+  QV+ PD++FF S+
Subjt:  DEHRMANSLMQAADKWLRCRQVNQPDFQFFASH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGAACTGCCTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTAAGGGCCCGAAAGAATTATAATCAGCAAAAGCCATCGAGGAGACCTAA
CAAGACCGATGAAACTGAGAGCCCATCTTCAGATATTAAGACTAAAGTTGTAGCTTCTACTACAAAGCCTTCTAAGCCATTAGCTCCTCAGACTAAGAGCAACTTAGAGA
GATTCTTGGACGCCACAAGGCCTTCAGTTCCAGCGCAGAACTTCTCTAAGACAACTATGAGGGGTTGGGGGACTTGTGATATTGAGTTTCAACCTTATTTCATTCTGAAT
GATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATACGGTGCGGGAGTTCCTTTAGTACTTAATGGAGGTGACTCTGTTGTTCAATATTACGTTCCATATTTGTCTGGTAT
CCAAATATATGGTGAACCTGCAGCAATGAGATCAGATTCTAAGTCCAGGCTGGCTAGTGAGGATAGTGATCTTGACTCTTCTAGAGATACAAGCAGCGATGGTAGCATTG
AATATGAATTTGGAAAGAGCTGTAACCTTTCTAGAGAACAGTGGCTTCATGACCATTTAGCTTGTGAAAACATAGTTAAATTGAGAAAGACGTCTTTAAGTGATGAACAT
AGCATGATGCGAGAAGGTTTTTCGAGTGATGATGGGGATGCGGGAAATCCTCGGAGTGTTTTGCTCTTTCAGTTTCTTGAGCAAGATCTTCCTTATCAACGTGTACCATT
GGCTGATAAGATATTTGATCTTGCTTACCAATATCCTGGTTTGAAAACTTTAAGAAGTTGTGATATCCAGCCAGCCAGTTGGGTCTCTGTAGCATGGTACCCAATATACC
GTATACCCACTGGTCCGACATTAAAAGATTTGGATGCTTGCTTTTTAACATACCATTCCCTTTCCACACCCATTAGAGGTACTGGACATGGCCAGGCACCAGTGATGATA
TATCCAAATGACATTGATGGTATTGCAAAGGTCTCCCTGCCTGTTTTTGCATTGGCTTCTTATAAGCTGAAAGGCTCAATTTGGGCGCAAAATGGCGTGGACGAGCATCG
AATGGCAAATTCCCTCATGCAGGCAGCAGATAAGTGGCTGAGGTGCCGACAGGTCAATCAACCCGATTTTCAGTTCTTTGCATCGCATGGTACATACTGGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGAACTGCCTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTAAGGGCCCGAAAGAATTATAATCAGCAAAAGCCATCGAGGAGACCTAA
CAAGACCGATGAAACTGAGAGCCCATCTTCAGATATTAAGACTAAAGTTGTAGCTTCTACTACAAAGCCTTCTAAGCCATTAGCTCCTCAGACTAAGAGCAACTTAGAGA
GATTCTTGGACGCCACAAGGCCTTCAGTTCCAGCGCAGAACTTCTCTAAGACAACTATGAGGGGTTGGGGGACTTGTGATATTGAGTTTCAACCTTATTTCATTCTGAAT
GATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATACGGTGCGGGAGTTCCTTTAGTACTTAATGGAGGTGACTCTGTTGTTCAATATTACGTTCCATATTTGTCTGGTAT
CCAAATATATGGTGAACCTGCAGCAATGAGATCAGATTCTAAGTCCAGGCTGGCTAGTGAGGATAGTGATCTTGACTCTTCTAGAGATACAAGCAGCGATGGTAGCATTG
AATATGAATTTGGAAAGAGCTGTAACCTTTCTAGAGAACAGTGGCTTCATGACCATTTAGCTTGTGAAAACATAGTTAAATTGAGAAAGACGTCTTTAAGTGATGAACAT
AGCATGATGCGAGAAGGTTTTTCGAGTGATGATGGGGATGCGGGAAATCCTCGGAGTGTTTTGCTCTTTCAGTTTCTTGAGCAAGATCTTCCTTATCAACGTGTACCATT
GGCTGATAAGATATTTGATCTTGCTTACCAATATCCTGGTTTGAAAACTTTAAGAAGTTGTGATATCCAGCCAGCCAGTTGGGTCTCTGTAGCATGGTACCCAATATACC
GTATACCCACTGGTCCGACATTAAAAGATTTGGATGCTTGCTTTTTAACATACCATTCCCTTTCCACACCCATTAGAGGTACTGGACATGGCCAGGCACCAGTGATGATA
TATCCAAATGACATTGATGGTATTGCAAAGGTCTCCCTGCCTGTTTTTGCATTGGCTTCTTATAAGCTGAAAGGCTCAATTTGGGCGCAAAATGGCGTGGACGAGCATCG
AATGGCAAATTCCCTCATGCAGGCAGCAGATAAGTGGCTGAGGTGCCGACAGGTCAATCAACCCGATTTTCAGTTCTTTGCATCGCATGGTACATACTGGAAATGA
Protein sequenceShow/hide protein sequence
MLGTALQFGGIKGEDRFYIPVRARKNYNQQKPSRRPNKTDETESPSSDIKTKVVASTTKPSKPLAPQTKSNLERFLDATRPSVPAQNFSKTTMRGWGTCDIEFQPYFILN
DLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEPAAMRSDSKSRLASEDSDLDSSRDTSSDGSIEYEFGKSCNLSREQWLHDHLACENIVKLRKTSLSDEH
SMMREGFSSDDGDAGNPRSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGTGHGQAPVMI
YPNDIDGIAKVSLPVFALASYKLKGSIWAQNGVDEHRMANSLMQAADKWLRCRQVNQPDFQFFASHGTYWK