; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015455 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015455
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1262)
Genome locationChr02:26784210..26791155
RNA-Seq ExpressionHG10015455
SyntenyHG10015455
Gene Ontology termsGO:0035821 - modification of morphology or physiology of other organism (biological process)
GO:0016798 - hydrolase activity, acting on glycosyl bonds (molecular function)
GO:0090729 - toxin activity (molecular function)
InterPro domainsIPR010683 - Protein of unknown function DUF1262


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446516.1 PREDICTED: uncharacterized protein LOC103489223 [Cucumis melo]3.4e-15879.17Show/hide
Query:  MYVTRPLSLNRNSPSALSIAPPEGPNLGILVIKDEAAESKWLFGMPKDETILVPPFPQNKKLWLSHTMVVGTSSFVDYIYALFIPVLNQPLSSNQYYIIK
        MYVTRPLS+ RNSPSALS+APPEGPN GILVIKDEAAESKWLFGM KDET+LVPPFPQNKK+WLSHTMVVGT+S V+YIYAL IPVLNQPLSSNQYYIIK
Subjt:  MYVTRPLSLNRNSPSALSIAPPEGPNLGILVIKDEAAESKWLFGMPKDETILVPPFPQNKKLWLSHTMVVGTSSFVDYIYALFIPVLNQPLSSNQYYIIK

Query:  SNGSDKG-------EENRSRLLGVADTPPQKFDPTNIYQQFEISNDMYCGKPNGFNFKSVASNGTTPYSMTHKNWRAYCKTLKTFQPTTEALGLDALLRA
        SNGSDKG       EEN SRLLGVADTPPQKFDPTNIYQ+FEISNDMY GKPNGFNFKSVASNG TPY+MTHKNWRAYCKTLKTFQPTTEALGLD  LRA
Subjt:  SNGSDKG-------EENRSRLLGVADTPPQKFDPTNIYQQFEISNDMYCGKPNGFNFKSVASNGTTPYSMTHKNWRAYCKTLKTFQPTTEALGLDALLRA

Query:  RLPPLIFSLPCKSSSP-VVVGKWYCPFIFVRE----------------------------------GRGVQLDVLVEKEVVSVAGRAMAGINVGDGAAWF
        RLPPL FSLPCK SSP VVVGKWYCPFIFVRE                                  GRGV+LDVLVEKEVVSV GRA+ G+N+GDGAAWF
Subjt:  RLPPLIFSLPCKSSSP-VVVGKWYCPFIFVRE----------------------------------GRGVQLDVLVEKEVVSVAGRAMAGINVGDGAAWF

Query:  GSSRVGLSLAIVERIKWEEERGGFEWVRDGGEKEVKVKRREEFEGVGMWKRFGCYVLVER
        GSSRVGLS+AIVERI+WEEERGGFEWVR+GGEK VKV+RREEF+GVGMW+RFGCYVLVER
Subjt:  GSSRVGLSLAIVERIKWEEERGGFEWVRDGGEKEVKVKRREEFEGVGMWKRFGCYVLVER

XP_008446517.1 PREDICTED: uncharacterized protein LOC103489225 [Cucumis melo]2.8e-15271.92Show/hide
Query:  SSSSSASGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAA--DRRRH--YYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKT
        S+S+S   EGPNTGVL IEDE SE RWFFGL+KD RV+ PPFPQNKLLELRY K   A   R+RH  +  TDYF+AMLIPVLNQP NSNQYYVINSRGK+
Subjt:  SSSSSASGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAA--DRRRH--YYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKT

Query:  KGLACTSSKEDDMTSCCCFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTL-ENFDATPALGVDAALRAR
        KGLACTSSKEDDM SCCCFN IDDI PQ FDP NPYQ+FQI  KY+YF +P+ F S SMAPDGVPPYFLRHQGWTA+T+TL + FD TPALG+D  LRAR
Subjt:  KGLACTSSKEDDMTSCCCFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTL-ENFDATPALGVDAALRAR

Query:  LPE---------PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNNNDNGDVAVDVCVRREVVLVGRALLAVETVVVSDGIMWF-DSSSVG
        LPE           +W+    +  +G+VG QMRDSTYYEMTLQQ+WE+IF CYNN+    V VDVCVRRE VLVG ALLA E VVVSDGIMW   SS VG
Subjt:  LPE---------PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNNNDNGDVAVDVCVRREVVLVGRALLAVETVVVSDGIMWF-DSSSVG

Query:  LSMAIVERVKWEEERCGFVWGSDHEKEIERVVRREKFEGKGVWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE
        LSMAIVERVKWEEER GFVWG +HE+EIERVVRRE+F+GKG+W+  +CYV+VE+FVLKRM+GTLVLTWEFRH HQIRTKWE
Subjt:  LSMAIVERVKWEEERCGFVWGSDHEKEIERVVRREKFEGKGVWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE

XP_011655744.1 uncharacterized protein LOC105435579 [Cucumis sativus]5.6e-16180.11Show/hide
Query:  MYVTRPLSLNRNSPSALSIAPPEGPNLGILVIKDEAAESKWLFGMPKDETILVPPFPQNKKLWLSHTMVVGTSSFVDYIYALFIPVLNQPLSSNQYYIIK
        MYVTRPLS+ RNSPSALS+APPEGPN GILVIKDEAAESKWLFGM KDET+LVPPFPQNKK+WLSHTMVVGT+S VDYIYAL IPVLNQPLSSNQYYIIK
Subjt:  MYVTRPLSLNRNSPSALSIAPPEGPNLGILVIKDEAAESKWLFGMPKDETILVPPFPQNKKLWLSHTMVVGTSSFVDYIYALFIPVLNQPLSSNQYYIIK

Query:  SNGSDKG-------EENRSRLLGVADTPPQKFDPTNIYQQFEISNDMYCGKPNGFNFKSVASNGTTPYSMTHKNWRAYCKTLKTFQPTTEALGLDALLRA
        SNGSDKG       EEN SRLLG+ADTPPQKFDPTNIYQ+FEISNDMY GKPNGFNFKSVASNG TPY+MTHKNWRAYCKTLKTFQPT EALGLD  LRA
Subjt:  SNGSDKG-------EENRSRLLGVADTPPQKFDPTNIYQQFEISNDMYCGKPNGFNFKSVASNGTTPYSMTHKNWRAYCKTLKTFQPTTEALGLDALLRA

Query:  RLPPLIFSLPCKSSSPVVVGKWYCPFIFVRE--------------------------------GRGVQLDVLVEKEVVSVAGRAMAGINVGDGAAWFGSS
        RLPPL FSLPC  SSP+VVGKWYCPFIFVRE                                GRGV+LDVLVEKEVVSVAGR +AG+N+GDGAAWFGSS
Subjt:  RLPPLIFSLPCKSSSPVVVGKWYCPFIFVRE--------------------------------GRGVQLDVLVEKEVVSVAGRAMAGINVGDGAAWFGSS

Query:  RVGLSLAIVERIKWEEERGGFEWVRDGGEKEVKVKRREEFEGVGMWKRFGCYVLVER
        RVGLS+AIVERIKWEEE+GGFEWVR+GGEKEVKVKRREEFEGVGMW+RFGCYVLVER
Subjt:  RVGLSLAIVERIKWEEERGGFEWVRDGGEKEVKVKRREEFEGVGMWKRFGCYVLVER

XP_011655745.2 uncharacterized protein LOC105435580 [Cucumis sativus]1.5e-15072.09Show/hide
Query:  SSSSSSSASGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKTKG
        S+S+S+S   EGPNTGVLVIEDE SE RWFFGL+KD  V+ PPFPQNKLLELRY K  A  R R +  TDYF+ MLIPVLNQP NSNQYYVI+SRGK+KG
Subjt:  SSSSSSSASGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKTKG

Query:  LACTSSKEDDMTSCCCFNFIDDIPPQLFDPRNPYQQFQITNKYTYF-GSPHSFESKSMAPDGVPPYFLRHQGWTADTKTL-ENFDATPALGVDAALRARL
        LACTSSKEDDM SCCCFN IDDI PQLFDP NPYQQFQI +KY++F  S + F S SMAPDGVPPYFLRHQGWTA+T+ L + FDA PALG+DAALRARL
Subjt:  LACTSSKEDDMTSCCCFNFIDDIPPQLFDPRNPYQQFQITNKYTYF-GSPHSFESKSMAPDGVPPYFLRHQGWTADTKTL-ENFDATPALGVDAALRARL

Query:  PE----------PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNNNDNGD-VAVDVCVRREVVLVGRALLAVETVVVSDGIMWFDSSS--
        PE            +W+    +  +G+VG QMRDSTYYEMTLQQ+WE+IF CYNNND G  V VDVCVR E VLVG ALLA E VVVSDGIMWF  SS  
Subjt:  PE----------PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNNNDNGD-VAVDVCVRREVVLVGRALLAVETVVVSDGIMWFDSSS--

Query:  -VGLSMAIVERVKWEEERCGFVWG-SDHEKEIE-RVVRREKFEGKG-VWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE
         VGLSMAIVERVKWEEER GFVWG +DHE++IE R VRRE+FEGKG +W+ FRCYV+VE+FVLKRM+GTLVLTWEFRH HQIRTKWE
Subjt:  -VGLSMAIVERVKWEEERCGFVWG-SDHEKEIE-RVVRREKFEGKG-VWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE

XP_038893050.1 uncharacterized protein LOC120081938 [Benincasa hispida]7.8e-16376.68Show/hide
Query:  RDSSSSSSSSA-SGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYP-TDYFHAMLIPVLNQPSNSNQYYVINSR
        RD+SSSSSSSA   EGPNTGVLVIEDEASE RWFFGL+KDT V VPPFPQNKLLELRYLKH+AA  RRH++  TDYF+A LIPVLNQP NSNQYY+INSR
Subjt:  RDSSSSSSSSA-SGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYP-TDYFHAMLIPVLNQPSNSNQYYVINSR

Query:  GKTKGLACTSSKEDDMTSCCCFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFDATPALGVDAALR
        GKTKGLACTSSKEDD+ SCCCFNFIDDI PQLFDPR+PYQQFQITNKYTYFGS   F SKSMA DGVPPYFLRHQGWTA+TKTLENF+A PALG++  LR
Subjt:  GKTKGLACTSSKEDDMTSCCCFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFDATPALGVDAALR

Query:  ARLP-----EPT---QWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNNN---DNGDVAVDVCVRREVVLVGRALLAVETVVVSDGIMWF-DS
        ARLP     +P    +W+    +  +GE+GAQMRDSTYYEMTLQQ+W++IF CYNNN    NGDV VDVCV+RE VLV   LL  + VVVSDGIMWF  S
Subjt:  ARLP-----EPT---QWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNNN---DNGDVAVDVCVRREVVLVGRALLAVETVVVSDGIMWF-DS

Query:  SSVGLSMAIVERVKWEEERCGFVWGSDHE-KEIERVVRREKFEGKGVWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE
        S VGLSMAIVERV+WEEER GFVWG  HE +EIERVVRRE F+GKGVWK F CYV+VERFVLKRMDGTL LTWEFRH HQIRTKWE
Subjt:  SSVGLSMAIVERVKWEEERCGFVWGSDHE-KEIERVVRREKFEGKGVWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE

TrEMBL top hitse value%identityAlignment
A0A0A0KQZ6 Toxin_10 domain-containing protein1.2e-15680.12Show/hide
Query:  RNSPSALSIAPPEGPNLGILVIKDEAAESKWLFGMPKDETILVPPFPQNKKLWLSHTMVVGTSSFVDYIYALFIPVLNQPLSSNQYYIIKSNGSDKG---
        RNSPSALS+APPEGPN GILVIKDEAAESKWLFGM KDET+LVPPFPQNKK+WLSHTMVVGT+S VDYIYAL IPVLNQPLSSNQYYIIKSNGSDKG   
Subjt:  RNSPSALSIAPPEGPNLGILVIKDEAAESKWLFGMPKDETILVPPFPQNKKLWLSHTMVVGTSSFVDYIYALFIPVLNQPLSSNQYYIIKSNGSDKG---

Query:  ----EENRSRLLGVADTPPQKFDPTNIYQQFEISNDMYCGKPNGFNFKSVASNGTTPYSMTHKNWRAYCKTLKTFQPTTEALGLDALLRARLPPLIFSLP
            EEN SRLLG+ADTPPQKFDPTNIYQ+FEISNDMY GKPNGFNFKSVASNG TPY+MTHKNWRAYCKTLKTFQPT EALGLD  LRARLPPL FSLP
Subjt:  ----EENRSRLLGVADTPPQKFDPTNIYQQFEISNDMYCGKPNGFNFKSVASNGTTPYSMTHKNWRAYCKTLKTFQPTTEALGLDALLRARLPPLIFSLP

Query:  CKSSSPVVVGKWYCPFIFVRE--------------------------------GRGVQLDVLVEKEVVSVAGRAMAGINVGDGAAWFGSSRVGLSLAIVE
        C  SSP+VVGKWYCPFIFVRE                                GRGV+LDVLVEKEVVSVAGR +AG+N+GDGAAWFGSSRVGLS+AIVE
Subjt:  CKSSSPVVVGKWYCPFIFVRE--------------------------------GRGVQLDVLVEKEVVSVAGRAMAGINVGDGAAWFGSSRVGLSLAIVE

Query:  RIKWEEERGGFEWVRDGGEKEVKVKRREEFEGVGMWKRFGCYVLVER
        RIKWEEE+GGFEWVR+GGEKEVKVKRREEFEGVGMW+RFGCYVLVER
Subjt:  RIKWEEERGGFEWVRDGGEKEVKVKRREEFEGVGMWKRFGCYVLVER

A0A0A0KWB8 Uncharacterized protein7.4e-15172.09Show/hide
Query:  SSSSSSSASGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKTKG
        S+S+S+S   EGPNTGVLVIEDE SE RWFFGL+KD  V+ PPFPQNKLLELRY K  A  R R +  TDYF+ MLIPVLNQP NSNQYYVI+SRGK+KG
Subjt:  SSSSSSSASGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKTKG

Query:  LACTSSKEDDMTSCCCFNFIDDIPPQLFDPRNPYQQFQITNKYTYF-GSPHSFESKSMAPDGVPPYFLRHQGWTADTKTL-ENFDATPALGVDAALRARL
        LACTSSKEDDM SCCCFN IDDI PQLFDP NPYQQFQI +KY++F  S + F S SMAPDGVPPYFLRHQGWTA+T+ L + FDA PALG+DAALRARL
Subjt:  LACTSSKEDDMTSCCCFNFIDDIPPQLFDPRNPYQQFQITNKYTYF-GSPHSFESKSMAPDGVPPYFLRHQGWTADTKTL-ENFDATPALGVDAALRARL

Query:  PE----------PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNNNDNGD-VAVDVCVRREVVLVGRALLAVETVVVSDGIMWFDSSS--
        PE            +W+    +  +G+VG QMRDSTYYEMTLQQ+WE+IF CYNNND G  V VDVCVR E VLVG ALLA E VVVSDGIMWF  SS  
Subjt:  PE----------PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNNNDNGD-VAVDVCVRREVVLVGRALLAVETVVVSDGIMWFDSSS--

Query:  -VGLSMAIVERVKWEEERCGFVWG-SDHEKEIE-RVVRREKFEGKG-VWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE
         VGLSMAIVERVKWEEER GFVWG +DHE++IE R VRRE+FEGKG +W+ FRCYV+VE+FVLKRM+GTLVLTWEFRH HQIRTKWE
Subjt:  -VGLSMAIVERVKWEEERCGFVWG-SDHEKEIE-RVVRREKFEGKG-VWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE

A0A1S3BER1 uncharacterized protein LOC1034892231.6e-15879.17Show/hide
Query:  MYVTRPLSLNRNSPSALSIAPPEGPNLGILVIKDEAAESKWLFGMPKDETILVPPFPQNKKLWLSHTMVVGTSSFVDYIYALFIPVLNQPLSSNQYYIIK
        MYVTRPLS+ RNSPSALS+APPEGPN GILVIKDEAAESKWLFGM KDET+LVPPFPQNKK+WLSHTMVVGT+S V+YIYAL IPVLNQPLSSNQYYIIK
Subjt:  MYVTRPLSLNRNSPSALSIAPPEGPNLGILVIKDEAAESKWLFGMPKDETILVPPFPQNKKLWLSHTMVVGTSSFVDYIYALFIPVLNQPLSSNQYYIIK

Query:  SNGSDKG-------EENRSRLLGVADTPPQKFDPTNIYQQFEISNDMYCGKPNGFNFKSVASNGTTPYSMTHKNWRAYCKTLKTFQPTTEALGLDALLRA
        SNGSDKG       EEN SRLLGVADTPPQKFDPTNIYQ+FEISNDMY GKPNGFNFKSVASNG TPY+MTHKNWRAYCKTLKTFQPTTEALGLD  LRA
Subjt:  SNGSDKG-------EENRSRLLGVADTPPQKFDPTNIYQQFEISNDMYCGKPNGFNFKSVASNGTTPYSMTHKNWRAYCKTLKTFQPTTEALGLDALLRA

Query:  RLPPLIFSLPCKSSSP-VVVGKWYCPFIFVRE----------------------------------GRGVQLDVLVEKEVVSVAGRAMAGINVGDGAAWF
        RLPPL FSLPCK SSP VVVGKWYCPFIFVRE                                  GRGV+LDVLVEKEVVSV GRA+ G+N+GDGAAWF
Subjt:  RLPPLIFSLPCKSSSP-VVVGKWYCPFIFVRE----------------------------------GRGVQLDVLVEKEVVSVAGRAMAGINVGDGAAWF

Query:  GSSRVGLSLAIVERIKWEEERGGFEWVRDGGEKEVKVKRREEFEGVGMWKRFGCYVLVER
        GSSRVGLS+AIVERI+WEEERGGFEWVR+GGEK VKV+RREEF+GVGMW+RFGCYVLVER
Subjt:  GSSRVGLSLAIVERIKWEEERGGFEWVRDGGEKEVKVKRREEFEGVGMWKRFGCYVLVER

A0A1S3BF83 uncharacterized protein LOC1034892251.3e-15271.92Show/hide
Query:  SSSSSASGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAA--DRRRH--YYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKT
        S+S+S   EGPNTGVL IEDE SE RWFFGL+KD RV+ PPFPQNKLLELRY K   A   R+RH  +  TDYF+AMLIPVLNQP NSNQYYVINSRGK+
Subjt:  SSSSSASGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAA--DRRRH--YYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKT

Query:  KGLACTSSKEDDMTSCCCFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTL-ENFDATPALGVDAALRAR
        KGLACTSSKEDDM SCCCFN IDDI PQ FDP NPYQ+FQI  KY+YF +P+ F S SMAPDGVPPYFLRHQGWTA+T+TL + FD TPALG+D  LRAR
Subjt:  KGLACTSSKEDDMTSCCCFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTL-ENFDATPALGVDAALRAR

Query:  LPE---------PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNNNDNGDVAVDVCVRREVVLVGRALLAVETVVVSDGIMWF-DSSSVG
        LPE           +W+    +  +G+VG QMRDSTYYEMTLQQ+WE+IF CYNN+    V VDVCVRRE VLVG ALLA E VVVSDGIMW   SS VG
Subjt:  LPE---------PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNNNDNGDVAVDVCVRREVVLVGRALLAVETVVVSDGIMWF-DSSSVG

Query:  LSMAIVERVKWEEERCGFVWGSDHEKEIERVVRREKFEGKGVWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE
        LSMAIVERVKWEEER GFVWG +HE+EIERVVRRE+F+GKG+W+  +CYV+VE+FVLKRM+GTLVLTWEFRH HQIRTKWE
Subjt:  LSMAIVERVKWEEERCGFVWGSDHEKEIERVVRREKFEGKGVWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE

A0A5A7SZ71 Uncharacterized protein1.8e-14978.65Show/hide
Query:  IAPPEGPNLGILVIKDEAAESKWLFGMPKDETILVPPFPQNKKLWLSHTMVVGTSSFVDYIYALFIPVLNQPLSSNQYYIIKSNGSDKG-------EENR
        +APPEGPN GILVIKDEAAESKWLFGM KDET+LVPPFPQNKK+WLSHTMVVGT+S V+YIYAL IPVLNQPLSSNQYYIIKSNGSDKG       EEN 
Subjt:  IAPPEGPNLGILVIKDEAAESKWLFGMPKDETILVPPFPQNKKLWLSHTMVVGTSSFVDYIYALFIPVLNQPLSSNQYYIIKSNGSDKG-------EENR

Query:  SRLLGVADTPPQKFDPTNIYQQFEISNDMYCGKPNGFNFKSVASNGTTPYSMTHKNWRAYCKTLKTFQPTTEALGLDALLRARLPPLIFSLPCKSSSP-V
        SRLLGVADTPPQKFDPTNIYQ+FEISNDMY GKPNGFNFKSVASNG TPY+MTHKNWRAYCKTLKTFQPTTEALGLD  LRARLPPL FSLPCK SSP V
Subjt:  SRLLGVADTPPQKFDPTNIYQQFEISNDMYCGKPNGFNFKSVASNGTTPYSMTHKNWRAYCKTLKTFQPTTEALGLDALLRARLPPLIFSLPCKSSSP-V

Query:  VVGKWYCPFIFVRE----------------------------------GRGVQLDVLVEKEVVSVAGRAMAGINVGDGAAWFGSSRVGLSLAIVERIKWE
        VVGKWYCPFIFVRE                                  GRGV+LDVLVEKEVVSV GRA+ G+N+GDGAAWFGSSRVGLS+AIVERI+WE
Subjt:  VVGKWYCPFIFVRE----------------------------------GRGVQLDVLVEKEVVSVAGRAMAGINVGDGAAWFGSSRVGLSLAIVERIKWE

Query:  EERGGFEWVRDGGEKEVKVKRREEFEGVGMWKRFGCYVLVER
        EERGGFEWVR+GGEK VKV+RREEF+GVGMW+RFGCYVLVER
Subjt:  EERGGFEWVRDGGEKEVKVKRREEFEGVGMWKRFGCYVLVER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13480.1 Protein of unknown function (DUF1262)2.8e-6538.89Show/hide
Query:  EGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKTKGLACTSSKEDD
        EGPN+G++VI+DE S++   FG   + R+K  PFPQN  L   Y       +  H  P      + IPVL+QP +SN YYV+  RGK  G A  S+ E++
Subjt:  EGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKTKGLACTSSKEDD

Query:  -MTSCCCFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFD-ATPALGVDAALR--------ARLPE
         ++SC CF++I D  PQ  DP + YQQF+I  + +   S   + + S+A DGVPP +L+ + W     T  +F     A G++  LR         +  E
Subjt:  -MTSCCCFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFD-ATPALGVDAALR--------ARLPE

Query:  PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYN-NNDNGDVAVDVCVRRE-VVLVGRALLAVETVVVSDGIMWF-------DSSSVGLSMA
          +W+    +  +G+V  QM  ST+Y +TLQQ WE++F C N  N++ +V VDV V  E V L G+     E     DG++WF       D   +GL   
Subjt:  PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYN-NNDNGDVAVDVCVRRE-VVLVGRALLAVETVVVSDGIMWF-------DSSSVGLSMA

Query:  IVERVKWEEERCGFVWGSDHEKEIERVVRREKFE-GKGVWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE
        +VER+KWEEER G++   +   E   + R E+FE G   WK +RCYV++E F L RMDG+LVLT+EFRH+ ++++KW+
Subjt:  IVERVKWEEERCGFVWGSDHEKEIERVVRREKFE-GKGVWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE

AT1G13500.1 Protein of unknown function (DUF1262)1.1e-6141.01Show/hide
Query:  EGPNTGVLVIEDEASES--RWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKTKGLACTSSKE
        EGPN+GVLVI+DE S    +  F    D  +   PFPQN     R +         +  P  +     IPVL+QP +SN YYVI   GK  G AC S+KE
Subjt:  EGPNTGVLVIEDEASES--RWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKTKGLACTSSKE

Query:  DDMTSCC-CFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFD---ATPALG-VDAALRARLPE---
         D  SCC CF +I +  P+  DP +  QQF+I   +    S   F + S+A DG+PP FL  +GW+      E  D      A G VDA LR  LP+   
Subjt:  DDMTSCC-CFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFD---ATPALG-VDAALRARLPE---

Query:  ---PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYN--NNDNGDVAVDVCVRREVVLVGRALLAVETV-VVSDGIMWFDSS---SVGLSMA
             +W+    +  +G+   QM  S YY MTLQQ +E++F C N  NN   +V VDV V  EVV +    +A ET  V SDG++WF++S    +GL   
Subjt:  ---PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYN--NNDNGDVAVDVCVRREVVLVGRALLAVETV-VVSDGIMWFDSS---SVGLSMA

Query:  IVERVKWEEERCGFVWGSDHEKEIERVVRREKFEGKG-VWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE
        ++ER+KWEEER G  W +   ++   + R E+FEG G  WK +RCYV+VE F LKR DG+LVLT+EF+H+ ++++KW+
Subjt:  IVERVKWEEERCGFVWGSDHEKEIERVVRREKFEGKG-VWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE

AT1G13520.1 Protein of unknown function (DUF1262)5.4e-6938.48Show/hide
Query:  LVERDSSSSSSSSASGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINS
        L E   +SS  + +  EGPN+GVLVI+DE S+    FG   D  +K  PFPQN  L + Y      DRR +     +   + IPVL+QP  SN+YYVI  
Subjt:  LVERDSSSSSSSSASGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINS

Query:  RGKTKGLACTSSKEDDMTSCC-CFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFDAT-PALGVDA
        RGK  G A  S+KE+D   CC CF+++ ++ PQ  DP + YQQF++  +    G    + + S+AP+G+PP FL+ + WT +    ++F  T  A G++ 
Subjt:  RGKTKGLACTSSKEDDMTSCC-CFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFDAT-PALGVDA

Query:  ALRARLPEPT-------QWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNNNDN---GDVAVDVCVRREVV-LVGRALLAVETVVVSDGIMWF
         LR++LP          +W+    +  + +   Q++ S YY MTL+Q WE+++ C N N N   G+V VDV V  +VV L G+  ++ ET     G +WF
Subjt:  ALRARLPEPT-------QWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNNNDN---GDVAVDVCVRREVV-LVGRALLAVETVVVSDGIMWF

Query:  -------DSSSVGLSMAIVERVKWEEERCGFVWGSDHEKEIERVVRREKFE-GKGVWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE
                   +GL   +VER+KWEEER G++    +  E   + R E+FE G   WK +RC V++E F LKRMDG+LVLT+EF H+ ++++KW+
Subjt:  -------DSSSVGLSMAIVERVKWEEERCGFVWGSDHEKEIERVVRREKFE-GKGVWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE

AT1G13530.1 Protein of unknown function (DUF1262)1.2e-6840.62Show/hide
Query:  LVERDSSSSSSSSASGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINS
        L E   + S  +    EGPN+GVLVI+DE S+    FG      +   PFPQN  + ++Y   +  DR              IPVL+QP +SN YYVI  
Subjt:  LVERDSSSSSSSSASGEGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINS

Query:  RGKTKGLACTSSKEDDMTSCC-CFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFD-ATPALG-VD
        RGK  G AC S+KE+D  SCC CF ++ +  P+L DP + YQQF+I  + +   S   F + S+A DG+PP FLR +GWT      +++     A G VD
Subjt:  RGKTKGLACTSSKEDDMTSCC-CFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFD-ATPALG-VD

Query:  AALRARLPE------PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYN-NNDNGDVAVDVCVRREVVLVGRALLAVETV-VVSDGIMWF--
        A LR  LP+        +W+    +  +G+   QM+ S YY MTL Q +E++F C N +N N +V VDV V  EVV +    +  ET  V SDG++WF  
Subjt:  AALRARLPE------PTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYN-NNDNGDVAVDVCVRREVVLVGRALLAVETV-VVSDGIMWF--

Query:  -DSSSVGLSMAIVERVKWEEERCGFVWGSDHEKEIERVVRREKFEGKG-VWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE
         ++  +G+   ++ER+KWEEER G  W     +    + R EKFEG G  WK +RCYV+VE F LK+ DG+LVLT+EFRH+ ++++KW+
Subjt:  -DSSSVGLSMAIVERVKWEEERCGFVWGSDHEKEIERVVRREKFEGKG-VWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE

AT1G13540.1 Protein of unknown function (DUF1262)2.2e-6238.87Show/hide
Query:  EGPNTGVLVIEDEASESR--WFFG--LIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKTKGLACTSS
        EGPN+G+LVI+D+ S +R    FG  L+ D+ +   P PQN  L + +      D  R          + IPVL++P +SN YY I   GK  G A  S+
Subjt:  EGPNTGVLVIEDEASESR--WFFG--LIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKTKGLACTSS

Query:  KEDDMTSCC-CFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFD-ATPALGVDAALRARLP-----
        KEDD+ SCC C +F+ +  P+  DP + YQQF+I  K     S   + + S+APDGVPP+FLR + WT +    ++F+    A G+   LR  LP     
Subjt:  KEDDMTSCC-CFNFIDDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFD-ATPALGVDAALRARLP-----

Query:  -EPTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNN-NDNGDVAVDVCVRREVVLVGRALLAVETVVVSDGIMWF--DSSSVGLSMAIVER
            +W+    +  + EV  Q+++S YY MTL+Q W+++F   N+ ++N DV VDV V  EVV +   +  +E  + ++G +WF      +GL   +VER
Subjt:  -EPTQWW----WENDGEVGAQMRDSTYYEMTLQQSWEKIFECYNN-NDNGDVAVDVCVRREVVLVGRALLAVETVVVSDGIMWF--DSSSVGLSMAIVER

Query:  VKWEEERCGFVWGSDHEKEIERVVRREKFEGKGVWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE
        +KWEEER G+    D+E+ +  V R EK     +WK + CYV++E FVLKRMD +LVLT+EF H+ +++TKW+
Subjt:  VKWEEERCGFVWGSDHEKEIERVVRREKFEGKGVWKGFRCYVMVERFVLKRMDGTLVLTWEFRHIHQIRTKWE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGTAACAAGGCCACTTTCTCTGAACAGAAACTCGCCGTCGGCGTTATCTATAGCACCGCCGGAGGGTCCAAACTTGGGGATTTTGGTGATCAAAGACGAAGCTGC
AGAATCCAAATGGCTATTCGGAATGCCTAAGGACGAAACAATTCTGGTCCCTCCTTTCCCTCAAAACAAGAAACTCTGGCTTTCCCATACAATGGTTGTCGGAACCAGTT
CCTTCGTTGATTACATTTATGCCCTCTTTATTCCTGTTCTCAATCAGCCCTTGTCTTCTAACCAATATTACATTATCAAATCCAATGGAAGTGATAAAGGAGAAGAAAAT
CGTAGCCGATTGTTGGGCGTTGCTGATACACCACCACAAAAATTCGACCCAACAAACATTTACCAACAATTCGAAATCAGTAATGATATGTATTGTGGGAAACCCAACGG
TTTCAATTTCAAGTCCGTGGCTTCCAATGGAACTACTCCATATTCCATGACACACAAAAACTGGCGAGCTTACTGCAAGACACTCAAGACCTTTCAACCCACCACCGAAG
CACTTGGCCTTGACGCGCTGCTCCGTGCCCGCCTCCCTCCGCTCATCTTCTCCCTGCCCTGCAAATCGTCCAGCCCTGTGGTTGTAGGAAAATGGTACTGCCCTTTCATT
TTCGTCCGAGAAGGGAGAGGAGTGCAGTTGGATGTTTTGGTGGAGAAGGAAGTGGTTTCGGTTGCAGGGCGGGCGATGGCCGGAATAAATGTCGGCGATGGGGCAGCGTG
GTTTGGTTCGTCGAGAGTTGGGTTGAGCTTGGCGATTGTGGAAAGGATAAAATGGGAAGAGGAGAGAGGTGGATTTGAATGGGTTAGAGATGGAGGAGAAAAGGAAGTGA
AGGTGAAAAGAAGGGAAGAGTTTGAAGGGGTGGGAATGTGGAAGAGATTTGGATGTTATGTTTTGGTTGAGAGAGACTCATCTTCTTCTTCTTCCTCTTCGGCTTCGGGT
GAGGGTCCAAATACAGGCGTTTTAGTGATAGAAGACGAAGCTTCAGAATCGAGATGGTTTTTTGGATTGATTAAGGATACGCGTGTGAAGGTGCCGCCATTCCCTCAAAA
CAAGCTACTGGAGCTTCGATATTTGAAACATAATGCTGCAGACAGGCGAAGGCACTACTATCCAACTGATTACTTTCACGCAATGCTAATTCCAGTTCTGAATCAGCCTT
CCAATTCCAACCAATATTATGTCATTAATTCCCGTGGCAAAACTAAAGGGCTAGCGTGTACAAGTTCAAAGGAAGACGACATGACCAGCTGTTGTTGCTTCAATTTTATT
GATGATATACCACCACAACTCTTCGACCCAAGAAATCCTTATCAACAATTCCAAATTACCAATAAATATACTTACTTTGGGAGTCCCCATAGTTTTGAGTCCAAGTCAAT
GGCTCCCGACGGGGTTCCTCCATATTTCCTAAGACACCAAGGCTGGACAGCTGATACCAAGACTCTTGAAAACTTCGACGCCACGCCAGCACTCGGCGTCGATGCTGCGT
TACGAGCCCGCCTCCCTGAGCCGACCCAGTGGTGGTGGGAAAATGATGGAGAGGTTGGAGCCCAAATGAGGGACTCAACGTACTACGAAATGACCCTCCAACAGAGCTGG
GAAAAGATATTTGAATGCTATAATAATAATGATAATGGTGATGTGGCAGTGGATGTTTGTGTTAGAAGGGAGGTGGTTTTAGTTGGAAGGGCGTTGTTGGCGGTAGAGAC
GGTGGTGGTTTCCGATGGGATTATGTGGTTCGATTCATCGTCGGTGGGGCTAAGCATGGCGATTGTGGAGAGGGTGAAATGGGAGGAGGAGAGATGTGGGTTTGTATGGG
GGAGTGATCATGAGAAGGAGATAGAGAGAGTTGTGAGAAGAGAAAAGTTTGAAGGAAAGGGAGTGTGGAAGGGATTTAGGTGTTATGTTATGGTTGAAAGGTTTGTGCTT
AAGAGAATGGATGGAACTTTGGTTCTTACTTGGGAATTTAGGCACATTCATCAAATTAGAACCAAATGGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTATGTAACAAGGCCACTTTCTCTGAACAGAAACTCGCCGTCGGCGTTATCTATAGCACCGCCGGAGGGTCCAAACTTGGGGATTTTGGTGATCAAAGACGAAGCTGC
AGAATCCAAATGGCTATTCGGAATGCCTAAGGACGAAACAATTCTGGTCCCTCCTTTCCCTCAAAACAAGAAACTCTGGCTTTCCCATACAATGGTTGTCGGAACCAGTT
CCTTCGTTGATTACATTTATGCCCTCTTTATTCCTGTTCTCAATCAGCCCTTGTCTTCTAACCAATATTACATTATCAAATCCAATGGAAGTGATAAAGGAGAAGAAAAT
CGTAGCCGATTGTTGGGCGTTGCTGATACACCACCACAAAAATTCGACCCAACAAACATTTACCAACAATTCGAAATCAGTAATGATATGTATTGTGGGAAACCCAACGG
TTTCAATTTCAAGTCCGTGGCTTCCAATGGAACTACTCCATATTCCATGACACACAAAAACTGGCGAGCTTACTGCAAGACACTCAAGACCTTTCAACCCACCACCGAAG
CACTTGGCCTTGACGCGCTGCTCCGTGCCCGCCTCCCTCCGCTCATCTTCTCCCTGCCCTGCAAATCGTCCAGCCCTGTGGTTGTAGGAAAATGGTACTGCCCTTTCATT
TTCGTCCGAGAAGGGAGAGGAGTGCAGTTGGATGTTTTGGTGGAGAAGGAAGTGGTTTCGGTTGCAGGGCGGGCGATGGCCGGAATAAATGTCGGCGATGGGGCAGCGTG
GTTTGGTTCGTCGAGAGTTGGGTTGAGCTTGGCGATTGTGGAAAGGATAAAATGGGAAGAGGAGAGAGGTGGATTTGAATGGGTTAGAGATGGAGGAGAAAAGGAAGTGA
AGGTGAAAAGAAGGGAAGAGTTTGAAGGGGTGGGAATGTGGAAGAGATTTGGATGTTATGTTTTGGTTGAGAGAGACTCATCTTCTTCTTCTTCCTCTTCGGCTTCGGGT
GAGGGTCCAAATACAGGCGTTTTAGTGATAGAAGACGAAGCTTCAGAATCGAGATGGTTTTTTGGATTGATTAAGGATACGCGTGTGAAGGTGCCGCCATTCCCTCAAAA
CAAGCTACTGGAGCTTCGATATTTGAAACATAATGCTGCAGACAGGCGAAGGCACTACTATCCAACTGATTACTTTCACGCAATGCTAATTCCAGTTCTGAATCAGCCTT
CCAATTCCAACCAATATTATGTCATTAATTCCCGTGGCAAAACTAAAGGGCTAGCGTGTACAAGTTCAAAGGAAGACGACATGACCAGCTGTTGTTGCTTCAATTTTATT
GATGATATACCACCACAACTCTTCGACCCAAGAAATCCTTATCAACAATTCCAAATTACCAATAAATATACTTACTTTGGGAGTCCCCATAGTTTTGAGTCCAAGTCAAT
GGCTCCCGACGGGGTTCCTCCATATTTCCTAAGACACCAAGGCTGGACAGCTGATACCAAGACTCTTGAAAACTTCGACGCCACGCCAGCACTCGGCGTCGATGCTGCGT
TACGAGCCCGCCTCCCTGAGCCGACCCAGTGGTGGTGGGAAAATGATGGAGAGGTTGGAGCCCAAATGAGGGACTCAACGTACTACGAAATGACCCTCCAACAGAGCTGG
GAAAAGATATTTGAATGCTATAATAATAATGATAATGGTGATGTGGCAGTGGATGTTTGTGTTAGAAGGGAGGTGGTTTTAGTTGGAAGGGCGTTGTTGGCGGTAGAGAC
GGTGGTGGTTTCCGATGGGATTATGTGGTTCGATTCATCGTCGGTGGGGCTAAGCATGGCGATTGTGGAGAGGGTGAAATGGGAGGAGGAGAGATGTGGGTTTGTATGGG
GGAGTGATCATGAGAAGGAGATAGAGAGAGTTGTGAGAAGAGAAAAGTTTGAAGGAAAGGGAGTGTGGAAGGGATTTAGGTGTTATGTTATGGTTGAAAGGTTTGTGCTT
AAGAGAATGGATGGAACTTTGGTTCTTACTTGGGAATTTAGGCACATTCATCAAATTAGAACCAAATGGGAATGA
Protein sequenceShow/hide protein sequence
MYVTRPLSLNRNSPSALSIAPPEGPNLGILVIKDEAAESKWLFGMPKDETILVPPFPQNKKLWLSHTMVVGTSSFVDYIYALFIPVLNQPLSSNQYYIIKSNGSDKGEEN
RSRLLGVADTPPQKFDPTNIYQQFEISNDMYCGKPNGFNFKSVASNGTTPYSMTHKNWRAYCKTLKTFQPTTEALGLDALLRARLPPLIFSLPCKSSSPVVVGKWYCPFI
FVREGRGVQLDVLVEKEVVSVAGRAMAGINVGDGAAWFGSSRVGLSLAIVERIKWEEERGGFEWVRDGGEKEVKVKRREEFEGVGMWKRFGCYVLVERDSSSSSSSSASG
EGPNTGVLVIEDEASESRWFFGLIKDTRVKVPPFPQNKLLELRYLKHNAADRRRHYYPTDYFHAMLIPVLNQPSNSNQYYVINSRGKTKGLACTSSKEDDMTSCCCFNFI
DDIPPQLFDPRNPYQQFQITNKYTYFGSPHSFESKSMAPDGVPPYFLRHQGWTADTKTLENFDATPALGVDAALRARLPEPTQWWWENDGEVGAQMRDSTYYEMTLQQSW
EKIFECYNNNDNGDVAVDVCVRREVVLVGRALLAVETVVVSDGIMWFDSSSVGLSMAIVERVKWEEERCGFVWGSDHEKEIERVVRREKFEGKGVWKGFRCYVMVERFVL
KRMDGTLVLTWEFRHIHQIRTKWE