; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0027314 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0027314
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationchr04:30672948..30675525
RNA-Seq ExpressionPI0027314
SyntenyPI0027314
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136450.1 uncharacterized protein LOC101212293 [Cucumis sativus]3.0e-20997.88Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQ GSAFPNQNQS PIWPNGVLPVSPRKGRS LRG
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
        KFRDRPSPLGPNGK TCLSYQSTGSEDSSSKVITENGNVTLCDYQRPV++LQ+VAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG

Query:  PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG
        PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEHSG
Subjt:  PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG

Query:  HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
        HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Subjt:  HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

XP_008466308.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo]2.2e-20797.61Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVL AS NSPCREDGPEQ GSAFPNQNQS PIWPNGVLPVSPRKGRS LRG
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
        KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQ+LQ+VAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPL FLRG
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG

Query:  PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG
        PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG
Subjt:  PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG

Query:  HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
        HPIQKQQNQGKVLNGMWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Subjt:  HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

XP_022976270.1 uncharacterized protein LOC111476715 [Cucurbita maxima]6.6e-19692.06Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        +SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQASNN+PCRED PEQ GSAFPNQNQSIPIW NGVLPVSPRKGRS LRG
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
        KFRDRPSPLGPNGK  CLSYQSTG+ED   KVITENGNVT+CDYQRPVQ LQAVAELPENDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPLSFLRG
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG

Query:  PLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHS
        PLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQGLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEH+
Subjt:  PLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHS

Query:  GHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
        GHPIQKQQNQGKV+NGMWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Subjt:  GHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

XP_023536522.1 uncharacterized protein LOC111797673 [Cucurbita pepo subsp. pepo]1.5e-19591.8Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        +SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQASNN+PCRED PEQ GSAFPNQNQSIPIW NGVLPVSPRKGRS LRG
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
        KFRDRPSPLGPNGK  CLSYQSTG+ED   KVITENGNVT+CDYQRPVQ LQAVAELPENDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPLSFLRG
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG

Query:  PLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHS
        PLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQGL+GVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEH+
Subjt:  PLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHS

Query:  GHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
        GHPIQKQQNQGKV+NGMWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Subjt:  GHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]1.5e-20394.96Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        +SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPP INASGHAQSVLQ SN SPCR+DGPEQ GSAFPNQNQSIPIW NGVLPVSPRKGRS LRG
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
        KFRDRPSPLGPNGKITCLSYQSTG+EDS+SKVITENGNVT+CDYQRPVQHLQAVAELPENDIDGAV RPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG

Query:  PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG
        PLLPPLGIPFCSASVGGARKALPV+SSGSSDFLSCYDSIGLSDS TVRKRMEQIA+AQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+G
Subjt:  PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG

Query:  HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
        HPIQKQQNQGKV+N MWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKI MRAFEE
Subjt:  HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein1.5e-20997.88Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQ GSAFPNQNQS PIWPNGVLPVSPRKGRS LRG
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
        KFRDRPSPLGPNGK TCLSYQSTGSEDSSSKVITENGNVTLCDYQRPV++LQ+VAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG

Query:  PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG
        PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEHSG
Subjt:  PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG

Query:  HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
        HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Subjt:  HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

A0A1S4E5S7 uncharacterized protein LOC1035037571.1e-20797.61Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVL AS NSPCREDGPEQ GSAFPNQNQS PIWPNGVLPVSPRKGRS LRG
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
        KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQ+LQ+VAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPL FLRG
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG

Query:  PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG
        PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG
Subjt:  PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG

Query:  HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
        HPIQKQQNQGKVLNGMWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Subjt:  HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein1.1e-20797.61Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVL AS NSPCREDGPEQ GSAFPNQNQS PIWPNGVLPVSPRKGRS LRG
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
        KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQ+LQ+VAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPL FLRG
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG

Query:  PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG
        PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG
Subjt:  PLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG

Query:  HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
        HPIQKQQNQGKVLNGMWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Subjt:  HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

A0A6J1FAS2 uncharacterized protein LOC1114436213.5e-19591.53Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        +SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQASNN+PCRED PEQ GSAFPNQNQ IPIW NGVLPVSPRKGRS LRG
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
        KFRDRPSPLGPNGK  CLSYQSTG+ED   KVITENGNVT+CDYQRPVQ LQAVAELPENDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPLSFLRG
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG

Query:  PLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHS
        PLLPPLGIPFCSASVGGARKALPVSSSGS  DFL CYDSIGLSDSETVRKRMEQIA+AQGLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEH+
Subjt:  PLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHS

Query:  GHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
        GHPIQKQQNQGKV+NGMWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Subjt:  GHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

A0A6J1IIZ9 uncharacterized protein LOC1114767153.2e-19692.06Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        +SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQASNN+PCRED PEQ GSAFPNQNQSIPIW NGVLPVSPRKGRS LRG
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG
        KFRDRPSPLGPNGK  CLSYQSTG+ED   KVITENGNVT+CDYQRPVQ LQAVAELPENDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPLSFLRG
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRG

Query:  PLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHS
        PLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQGLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEH+
Subjt:  PLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHS

Query:  GHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
        GHPIQKQQNQGKV+NGMWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Subjt:  GHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein6.2e-3532.19Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        +SK EFDK+C + +GRENI LHN+L+RSILKNA VAK+PPP                              +P ++    ++ + V P SPRK RS    
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ--SDPLSFL
        KFRDRPSPLGP GK   L   +T +++S SK                 Q L                         P E   +E+GEEVEQ    P    
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ--SDPLSFL

Query:  RGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH
        R PL  PLG+ F        +     S+    +  +C  S  L D  T+R R+E+    +G++ +SM+  N+LN  L+ Y+++LI+ CL L         
Subjt:  RGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH

Query:  SGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
                                            Q+K    +VS+LDF  AME+NP+ LGE+WP+ LEKI  RA EE
Subjt:  SGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

AT2G24530.1 unknown protein1.6e-9952.36Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        ++K EFDK C+R+LGREN+ LHNQLIRSIL+NA VAK+PPP + +GH+      +N    R DG EQ G+  PN +Q  P+W NGVLP+SPRK RS ++ 
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  -KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTE---AAILEEGEEVEQSDPLS
         K RDRPSPLG NGK+  + +Q    ED+   V  ENG     DYQR  +++        ++ DG   RP EKPRI   E   A  + + +  E+   ++
Subjt:  -KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTE---AAILEEGEEVEQSDPLS

Query:  FLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTF
            PL+ PLGIPFCSASVGG+ + +PVS+  +++ +SCYDS GL D E +RKRME IA AQGLEGVSMEC   LNN LDVYLK+LI SC +LV ARST 
Subjt:  FLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTF

Query:  EHSG-HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
           G   I KQQ+Q K++NG+WPTN L++Q  NG S++ Q+     SVS+LDF+ AMELNP+QLGEDWP L E+IS+R+FEE
Subjt:  EHSG-HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

AT4G31440.1 unknown protein1.0e-6943.86Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        ++K EFDK C R+LGREN+ LHN+LIRSIL+NA +AK+PP ++ SGH    L        +EDGPE+  S  P+  ++     NGVL    R G    R 
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTE--AAILEEGEEVE---QSDPL
          RD+P PLG NGK+                       +    Y RP ++         ++ D A   P+E+  +   +  AA +   +E +    S P 
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTE--AAILEEGEEVE---QSDPL

Query:  SFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARST
             P++ PLGIPFCSASVGG R+ +PVS+S ++  +SCYDS GLSD+E +RKRME IA  QGL GVS EC  +LNN LD+YLK+L+KSC++L  ARS 
Subjt:  SFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARST

Query:  FEHSG-HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
            G H ++KQQ++ +++NG+   N   +Q SN  S++ +E+    SVSLLDF+VAMELNP QLGEDWPLL E+IS+  FEE
Subjt:  FEHSG-HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

AT4G33890.1 unknown protein8.7e-3733.51Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        ++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N                   +Q  P+  +     S RK RS    
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ---SDPLSF
        K RDRPSPLGP GK   L   +T +E+S SK                    Q+  EL          RP       P E   +EEGEEVEQ     P   
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ---SDPLSF

Query:  LRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDF--LSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARST
         R PL  PLG+   S   G  RK++   S  S  F   +C ++  L D+ T+R R+E+    +GL+ ++M+  ++LN+ LDV++++LI+ CL L   R  
Subjt:  LRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDF--LSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARST

Query:  FEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
                               T+ +R  N     +  Q+      VS+ DF+  MELN + LGEDWP+ +EKI  RA ++
Subjt:  FEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

AT4G33890.2 unknown protein8.7e-3733.51Show/hide
Query:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG
        ++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N                   +Q  P+  +     S RK RS    
Subjt:  MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRG

Query:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ---SDPLSF
        K RDRPSPLGP GK   L   +T +E+S SK                    Q+  EL          RP       P E   +EEGEEVEQ     P   
Subjt:  KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ---SDPLSF

Query:  LRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDF--LSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARST
         R PL  PLG+   S   G  RK++   S  S  F   +C ++  L D+ T+R R+E+    +GL+ ++M+  ++LN+ LDV++++LI+ CL L   R  
Subjt:  LRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDF--LSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARST

Query:  FEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
                               T+ +R  N     +  Q+      VS+ DF+  MELN + LGEDWP+ +EKI  RA ++
Subjt:  FEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAGGTTGAATTTGATAAGGTGTGCGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCTTGTGTAGCCAA
GACCCCGCCACCAATAAATGCTTCAGGACATGCACAATCTGTGTTACAAGCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAATTGGATCTGCCTTTCCAA
ATCAGAATCAGAGTATACCAATTTGGCCAAATGGAGTTCTTCCAGTATCCCCACGGAAGGGTAGATCTGCCTTACGTGGAAAGTTTAGGGATAGGCCAAGTCCGCTTGGT
CCAAATGGAAAAATCACATGTCTTTCATATCAATCAACTGGCTCTGAAGATAGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCTTGTGTGACTATCAGAGGCC
AGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAAAATGACATAGATGGAGCAGTTCAACGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCAGCTATTCTTG
AAGAAGGAGAGGAGGTGGAACAGTCGGATCCCTTAAGCTTCCTGAGAGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGGTGGGGCACGCAAG
GCCTTGCCAGTCAGCAGTAGTGGCAGTAGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTGCATCTGC
ACAAGGACTTGAAGGTGTTTCTATGGAATGTCCTAACATCTTGAATAATACTCTGGATGTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAGCAAGGT
CTACATTTGAACATTCAGGGCACCCTATCCAGAAGCAACAAAATCAAGGGAAGGTTTTAAATGGCATGTGGCCCACTAACCACCTACGTGTACAGAACAGCAATGGGCGA
TCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGACTGGCCTTTGCTGTT
GGAGAAAATCTCTATGCGTGCCTTTGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
TCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTACCTTTTCTTCTTCTTCTTCTTCCATTTACATAACTTCATTCTTCTTCTTCTTCA
ATTAATCCCCTTTTTTCCACTTTCCTCATCATCCACTTTACTGATCTTTCCATCATTTTTACTTCCAACTTCAACCCACATCCCAATTCTCTTTTACGAACAAGGGTTTC
TTCTATTTCTTACTCTCCTTCTGCAATTCGCGTCCTTTGCCTTTCTTCTTCCAGGCTGCTGTAAAACCCCCACTTTTTTGCTTGGTGCGCGTATTATGTATATGTGTGTG
TATATATATATAATATATTTATATATATATATATTCCTTCAAATGGTGCGGCGATTTAGACTTGATTTGTTCTGAATGACGTGGAAGTATAAATTCTGGGGGACTTTTGT
ATTGGTTCAATACCACAATTCAGCTTCAATTTTAGTGACTGGGGTTTGGAGCTACAGTTTATAGTGTATGAAACCAACCGGCCTCTGGAGAAATGCAACCTCAGCACAGC
TCCAGAATTGATTTAGGTGACTTGAAAGCTCAGATAGTTAAAAACTTGGAAATGATAAGTCCAAGCGGTACTTCTTCTTCTTGAGCAGATTCTTGGGTCAGAAGATGAGC
AAGGTTGAATTTGATAAGGTGTGCGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCTTGTGTAGCCAAGACCCC
GCCACCAATAAATGCTTCAGGACATGCACAATCTGTGTTACAAGCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAATTGGATCTGCCTTTCCAAATCAGA
ATCAGAGTATACCAATTTGGCCAAATGGAGTTCTTCCAGTATCCCCACGGAAGGGTAGATCTGCCTTACGTGGAAAGTTTAGGGATAGGCCAAGTCCGCTTGGTCCAAAT
GGAAAAATCACATGTCTTTCATATCAATCAACTGGCTCTGAAGATAGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCTTGTGTGACTATCAGAGGCCAGTACA
GCATCTCCAAGCAGTAGCTGAGCTACCTGAAAATGACATAGATGGAGCAGTTCAACGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCAGCTATTCTTGAAGAAG
GAGAGGAGGTGGAACAGTCGGATCCCTTAAGCTTCCTGAGAGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGGTGGGGCACGCAAGGCCTTG
CCAGTCAGCAGTAGTGGCAGTAGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTGCATCTGCACAAGG
ACTTGAAGGTGTTTCTATGGAATGTCCTAACATCTTGAATAATACTCTGGATGTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAGCAAGGTCTACAT
TTGAACATTCAGGGCACCCTATCCAGAAGCAACAAAATCAAGGGAAGGTTTTAAATGGCATGTGGCCCACTAACCACCTACGTGTACAGAACAGCAATGGGCGATCTGAA
GTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGACTGGCCTTTGCTGTTGGAGAA
AATCTCTATGCGTGCCTTTGAGGAATAAGCAGTTGCAAAGGTTTTATTTATTATTCTACAGCCATCCAAGGCTGGTCTGTTCTGATCCTTGGGGGTTTAAAATTTACCTG
GGATATCATCCCGTTCTTACCTTCTTGCTGGGGTATACTTGATCGCTCCCATTTGGTTCATGCTCCAAGTTTCCATGCTAAAATATTGTAACTTTTTGGCCTGCTCCATT
GAGGAAGTTGAATTAACTCGACTAAGAAAGGGGGAGCTTTCAGCCAGAATGCCCATAAAAGGTTGTTCTGTTGCATGTTTAGCATTCTTCTTCACCCCATTTGCCACTAC
CATGTAACTTAGCTTTTCGTGTAACTTATTTTTGAGAGAATCAAAAAGTTTTGCAGGGATACTGTCATTTAGTCTGGTGAAAGATGAATCACCTGGGTGAGAGCTTCAGG
TGCTCTGTATGTATCTTTAGAATCTGAGAATTTATTGGAATTTCAATATCAATAACAAATTAACATCCATTTCAATGCA
Protein sequenceShow/hide protein sequence
MSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQIGSAFPNQNQSIPIWPNGVLPVSPRKGRSALRGKFRDRPSPLG
PNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK
ALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGR
SEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE