; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003524 (gene) of Snake gourd v1 genome

Gene IDTan0003524
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationLG01:102122696..102125370
RNA-Seq ExpressionTan0003524
SyntenyTan0003524
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136450.1 uncharacterized protein LOC101212293 [Cucumis sativus]1.5e-22092.12Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVC RVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQASNNS 
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS

Query:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQ GSAFPNQNQ+ PIWPNGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQST SED  SKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ
        NDIDGAVQRPSEKPRIHPTEAAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPV SSGSS DFLSCYD+IGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVSMECP+ILNNTLDVYLKQLIKSC ELVRARST +H+G+PIQKQQNQGKV+NGMWP+NHLRVQNSNGRSEVLQEKSLECS SLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

XP_008466308.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo]1.2e-21991.89Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVC RVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NS 
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS

Query:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQ GSAFPNQNQ+ PIWPNGVLPVSPRKGRS+LRGKFRDRPSPLGPNGKITCLSYQST SED  SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ
        NDIDGAVQRPSEKPRIHPTEAAILE+GEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARK LPV SSGSS DFLSCYD+IGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVSMECPNILNNTLDVYLKQLIKSC ELVRARST +H+G+PIQKQQNQGKV+NGMWP+NHLRVQN+NGRSEVLQEKSLECS SLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

XP_022976270.1 uncharacterized protein LOC111476715 [Cucurbita maxima]1.4e-21589.93Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFFYLS+FLGQKLSKVEFDK+C RVLGRENIQLHNQLIRSILKNACVAKTPP INVSGHAQSVLQASNN+ 
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS

Query:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPEND
        CRED PEQ GSAFPNQNQ++PIW NGVLPVSPRKGRS+LRGKFRDRPSPLGPNGK  CLSYQST +ED KVITENGNVTMCDYQRPVQ LQAVAELPEND
Subjt:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPEND

Query:  IDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQGL
        IDG+VQRPS KPRI PTEA+ILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPV SSGS  DFLSCYD+IGLSDSETVRKRMEQIATAQGL
Subjt:  IDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQGL

Query:  EGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQLG
        EGVS+ECPNILNNTLDVYLKQLIKSC ELVR RST +H G+PIQKQQNQGKVINGMWP+NHLRVQNSNGRSEVL+EKS ECS SLLDFKVAMELNPKQLG
Subjt:  EGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQLG

Query:  EDWPLLLEKICMRAFEE
        EDWPLLLEKI MRAFEE
Subjt:  EDWPLLLEKICMRAFEE

XP_023524221.1 uncharacterized protein LOC111788192 [Cucurbita pepo subsp. pepo]3.4e-21790.21Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS
        MQPQHSSRID+GDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSK EFDK+C RVLGRENIQLHN+LIRSILKNACVAKTPPPIN SGHAQS+LQASNNS 
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS

Query:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPE  GS FPNQNQ MPIWPNGVLPVSPRKGRS+LRGKFRDRPSPLGPNGKITCLSYQS+ +ED  SKVITENGNV MCDYQRPVQHL+AVAELPE
Subjt:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ
        NDIDGAV RPSEKPRIHPTEAA+LED +EVEQSDPLS LRGPLLPPLGIPFCSASVGGARK LPVGSSGSS DFLSCYD+IGLSDSETVRKRMEQIATAQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSC ELVR+RST++H G+PIQKQQNQGKVINGM PSNH  VQNSNGRSEVLQEKSLECSASLLDFKVAME+NPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPL+LEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]2.5e-22092.12Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFFYLSRFLGQKLSKVEFDKVC RVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQ SN S 
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS

Query:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+DGPEQ GSAFPNQNQ++PIW NGVLPVSPRKGRS+LRGKFRDRPSPLGPNGKITCLSYQST +ED  SKVITENGNVTMCDYQRPVQHLQAVAELPE
Subjt:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ
        NDIDGAV RPSEKPRIHPTEAAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPV SSGSS DFLSCYD+IGLSDS TVRKRMEQIATAQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVSMECPNILNNTLDVYLKQLIKSC ELVRARST +H G+PIQKQQNQGKV+N MWP+NHLRVQNSNGRSEVLQEKSLECS SLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKICMRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein7.1e-22192.12Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVC RVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQASNNS 
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS

Query:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQ GSAFPNQNQ+ PIWPNGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQST SED  SKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ
        NDIDGAVQRPSEKPRIHPTEAAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPV SSGSS DFLSCYD+IGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVSMECP+ILNNTLDVYLKQLIKSC ELVRARST +H+G+PIQKQQNQGKV+NGMWP+NHLRVQNSNGRSEVLQEKSLECS SLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

A0A1S4E5S7 uncharacterized protein LOC1035037576.0e-22091.89Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVC RVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NS 
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS

Query:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQ GSAFPNQNQ+ PIWPNGVLPVSPRKGRS+LRGKFRDRPSPLGPNGKITCLSYQST SED  SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ
        NDIDGAVQRPSEKPRIHPTEAAILE+GEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARK LPV SSGSS DFLSCYD+IGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVSMECPNILNNTLDVYLKQLIKSC ELVRARST +H+G+PIQKQQNQGKV+NGMWP+NHLRVQN+NGRSEVLQEKSLECS SLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein6.0e-22091.89Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVC RVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NS 
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS

Query:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQ GSAFPNQNQ+ PIWPNGVLPVSPRKGRS+LRGKFRDRPSPLGPNGKITCLSYQST SED  SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ
        NDIDGAVQRPSEKPRIHPTEAAILE+GEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARK LPV SSGSS DFLSCYD+IGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVSMECPNILNNTLDVYLKQLIKSC ELVRARST +H+G+PIQKQQNQGKV+NGMWP+NHLRVQN+NGRSEVLQEKSLECS SLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

A0A6J1CPD1 uncharacterized protein LOC1110128831.2e-21589.79Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL KVEFDK+C RVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNS 
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS

Query:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRG-KFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELP
        CREDGPEQ GSAFPNQNQT+PIW NGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY ST +ED  SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRG-KFRDRPSPLGPNGKITCLSYQSTASED--SKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATA
        ENDI+GAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGAR+ LP+   G+SGDF SCYD+IGLSD+ETVRKRMEQIATA
Subjt:  ENDIDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATA

Query:  QGLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQN-SNGRSEVLQEKSLECSASLLDFKVAMELNP
        QGLEGVSMEC NILN+TLD+YLKQLIKSC ELVR+RSTL+H G+PIQKQQNQGKVINGMWPSNHLRVQN SNGR EVLQEKSL+CS SLLDFKVAMELNP
Subjt:  QGLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQN-SNGRSEVLQEKSLECSASLLDFKVAMELNP

Query:  KQLGEDWPLLLEKICMRAFEE
        KQLGEDWPLLLEKICMR FEE
Subjt:  KQLGEDWPLLLEKICMRAFEE

A0A6J1IIZ9 uncharacterized protein LOC1114767156.8e-21689.93Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFFYLS+FLGQKLSKVEFDK+C RVLGRENIQLHNQLIRSILKNACVAKTPP INVSGHAQSVLQASNN+ 
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS

Query:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPEND
        CRED PEQ GSAFPNQNQ++PIW NGVLPVSPRKGRS+LRGKFRDRPSPLGPNGK  CLSYQST +ED KVITENGNVTMCDYQRPVQ LQAVAELPEND
Subjt:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPEND

Query:  IDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQGL
        IDG+VQRPS KPRI PTEA+ILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPV SSGS  DFLSCYD+IGLSDSETVRKRMEQIATAQGL
Subjt:  IDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQGL

Query:  EGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQLG
        EGVS+ECPNILNNTLDVYLKQLIKSC ELVR RST +H G+PIQKQQNQGKVINGMWP+NHLRVQNSNGRSEVL+EKS ECS SLLDFKVAMELNPKQLG
Subjt:  EGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQLG

Query:  EDWPLLLEKICMRAFEE
        EDWPLLLEKI MRAFEE
Subjt:  EDWPLLLEKICMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein1.5e-4232.2Show/hide
Query:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSSCREDGP
        SR++  ++KA I +K+G+ ++  YF  L +FL  ++SK EFDK+C++ +GRENI LHN+L+RSILKNA VAK+PPP                        
Subjt:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSSCREDGP

Query:  EQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGAVQ
              +P ++    ++ + V P SPRK RS    KFRDRPSPLGP GK   L   +T +++S                                     
Subjt:  EQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGAVQ

Query:  RPSEKPRIHPTEAAILEDGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQGLEGVS
            K +  P E   +EDGEEVEQ    P    R PL  PLG+ F   S   AR +   G +       +C  +  L D  T+R R+E+    +G++ +S
Subjt:  RPSEKPRIHPTEAAILEDGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQGLEGVS

Query:  MECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQLGEDWP
        M+  N+LN  L+ Y+++LI+ C  L                                             Q+K    + S+LDF  AME+NP+ LGE+WP
Subjt:  MECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQLGEDWP

Query:  LLLEKICMRAFEE
        + LEKIC RA EE
Subjt:  LLLEKICMRAFEE

AT2G24530.1 unknown protein1.1e-11252.36Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS
        MQ     RI L +LK  IVKK G ++S+RYF+YL RFL QKL+K EFDK C R+LGREN+ LHNQLIRSIL+NA VAK+PPP + +GH+      +N   
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS

Query:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRG-KFRDRPSPLGPNGKITCLSYQSTASEDSK--VITENGNVTMCDYQRPVQHLQAVAELP
         R DG EQ+G+  PN +Q  P+W NGVLP+SPRK RS ++  K RDRPSPLG NGK+  + +Q    ED++  V  ENG     DYQR  +++       
Subjt:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRG-KFRDRPSPLGPNGKITCLSYQSTASEDSK--VITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGAVQRPSEKPRIHPTE---AAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQI
         ++ DG   RP EKPRI   E   A  + D +  E+   ++    PL+ PLGIPFCSASVGG+ +T+PV    ++ + +SCYD+ GL D E +RKRME I
Subjt:  ENDIDGAVQRPSEKPRIHPTE---AAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQI

Query:  ATAQGLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAG-YPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAME
        A AQGLEGVSMEC   LNN LDVYLK+LI SCF+LV ARST    G   I KQQ+Q K++NG+WP+N L++Q  NG S++ Q+     S S+LDF+ AME
Subjt:  ATAQGLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAG-YPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAME

Query:  LNPKQLGEDWPLLLEKICMRAFEE
        LNP+QLGEDWP L E+I +R+FEE
Subjt:  LNPKQLGEDWPLLLEKICMRAFEE

AT4G31440.1 unknown protein4.0e-8344.02Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS
        MQ     RIDL +LK  IVKK+G ++S RYF+YL RFL QKL+K EFDK C R+LGREN+ LHN+LIRSIL+NA +AK+PP ++ SGH    L       
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSS

Query:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPEND
         +EDGPE++ S  P+  +      NGVL    R G    R   RD+P PLG NGK+                     +    Y RP ++         ++
Subjt:  CREDGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPEND

Query:  IDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQGL
         D A   P+E+  +   +        + +++        P++ PLGIPFCSASVGG R+T+PV +S ++   +SCYD+ GLSD+E +RKRME IA  QGL
Subjt:  IDGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQGL

Query:  EGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAG-YPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQL
         GVS EC  +LNN LD+YLK+L+KSC +L  ARS     G + ++KQQ++ +++NG+  +N   +Q SN  S++ +E+    S SLLDF+VAMELNP QL
Subjt:  EGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAG-YPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQL

Query:  GEDWPLLLEKICMRAFEE
        GEDWPLL E+I +  FEE
Subjt:  GEDWPLLLEKICMRAFEE

AT4G33890.1 unknown protein9.3e-4833.25Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSSCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C + +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N  S   
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSSCRE

Query:  DGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDG
                     +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   +T +E+S    ++    +    RP                 
Subjt:  DGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDG

Query:  AVQRPSEKPRIHPTEAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKTLP-VGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQG
                    P E   +E+GEEVEQ     P    R PL  PLG+   S   G  RK++  V     S +  +C +N  L D+ T+R R+E+    +G
Subjt:  AVQRPSEKPRIHPTEAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKTLP-VGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQL
        L+ ++M+  ++LN+ LDV++++LI+ C  L   R   D                         RV+  N   +  Q+       S+ DF+  MELN + L
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQL

Query:  GEDWPLLLEKICMRAFEE
        GEDWP+ +EKIC RA ++
Subjt:  GEDWPLLLEKICMRAFEE

AT4G33890.2 unknown protein9.3e-4833.25Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSSCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C + +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N  S   
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSSCRE

Query:  DGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDG
                     +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   +T +E+S    ++    +    RP                 
Subjt:  DGPEQAGSAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDG

Query:  AVQRPSEKPRIHPTEAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKTLP-VGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQG
                    P E   +E+GEEVEQ     P    R PL  PLG+   S   G  RK++  V     S +  +C +N  L D+ T+R R+E+    +G
Subjt:  AVQRPSEKPRIHPTEAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKTLP-VGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQL
        L+ ++M+  ++LN+ LDV++++LI+ C  L   R   D                         RV+  N   +  Q+       S+ DF+  MELN + L
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQL

Query:  GEDWPLLLEKICMRAFEE
        GEDWP+ +EKIC RA ++
Subjt:  GEDWPLLLEKICMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACCTGAAAGCTCAGATAGTTAAGAAACTTGGAAATGACAAGTCAAAGCGGTACTTCTTCTACTTGAGCAGATT
TTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGACAAGGTGTGCGCCCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATG
CATGTGTAGCCAAGACCCCACCACCAATAAATGTGTCAGGACATGCACAATCTGTGCTGCAAGCTTCAAACAATTCTTCTTGCAGGGAAGATGGCCCTGAACAAGCTGGA
TCTGCTTTTCCAAATCAAAATCAGACTATGCCAATTTGGCCAAATGGAGTTCTTCCAGTATCCCCACGGAAGGGGAGATCTATCTTACGTGGAAAGTTTAGGGATAGGCC
AAGTCCGCTTGGTCCAAATGGAAAAATCACTTGTCTTTCGTATCAATCAACTGCTAGTGAAGATAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTGACTATC
AGAGACCGGTGCAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGACATAGACGGAGCAGTTCAGCGACCATCAGAAAAACCAAGGATACATCCAACAGAGGCAGCT
ATTCTTGAAGATGGAGAAGAGGTTGAACAGTCGGATCCCTTAAGCTTCCTGAGAGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGGTGGGGC
ACGCAAGACCTTGCCAGTTGGTAGTAGTGGTAGTAGTGGTGATTTTCTGAGTTGTTATGACAATATCGGATTGTCTGATTCAGAGACAGTGAGAAAACGAATGGAGCAAA
TTGCAACTGCACAAGGGCTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTGGATGTGTACCTGAAACAATTGATCAAGTCTTGCTTTGAGTTGGTG
AGAGCAAGGTCTACATTAGATCATGCAGGGTACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGTATGTGGCCTAGTAACCACCTACGTGTACAGAATAG
CAACGGGCGGTCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCAGCGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGATTGGC
CTTTGCTGTTGGAGAAAATTTGTATGCGTGCCTTTGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
AAAAGACCGAGAGAGAGAGAGAGGCAAGAGGGCGAAGCTCCGAGGATTGGGACCCAGTCACTGAAGTTTGGTTTTTCGCTGCAATTCTTCTTCTTCTTCTTCTTCTTCTT
CTTCGAAGCAAGTTTTTTTCGTTGAGGTCAGAGCTTCTTCATAACATACAATACAATTTCCTTCTCTTTCTATTTCTTCCATGTTTCTCAGCTTCTTTTCTTCTTCTTCT
TTTTCTTCTTCTTCTTCTTCTTCATATTCCCTTCTTCAATCTCCTTTTCCAACTTCCCCTTTTCCTAATCCACTTTACTGATCTCCTTCATTCTACTTCCAATTCCCTAC
TTCCAACTTCCCAATTGCGCCTAACGAACGAGGGTTTCTCTTATTGCTTCCTGTTCTTCTGCAATTCGCGCCTCTTCCGTTCTCCTTCCAGGCTGCTATAAACCTACCCT
TTTTGCTTGGTGCACGTATTGTTTTTTTTCCCCCCAAATGGTGTGGCCTCTGAGACTTGATTTGTCCTGAATGACGTGGAAGTATAAATTCTGGGGGATTTTGTATTGGT
ACAGTACCACAATTCAGCTTCAATTCGAGTGACTGGGGTTTGGAGCTAGAGTTTACAGTGTATGAACCGCCCTCTGGAGAAATGCAACCTCAGCACAGCTCCAGAATTGA
TTTAGGCGACCTGAAAGCTCAGATAGTTAAGAAACTTGGAAATGACAAGTCAAAGCGGTACTTCTTCTACTTGAGCAGATTTTTGGGTCAGAAGCTGAGCAAGGTTGAAT
TTGACAAGGTGTGCGCCCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATA
AATGTGTCAGGACATGCACAATCTGTGCTGCAAGCTTCAAACAATTCTTCTTGCAGGGAAGATGGCCCTGAACAAGCTGGATCTGCTTTTCCAAATCAAAATCAGACTAT
GCCAATTTGGCCAAATGGAGTTCTTCCAGTATCCCCACGGAAGGGGAGATCTATCTTACGTGGAAAGTTTAGGGATAGGCCAAGTCCGCTTGGTCCAAATGGAAAAATCA
CTTGTCTTTCGTATCAATCAACTGCTAGTGAAGATAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTGACTATCAGAGACCGGTGCAGCATCTCCAAGCAGTA
GCTGAGCTACCTGAGAATGACATAGACGGAGCAGTTCAGCGACCATCAGAAAAACCAAGGATACATCCAACAGAGGCAGCTATTCTTGAAGATGGAGAAGAGGTTGAACA
GTCGGATCCCTTAAGCTTCCTGAGAGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGGTGGGGCACGCAAGACCTTGCCAGTTGGTAGTAGTG
GTAGTAGTGGTGATTTTCTGAGTTGTTATGACAATATCGGATTGTCTGATTCAGAGACAGTGAGAAAACGAATGGAGCAAATTGCAACTGCACAAGGGCTTGAAGGTGTT
TCTATGGAATGTCCTAACATCCTGAATAATACTCTGGATGTGTACCTGAAACAATTGATCAAGTCTTGCTTTGAGTTGGTGAGAGCAAGGTCTACATTAGATCATGCAGG
GTACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGTATGTGGCCTAGTAACCACCTACGTGTACAGAATAGCAACGGGCGGTCTGAAGTTTTGCAGGAAA
AGAGTTTAGAATGCTCAGCGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTGTTGGAGAAAATTTGTATGCGT
GCCTTTGAGGAATAAGCAGTTGCAAAGGTTTTATTTATCCTACAGCCATCCTTTTGGCTGGTTTGTTCAGGTCCAAAGGGCTTTAAAACTTACCCGGGATGTCTTCCCGG
TCTGCCCTTCTTGATGGGGTATACTTGATCGCACTCCCATTTGGTTCAGGCTCCAAGTTTCCATGCTAAAAATATTGTAACATCATTGCCTGCTCCATTGAGGAAGTTGA
ATTAACTCAACTAAGAAAGAGGAAGCTTTCAGCCAGAATGGCCATAAAAAGGTTGTTCTGTTGCATTTTTAGCATTCATGTTGGCCCCATTTGCCACTACCTTGTAACTT
AGCCTTTTCATGTAACTTATTTTTGAGAGAATCAAAAAAGTTTTGCAGGGATACTGTCATTTAGTTTGGTGAAAGATGAAACACCTGGGTGAGAGCTTCAGGTGCTCTGT
ATGTATCTTTAGAATCTGAGAATTTATTGGAATTTCAATATCAATAACAAATTAACATCCATTTCAA
Protein sequenceShow/hide protein sequence
MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCARVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSSCREDGPEQAG
SAFPNQNQTMPIWPNGVLPVSPRKGRSILRGKFRDRPSPLGPNGKITCLSYQSTASEDSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAA
ILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVGSSGSSGDFLSCYDNIGLSDSETVRKRMEQIATAQGLEGVSMECPNILNNTLDVYLKQLIKSCFELV
RARSTLDHAGYPIQKQQNQGKVINGMWPSNHLRVQNSNGRSEVLQEKSLECSASLLDFKVAMELNPKQLGEDWPLLLEKICMRAFEE