; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G14060 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G14060
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionKAT8 regulatory NSL complex subunit 2
Genome locationChr7:12562472..12565313
RNA-Seq ExpressionCSPI07G14060
SyntenyCSPI07G14060
Gene Ontology termsGO:0043981 - histone H4-K5 acetylation (biological process)
GO:0043982 - histone H4-K8 acetylation (biological process)
GO:0043984 - histone H4-K16 acetylation (biological process)
GO:0044545 - NSL complex (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR025927 - Potential DNA-binding domain
IPR026316 - KAT8 regulatory NSL complex subunit 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042163.1 INO80 complex subunit D-like isoform X1 [Cucumis melo var. makuwa]2.4e-11997.26Show/hide
Query:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
        MA+SNSPGSFQPPPVTP PILIDGADRDRALA+SM+CSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
        IGENGKLGL S+TGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPDFHVLIAEY
        NVSSTSKLRPDFHVLIAEY
Subjt:  NVSSTSKLRPDFHVLIAEY

XP_004139660.1 INO80 complex subunit D [Cucumis sativus]5.1e-133100Show/hide
Query:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
        MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
        IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN
        NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN
Subjt:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN

XP_008447279.1 PREDICTED: INO80 complex subunit D-like isoform X1 [Cucumis melo]8.1e-13197.51Show/hide
Query:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
        MA+SNSPGSFQPPPVTP PILIDGADRDRALA+SM+CSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
        IGENGKLGL S+TGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN
        NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN
Subjt:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN

XP_038896099.1 uncharacterized protein LOC120084404 isoform X1 [Benincasa hispida]7.9e-12692.95Show/hide
Query:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
        MAES+SPGSFQPPPVTPLPILIDGADRDRALA S +C+RREVLERRSRR KQLCRI K++YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
        IGENGKLGL S TGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYC GHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN
        NVSSTSKLRPDFHVL+AE+VRQIQSKRRAT++ATA+KIESN
Subjt:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN

XP_038896100.1 INO80 complex subunit D-like isoform X2 [Benincasa hispida]7.9e-12692.95Show/hide
Query:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
        MAES+SPGSFQPPPVTPLPILIDGADRDRALA S +C+RREVLERRSRR KQLCRI K++YWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
        IGENGKLGL S TGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYC GHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN
        NVSSTSKLRPDFHVL+AE+VRQIQSKRRAT++ATA+KIESN
Subjt:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN

TrEMBL top hitse value%identityAlignment
A0A0A0K4I2 KAT8 regulatory NSL complex subunit 22.5e-133100Show/hide
Query:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
        MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
        IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN
        NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN
Subjt:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN

A0A1S3BH19 KAT8 regulatory NSL complex subunit 23.9e-13197.51Show/hide
Query:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
        MA+SNSPGSFQPPPVTP PILIDGADRDRALA+SM+CSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
        IGENGKLGL S+TGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN
        NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN
Subjt:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN

A0A5D3DT82 KAT8 regulatory NSL complex subunit 21.2e-11997.26Show/hide
Query:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
        MA+SNSPGSFQPPPVTP PILIDGADRDRALA+SM+CSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
        IGENGKLGL S+TGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPDFHVLIAEY
        NVSSTSKLRPDFHVLIAEY
Subjt:  NVSSTSKLRPDFHVLIAEY

A0A6J1GNX0 KAT8 regulatory NSL complex subunit 21.2e-11687.14Show/hide
Query:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
        MAESNSPGSFQ PP  P P++IDGA+ D ALA+    +RREVLERRSRR KQLCR+++ELYW L+EELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
        IGENGKLGL S TGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYC GHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN
        NVSSTSKLRPDFHVL+AE VRQIQ KRRA ++ATA+KIESN
Subjt:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN

A0A6J1JMD6 KAT8 regulatory NSL complex subunit 21.2e-11687.14Show/hide
Query:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
        MAESNSPGSFQ PP  P P++IDGA+ D ALA+    +RREVLERRSRR KQLCR+++ELYW L+EELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL
        IGENGKLGL S TGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYC GHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN
        NVSSTSKLRPDFHVL+AE VRQIQ KRRA ++ATA+KIESN
Subjt:  NVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIKIESN

SwissProt top hitse value%identityAlignment
Q54J07 INO80 complex subunit D3.4e-0724.47Show/hide
Query:  LIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLASATGS-----
        L +  D D   A+S + +  E+++RR     +L  ++K+ Y    E L+   R Y  T      + D  + E   +    I  N      +   +     
Subjt:  LIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLASATGS-----

Query:  -------------------DEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHL
                           +E   C    CK K M L+KYC++HIL DK Q+L+  CT+ + + +     C  P+L+  +P  C  HL
Subjt:  -------------------DEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHL

Arabidopsis top hitse value%identityAlignment
AT1G05860.1 unknown protein2.6e-4243.91Show/hide
Query:  IDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKED------EKEAEG-IGDYPEGIGENG-------KL
        I  A  D+ L  S   +R E+L RRS   KQL R +++ YW L+E+LK ++R Y W YG SPFK++       ++ EG  GD  EG G+N        K 
Subjt:  IDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKED------EKEAEG-IGDYPEGIGENG-------KL

Query:  GLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGLNVSSTSK
        G   A GS         GCK+KAMALT YC  HIL DKKQ+LY  CT+V K  QS  + C KP L STVP  C+ H QK +K +AR L+ AG NVSS S+
Subjt:  GLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGLNVSSTSK

Query:  LRPDFHVLIAEYVRQIQSKRRATKRATAIK
          P  H ++A +V  IQ+KR+  ++   +K
Subjt:  LRPDFHVLIAEYVRQIQSKRRATKRATAIK

AT2G31600.1 unknown protein3.3e-4542.68Show/hide
Query:  SPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEA--------EGI---
        +P +   P  +  PI +  +  D  LA S   +R E+L+RRS   KQL + +++ YW L+E++K ++R+Y+W YG S FK++  ++        EG    
Subjt:  SPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEA--------EGI---

Query:  -GDYPEGIGEN--GKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCL
         GD  EG G+N     G+ S   ++      + GCKAKAMALTKYC  HIL D KQ+LY GCT VIK   +GPLLC KP L STVP  C+ H QK +K +
Subjt:  -GDYPEGIGEN--GKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCL

Query:  ARDLRKAGLNVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIK
        A+ L+ AG NVSSTSK  P  HV++A +V  IQ+KR+  ++   +K
Subjt:  ARDLRKAGLNVSSTSKLRPDFHVLIAEYVRQIQSKRRATKRATAIK

AT2G31600.2 unknown protein7.8e-2340.12Show/hide
Query:  SPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEA--------EGI---
        +P +   P  +  PI +  +  D  LA S   +R E+L+RRS   KQL + +++ YW L+E++K ++R+Y+W YG S FK++  ++        EG    
Subjt:  SPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEA--------EGI---

Query:  -GDYPEGIGEN--GKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIK
         GD  EG G+N     G+ S   ++      + GCKAKAMALTKYC  HIL D KQ+LY GCT VIK
Subjt:  -GDYPEGIGEN--GKLGLASATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIK

AT3G53860.1 unknown protein3.1e-4345.97Show/hide
Query:  DRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDY-PEGIGENGKLGLASATGSDEIRRCDVTGC
        D  LA+S   +R E+L RR+   KQL + +K  YW L+E+LK ++R+Y+  YG S FK+++ ++       PEG G+ G  G   A  +     C + GC
Subjt:  DRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDY-PEGIGENGKLGLASATGSDEIRRCDVTGC

Query:  KAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLIAEYVRQIQSK
        KAKAMALTKYC  HIL D KQ+LY GCT VI    +GPLLC KP L STVP  C+ H QK +K +A+ L+ AG NVSSTSK  P  HV++A +V  IQ++
Subjt:  KAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLIAEYVRQIQSK

Query:  RRATKRATAIK
        R+   +   +K
Subjt:  RRATKRATAIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAATCAAACTCTCCTGGTTCGTTTCAACCTCCTCCTGTTACCCCACTTCCTATTCTTATTGATGGGGCCGATCGCGATCGAGCACTTGCCACTTCTATGATTTG
TTCTCGTCGAGAAGTACTCGAGCGCCGGTCCCGAAGAGCGAAACAGCTTTGTCGAATCTTCAAGGAACTGTACTGGTTTTTGTTGGAGGAACTCAAGCGCAAATACAGGG
AGTATTATTGGACATATGGCAAGAGTCCGTTTAAGGAGGATGAGAAGGAAGCTGAGGGCATTGGCGATTATCCAGAGGGCATTGGGGAAAATGGAAAGCTAGGATTGGCT
TCTGCGACTGGGAGCGATGAGATTAGAAGGTGTGATGTCACAGGTTGCAAGGCGAAGGCGATGGCTTTGACAAAATATTGTCATGCTCATATCCTCTCGGATAAAAAGCA
GAGGCTCTACAAGGGTTGCACCTTTGTGATCAAGAGTATGCAGTCAGGGCCACTTCTATGTTCGAAGCCTGTTTTAAGATCAACCGTTCCCTGCTATTGTTCGGGTCATC
TACAAAAAGGCGAAAAGTGTTTAGCTCGAGATTTAAGAAAAGCCGGTCTTAATGTCTCCTCAACTAGTAAGCTTCGTCCTGATTTTCATGTATTGATAGCTGAATATGTC
CGACAAATACAAAGCAAAAGGAGAGCGACGAAAAGAGCAACTGCTATTAAAATTGAGAGTAACTGA
mRNA sequenceShow/hide mRNA sequence
TATTTGTTTTTTAAAAAAAAGGCATCGAGAACCGAGAAGAAGAAACCCTAAATCCTATTCCCTCCTTTCTTCCTTGGCCGCCGCCACCACACACCTATCTGTGTCCATTT
GTTTTTCCCTTCACCTTCTTCTCCATATACCACCTCTCCGCTACACCCTACGCCGGCAAACTTTCTCCGTCGCAGTCGTACGCCGCCTTCTCATCGGCAAACGCCCCTTT
CTTCCTTCTCTCCTTCTCTGCTGCAACCTCGTCGGCACTCACTACTGTGCAAACTTCCTCTACTCGTCTTCAAGGTCATGGCAGAATCAAACTCTCCTGGTTCGTTTCAA
CCTCCTCCTGTTACCCCACTTCCTATTCTTATTGATGGGGCCGATCGCGATCGAGCACTTGCCACTTCTATGATTTGTTCTCGTCGAGAAGTACTCGAGCGCCGGTCCCG
AAGAGCGAAACAGCTTTGTCGAATCTTCAAGGAACTGTACTGGTTTTTGTTGGAGGAACTCAAGCGCAAATACAGGGAGTATTATTGGACATATGGCAAGAGTCCGTTTA
AGGAGGATGAGAAGGAAGCTGAGGGCATTGGCGATTATCCAGAGGGCATTGGGGAAAATGGAAAGCTAGGATTGGCTTCTGCGACTGGGAGCGATGAGATTAGAAGGTGT
GATGTCACAGGTTGCAAGGCGAAGGCGATGGCTTTGACAAAATATTGTCATGCTCATATCCTCTCGGATAAAAAGCAGAGGCTCTACAAGGGTTGCACCTTTGTGATCAA
GAGTATGCAGTCAGGGCCACTTCTATGTTCGAAGCCTGTTTTAAGATCAACCGTTCCCTGCTATTGTTCGGGTCATCTACAAAAAGGCGAAAAGTGTTTAGCTCGAGATT
TAAGAAAAGCCGGTCTTAATGTCTCCTCAACTAGTAAGCTTCGTCCTGATTTTCATGTATTGATAGCTGAATATGTCCGACAAATACAAAGCAAAAGGAGAGCGACGAAA
AGAGCAACTGCTATTAAAATTGAGAGTAACTGAGAAGGTGATGACGAGTATGGAGACGAGACGAGACCCCTCGGTTTGCAAAAGAATATGTTGCAATCCTGTTTGTGTAT
TCAATCCAACCCCTTTCTCTCAATGGTAGAATTTGTTGATAAAATCAATTTTTGTGTATGATCCAATCCCTTCTTCCTTTCCCTCCGTCGATAGTATTTGTTGATAAAAT
CAATCTTTCTCTTGTTGATTAATTCTATATGAAAGCTCAATTACATATAGAATTATTCAATTGTCATTTGATGAACATCCATCGCAGATGCAAATAATGGAGAAAAATGA
TACGTGTTGAATTTAAGGAATTTGAAAATCAGTGACGATGATCTTCCATCGGATGGATCGATCAAACCACAAATTTTTTCATGAACCTCGAAAAGAGACCAGTCC
Protein sequenceShow/hide protein sequence
MAESNSPGSFQPPPVTPLPILIDGADRDRALATSMICSRREVLERRSRRAKQLCRIFKELYWFLLEELKRKYREYYWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGLA
SATGSDEIRRCDVTGCKAKAMALTKYCHAHILSDKKQRLYKGCTFVIKSMQSGPLLCSKPVLRSTVPCYCSGHLQKGEKCLARDLRKAGLNVSSTSKLRPDFHVLIAEYV
RQIQSKRRATKRATAIKIESN