; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020374 (gene) of Snake gourd v1 genome

Gene IDTan0020374
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionKAT8 regulatory NSL complex subunit 2
Genome locationLG05:11212089..11216310
RNA-Seq ExpressionTan0020374
SyntenyTan0020374
Gene Ontology termsGO:0043981 - histone H4-K5 acetylation (biological process)
GO:0043982 - histone H4-K8 acetylation (biological process)
GO:0043984 - histone H4-K16 acetylation (biological process)
GO:0044545 - NSL complex (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR025927 - Potential DNA-binding domain
IPR026316 - KAT8 regulatory NSL complex subunit 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575969.1 INO80 complex subunit D, partial [Cucurbita argyrosperma subsp. sororia]3.2e-11988.8Show/hide
Query:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG
        MAESNSPGSFQ PP PP PMVIDGA+HD+ALASCEFFTRREVL+RRSRRVKQLCR+YR LYWALM++LKRKYREY WTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
        IGENGKLG GSVT  DEIR+C+VTGCKAKAMALT+YCHAHILSDK+QRLYKGCTF IKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN
        NVSSTSKLRP+FHVL+AE + QIQ+KRRAARKA AVK ESN
Subjt:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN

XP_022150569.1 INO80 complex subunit D-like isoform X1 [Momordica charantia]2.9e-11283.4Show/hide
Query:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG
        MAESNSPGSFQPPPVPP PMVIDG DHDVALAS +F TR+E+L RRSRRVKQL RIY+A YWALM+D KRK+REY WTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
        IGENGK G GS    D+IR+C+VTGCK KAMA+T+YCHAHILSD +QRLYKGCTF IKSM SGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN
        N+SSTSKLRP  HVLL+EY+ QIQLKRRA RKA AVKTE+N
Subjt:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN

XP_022953677.1 INO80 complex subunit D-like [Cucurbita moschata]3.2e-11988.8Show/hide
Query:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG
        MAESNSPGSFQ PP PP PMVIDGA+HD+ALASCEFFTRREVL+RRSRRVKQLCR+YR LYWALM++LKRKYREY WTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
        IGENGKLG GSVT  DEIR+C+VTGCKAKAMALT+YCHAHILSDK+QRLYKGCTF IKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN
        NVSSTSKLRP+FHVL+AE + QIQ+KRRAARKA AVK ESN
Subjt:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN

XP_038896099.1 uncharacterized protein LOC120084404 isoform X1 [Benincasa hispida]1.1e-11184.23Show/hide
Query:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG
        MAES+SPGSFQPPPV PLP++IDGAD D ALA+ E   RREVL+RRSRRVKQLCRI + +YW L+++LKRKYREY WTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
        IGENGKLG GSVT  DEIR+C+VTGCKAKAMALT+YCHAHILSDK+QRLYKGCTF IKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN
        NVSSTSKLRP+FHVL+AE++ QIQ KRRA RKA AVK ESN
Subjt:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN

XP_038896100.1 INO80 complex subunit D-like isoform X2 [Benincasa hispida]1.1e-11184.23Show/hide
Query:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG
        MAES+SPGSFQPPPV PLP++IDGAD D ALA+ E   RREVL+RRSRRVKQLCRI + +YW L+++LKRKYREY WTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
        IGENGKLG GSVT  DEIR+C+VTGCKAKAMALT+YCHAHILSDK+QRLYKGCTF IKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN
        NVSSTSKLRP+FHVL+AE++ QIQ KRRA RKA AVK ESN
Subjt:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN

TrEMBL top hitse value%identityAlignment
A0A0A0K4I2 KAT8 regulatory NSL complex subunit 25.0e-11082.16Show/hide
Query:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG
        MAESNSPGSFQPPPV PLP++IDGAD D ALA+    +RREVL+RRSRR KQLCRI++ LYW L+++LKRKYREY WTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
        IGENGKLG  S T  DEIR+C+VTGCKAKAMALT+YCHAHILSDK+QRLYKGCTF IKSMQSGPLLCSKPVLRSTVPCYC GHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN
        NVSSTSKLRP+FHVL+AEY+ QIQ KRRA ++A A+K ESN
Subjt:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN

A0A1S3BH19 KAT8 regulatory NSL complex subunit 26.5e-11082.16Show/hide
Query:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG
        MA+SNSPGSFQPPPV P P++IDGAD D ALAS    +RREVL+RRSRR KQLCRI++ LYW L+++LKRKYREY WTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
        IGENGKLG GS T  DEIR+C+VTGCKAKAMALT+YCHAHILSDK+QRLYKGCTF IKSMQSGPLLCSKPVLRSTVPCYC GHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN
        NVSSTSKLRP+FHVL+AEY+ QIQ KRRA ++A A+K ESN
Subjt:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN

A0A6J1DAF9 KAT8 regulatory NSL complex subunit 21.4e-11283.4Show/hide
Query:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG
        MAESNSPGSFQPPPVPP PMVIDG DHDVALAS +F TR+E+L RRSRRVKQL RIY+A YWALM+D KRK+REY WTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
        IGENGK G GS    D+IR+C+VTGCK KAMA+T+YCHAHILSD +QRLYKGCTF IKSM SGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN
        N+SSTSKLRP  HVLL+EY+ QIQLKRRA RKA AVKTE+N
Subjt:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN

A0A6J1GNX0 KAT8 regulatory NSL complex subunit 21.5e-11988.8Show/hide
Query:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG
        MAESNSPGSFQ PP PP PMVIDGA+HD+ALASCEFFTRREVL+RRSRRVKQLCR+YR LYWALM++LKRKYREY WTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
        IGENGKLG GSVT  DEIR+C+VTGCKAKAMALT+YCHAHILSDK+QRLYKGCTF IKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN
        NVSSTSKLRP+FHVL+AE + QIQ+KRRAARKA AVK ESN
Subjt:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN

A0A6J1JMD6 KAT8 regulatory NSL complex subunit 21.5e-11988.8Show/hide
Query:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG
        MAESNSPGSFQ PP PP PMVIDGA+HD+ALASCEFFTRREVL+RRSRRVKQLCR+YR LYWALM++LKRKYREY WTYGKSPFKEDEKEAEGIGDYPEG
Subjt:  MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEG

Query:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
        IGENGKLG GSVT  DEIR+C+VTGCKAKAMALT+YCHAHILSDK+QRLYKGCTF IKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL
Subjt:  IGENGKLGSGSVT-VDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGL

Query:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN
        NVSSTSKLRP+FHVL+AE + QIQ+KRRAARKA AVK ESN
Subjt:  NVSSTSKLRPNFHVLLAEYINQIQLKRRAARKAIAVKTESN

SwissProt top hitse value%identityAlignment
Q54J07 INO80 complex subunit D2.4e-0824.59Show/hide
Query:  DHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGSG---------------
        D D   AS    T  E++QRR   + +L  +Y+  Y    + L+   R Y  T      + D  + E   +    I  N    +                
Subjt:  DHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGSG---------------

Query:  ----------SVTVDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHL
                  +   +E   C    CK K M L++YC++HIL DK Q+L+  CT+ + + +     C  P+L+  +P  C  HL
Subjt:  ----------SVTVDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHL

Arabidopsis top hitse value%identityAlignment
AT1G05860.1 unknown protein1.5e-4544.3Show/hide
Query:  PLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKED------EKEAEG-IGDYPEGIGENGKLGS
        P+ M ++    D  L +    TR E+L+RRS  +KQL R YR  YWALM+DLK ++R Y W YG SPFK++       ++ EG  GD  EG G+N    +
Subjt:  PLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKED------EKEAEG-IGDYPEGIGENGKLGS

Query:  GSVTVDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRP
          V       C  +GCK+KAMALT YC  HIL DK+Q+LY  CT+  K  QS  + C KP L STVP  C  H QK +K +AR L+ AG NVSS S+  P
Subjt:  GSVTVDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRP

Query:  NFHVLLAEYINQIQLKRRAARKAIAVKT
          H ++A +++ IQ KR+  RK   +K+
Subjt:  NFHVLLAEYINQIQLKRRAARKAIAVKT

AT2G31600.1 unknown protein3.8e-4644.98Show/hide
Query:  ADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEA--------EGI----GDYPEGIGENGKLGSGSV
        +  D  LA     TR E+L+RRS  +KQL + YR  YWALM+D+K ++R+Y W YG S FK++  ++        EG     GD  EG G+N     G V
Subjt:  ADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEA--------EGI----GDYPEGIGENGKLGSGSV

Query:  TVDEIRKCE----VTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLR
          D+         + GCKAKAMALT+YC  HIL D +Q+LY GCT  IK   +GPLLC KP L STVP  C  H QK +K +A+ L+ AG NVSSTSK  
Subjt:  TVDEIRKCE----VTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLR

Query:  PNFHVLLAEYINQIQLKRRAARKAIAVKT
        P  HV++A +++ IQ KR+  +K   +K+
Subjt:  PNFHVLLAEYINQIQLKRRAARKAIAVKT

AT2G31600.2 unknown protein2.4e-2443.62Show/hide
Query:  ADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEA--------EGI----GDYPEGIGENGKLGSGSV
        +  D  LA     TR E+L+RRS  +KQL + YR  YWALM+D+K ++R+Y W YG S FK++  ++        EG     GD  EG G+N     G V
Subjt:  ADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEA--------EGI----GDYPEGIGENGKLGSGSV

Query:  TVDEIRKCE----VTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIK
          D+         + GCKAKAMALT+YC  HIL D +Q+LY GCT  IK
Subjt:  TVDEIRKCE----VTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIK

AT3G53860.1 unknown protein3.0e-4345.02Show/hide
Query:  DVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDY-PEGIGENGKLGSGSVTVDEIRKCEVTGCK
        D  LAS    TR E+L+RR+  +KQL + Y+  YWALM+DLK ++R+Y   YG S FK+++ ++       PEG G+ G    G    +      + GCK
Subjt:  DVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDY-PEGIGENGKLGSGSVTVDEIRKCEVTGCK

Query:  AKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPNFHVLLAEYINQIQLKR
        AKAMALT+YC  HIL D +Q+LY GCT  I    +GPLLC KP L STVP  C  H QK +K +A+ L+ AG NVSSTSK  P  HV++A +++ IQ +R
Subjt:  AKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPNFHVLLAEYINQIQLKR

Query:  RAARKAIAVKT
        +   K   +K+
Subjt:  RAARKAIAVKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAATCAAACTCACCTGGTTCGTTTCAACCTCCTCCAGTTCCCCCACTTCCTATGGTTATCGATGGGGCAGATCATGATGTAGCGCTTGCCTCTTGTGAGTTCTT
TACTCGTCGAGAAGTACTCCAGCGTCGGTCCCGGAGAGTGAAGCAACTTTGTAGAATCTATAGGGCACTTTACTGGGCTTTAATGGACGATCTCAAGCGCAAGTACCGGG
AGTATTGTTGGACATATGGCAAGAGTCCATTTAAGGAGGATGAGAAGGAGGCCGAGGGCATTGGTGATTATCCGGAGGGTATTGGGGAGAATGGAAAGCTAGGATCAGGT
TCTGTGACAGTTGATGAGATTAGAAAGTGTGAGGTCACAGGTTGCAAGGCAAAGGCGATGGCATTGACAGAATACTGTCATGCTCATATCCTCTCGGATAAAAGGCAGAG
GCTCTACAAGGGTTGTACCTTTGCAATCAAGAGTATGCAGTCAGGACCCCTTCTATGTTCAAAGCCTGTTTTAAGATCTACTGTTCCCTGCTATTGCCCTGGTCATCTAC
AAAAAGGCGAAAAATGCTTAGCTAGAGATTTAAGAAAAGCAGGTCTTAACGTCTCCTCTACTAGCAAGCTTCGTCCTAATTTCCATGTATTGTTAGCTGAATACATAAAC
CAGATACAACTTAAGAGGAGGGCGGCGAGAAAGGCAATTGCTGTTAAAACTGAGAGTAACTGA
mRNA sequenceShow/hide mRNA sequence
AGTTTTTATTATCCTCAAAAACTTAACAACCACAATGTCCTCTCAAAAACTAAAAGAGAGGAAATCCTAGCAACCACTCTTTCATCTCCCTTTTAGCACCTCAGTCCTGC
CACCGCCACCGTCGCCTCTCTCCTGTTCAGTGCTGCCGGCGCCGCCTATCACTCATTCGATCAGGAACACTGTTTCGATCTCTCTCCCCATCAACGCTGAAAGAAAGCAG
CTTCTTAGTGCCAGGGCTCTGATATCTGTAGGATCCTCATCTGAATAATGGGTCATGGCAGAATCAAACTCACCTGGTTCGTTTCAACCTCCTCCAGTTCCCCCACTTCC
TATGGTTATCGATGGGGCAGATCATGATGTAGCGCTTGCCTCTTGTGAGTTCTTTACTCGTCGAGAAGTACTCCAGCGTCGGTCCCGGAGAGTGAAGCAACTTTGTAGAA
TCTATAGGGCACTTTACTGGGCTTTAATGGACGATCTCAAGCGCAAGTACCGGGAGTATTGTTGGACATATGGCAAGAGTCCATTTAAGGAGGATGAGAAGGAGGCCGAG
GGCATTGGTGATTATCCGGAGGGTATTGGGGAGAATGGAAAGCTAGGATCAGGTTCTGTGACAGTTGATGAGATTAGAAAGTGTGAGGTCACAGGTTGCAAGGCAAAGGC
GATGGCATTGACAGAATACTGTCATGCTCATATCCTCTCGGATAAAAGGCAGAGGCTCTACAAGGGTTGTACCTTTGCAATCAAGAGTATGCAGTCAGGACCCCTTCTAT
GTTCAAAGCCTGTTTTAAGATCTACTGTTCCCTGCTATTGCCCTGGTCATCTACAAAAAGGCGAAAAATGCTTAGCTAGAGATTTAAGAAAAGCAGGTCTTAACGTCTCC
TCTACTAGCAAGCTTCGTCCTAATTTCCATGTATTGTTAGCTGAATACATAAACCAGATACAACTTAAGAGGAGGGCGGCGAGAAAGGCAATTGCTGTTAAAACTGAGAG
TAACTGAGAAAGTGAAGATTAATTAGGCCCTCAGTTTGAATAAGAATACGTAGTTTGCGGTCGGTCGGTATGTGCAGGGTGTATTCAATCCAACCTCCCAATAGAAGAAT
TTGTTGATAAAATCAATTTCGTGTATGTTATGATCGCTTCTTTCTTCTCTTTGCACATTTGTCGAGTATTATTCGAACAAAAGGTGGATGTGATGTGCATTTGATTGCCG
AATGTAGGGATGTACATTCCTTTGTTTTTGAGAAGGATGTTTTAGTTTTAGCCTCCTTTCCTAACTTGCAGTAGAGTAACGTTGTTGGGGATACCTAATATTAGTTATAC
TTATAATGGTAAAAGAAAGTTATCTAATTGTGCTATTTGATGACGG
Protein sequenceShow/hide protein sequence
MAESNSPGSFQPPPVPPLPMVIDGADHDVALASCEFFTRREVLQRRSRRVKQLCRIYRALYWALMDDLKRKYREYCWTYGKSPFKEDEKEAEGIGDYPEGIGENGKLGSG
SVTVDEIRKCEVTGCKAKAMALTEYCHAHILSDKRQRLYKGCTFAIKSMQSGPLLCSKPVLRSTVPCYCPGHLQKGEKCLARDLRKAGLNVSSTSKLRPNFHVLLAEYIN
QIQLKRRAARKAIAVKTESN