; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0021158 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0021158
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionWIYLD domain-containing protein
Genome locationchr03:24862176..24865881
RNA-Seq ExpressionPI0021158
SyntenyPI0021158
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0034968 - histone lysine methylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
GO:0008237 - metallopeptidase activity (molecular function)
GO:0018024 - histone-lysine N-methyltransferase activity (molecular function)
InterPro domainsIPR018848 - WIYLD domain
IPR043017 - WIYLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK24425.1 ubiquitin-binding WIYLD domain protein [Cucumis melo var. makuwa]7.0e-10796.48Show/hide
Query:  RPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVTPSNEAIMTT
        RPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQ TSVAGCSLSAIDVTPSNEAI+TT
Subjt:  RPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVTPSNEAIMTT

Query:  ATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKRWDVEPAES
        ATLPAN+LDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKD+KEEDLVYLTPDHLPEELAKLL+ GALKKRKKRWDVE AES
Subjt:  ATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKRWDVEPAES

XP_004138569.1 uncharacterized protein LOC101218050 isoform X1 [Cucumis sativus]2.1e-11194.71Show/hide
Query:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT
        MAPR+R KKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSY LLID+LLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSA DVT
Subjt:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT

Query:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR
        PSNEAIMTTA LPANDLDTLFPGDESYWNNK SVDDDHFRSTFNQSLPAYTPKIRRRKAYHGW+GKDDKEEDLVYLTPDHLPEELAKLL++GALKKRKKR
Subjt:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR

Query:  WDVEPAES
        WDVEPAE+
Subjt:  WDVEPAES

XP_008441447.1 PREDICTED: uncharacterized protein LOC103485563 isoform X1 [Cucumis melo]9.5e-11296.63Show/hide
Query:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT
        MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQ TSVAGCSLSAIDVT
Subjt:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT

Query:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR
        PSNEAI+TTATLPAN+LDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKD+KEEDLVYLTPDHLPEELAKLL+ GALKKRKKR
Subjt:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR

Query:  WDVEPAES
        WDVE AES
Subjt:  WDVEPAES

XP_008441449.1 PREDICTED: uncharacterized protein LOC103485563 isoform X2 [Cucumis melo]3.7e-10089.42Show/hide
Query:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT
        MAPRVRSKK               FGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQ TSVAGCSLSAIDVT
Subjt:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT

Query:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR
        PSNEAI+TTATLPAN+LDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKD+KEEDLVYLTPDHLPEELAKLL+ GALKKRKKR
Subjt:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR

Query:  WDVEPAES
        WDVE AES
Subjt:  WDVEPAES

XP_031742523.1 uncharacterized protein LOC101218050 isoform X2 [Cucumis sativus]2.7e-10692.31Show/hide
Query:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT
        MAPR+R KKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSY LLID+LLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSA DVT
Subjt:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT

Query:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR
        PSNEAIMTTA LPANDL     GDESYWNNK SVDDDHFRSTFNQSLPAYTPKIRRRKAYHGW+GKDDKEEDLVYLTPDHLPEELAKLL++GALKKRKKR
Subjt:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR

Query:  WDVEPAES
        WDVEPAE+
Subjt:  WDVEPAES

TrEMBL top hitse value%identityAlignment
A0A0A0K7Q2 WIYLD domain-containing protein1.0e-11194.71Show/hide
Query:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT
        MAPR+R KKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSY LLID+LLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSA DVT
Subjt:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT

Query:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR
        PSNEAIMTTA LPANDLDTLFPGDESYWNNK SVDDDHFRSTFNQSLPAYTPKIRRRKAYHGW+GKDDKEEDLVYLTPDHLPEELAKLL++GALKKRKKR
Subjt:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR

Query:  WDVEPAES
        WDVEPAE+
Subjt:  WDVEPAES

A0A1S3B3F8 uncharacterized protein LOC103485563 isoform X14.6e-11296.63Show/hide
Query:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT
        MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQ TSVAGCSLSAIDVT
Subjt:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT

Query:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR
        PSNEAI+TTATLPAN+LDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKD+KEEDLVYLTPDHLPEELAKLL+ GALKKRKKR
Subjt:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR

Query:  WDVEPAES
        WDVE AES
Subjt:  WDVEPAES

A0A1S3B448 uncharacterized protein LOC103485563 isoform X21.8e-10089.42Show/hide
Query:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT
        MAPRVRSKK               FGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQ TSVAGCSLSAIDVT
Subjt:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVT

Query:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR
        PSNEAI+TTATLPAN+LDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKD+KEEDLVYLTPDHLPEELAKLL+ GALKKRKKR
Subjt:  PSNEAIMTTATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKR

Query:  WDVEPAES
        WDVE AES
Subjt:  WDVEPAES

A0A1S3B481 uncharacterized protein LOC103485563 isoform X31.7e-9896.2Show/hide
Query:  FGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVTPSNEAIMTTATLPANDLDTLFPGD
        FGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQ TSVAGCSLSAIDVTPSNEAI+TTATLPAN+LDTLFPGD
Subjt:  FGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVTPSNEAIMTTATLPANDLDTLFPGD

Query:  ESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKRWDVEPAES
        ESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKD+KEEDLVYLTPDHLPEELAKLL+ GALKKRKKRWDVE AES
Subjt:  ESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKRWDVEPAES

A0A5D3DLD6 Ubiquitin-binding WIYLD domain protein3.4e-10796.48Show/hide
Query:  RPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVTPSNEAIMTT
        RPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQ TSVAGCSLSAIDVTPSNEAI+TT
Subjt:  RPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVTPSNEAIMTT

Query:  ATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKRWDVEPAES
        ATLPAN+LDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKD+KEEDLVYLTPDHLPEELAKLL+ GALKKRKKRWDVE AES
Subjt:  ATLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKRWDVEPAES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45248.2 Nucleolar histone methyltransferase-related protein4.4e-0641.89Show/hide
Query:  MAPRVRSKKRPNL---RIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTL-LIDTLLEKQNE
        MAP    K R N+   R DAA D MK+FGF   ++  ++K++L VY G+D W  IE+ +Y + LI  L  K+N+
Subjt:  MAPRVRSKKRPNL---RIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTL-LIDTLLEKQNE

AT1G45248.3 Nucleolar histone methyltransferase-related protein4.4e-0641.89Show/hide
Query:  MAPRVRSKKRPNL---RIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTL-LIDTLLEKQNE
        MAP    K R N+   R DAA D MK+FGF   ++  ++K++L VY G+D W  IE+ +Y + LI  L  K+N+
Subjt:  MAPRVRSKKRPNL---RIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTL-LIDTLLEKQNE

AT2G40020.1 Nucleolar histone methyltransferase-related protein2.7e-0824.46Show/hide
Query:  LRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQ-----------------------------NEGAIEVVHDNERVD
        +R DAA D M+ FGF   ++ +++KELL+VY  +D W  IE+ SY  L+   LEKQ                             N+ A+E +H+ E+  
Subjt:  LRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQ-----------------------------NEGAIEVVHDNERVD

Query:  HQYTSVAGCSLSAIDVTPSNEAI--MTTATLPANDLDTLFPGDE---------SYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEED
         Q   +      A +++ ++EA+   + A +   +  + +              +  ++   D +       +      PK +  +      G DD   +
Subjt:  HQYTSVAGCSLSAIDVTPSNEAI--MTTATLPANDLDTLFPGDE---------SYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEED

Query:  LVYLTPDHLPEELAKLL--VSGALKKRKK-RWD
        ++ LTP+ L EEL +LL  V G  +++K+ RWD
Subjt:  LVYLTPDHLPEELAKLL--VSGALKKRKK-RWD

AT2G40020.2 Nucleolar histone methyltransferase-related protein1.3e-1048.57Show/hide
Query:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNE
        MAPR R KK   +R DAA D M+ FGF   ++ +++KELL+VY  +D W  IE+ SY  L+   LEKQ E
Subjt:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNE

AT2G40020.3 Nucleolar histone methyltransferase-related protein4.5e-1126.12Show/hide
Query:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQ-----------------------------NEG
        MAPR R KK   +R DAA D M+ FGF   ++ +++KELL+VY  +D W  IE+ SY  L+   LEKQ                             N+ 
Subjt:  MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQ-----------------------------NEG

Query:  AIEVVHDNERVDHQYTSVAGCSLSAIDVTPSNEAI--MTTATLPANDLDTLFPGDE---------SYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAY
        A+E +H+ E+   Q   +      A +++ ++EA+   + A +   +  + +              +  ++   D +       +      PK +  +  
Subjt:  AIEVVHDNERVDHQYTSVAGCSLSAIDVTPSNEAI--MTTATLPANDLDTLFPGDE---------SYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAY

Query:  HGWIGKDDKEEDLVYLTPDHLPEELAKLL--VSGALKKRKK-RWD
            G DD   +++ LTP+ L EEL +LL  V G  +++K+ RWD
Subjt:  HGWIGKDDKEEDLVYLTPDHLPEELAKLL--VSGALKKRKK-RWD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCTAGAGTGCGAAGCAAAAAGAGGCCTAACTTACGAATTGATGCTGCGCTCGATGCTATGAAGCAGTTTGGATTTCCTCCAAAGTTGGTGCGTGACACGGTCAA
GGAGCTCCTCGATGTATATGGAGGAGACGACGGATGGGTATTCATTGAAGAAGGCTCTTATACTCTCTTGATCGATACCCTTCTCGAGAAGCAGAATGAGGGTGCAATTG
AAGTGGTTCATGATAATGAAAGAGTAGATCATCAGTACACCTCAGTAGCTGGCTGTTCGTTGAGTGCTATCGATGTAACTCCCTCAAATGAAGCTATAATGACCACAGCC
ACATTGCCTGCAAATGACTTAGATACATTATTTCCTGGAGATGAAAGTTATTGGAACAATAAAGCTTCTGTTGACGATGACCATTTTAGGAGTACTTTTAATCAGTCTTT
ACCTGCATATACCCCCAAAATACGAAGGCGAAAAGCTTATCATGGCTGGATTGGTAAGGACGACAAGGAGGAGGATCTCGTGTACTTAACCCCAGATCATTTGCCTGAAG
AGCTTGCCAAGTTACTCGTGTCTGGTGCACTGAAAAAAAGAAAGAAGCGTTGGGATGTGGAACCTGCAGAATCATGA
mRNA sequenceShow/hide mRNA sequence
GTAGCACTGGTCTTTCTTCGACTCATCACACTCTGCCACCGCGACGTCCACATCGGAGAAGCTGAGAGCTTCTGCCTCTCATTTGCTAACATCAGCACTAACTGCGTCGA
ATTCGGAAAAGCAACGGTCGTCGGACTCTCCCTGTTGGAAAGTTTTCTGGTTCAACCATCGGTCCCCATTGTTGTTTTGTTTGCTGATATTTCCATTTTCAGGTAATATC
ATTCAATTACTGAGGTATTTAAACTGACTTATTTAATTGACCCAATAATCAATTTGTTGTTCTTGTGGTGGGGAAATCTTGGCTGGCTTTAAACTGAGTTTTTTGTTGTT
AGGCCATCGATTGTTTTTTTTTGGGGTAGGAAAATCAAGAATTTGGTACAAATGTTTATTTGTTTATGTCTATAATTATATTTGAGAAGCCAGCTACAATTTGAAAAAAA
AGAGAGTAAAATATGGGAGGAGAGAGATAGGGGAAAGGACAGAACGTTTTTTTTTTGTTTTTGTGCGGATGACCTTCATTTTGGATGGTCAATGCCGTTTTCCCTAATGA
GAGCTTAGAGTGTGAGTGATAGAGTGAGAAGTAGTGCGAGTATGAGGGAGGAAGACCCTAGTTTATGAGAGCATTGTTGGATTAGATTAGTTGAACACGAGCAGGAGTGC
AATGGAGGGATGGGGATGGGAGAAGGAAGAGTCCTAACTTGTAAGAGCATTATTGTACAAAAAACTCAAGAACGAAAAATCATAAAAAACCGTATGAGTTTTCACTGCCC
TTCTTCTATTTGTTGAACATGAGGTTTCATCTAAAAAATAATTAGCAATGAGAGGAGTAGCTCATCTATCTTATAAGTAGTGTGAGGTCTCTTGAATTTTCCAATGTAGG
ATTCTCAATGTGCCCCAAGATGGTTCTTCTTTTGGGTTCACTATTCTTGATCGGAACACAATTTCTTTTTATTAGACCAAAAACCCGTTTGGGCTTTATGGGCTCTAATA
CCATGATAACATGGGGTTTCAGCTTAAAACTAATTGGCAATGAGAAGAGTAGCTCATCTATCTTATAAGGAGTGTGAAGAACCTTGAATTTTCCAATGTGGGATCCTCAA
CATCTCCTTCCTTTCTCAATCAGTCAATCCGATAAAAAATCACTCCCCTTTTCTCTTCATATTGATTCTGACGTCTTCCCATTCTGCTTCTCAAGCTTCTTCGACGATAA
TTTGTTGGGTTCTGAGGTCAAGAAATGGCTCCTAGAGTGCGAAGCAAAAAGAGGCCTAACTTACGAATTGATGCTGCGCTCGATGCTATGAAGCAGTTTGGATTTCCTCC
AAAGTTGGTGCGTGACACGGTCAAGGAGCTCCTCGATGTATATGGAGGAGACGACGGATGGGTATTCATTGAAGAAGGCTCTTATACTCTCTTGATCGATACCCTTCTCG
AGAAGCAGAATGAGGGTGCAATTGAAGTGGTTCATGATAATGAAAGAGTAGATCATCAGTACACCTCAGTAGCTGGCTGTTCGTTGAGTGCTATCGATGTAACTCCCTCA
AATGAAGCTATAATGACCACAGCCACATTGCCTGCAAATGACTTAGATACATTATTTCCTGGAGATGAAAGTTATTGGAACAATAAAGCTTCTGTTGACGATGACCATTT
TAGGAGTACTTTTAATCAGTCTTTACCTGCATATACCCCCAAAATACGAAGGCGAAAAGCTTATCATGGCTGGATTGGTAAGGACGACAAGGAGGAGGATCTCGTGTACT
TAACCCCAGATCATTTGCCTGAAGAGCTTGCCAAGTTACTCGTGTCTGGTGCACTGAAAAAAAGAAAGAAGCGTTGGGATGTGGAACCTGCAGAATCATGAGTTTTTTAT
TTGGCTGGTTGGATGTTGATGACTTGAGGGGAGAAGATAGGTGAAAGTGTAAATGGTAGTTTCACCCTCTTTTGTCTGTTAGGGTAACATTTGAAGTAGCTTCCAAGCCT
TTTAGTTGAATTGCCCACAAAAGAAGTGTTTTGTTTTTGGTAGTGATGGATTATTATGAGCATGATGTAAATGTAGCTCACTCATATCTCAATTTTGAAAAATGAATTTA
TAACTCTTTAAACAATTTCCTTCCATGTGATCTTGT
Protein sequenceShow/hide protein sequence
MAPRVRSKKRPNLRIDAALDAMKQFGFPPKLVRDTVKELLDVYGGDDGWVFIEEGSYTLLIDTLLEKQNEGAIEVVHDNERVDHQYTSVAGCSLSAIDVTPSNEAIMTTA
TLPANDLDTLFPGDESYWNNKASVDDDHFRSTFNQSLPAYTPKIRRRKAYHGWIGKDDKEEDLVYLTPDHLPEELAKLLVSGALKKRKKRWDVEPAES