; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0023906 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0023906
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionThiol-disulfide oxidoreductase DCC isoform 1
Genome locationchr02:24087132..24090459
RNA-Seq ExpressionPI0023906
SyntenyPI0023906
Gene Ontology termsGO:0042246 - tissue regeneration (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0015035 - protein disulfide oxidoreductase activity (molecular function)
InterPro domainsIPR007263 - DCC1-like thiol-disulfide oxidoreductase family
IPR036249 - Thioredoxin-like superfamily
IPR044691 - Thioredoxin DCC1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057890.1 Thiol-disulfide oxidoreductase DCC isoform 1 [Cucumis melo var. makuwa]4.1e-11394.47Show/hide
Query:  MAFGGAAATAMASKFATRITSPIRL-SSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
        MAFGGAAATAMASKF T ITSPIRL SSS+LPKLGLHST+PFSPFRTTNQLGLKYQ+RAISEA+VDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
Subjt:  MAFGGAAATAMASKFATRITSPIRL-SSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN

Query:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL
        MLRERNKQYGTI FVDISSDDYTP+ENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYA+TKYEP GRLADAAYGLWARYRLQLTGRPPL
Subjt:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL

Query:  EDILAARKKNLVNLCAW
        EDILAARKKN VNLCAW
Subjt:  EDILAARKKNLVNLCAW

XP_004138124.1 uncharacterized protein At5g50100, chloroplastic isoform X1 [Cucumis sativus]1.8e-10893.49Show/hide
Query:  MAFGGAAATAMASKFATRITSPIRL-SSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
        MAFGGAAA AMASKFAT IT+PIRL  SS+LPKLG HSTQ FSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
Subjt:  MAFGGAAATAMASKFATRITSPIRL-SSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN

Query:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL
        MLRERNKQYGTIKFVDI SDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFR+LYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL
Subjt:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL

Query:  EDILAARKKNLVNLC
        EDILAARKKN   +C
Subjt:  EDILAARKKNLVNLC

XP_008453124.1 PREDICTED: uncharacterized protein At5g50100, mitochondrial [Cucumis melo]1.0e-10892.09Show/hide
Query:  MAFGGAAATAMASKFATRITSPIRL-SSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
        MAFGGAAATAMASKF T ITSPIRL SSS+LPKLGLHST+PFSPFRTTNQLGLKYQ+RAISEA+VDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
Subjt:  MAFGGAAATAMASKFATRITSPIRL-SSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN

Query:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL
        MLRERNKQYGTI FVDISSDDYTP+ENQGLDYKTVMGRIHAILADGTVVRDVE FRRLYEQVGLGWVYA+TKYEP GRL DAAYGLWARYRLQLTGRPPL
Subjt:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL

Query:  EDILAARKKNLVNLC
        EDILAARKKN   +C
Subjt:  EDILAARKKNLVNLC

XP_038878376.1 uncharacterized protein At5g50100, chloroplastic isoform X1 [Benincasa hispida]2.2e-10685.83Show/hide
Query:  MAFGGAAATAMASKFATRITSPIRLSSSYLPKLGLHST-----QPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCM
        MAFGGAAATAM SKFA RIT+PIRLSSS LPKLGLH T     QPFSPF  TNQ GLKYQIRAISEA VDPVSSNKE GE S QSWKIKMLYDGDCPLCM
Subjt:  MAFGGAAATAMASKFATRITSPIRLSSSYLPKLGLHST-----QPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCM

Query:  REVNMLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTG
        REVNMLRERNKQYGTIKFVDISSDDY+P+ENQGLDYKTVMGRIHAILADGTVVRDVEAFR+LYEQV LGWVYAVTKYEPFGRLADAAY LWARYRLQLTG
Subjt:  REVNMLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTG

Query:  RPPLEDILAARKKNLVNLCAWYELSFCLSPVVHTRYIHHS
        RPPLEDILAARKKN VNL AWYEL F + P   TR I  S
Subjt:  RPPLEDILAARKKNLVNLCAWYELSFCLSPVVHTRYIHHS

XP_038878377.1 uncharacterized protein At5g50100, chloroplastic isoform X2 [Benincasa hispida]9.5e-10288.13Show/hide
Query:  MAFGGAAATAMASKFATRITSPIRLSSSYLPKLGLHST-----QPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCM
        MAFGGAAATAM SKFA RIT+PIRLSSS LPKLGLH T     QPFSPF  TNQ GLKYQIRAISEA VDPVSSNKE GE S QSWKIKMLYDGDCPLCM
Subjt:  MAFGGAAATAMASKFATRITSPIRLSSSYLPKLGLHST-----QPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCM

Query:  REVNMLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTG
        REVNMLRERNKQYGTIKFVDISSDDY+P+ENQGLDYKTVMGRIHAILADGTVVRDVEAFR+LYEQV LGWVYAVTKYEPFGRLADAAY LWARYRLQLTG
Subjt:  REVNMLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTG

Query:  RPPLEDILAARKKNLVNLC
        RPPLEDILAARKKN   +C
Subjt:  RPPLEDILAARKKNLVNLC

TrEMBL top hitse value%identityAlignment
A0A0A0LS53 Uncharacterized protein8.6e-10993.49Show/hide
Query:  MAFGGAAATAMASKFATRITSPIRL-SSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
        MAFGGAAA AMASKFAT IT+PIRL  SS+LPKLG HSTQ FSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
Subjt:  MAFGGAAATAMASKFATRITSPIRL-SSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN

Query:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL
        MLRERNKQYGTIKFVDI SDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFR+LYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL
Subjt:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL

Query:  EDILAARKKNLVNLC
        EDILAARKKN   +C
Subjt:  EDILAARKKNLVNLC

A0A1S3BWL4 uncharacterized protein At5g50100, mitochondrial5.0e-10992.09Show/hide
Query:  MAFGGAAATAMASKFATRITSPIRL-SSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
        MAFGGAAATAMASKF T ITSPIRL SSS+LPKLGLHST+PFSPFRTTNQLGLKYQ+RAISEA+VDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
Subjt:  MAFGGAAATAMASKFATRITSPIRL-SSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN

Query:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL
        MLRERNKQYGTI FVDISSDDYTP+ENQGLDYKTVMGRIHAILADGTVVRDVE FRRLYEQVGLGWVYA+TKYEP GRL DAAYGLWARYRLQLTGRPPL
Subjt:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL

Query:  EDILAARKKNLVNLC
        EDILAARKKN   +C
Subjt:  EDILAARKKNLVNLC

A0A5A7URW2 Thiol-disulfide oxidoreductase DCC isoform 12.0e-11394.47Show/hide
Query:  MAFGGAAATAMASKFATRITSPIRL-SSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
        MAFGGAAATAMASKF T ITSPIRL SSS+LPKLGLHST+PFSPFRTTNQLGLKYQ+RAISEA+VDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
Subjt:  MAFGGAAATAMASKFATRITSPIRL-SSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVN

Query:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL
        MLRERNKQYGTI FVDISSDDYTP+ENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYA+TKYEP GRLADAAYGLWARYRLQLTGRPPL
Subjt:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL

Query:  EDILAARKKNLVNLCAW
        EDILAARKKN VNLCAW
Subjt:  EDILAARKKNLVNLCAW

A0A6J1ED50 uncharacterized protein At5g50100, mitochondrial6.2e-9180.18Show/hide
Query:  MAFGGAA---ATAMASKFATRITSPIRLSSSYLPKLGLHST-----QPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCP
        MAFG AA   ATAMASKFA RI +PIRLS+S LP L LH T     Q FS FR TNQ  LK+QI AISE+ VDPVSSN+E+GE S +SWKIKMLYDGDCP
Subjt:  MAFGGAA---ATAMASKFATRITSPIRLSSSYLPKLGLHST-----QPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCP

Query:  LCMREVNMLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQ
        LCMREVNMLRERNK+YGTIKFVDISSDDYTP+ENQGLDY+TVMGRIHAILADGTVVRDVE FRRLYEQVGLGWVYAVTKYEP G LADA YGLWA+YRLQ
Subjt:  LCMREVNMLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQ

Query:  LTGRPPLEDILAARKKNLVNLC
        LTGRPPLEDIL ARKKN   +C
Subjt:  LTGRPPLEDILAARKKNLVNLC

A0A6J1IA19 uncharacterized protein At5g50100, chloroplastic3.3e-9281.08Show/hide
Query:  MAFGGAA---ATAMASKFATRITSPIRLSSSYLPKLGLHST-----QPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCP
        MAFG AA   ATAMASKFA RI +PIRLS+S LPKL LH T     QPFS F  TNQ  LK+QI AISE+ VDPVSSN+E+GE S +SWKIKMLYDGDCP
Subjt:  MAFGGAA---ATAMASKFATRITSPIRLSSSYLPKLGLHST-----QPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCP

Query:  LCMREVNMLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQ
        LCMREVNMLRERNK+YGTIKFVDISSDDYTP+ENQGLDYKTVMGRIHAILADGTVVRDVE FRRLYEQVGLGWVYAVTKYEP G LADA YGLWA+YRLQ
Subjt:  LCMREVNMLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQ

Query:  LTGRPPLEDILAARKKNLVNLC
        LTGRPPLEDIL ARKKN   +C
Subjt:  LTGRPPLEDILAARKKNLVNLC

SwissProt top hitse value%identityAlignment
Q8W485 Uncharacterized protein At5g50100, chloroplastic4.0e-6357.67Show/hide
Query:  MAFGGAAATAMASKFATRITSPIRLSSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPV-SSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
        MA  GA A A ++ +  R    +R  S +      H   P          G KYQ+RAI     DPV +  K   E   Q+WKIKMLYDGDCPLCMREVN
Subjt:  MAFGGAAATAMASKFATRITSPIRLSSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPV-SSNKENGEGSSQSWKIKMLYDGDCPLCMREVN

Query:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL
        ML ERN+++GTIKFVDISS+DY+P++NQGLDYKTVMG+IHAI +DG VV+ VEAFRRLYE+VGLGWVY +TK+EP G+LAD  Y +WA+YRLQ+TGRP +
Subjt:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL

Query:  EDILAARKKNLVNLC
        E IL ARKK+ V  C
Subjt:  EDILAARKKNLVNLC

Arabidopsis top hitse value%identityAlignment
AT5G50100.1 Putative thiol-disulphide oxidoreductase DCC2.9e-6457.67Show/hide
Query:  MAFGGAAATAMASKFATRITSPIRLSSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPV-SSNKENGEGSSQSWKIKMLYDGDCPLCMREVN
        MA  GA A A ++ +  R    +R  S +      H   P          G KYQ+RAI     DPV +  K   E   Q+WKIKMLYDGDCPLCMREVN
Subjt:  MAFGGAAATAMASKFATRITSPIRLSSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPV-SSNKENGEGSSQSWKIKMLYDGDCPLCMREVN

Query:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL
        ML ERN+++GTIKFVDISS+DY+P++NQGLDYKTVMG+IHAI +DG VV+ VEAFRRLYE+VGLGWVY +TK+EP G+LAD  Y +WA+YRLQ+TGRP +
Subjt:  MLRERNKQYGTIKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPL

Query:  EDILAARKKNLVNLC
        E IL ARKK+ V  C
Subjt:  EDILAARKKNLVNLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTTGGAGGAGCAGCTGCCACTGCCATGGCGTCTAAATTTGCAACAAGAATCACTAGTCCAATTCGACTCTCATCTTCTTACCTTCCCAAGCTCGGCCTCCATAG
CACTCAACCCTTCTCCCCGTTTCGAACCACCAATCAACTCGGATTGAAATACCAAATTCGTGCTATAAGTGAAGCTGTAGTAGATCCTGTTTCTTCAAATAAAGAGAATG
GAGAAGGATCATCCCAAAGCTGGAAGATCAAAATGCTCTATGATGGAGATTGCCCACTCTGTATGCGTGAGGTTAATATGCTAAGAGAGAGGAATAAGCAATATGGTACA
ATCAAGTTTGTTGACATTAGTTCAGATGATTACACGCCACAAGAAAATCAGGGTCTTGACTACAAAACCGTTATGGGAAGAATTCATGCGATACTTGCAGATGGTACTGT
GGTCAGAGATGTTGAAGCCTTTAGAAGACTTTATGAACAAGTTGGCCTTGGATGGGTATATGCTGTTACAAAATACGAACCATTTGGAAGATTAGCCGATGCCGCATATG
GTCTCTGGGCGAGGTATCGTCTTCAGTTGACAGGCCGGCCACCTCTAGAAGATATTCTGGCAGCACGAAAGAAAAACTTGGTAAATTTATGTGCTTGGTATGAACTATCT
TTTTGCCTTTCCCCTGTGGTCCACACGAGATATATCCACCACTCACCAGCTTTAACTTACAAGAATGAGTTTCTCACAAGAAAGATTTTAAAAATTCTTTTTTGGGCTAT
CTTATGGCTTCTAAGTTGGCACAAAAGTACAATAGTAATTTCGACACGTTTTTCTGACTTTCCAATTTGTGTTTTACAAAATATAGAAGCCTATCTATGA
mRNA sequenceShow/hide mRNA sequence
GTGGATTATGGGTAGCCACGCGTTTGAGCATGCAAGGAAGAAGCAGTTTACGAATCACAGCCTCAATTTGCTTAAGCCTTCAAATGCAAACCACAAACGAGGAACAAGAA
ACTAGAACATGGCGTTTGGAGGAGCAGCTGCCACTGCCATGGCGTCTAAATTTGCAACAAGAATCACTAGTCCAATTCGACTCTCATCTTCTTACCTTCCCAAGCTCGGC
CTCCATAGCACTCAACCCTTCTCCCCGTTTCGAACCACCAATCAACTCGGATTGAAATACCAAATTCGTGCTATAAGTGAAGCTGTAGTAGATCCTGTTTCTTCAAATAA
AGAGAATGGAGAAGGATCATCCCAAAGCTGGAAGATCAAAATGCTCTATGATGGAGATTGCCCACTCTGTATGCGTGAGGTTAATATGCTAAGAGAGAGGAATAAGCAAT
ATGGTACAATCAAGTTTGTTGACATTAGTTCAGATGATTACACGCCACAAGAAAATCAGGGTCTTGACTACAAAACCGTTATGGGAAGAATTCATGCGATACTTGCAGAT
GGTACTGTGGTCAGAGATGTTGAAGCCTTTAGAAGACTTTATGAACAAGTTGGCCTTGGATGGGTATATGCTGTTACAAAATACGAACCATTTGGAAGATTAGCCGATGC
CGCATATGGTCTCTGGGCGAGGTATCGTCTTCAGTTGACAGGCCGGCCACCTCTAGAAGATATTCTGGCAGCACGAAAGAAAAACTTGGTAAATTTATGTGCTTGGTATG
AACTATCTTTTTGCCTTTCCCCTGTGGTCCACACGAGATATATCCACCACTCACCAGCTTTAACTTACAAGAATGAGTTTCTCACAAGAAAGATTTTAAAAATTCTTTTT
TGGGCTATCTTATGGCTTCTAAGTTGGCACAAAAGTACAATAGTAATTTCGACACGTTTTTCTGACTTTCCAATTTGTGTTTTACAAAATATAGAAGCCTATCTATGAAT
ATACACATACGTGTTGAACTTTCTGTTTTCTGCACAGGATGAAGTATGTAATGACAGCAATGCATGCAAGAGGTAATCCTTCTACACGGGATGATGATTCTCGCTTGGCT
AGTTGAATATCAGAAAGGGATTGGAATAGGACTTGAGCTATTAAACATCGTTAATTTAAAGGCCAGTAGCTCCTCTTACATTCTTGTATAAATGGTTGGCTAATAGGCTT
ATAGCAATACCTTTTTTTCTTCTTCTTTCTTGGGGTAGGAGATAAACCTTTGCCTTCGGATCTACAATTTCATCTTGTTGGTTCCATTGTGTTATGAAATGTTTTTGTCG
AAATATATACATCATTGGGGAATTTATGAGGGGTTGTGATACTTTTTAACTTCTACTATTCTGTGGTTCCTCGAGAAGTGCAGTTTCTAAGTTACACAGAAATTTACAGG
TGATGAAAATGTTTGAAGCTTTTTAGTATGATAAATGGGCAATTTTGCAGAAGAAATGTACTTTTATAGTCATGGCAGAGGCCTTCAAGGACGCTGCTACCCAAAAAGAT
TGTAATTAGTGAAAACAAAGTTATGTTTCAGTTATCGTAGGTGTTAATTTTAGTGTTCTTGAATGCATAAGCTTTATTTTCCA
Protein sequenceShow/hide protein sequence
MAFGGAAATAMASKFATRITSPIRLSSSYLPKLGLHSTQPFSPFRTTNQLGLKYQIRAISEAVVDPVSSNKENGEGSSQSWKIKMLYDGDCPLCMREVNMLRERNKQYGT
IKFVDISSDDYTPQENQGLDYKTVMGRIHAILADGTVVRDVEAFRRLYEQVGLGWVYAVTKYEPFGRLADAAYGLWARYRLQLTGRPPLEDILAARKKNLVNLCAWYELS
FCLSPVVHTRYIHHSPALTYKNEFLTRKILKILFWAILWLLSWHKSTIVISTRFSDFPICVLQNIEAYL