; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0026772 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0026772
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionDNA repair protein XRCC4
Genome locationchr01:1547615..1551686
RNA-Seq ExpressionIVF0026772
SyntenyIVF0026772
Gene Ontology termsGO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0006310 - DNA recombination (biological process)
GO:0010165 - response to X-ray (biological process)
GO:0051103 - DNA ligation involved in DNA repair (biological process)
GO:0051351 - positive regulation of ligase activity (biological process)
GO:0005958 - DNA-dependent protein kinase-DNA ligase 4 complex (cellular component)
GO:0032807 - DNA ligase IV complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0016874 - ligase activity (molecular function)
InterPro domainsIPR010585 - DNA repair protein XRCC4
IPR014751 - DNA repair protein XRCC4-like, C-terminal
IPR038051 - XRCC4-like, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136882.1 DNA repair protein XRCC4 [Cucumis sativus]3.61e-14893.01Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
        MDAIRHTCLKLEVPTNAQP+ERSSIFVKGTWNH RFDLSITDG HAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIY FADAGNGYKRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS

Query:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK
        WTFDKEGMKLEWRWKCQPASDNKTTTA ILNFLMDANIRLSEEVVRK QSVERLKAESEKCLAQSEKICDEKVEFETAIYAK       FLNVLNAKKAK
Subjt:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK

Query:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD
        LREYRDQLS+QT TGSKLKQEEYSSDKT+
Subjt:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD

XP_008455175.1 PREDICTED: DNA repair protein XRCC4 [Cucumis melo]1.74e-15596.51Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
        MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS

Query:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK
        WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAK       FLNVLNAKKAK
Subjt:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK

Query:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD
        LREYRDQLSKQTTTGSKLKQEEYSSDKT+
Subjt:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD

XP_022952201.1 DNA repair protein XRCC4 [Cucurbita moschata]1.03e-13183.41Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
        MDAIRHTCLKLEVPTNAQPD R SIFVKGTW  HRFDLSITDGL+AWTCHATEDEVRLRA QWDQEPSDYV+LAERYLGFQQP S+YGFAD GNG KRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS

Query:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK
        WTF+KEGMKLEWRWKC+ ASDNKTTTA IL+FLMDANIRLSEEVVRKTQS E+LK ESEKCLAQSE+IC+EKVEFETA+YAK       FLNVLN KKAK
Subjt:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK

Query:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD
        LREYRDQ  KQTTT SKLKQ++  SDKT+
Subjt:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD

XP_022971995.1 DNA repair protein XRCC4 isoform X1 [Cucurbita maxima]1.70e-13082.97Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
        MDAIRHTCLKLEVPTNAQPD R SIFVKGTW  HRFDLSITDGL+AWTCHATEDEVRLRA QWDQEPSDYV LAERYLGFQQP S+Y FAD GNG KRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS

Query:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK
        WTF+KEGMKLEWRWKC+ ASDNKTTTA IL+FLMDANIRLSEEVVRKTQS E+LK ESEKCLAQSE+IC+EKVEFETA+YAK       FLNVLN KKAK
Subjt:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK

Query:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD
        LREYRDQ  KQTTT SKLKQ++  SDKT+
Subjt:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD

XP_038888651.1 DNA repair protein XRCC4 [Benincasa hispida]3.17e-14288.65Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
        MDA RHTCLKLEVPTNAQP++RSSIFVKGTWNHH FDLSITDGL+AW+CHATEDEV LRAAQWDQEPSDYVALAERYLGFQQPGS+YGF DAGNG +RLS
Subjt:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS

Query:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK
        WTFDKEGMKLEWRWKC+PASDNKTTTA IL+FLMDANIRLSEEVVRKTQ  ERLK+ESE+CLAQSEKICDEKVEFETAIYAK       FLNVLNAKKAK
Subjt:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK

Query:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD
        LREYRDQLSKQTTTGSKLKQEEYSSDKT+
Subjt:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD

TrEMBL top hitse value%identityAlignment
A0A0A0K2L2 Uncharacterized protein3.0e-11593.01Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
        MDAIRHTCLKLEVPTNAQP+ERSSIFVKGTWNH RFDLSITDG HAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIY FADAGNGYKRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS

Query:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK
        WTFDKEGMKLEWRWKCQPASDNKTTTA ILNFLMDANIRLSEEVVRK QSVERLKAESEKCLAQSEKICDEKVEFETAIYAK       FLNVLNAKKAK
Subjt:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK

Query:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD
        LREYRDQLS+QT TGSKLKQEEYSSDKT+
Subjt:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD

A0A1S3BZW1 DNA repair protein XRCC48.3e-12196.51Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
        MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS

Query:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK
        WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAK       FLNVLNAKKAK
Subjt:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK

Query:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD
        LREYRDQLSKQTTTGSKLKQEEYSSDKT+
Subjt:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD

A0A5A7SK69 DNA repair protein XRCC48.3e-12196.51Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
        MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS

Query:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK
        WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAK       FLNVLNAKKAK
Subjt:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK

Query:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD
        LREYRDQLSKQTTTGSKLKQEEYSSDKT+
Subjt:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD

A0A6J1GJL0 DNA repair protein XRCC41.0e-10283.41Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
        MDAIRHTCLKLEVPTNAQPD R SIFVKGTW  HRFDLSITDGL+AWTCHATEDEVRLRA QWDQEPSDYV+LAERYLGFQQP S+YGFAD GNG KRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS

Query:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK
        WTF+KEGMKLEWRWKC+ ASDNKTTTA IL+FLMDANIRLSEEVVRKTQS E+LK ESEKCLAQSE+IC+EKVEFETA+YAK       FLNVLN KKAK
Subjt:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK

Query:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD
        LREYRDQ  KQTTT SKLKQ++  SDKT+
Subjt:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD

A0A6J1I7A5 DNA repair protein XRCC4 isoform X18.6e-10282.97Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS
        MDAIRHTCLKLEVPTNAQPD R SIFVKGTW  HRFDLSITDGL+AWTCHATEDEVRLRA QWDQEPSDYV LAERYLGFQQP S+Y FAD GNG KRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLS

Query:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK
        WTF+KEGMKLEWRWKC+ ASDNKTTTA IL+FLMDANIRLSEEVVRKTQS E+LK ESEKCLAQSE+IC+EKVEFETA+YAK       FLNVLN KKAK
Subjt:  WTFDKEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAK

Query:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD
        LREYRDQ  KQTTT SKLKQ++  SDKT+
Subjt:  LREYRDQLSKQTTTGSKLKQEEYSSDKTD

SwissProt top hitse value%identityAlignment
Q682V0 DNA repair protein XRCC49.3e-6959.11Show/hide
Query:  RHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLSWTFD
        +HTCL+LE+ + A P     IFVKGTW++ RFD+S+TDG  +W C+ATE+EV  RAAQWDQ  S+Y+ LAE+YLGFQQP S+Y F+DA  G KRLSWTF+
Subjt:  RHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLSWTFD

Query:  KEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAKLREY
        KEG KLEWRWKC+P+ D+K  T  IL+FLM+ANIRLSEEVV KT+S E++++E+E+CLAQ EK+CDEK EFE+A YAK       FL+VLNAKKAKLR  
Subjt:  KEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAKLREY

Query:  RDQLSKQTTTGSKLKQEEYSSDKTD
        RD+         ++ +EE S+DK +
Subjt:  RDQLSKQTTTGSKLKQEEYSSDKTD

Arabidopsis top hitse value%identityAlignment
AT1G61410.1 DNA double-strand break repair and VJ recombination XRCC47.6e-1854.55Show/hide
Query:  MDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAKLREYRDQ---LSKQTTTGSKLKQEEYSSDKTD
        M+ANIRLSEEVV KT+S E++K+E+E+CLAQ EK+CDEK EFE A YAK       FL+VLNAKKAKLR  RD+   +       S  K E + S ++D
Subjt:  MDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAKLREYRDQ---LSKQTTTGSKLKQEEYSSDKTD

AT3G23100.1 homolog of human DNA ligase iv-binding protein XRCC46.6e-7059.11Show/hide
Query:  RHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLSWTFD
        +HTCL+LE+ + A P     IFVKGTW++ RFD+S+TDG  +W C+ATE+EV  RAAQWDQ  S+Y+ LAE+YLGFQQP S+Y F+DA  G KRLSWTF+
Subjt:  RHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLSWTFD

Query:  KEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAKLREY
        KEG KLEWRWKC+P+ D+K  T  IL+FLM+ANIRLSEEVV KT+S E++++E+E+CLAQ EK+CDEK EFE+A YAK       FL+VLNAKKAKLR  
Subjt:  KEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAKLREY

Query:  RDQLSKQTTTGSKLKQEEYSSDKTD
        RD+         ++ +EE S+DK +
Subjt:  RDQLSKQTTTGSKLKQEEYSSDKTD

AT3G23100.2 homolog of human DNA ligase iv-binding protein XRCC46.6e-7059.11Show/hide
Query:  RHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLSWTFD
        +HTCL+LE+ + A P     IFVKGTW++ RFD+S+TDG  +W C+ATE+EV  RAAQWDQ  S+Y+ LAE+YLGFQQP S+Y F+DA  G KRLSWTF+
Subjt:  RHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLSWTFD

Query:  KEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAKLREY
        KEG KLEWRWKC+P+ D+K  T  IL+FLM+ANIRLSEEVV KT+S E++++E+E+CLAQ EK+CDEK EFE+A YAK       FL+VLNAKKAKLR  
Subjt:  KEGMKLEWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAKLREY

Query:  RDQLSKQTTTGSKLKQEEYSSDKTD
        RD+         ++ +EE S+DK +
Subjt:  RDQLSKQTTTGSKLKQEEYSSDKTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCAATCAGGCACACATGCCTGAAGCTTGAAGTACCAACCAACGCCCAGCCGGACGAAAGAAGCTCCATCTTCGTCAAAGGCACTTGGAATCATCACCGTTTCGA
TCTCTCCATCACCGACGGCCTCCACGCTTGGACGTGCCATGCGACGGAGGATGAGGTTCGATTGCGCGCTGCACAATGGGACCAAGAACCGTCGGACTATGTGGCATTGG
CCGAGCGCTATTTAGGGTTTCAACAGCCTGGTTCGATCTATGGCTTTGCCGATGCTGGAAATGGGTACAAGAGGCTTTCTTGGACATTTGACAAAGAAGGGATGAAGTTA
GAATGGCGATGGAAGTGTCAACCAGCGTCTGATAATAAGACAACTACAGCAGAAATATTGAACTTTCTTATGGATGCAAACATAAGGCTGAGCGAAGAGGTTGTGAGAAA
AACTCAATCAGTTGAGAGGCTGAAGGCAGAATCTGAGAAGTGTTTAGCTCAGAGTGAGAAGATTTGTGATGAAAAAGTGGAGTTTGAAACTGCAATATATGCAAAGGTAC
GGGAAATTTCATATACTTTTCTGAATGTCTTGAATGCAAAGAAGGCAAAGCTTAGAGAGTACAGAGATCAGCTTTCGAAACAAACCACTACCGGTAGCAAACTGAAACAA
GAAGAGTACTCCTCTGATAAAACCGATCTTTTGACGATGAAAGCGATGCAGAAAAGAACTGACAGAGTCGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGCAATCAGGCACACATGCCTGAAGCTTGAAGTACCAACCAACGCCCAGCCGGACGAAAGAAGCTCCATCTTCGTCAAAGGCACTTGGAATCATCACCGTTTCGA
TCTCTCCATCACCGACGGCCTCCACGCTTGGACGTGCCATGCGACGGAGGATGAGGTTCGATTGCGCGCTGCACAATGGGACCAAGAACCGTCGGACTATGTGGCATTGG
CCGAGCGCTATTTAGGGTTTCAACAGCCTGGTTCGATCTATGGCTTTGCCGATGCTGGAAATGGGTACAAGAGGCTTTCTTGGACATTTGACAAAGAAGGGATGAAGTTA
GAATGGCGATGGAAGTGTCAACCAGCGTCTGATAATAAGACAACTACAGCAGAAATATTGAACTTTCTTATGGATGCAAACATAAGGCTGAGCGAAGAGGTTGTGAGAAA
AACTCAATCAGTTGAGAGGCTGAAGGCAGAATCTGAGAAGTGTTTAGCTCAGAGTGAGAAGATTTGTGATGAAAAAGTGGAGTTTGAAACTGCAATATATGCAAAGGTAC
GGGAAATTTCATATACTTTTCTGAATGTCTTGAATGCAAAGAAGGCAAAGCTTAGAGAGTACAGAGATCAGCTTTCGAAACAAACCACTACCGGTAGCAAACTGAAACAA
GAAGAGTACTCCTCTGATAAAACCGATCTTTTGACGATGAAAGCGATGCAGAAAAGAACTGACAGAGTCGTATGA
Protein sequenceShow/hide protein sequence
MDAIRHTCLKLEVPTNAQPDERSSIFVKGTWNHHRFDLSITDGLHAWTCHATEDEVRLRAAQWDQEPSDYVALAERYLGFQQPGSIYGFADAGNGYKRLSWTFDKEGMKL
EWRWKCQPASDNKTTTAEILNFLMDANIRLSEEVVRKTQSVERLKAESEKCLAQSEKICDEKVEFETAIYAKVREISYTFLNVLNAKKAKLREYRDQLSKQTTTGSKLKQ
EEYSSDKTDLLTMKAMQKRTDRVV