; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg22824 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg22824
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDNA repair protein XRCC4
Genome locationCarg_Chr19:5265285..5276380
RNA-Seq ExpressionCarg22824
SyntenyCarg22824
Gene Ontology termsGO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0006310 - DNA recombination (biological process)
GO:0010165 - response to X-ray (biological process)
GO:0051103 - DNA ligation involved in DNA repair (biological process)
GO:0051351 - positive regulation of ligase activity (biological process)
GO:0005958 - DNA-dependent protein kinase-DNA ligase 4 complex (cellular component)
GO:0032807 - DNA ligase IV complex (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR010585 - DNA repair protein XRCC4
IPR014751 - DNA repair protein XRCC4-like, C-terminal
IPR038051 - XRCC4-like, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571850.1 DNA repair protein XRCC4, partial [Cucurbita argyrosperma subsp. sororia]1.7e-11994.42Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
        MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS

Query:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ
        WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS            LKVESEKCLAQSERICEEKVEFETALYAK LNVLNTKKAKLREYRDQ
Subjt:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ

Query:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
        FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
Subjt:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN

KAG7011537.1 DNA repair protein XRCC4 [Cucurbita argyrosperma subsp. argyrosperma]1.3e-122100Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
        MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS

Query:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLSLKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQFPKQTTTSSKLK
        WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLSLKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQFPKQTTTSSKLK
Subjt:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLSLKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQFPKQTTTSSKLK

Query:  QDDEYSDKTESFDDDSDAEKN
        QDDEYSDKTESFDDDSDAEKN
Subjt:  QDDEYSDKTESFDDDSDAEKN

XP_022952201.1 DNA repair protein XRCC4 [Cucurbita moschata]1.7e-11994.42Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
        MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS

Query:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ
        WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS            LKVESEKCLAQSERICEEKVEFETALYAK LNVLNTKKAKLREYRDQ
Subjt:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ

Query:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
        FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
Subjt:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN

XP_022971995.1 DNA repair protein XRCC4 isoform X1 [Cucurbita maxima]3.2e-11893.56Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
        MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYV LAERYLGFQQPDSVY FADVGNGDKRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS

Query:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ
        WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS            LKVESEKCLAQSERICEEKVEFETALYAK LNVLNTKKAKLREYRDQ
Subjt:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ

Query:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
        FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
Subjt:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN

XP_023553967.1 DNA repair protein XRCC4 isoform X1 [Cucurbita pepo subsp. pepo]7.2e-11893.13Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
        MDAIRHTCLKL+VPT+AQPDGRVSIFVKGTWY HRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS

Query:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ
        WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS            LKVESEKCLAQSERICEEKVEFETALYAK LNVLNTKKAKLREYRDQ
Subjt:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ

Query:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
        FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
Subjt:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN

TrEMBL top hitse value%identityAlignment
A0A0A0K2L2 Uncharacterized protein1.1e-9879.83Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
        MDAIRHTCLKLEVPTNAQP+ R SIFVKGTW   RFDLSITDG +AWTCHATEDEVRLRA QWDQEPSDYV+LAERYLGFQQP S+Y FAD GNG KRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS

Query:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ
        WTF+KEGMKLEWRWKC+ ASDNKTTTAGIL+FLMDANIRLS            LK ESEKCLAQSE+IC+EKVEFETA+YAK LNVLN KKAKLREYRDQ
Subjt:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ

Query:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
          +QT T SKLKQ++  SDKTESFDD+SDAEKN
Subjt:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN

A0A1S3BZW1 DNA repair protein XRCC46.0e-10281.97Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
        MDAIRHTCLKLEVPTNAQPD R SIFVKGTW  HRFDLSITDGL+AWTCHATEDEVRLRA QWDQEPSDYV+LAERYLGFQQP S+YGFAD GNG KRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS

Query:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ
        WTF+KEGMKLEWRWKC+ ASDNKTTTA IL+FLMDANIRLS            LK ESEKCLAQSE+IC+EKVEFETA+YAK LNVLN KKAKLREYRDQ
Subjt:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ

Query:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
          KQTTT SKLKQ++  SDKTESFDD+SDAEKN
Subjt:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN

A0A5A7SK69 DNA repair protein XRCC46.0e-10281.97Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
        MDAIRHTCLKLEVPTNAQPD R SIFVKGTW  HRFDLSITDGL+AWTCHATEDEVRLRA QWDQEPSDYV+LAERYLGFQQP S+YGFAD GNG KRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS

Query:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ
        WTF+KEGMKLEWRWKC+ ASDNKTTTA IL+FLMDANIRLS            LK ESEKCLAQSE+IC+EKVEFETA+YAK LNVLN KKAKLREYRDQ
Subjt:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ

Query:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
          KQTTT SKLKQ++  SDKTESFDD+SDAEKN
Subjt:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN

A0A6J1GJL0 DNA repair protein XRCC48.3e-12094.42Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
        MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS

Query:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ
        WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS            LKVESEKCLAQSERICEEKVEFETALYAK LNVLNTKKAKLREYRDQ
Subjt:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ

Query:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
        FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
Subjt:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN

A0A6J1I7A5 DNA repair protein XRCC4 isoform X11.6e-11893.56Show/hide
Query:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS
        MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYV LAERYLGFQQPDSVY FADVGNGDKRLS
Subjt:  MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLS

Query:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ
        WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS            LKVESEKCLAQSERICEEKVEFETALYAK LNVLNTKKAKLREYRDQ
Subjt:  WTFNKEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQ

Query:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
        FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN
Subjt:  FPKQTTTSSKLKQDDEYSDKTESFDDDSDAEKN

SwissProt top hitse value%identityAlignment
Q682V0 DNA repair protein XRCC44.0e-6354.78Show/hide
Query:  RHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLSWTFN
        +HTCL+LE+       G   IFVKGTW+  RFD+S+TDG ++W C+ATE+EV  RA QWDQ  S+Y+ LAE+YLGFQQP+SVY F+D   G KRLSWTF 
Subjt:  RHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLSWTFN

Query:  KEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQFPKQ
        KEG KLEWRWKC+ + D+K  T GILDFLM+ANIRLS            ++ E+E+CLAQ E++C+EK EFE+A YAK L+VLN KKAKLR  RD+    
Subjt:  KEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQFPKQ

Query:  TTTSSKLKQDDEYSDKTESFDDD-SDAEKN
           S ++ +++E +DK ESF+   SD EK+
Subjt:  TTTSSKLKQDDEYSDKTESFDDD-SDAEKN

Arabidopsis top hitse value%identityAlignment
AT1G61410.1 DNA double-strand break repair and VJ recombination XRCC41.1e-1045.54Show/hide
Query:  MDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQFPKQTTTSSKLKQDDEYSDKTESFDDD-SDAEK
        M+ANIRLS            +K E+E+CLAQ E++C+EK EFE A YAK L+VLN KKAKLR  RD+       S +  +++E + K ESF+   SD E+
Subjt:  MDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQFPKQTTTSSKLKQDDEYSDKTESFDDD-SDAEK

Query:  N
        +
Subjt:  N

AT3G23100.1 homolog of human DNA ligase iv-binding protein XRCC42.9e-6454.78Show/hide
Query:  RHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLSWTFN
        +HTCL+LE+       G   IFVKGTW+  RFD+S+TDG ++W C+ATE+EV  RA QWDQ  S+Y+ LAE+YLGFQQP+SVY F+D   G KRLSWTF 
Subjt:  RHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLSWTFN

Query:  KEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQFPKQ
        KEG KLEWRWKC+ + D+K  T GILDFLM+ANIRLS            ++ E+E+CLAQ E++C+EK EFE+A YAK L+VLN KKAKLR  RD+    
Subjt:  KEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQFPKQ

Query:  TTTSSKLKQDDEYSDKTESFDDD-SDAEKN
           S ++ +++E +DK ESF+   SD EK+
Subjt:  TTTSSKLKQDDEYSDKTESFDDD-SDAEKN

AT3G23100.2 homolog of human DNA ligase iv-binding protein XRCC42.9e-6454.78Show/hide
Query:  RHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLSWTFN
        +HTCL+LE+       G   IFVKGTW+  RFD+S+TDG ++W C+ATE+EV  RA QWDQ  S+Y+ LAE+YLGFQQP+SVY F+D   G KRLSWTF 
Subjt:  RHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLSWTFN

Query:  KEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQFPKQ
        KEG KLEWRWKC+ + D+K  T GILDFLM+ANIRLS            ++ E+E+CLAQ E++C+EK EFE+A YAK L+VLN KKAKLR  RD+    
Subjt:  KEGMKLEWRWKCRQASDNKTTTAGILDFLMDANIRLS------------LKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQFPKQ

Query:  TTTSSKLKQDDEYSDKTESFDDD-SDAEKN
           S ++ +++E +DK ESF+   SD EK+
Subjt:  TTTSSKLKQDDEYSDKTESFDDD-SDAEKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCGATCAGACACACATGCCTGAAGCTTGAAGTACCAACCAACGCGCAACCGGACGGAAGAGTCTCCATCTTCGTTAAAGGCACTTGGTATCAGCACCGCTTCGA
TCTCTCCATCACCGACGGCCTCAACGCTTGGACGTGCCATGCGACGGAGGACGAGGTTCGGTTGCGCGCTGAACAATGGGACCAAGAACCGTCGGACTATGTGTCCTTGG
CCGAGCGCTATTTAGGGTTTCAGCAGCCTGATTCGGTCTATGGCTTTGCCGATGTTGGAAATGGGGACAAGAGGCTTTCTTGGACATTTAACAAAGAAGGGATGAAGTTA
GAATGGCGATGGAAATGTCGACAGGCATCGGATAATAAGACTACTACAGCAGGAATATTGGACTTTCTCATGGATGCAAACATAAGGCTGAGCCTGAAAGTGGAGTCTGA
GAAGTGTTTGGCTCAGAGCGAGAGGATTTGTGAAGAAAAGGTGGAGTTTGAAACTGCACTATATGCAAAGATTCTAAATGTCTTGAATACAAAGAAGGCAAAACTTAGAG
AGTACAGAGATCAGTTTCCAAAACAGACCACAACTAGCAGCAAACTTAAACAAGACGACGAGTACTCTGATAAAACCGAATCGTTTGACGATGATAGCGATGCCGAAAAG
AACTGA
mRNA sequenceShow/hide mRNA sequence
TATATATACTTTAATTTGTTGATTTTATTTTTAAATACCCTGCACCCCCTTGACTTTTAGCAACCTTCCTCCATTAGCCTTTTCAGAGACGAAGAAAAGCGCCATGGACG
CGATCAGACACACATGCCTGAAGCTTGAAGTACCAACCAACGCGCAACCGGACGGAAGAGTCTCCATCTTCGTTAAAGGCACTTGGTATCAGCACCGCTTCGATCTCTCC
ATCACCGACGGCCTCAACGCTTGGACGTGCCATGCGACGGAGGACGAGGTTCGGTTGCGCGCTGAACAATGGGACCAAGAACCGTCGGACTATGTGTCCTTGGCCGAGCG
CTATTTAGGGTTTCAGCAGCCTGATTCGGTCTATGGCTTTGCCGATGTTGGAAATGGGGACAAGAGGCTTTCTTGGACATTTAACAAAGAAGGGATGAAGTTAGAATGGC
GATGGAAATGTCGACAGGCATCGGATAATAAGACTACTACAGCAGGAATATTGGACTTTCTCATGGATGCAAACATAAGGCTGAGCCTGAAAGTGGAGTCTGAGAAGTGT
TTGGCTCAGAGCGAGAGGATTTGTGAAGAAAAGGTGGAGTTTGAAACTGCACTATATGCAAAGATTCTAAATGTCTTGAATACAAAGAAGGCAAAACTTAGAGAGTACAG
AGATCAGTTTCCAAAACAGACCACAACTAGCAGCAAACTTAAACAAGACGACGAGTACTCTGATAAAACCGAATCGTTTGACGATGATAGCGATGCCGAAAAGAACTGAC
GAAGAACAACGTTGAGACTTAACGCAAGGAATCTTACATCAAATAAGTTGGATTTGTTAATCCATATTAGTACGTTAAACATTTTCTTTGTTTTTCGAGTATGGGAGAGA
AGATTTGAATCTCCAATATCAAGATAGATATATTTACCGATGAACTAGGTTCATATTGACATTAGGGAACAACGACAAATACTATGTTCATATTGCTCAATTGTATAATA
TGAACTCCAAGGACGATGGATTTCATTTCTTTTTAGATTGTTACAAATTTTGTGGGGACAATAATTTAGAGTTATGGTATTTCATTAGTGTATTTTTAAGA
Protein sequenceShow/hide protein sequence
MDAIRHTCLKLEVPTNAQPDGRVSIFVKGTWYQHRFDLSITDGLNAWTCHATEDEVRLRAEQWDQEPSDYVSLAERYLGFQQPDSVYGFADVGNGDKRLSWTFNKEGMKL
EWRWKCRQASDNKTTTAGILDFLMDANIRLSLKVESEKCLAQSERICEEKVEFETALYAKILNVLNTKKAKLREYRDQFPKQTTTSSKLKQDDEYSDKTESFDDDSDAEK
N