; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC00g0018 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC00g0018
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDNA repair protein XRCC4-like
Genome locationscaffold31:265520..270129
RNA-Seq ExpressionMC00g0018
SyntenyMC00g0018
Gene Ontology termsGO:0006302 - double-strand break repair (biological process)
GO:0006310 - DNA recombination (biological process)
GO:0140513 - nuclear protein-containing complex (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR010585 - DNA repair protein XRCC4
IPR014751 - DNA repair protein XRCC4-like, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455175.1 PREDICTED: DNA repair protein XRCC4 [Cucumis melo]2.06e-9366.94Show/hide
Query:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV
        MDAIRHTCLKLEVP+ AQ +E RS  FVKGTW  HRFDLSITDGL+AWTCH             +TEDEVRLRA QWDQEPSDYVALAER+LGFQQPGS+
Subjt:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV

Query:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV
        YGFADAGNG KR                                           EEVVRKTQS ERLK ESE CLAQSEKI  EKVEFETAIYAKFLNV
Subjt:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV

Query:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN
        LN+KKAKLR YRDQLSKQTT  SKLKQEE YSSDKTE+FDDESDAEKN
Subjt:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN

XP_022141947.1 DNA repair protein XRCC4-like [Momordica charantia]1.24e-11677.02Show/hide
Query:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV
        MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCH             +TEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV
Subjt:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV

Query:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV
        YGFADAGNGDKR                                           EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV
Subjt:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV

Query:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN
        LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN
Subjt:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN

XP_022952201.1 DNA repair protein XRCC4 [Cucurbita moschata]2.38e-9264.92Show/hide
Query:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV
        MDAIRHTCLKLEVP+ AQ +   SIF VKGTW++HRFDLSITDGLNAWTCH             +TEDEVRLRAEQWDQEPSDYV+LAER+LGFQQP SV
Subjt:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV

Query:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV
        YGFAD GNGDKR                                           EEVVRKTQSFE+LKVESE CLAQSE+I  EKVEFETA+YAKFLNV
Subjt:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV

Query:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN
        LN+KKAKLR YRDQ  KQTT SSKLKQ++EYS DKTE+FDD+SDAEKN
Subjt:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN

XP_022971997.1 DNA repair protein XRCC4 isoform X2 [Cucurbita maxima]5.47e-10078.05Show/hide
Query:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV
        MDAIRHTCLKLEVP+ AQ +   SIF VKGTW++HRFDLSITDGLNAWTCH             +TEDEVRLRAEQWDQEPSDYV LAER+LGFQQP SV
Subjt:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV

Query:  YGFADAGNGDKREEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDES
        Y FAD GNGDKREEVVRKTQSFE+LKVESE CLAQSE+I  EKVEFETA+YAKFLNVLN+KKAKLR YRDQ  KQTT SSKLKQ++EYS DKTE+FDD+S
Subjt:  YGFADAGNGDKREEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDES

Query:  DAEKN
        DAEKN
Subjt:  DAEKN

XP_023553968.1 DNA repair protein XRCC4 isoform X2 [Cucurbita pepo subsp. pepo]2.71e-10078.05Show/hide
Query:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV
        MDAIRHTCLKL+VP+ AQ +   SIF VKGTW+ HRFDLSITDGLNAWTCH             +TEDEVRLRAEQWDQEPSDYV+LAER+LGFQQP SV
Subjt:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV

Query:  YGFADAGNGDKREEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDES
        YGFAD GNGDKREEVVRKTQSFE+LKVESE CLAQSE+I  EKVEFETA+YAKFLNVLN+KKAKLR YRDQ  KQTT SSKLKQ++EYS DKTE+FDD+S
Subjt:  YGFADAGNGDKREEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDES

Query:  DAEKN
        DAEKN
Subjt:  DAEKN

TrEMBL top hitse value%identityAlignment
A0A1S3BZW1 DNA repair protein XRCC49.95e-9466.94Show/hide
Query:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV
        MDAIRHTCLKLEVP+ AQ +E RS  FVKGTW  HRFDLSITDGL+AWTCH             +TEDEVRLRA QWDQEPSDYVALAER+LGFQQPGS+
Subjt:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV

Query:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV
        YGFADAGNG KR                                           EEVVRKTQS ERLK ESE CLAQSEKI  EKVEFETAIYAKFLNV
Subjt:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV

Query:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN
        LN+KKAKLR YRDQLSKQTT  SKLKQEE YSSDKTE+FDDESDAEKN
Subjt:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN

A0A5A7SK69 DNA repair protein XRCC49.95e-9466.94Show/hide
Query:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV
        MDAIRHTCLKLEVP+ AQ +E RS  FVKGTW  HRFDLSITDGL+AWTCH             +TEDEVRLRA QWDQEPSDYVALAER+LGFQQPGS+
Subjt:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV

Query:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV
        YGFADAGNG KR                                           EEVVRKTQS ERLK ESE CLAQSEKI  EKVEFETAIYAKFLNV
Subjt:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV

Query:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN
        LN+KKAKLR YRDQLSKQTT  SKLKQEE YSSDKTE+FDDESDAEKN
Subjt:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN

A0A6J1CL88 DNA repair protein XRCC4-like6.00e-11777.02Show/hide
Query:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV
        MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCH             +TEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV
Subjt:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV

Query:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV
        YGFADAGNGDKR                                           EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV
Subjt:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV

Query:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN
        LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN
Subjt:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN

A0A6J1GJL0 DNA repair protein XRCC41.15e-9264.92Show/hide
Query:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV
        MDAIRHTCLKLEVP+ AQ +   SIF VKGTW++HRFDLSITDGLNAWTCH             +TEDEVRLRAEQWDQEPSDYV+LAER+LGFQQP SV
Subjt:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV

Query:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV
        YGFAD GNGDKR                                           EEVVRKTQSFE+LKVESE CLAQSE+I  EKVEFETA+YAKFLNV
Subjt:  YGFADAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNV

Query:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN
        LN+KKAKLR YRDQ  KQTT SSKLKQ++EYS DKTE+FDD+SDAEKN
Subjt:  LNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN

A0A6J1I3G3 DNA repair protein XRCC4 isoform X22.65e-10078.05Show/hide
Query:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV
        MDAIRHTCLKLEVP+ AQ +   SIF VKGTW++HRFDLSITDGLNAWTCH             +TEDEVRLRAEQWDQEPSDYV LAER+LGFQQP SV
Subjt:  MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSV

Query:  YGFADAGNGDKREEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDES
        Y FAD GNGDKREEVVRKTQSFE+LKVESE CLAQSE+I  EKVEFETA+YAKFLNVLN+KKAKLR YRDQ  KQTT SSKLKQ++EYS DKTE+FDD+S
Subjt:  YGFADAGNGDKREEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDES

Query:  DAEKN
        DAEKN
Subjt:  DAEKN

SwissProt top hitse value%identityAlignment
Q682V0 DNA repair protein XRCC47.1e-3842.45Show/hide
Query:  RHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSVYGFA
        +HTCL+LE+             FVKGTW   RFD+S+TDG ++W C+             +TE+EV  RA QWDQ  S+Y+ LAE++LGFQQP SVY F+
Subjt:  RHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSVYGFA

Query:  DAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSK
        DA  G KR                                           EEVV KT+SFE+++ E+E CLAQ EK+  EK EFE+A YAKFL+VLN+K
Subjt:  DAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSK

Query:  KAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDD-ESDAEKN
        KAKLRA RD+        S    EEE S+DK E+F+   SD EK+
Subjt:  KAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDD-ESDAEKN

Arabidopsis top hitse value%identityAlignment
AT1G61410.1 DNA double-strand break repair and VJ recombination XRCC43.0e-1554.64Show/hide
Query:  EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETF-----DDESDAEK
        EEVV KT+SFE++K E+E CLAQ EK+  EK EFE A YAKFL+VLN+KKAKLRA RD+        S    EEE S+ K E+F     DDE   E+
Subjt:  EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETF-----DDESDAEK

AT3G23100.1 homolog of human DNA ligase iv-binding protein XRCC45.0e-3942.45Show/hide
Query:  RHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSVYGFA
        +HTCL+LE+             FVKGTW   RFD+S+TDG ++W C+             +TE+EV  RA QWDQ  S+Y+ LAE++LGFQQP SVY F+
Subjt:  RHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSVYGFA

Query:  DAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSK
        DA  G KR                                           EEVV KT+SFE+++ E+E CLAQ EK+  EK EFE+A YAKFL+VLN+K
Subjt:  DAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSK

Query:  KAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDD-ESDAEKN
        KAKLRA RD+        S    EEE S+DK E+F+   SD EK+
Subjt:  KAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDD-ESDAEKN

AT3G23100.2 homolog of human DNA ligase iv-binding protein XRCC45.0e-3942.45Show/hide
Query:  RHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSVYGFA
        +HTCL+LE+             FVKGTW   RFD+S+TDG ++W C+             +TE+EV  RA QWDQ  S+Y+ LAE++LGFQQP SVY F+
Subjt:  RHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSVYGFA

Query:  DAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSK
        DA  G KR                                           EEVV KT+SFE+++ E+E CLAQ EK+  EK EFE+A YAKFL+VLN+K
Subjt:  DAGNGDKR-------------------------------------------EEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSK

Query:  KAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDD-ESDAEKN
        KAKLRA RD+        S    EEE S+DK E+F+   SD EK+
Subjt:  KAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDD-ESDAEKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCGATCAGGCACACATGCCTGAAGCTTGAAGTGCCAAGTATCGCGCAATCGAACGAAAGCCGCTCCATCTTCTTCGTCAAAGGCACTTGGTTTCGCCACCGCTT
CGATCTCTCCATTACCGACGGCCTCAATGCTTGGACTTGCCATGGTGGTGCCATTTCTTCAAACCACCACTTGTTATCGTTATCGACGGAGGACGAGGTTCGATTGCGCG
CCGAACAATGGGACCAAGAACCCTCGGACTATGTGGCGTTGGCGGAACGGCATTTAGGGTTTCAGCAGCCTGGTTCGGTCTATGGGTTCGCCGACGCTGGAAATGGGGAC
AAGAGGGAAGAGGTTGTCAGAAAAACACAATCGTTTGAGAGGCTGAAAGTTGAATCTGAGAATTGTTTGGCTCAAAGTGAGAAGATTAGTGGCGAAAAGGTGGAGTTCGA
AACTGCAATTTATGCAAAGTTTCTCAATGTCTTGAACTCGAAGAAAGCAAAACTTAGAGCGTACAGAGATCAGCTTTCGAAACAGACTACTGGCAGCAGCAAGCTGAAAC
AAGAAGAGGAGTACTCCTCTGACAAAACCGAAACTTTTGACGATGAAAGCGATGCCGAAAAGAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACGCGATCAGGCACACATGCCTGAAGCTTGAAGTGCCAAGTATCGCGCAATCGAACGAAAGCCGCTCCATCTTCTTCGTCAAAGGCACTTGGTTTCGCCACCGCTT
CGATCTCTCCATTACCGACGGCCTCAATGCTTGGACTTGCCATGGTGGTGCCATTTCTTCAAACCACCACTTGTTATCGTTATCGACGGAGGACGAGGTTCGATTGCGCG
CCGAACAATGGGACCAAGAACCCTCGGACTATGTGGCGTTGGCGGAACGGCATTTAGGGTTTCAGCAGCCTGGTTCGGTCTATGGGTTCGCCGACGCTGGAAATGGGGAC
AAGAGGGAAGAGGTTGTCAGAAAAACACAATCGTTTGAGAGGCTGAAAGTTGAATCTGAGAATTGTTTGGCTCAAAGTGAGAAGATTAGTGGCGAAAAGGTGGAGTTCGA
AACTGCAATTTATGCAAAGTTTCTCAATGTCTTGAACTCGAAGAAAGCAAAACTTAGAGCGTACAGAGATCAGCTTTCGAAACAGACTACTGGCAGCAGCAAGCTGAAAC
AAGAAGAGGAGTACTCCTCTGACAAAACCGAAACTTTTGACGATGAAAGCGATGCCGAAAAGAACTAACGAAGAATGAAGGAATGTTATATGCAATTAGTTAGGATGACA
CACACACATCTTTGGTAATGTTTCATGGCCAAAATCCCTCTCTTTCTCTCTACTGATTAAAGTTCTAACGGATTTTGTGGATTTTTGTAGTTTTAACTTGTTCTAACGTA
AGCTATTGCTTGGACTGAGTTGTCATTTTATTTTATTTTTTGTGTAGAGCTTTGTTAACCTCTAGGAAACAGTGGGATAAGAGCGCTTTGATAGTCTCACGGAGCAATGC
ACTTATTCGTTCGCATAAAATGAAAGTTTACTATTAAACTGGGATTCAGATTCATATGATTAATCTATTGATAGATTTGTATCTCCGATAAATTATAAGAAATGTTGCGG
TCTACGCTTATACAATCTTCATTGTATTGTTTACAATCTTTCTAAATCTAAAACACAATGAATTTTTGGTGCCACTGATTGTAAGGCAAGTTTGGCAAAGTAATGCTAAG
G
Protein sequenceShow/hide protein sequence
MDAIRHTCLKLEVPSIAQSNESRSIFFVKGTWFRHRFDLSITDGLNAWTCHGGAISSNHHLLSLSTEDEVRLRAEQWDQEPSDYVALAERHLGFQQPGSVYGFADAGNGD
KREEVVRKTQSFERLKVESENCLAQSEKISGEKVEFETAIYAKFLNVLNSKKAKLRAYRDQLSKQTTGSSKLKQEEEYSSDKTETFDDESDAEKN