; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009457 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009457
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDNA-repair protein XRCC1
Genome locationscaffold813:1442677..1446889
RNA-Seq ExpressionMS009457
SyntenyMS009457
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006284 - base-excision repair (biological process)
GO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0003684 - damaged DNA binding (molecular function)
InterPro domainsIPR001357 - BRCT domain
IPR036420 - BRCT domain superfamily
IPR045080 - XRCC1, first (central) BRCT domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575562.1 DNA-repair protein XRCC1, partial [Cucurbita argyrosperma subsp. sororia]1.5e-16080.53Show/hide
Query:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM
        MSNSK N GGG AKRSLPSWMSGKDDGSTSRGKKP SS SG N+V+AEAEE +Q    GE P+SSSLH DF++LLEGVVFVLSGFVNPERSILRSQALEM
Subjt:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM

Query:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ
        GAQY+PDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWIS CYAQ+KL+DIES+LLHAGKPWRRSN   EA QA   + SKKPQKPVE+ S LK +EQ
Subjt:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ

Query:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS
        +I+QSR+  SSRECFSP KLKKWA DDY+KT+SWL+SQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQ+QDINQ+TEEWKFVPQVVEELAK  SKKESIS
Subjt:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS

Query:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKR-PNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKK
        KEEL R A DSK+IYEVELN LL +SP+RKK+ PNI+KELKNG K KE       YDSDDTIEMTEEEID+AFQ VACKK
Subjt:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKR-PNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKK

XP_016902642.1 PREDICTED: DNA-repair protein XRCC1 isoform X2 [Cucumis melo]3.4e-16079.21Show/hide
Query:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM
        MSNSK N GGGSAKRSLPSWMSGKDDGSTSRGKKP SSGS GN+++ EAEE KQ  GNGE P+SSSLHRDFS+LLEGVVFVLSGFVNPERSILRSQALEM
Subjt:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM

Query:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ
        GAQY+PDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWIS CYAQK+L+DIES+LL+AGKPWRRS+LSHEA Q  I S SKKPQK VE+ S  K +E 
Subjt:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ

Query:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS
        +I+QSR +N SR+CFSP KLKKWA DDY+KT+SWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQ+QDINQ+ EEWKFVPQVVEELAK  +KKESIS
Subjt:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS

Query:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT
        K+EL R A DSK IYEVELN LLD SP+RKK+ NI KE K G + KE       YDSDDTIEMTEEEID+AF  V CK +
Subjt:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT

XP_022150690.1 DNA-repair protein XRCC1 [Momordica charantia]1.2e-21099.74Show/hide
Query:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM
        MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM
Subjt:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM

Query:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ
        GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ
Subjt:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ

Query:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS
        DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAK CSKKESIS
Subjt:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS

Query:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT
        KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT
Subjt:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT

XP_022954387.1 DNA-repair protein XRCC1 isoform X2 [Cucurbita moschata]5.9e-16080.26Show/hide
Query:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM
        MSNSK N GGG+AKRSLPSWMSGKD GSTSRGKKP SS SG N+V+AEAEE +Q    GE P+SSSLH DF++LLEGVVFVLSGFVNPERSILRSQALEM
Subjt:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM

Query:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ
        GAQY+PDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWIS CYAQ+KL+DIES+LLHAGKPWRRSN   EA QA   + SKKPQKPVE+ S LK +EQ
Subjt:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ

Query:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS
        +I+QSR+  SSRECFSP KLKKWA DDY+KT+SWL+SQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQ+QDINQ+TEEWKFVPQVVEELAK  SKKESIS
Subjt:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS

Query:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKR-PNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKK
        KEEL R A DSK+IYEVELN LL +SP+RKK+ PNI+KELKNG K KE       YDSDDTIEMTEEEID+AFQ VACKK
Subjt:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKR-PNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKK

XP_038899733.1 DNA-repair protein XRCC1 [Benincasa hispida]6.7e-16480.79Show/hide
Query:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM
        MS+SKTN GGG+AKRSLPSWMSGKDDGS+SRGKKP SSGS GN+V  EA+E KQ  GNGE P+SSSLHRDFS+LLEGVVFVLSGFVNPERSILRSQALEM
Subjt:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM

Query:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ
        GAQY+PDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWIS CYAQK+L+DIES+LL+AGKPWRRSNL+ EA QA + SSSKKPQKPVE+ S LK +EQ
Subjt:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ

Query:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS
        +I+QSR+ NSSRECFSP KLKKWA DDY+KT+SWLESQEEKPDP EIKKIAAEGILTCLQDAIDSLHQ+QDINQ+TEEWKFVPQVVEELAK  +KKESIS
Subjt:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS

Query:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT
        KEEL R A DSK+IYE+ELN LLD+S +RKK+PN++KE KNG K KE       YDSDDTIEMTE+EID+AFQ VACKK+
Subjt:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT

TrEMBL top hitse value%identityAlignment
A0A0A0KAC1 BRCT domain-containing protein4.8e-16078.95Show/hide
Query:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM
        MSNSK N GGGSAKRSLPSWMSGKDDGSTSRGKKP SS S GN+++ EAEE KQ  GNGE P+SSSLH DFS+LLEGVVFVLSGFVNPERSILRSQALEM
Subjt:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM

Query:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ
        GAQY+PDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWIS CYAQK+L+DIES+LL+AGKPWRR +LS EA Q  I S SKKPQ+ VE+TSHLK +E 
Subjt:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ

Query:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS
        +I+QSR +N SR+CFSP KLKKWA DDY+KT+SWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQ+QDINQ+TEEWKFVPQVVEELAK  +KKESIS
Subjt:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS

Query:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT
        KEEL RQA DSK IYEVELN LL++SP+RKK+  + KE KNG + KE       YDSDDTIEMTEEEID+AF  V CK +
Subjt:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT

A0A1S4E338 DNA-repair protein XRCC1 isoform X21.7e-16079.21Show/hide
Query:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM
        MSNSK N GGGSAKRSLPSWMSGKDDGSTSRGKKP SSGS GN+++ EAEE KQ  GNGE P+SSSLHRDFS+LLEGVVFVLSGFVNPERSILRSQALEM
Subjt:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM

Query:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ
        GAQY+PDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWIS CYAQK+L+DIES+LL+AGKPWRRS+LSHEA Q  I S SKKPQK VE+ S  K +E 
Subjt:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ

Query:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS
        +I+QSR +N SR+CFSP KLKKWA DDY+KT+SWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQ+QDINQ+ EEWKFVPQVVEELAK  +KKESIS
Subjt:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS

Query:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT
        K+EL R A DSK IYEVELN LLD SP+RKK+ NI KE K G + KE       YDSDDTIEMTEEEID+AF  V CK +
Subjt:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT

A0A5A7USA9 DNA-repair protein XRCC1 isoform X21.1e-15978.95Show/hide
Query:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM
        MSNSK N GGGSAKRSLPSWMSGKDDGSTSRGKKP SSGS GN+++ EAEE KQ  GNGE P+SSSLHRDFS+LLEGVVFVLSGFVNPERSILRSQALEM
Subjt:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM

Query:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ
        GAQY+PDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWIS CYAQK+L+DIES+LL+AGKPWRRS+LSHEA Q  I S  KKPQK VE+ S  K +E 
Subjt:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ

Query:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS
        +I+QSR +N SR+CFSP KLKKWA DDY+KT+SWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQ+QDINQ+ EEWKFVPQVVEELAK  +KKESIS
Subjt:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS

Query:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT
        K+EL R A DSK IYEVELN LLD SP+RKK+ NI KE K G + KE       YDSDDTIEMTEEEID+AF  V CK +
Subjt:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT

A0A6J1D979 DNA-repair protein XRCC16.0e-21199.74Show/hide
Query:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM
        MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM
Subjt:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM

Query:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ
        GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ
Subjt:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ

Query:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS
        DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAK CSKKESIS
Subjt:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS

Query:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT
        KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT
Subjt:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT

A0A6J1GQV2 DNA-repair protein XRCC1 isoform X22.8e-16080.26Show/hide
Query:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM
        MSNSK N GGG+AKRSLPSWMSGKD GSTSRGKKP SS SG N+V+AEAEE +Q    GE P+SSSLH DF++LLEGVVFVLSGFVNPERSILRSQALEM
Subjt:  MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEM

Query:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ
        GAQY+PDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWIS CYAQ+KL+DIES+LLHAGKPWRRSN   EA QA   + SKKPQKPVE+ S LK +EQ
Subjt:  GAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQ

Query:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS
        +I+QSR+  SSRECFSP KLKKWA DDY+KT+SWL+SQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQ+QDINQ+TEEWKFVPQVVEELAK  SKKESIS
Subjt:  DISQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESIS

Query:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKR-PNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKK
        KEEL R A DSK+IYEVELN LL +SP+RKK+ PNI+KELKNG K KE       YDSDDTIEMTEEEID+AFQ VACKK
Subjt:  KEELRRQAIDSKKIYEVELNFLLDNSPDRKKR-PNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKK

SwissProt top hitse value%identityAlignment
O54935 DNA repair protein XRCC13.6e-1933.33Show/hide
Query:  RDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLL----------
        ++  K+L+GVV VLSGF NP RS LR +ALE+GA+Y+PDW  D T LICAF NTPK+ QV    G IV KEW+  CY  ++ +    +L+          
Subjt:  RDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLL----------

Query:  ------HAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDNNSSR----ECFSPPKLKKWATDDYNKTISWLESQEEKPDPSE
               +G+      LS +  QAK  + +  P  P +R    K+ +    + +DN+ +     E           T+D  + ++    Q + P P E
Subjt:  ------HAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDNNSSR----ECFSPPKLKKWATDDYNKTISWLESQEEKPDPSE

P18887 DNA repair protein XRCC17.3e-2034.5Show/hide
Query:  DFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLL-----------
        +  K+L+GVV VLSGF NP RS LR +ALE+GA+Y+PDW  D T LICAF NTPK+ QV    G IV KEW+  C+  ++ +    +L+           
Subjt:  DFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLL-----------

Query:  --HAG--------KPWRRSNLSHEARQAKIASSSKKPQKPVE--RTSHLKQNEQDI----SQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKP
          H+G         P ++     +  QA   SS +KP  P E    S + Q + DI    S+ +DN +     +  +L++ A    ++     E   E P
Subjt:  --HAG--------KPWRRSNLSHEARQAKIASSSKKPQKPVE--RTSHLKQNEQDI----SQSRDNNSSRECFSPPKLKKWATDDYNKTISWLESQEEKP

Q24JK4 DNA-repair protein XRCC12.6e-9455.59Show/hide
Query:  SAKRSLPSWMSGKD---DGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDW
        S KR+LPSWMS +D     S S  KKP   G        E       S   E    SS   +FSKL+EGVVFVLSGFVNPERS LRSQAL MGA YQPDW
Subjt:  SAKRSLPSWMSGKD---DGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDW

Query:  NSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDN
        N+  TLLICAFPNTPKFRQVE++ GTI+SKEWI+ CYAQKKL+DIE +L+HAGKPWR+S+   +A + K    SKKP+K VE+ +  +      S++R  
Subjt:  NSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDN

Query:  -NSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKF--CSKKE--SISKEE
         N  +E F   ++KKWA DD ++TISWLESQEEKP+P EIK+IAAEG+LTCLQDAIDSL Q QDI  VTE W FVP+VV+EL K    SKKE  + SKEE
Subjt:  -NSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKF--CSKKE--SISKEE

Query:  LRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDE---RVARGYDSDDTIEMTEEEIDLAFQNVA
        + +QA   KKIYE EL                        K  EDE   RVA GYDSD T+EMTEEEI+LA++NV+
Subjt:  LRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDE---RVARGYDSDDTIEMTEEEIDLAFQNVA

Q60596 DNA repair protein XRCC11.9e-2047.57Show/hide
Query:  GNGEDPISSSL-HRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIES
        G G +P  +    ++  K+L+GVV VLSGF NP RS LR +ALE+GA+Y+PDW  D T LICAF NTPK+ QV    G IV KEW+  C+  ++ +    
Subjt:  GNGEDPISSSL-HRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIES

Query:  HLL
        +L+
Subjt:  HLL

Q9ESZ0 DNA repair protein XRCC13.8e-2148.54Show/hide
Query:  GNGEDPISSSL-HRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIES
        G G +P  +    ++  K+L+GVV VLSGF NP RS LR +ALE+GA+Y+PDW  D T LICAF NTPK+ QV    G IV KEW+  CY  ++ +    
Subjt:  GNGEDPISSSL-HRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDWNSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIES

Query:  HLL
        +L+
Subjt:  HLL

Arabidopsis top hitse value%identityAlignment
AT1G80420.1 BRCT domain-containing DNA repair protein1.9e-9555.59Show/hide
Query:  SAKRSLPSWMSGKD---DGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDW
        S KR+LPSWMS +D     S S  KKP   G        E       S   E    SS   +FSKL+EGVVFVLSGFVNPERS LRSQAL MGA YQPDW
Subjt:  SAKRSLPSWMSGKD---DGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDW

Query:  NSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDN
        N+  TLLICAFPNTPKFRQVE++ GTI+SKEWI+ CYAQKKL+DIE +L+HAGKPWR+S+   +A + K    SKKP+K VE+ +  +      S++R  
Subjt:  NSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDN

Query:  -NSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKF--CSKKE--SISKEE
         N  +E F   ++KKWA DD ++TISWLESQEEKP+P EIK+IAAEG+LTCLQDAIDSL Q QDI  VTE W FVP+VV+EL K    SKKE  + SKEE
Subjt:  -NSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKF--CSKKE--SISKEE

Query:  LRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDE---RVARGYDSDDTIEMTEEEIDLAFQNVA
        + +QA   KKIYE EL                        K  EDE   RVA GYDSD T+EMTEEEI+LA++NV+
Subjt:  LRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDE---RVARGYDSDDTIEMTEEEIDLAFQNVA

AT1G80420.2 BRCT domain-containing DNA repair protein1.9e-9555.59Show/hide
Query:  SAKRSLPSWMSGKD---DGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDW
        S KR+LPSWMS +D     S S  KKP   G        E       S   E    SS   +FSKL+EGVVFVLSGFVNPERS LRSQAL MGA YQPDW
Subjt:  SAKRSLPSWMSGKD---DGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDW

Query:  NSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDN
        N+  TLLICAFPNTPKFRQVE++ GTI+SKEWI+ CYAQKKL+DIE +L+HAGKPWR+S+   +A + K    SKKP+K VE+ +  +      S++R  
Subjt:  NSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDN

Query:  -NSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKF--CSKKE--SISKEE
         N  +E F   ++KKWA DD ++TISWLESQEEKP+P EIK+IAAEG+LTCLQDAIDSL Q QDI  VTE W FVP+VV+EL K    SKKE  + SKEE
Subjt:  -NSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKF--CSKKE--SISKEE

Query:  LRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDE---RVARGYDSDDTIEMTEEEIDLAFQNVA
        + +QA   KKIYE EL                        K  EDE   RVA GYDSD T+EMTEEEI+LA++NV+
Subjt:  LRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDE---RVARGYDSDDTIEMTEEEIDLAFQNVA

AT1G80420.3 BRCT domain-containing DNA repair protein1.9e-9555.59Show/hide
Query:  SAKRSLPSWMSGKD---DGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDW
        S KR+LPSWMS +D     S S  KKP   G        E       S   E    SS   +FSKL+EGVVFVLSGFVNPERS LRSQAL MGA YQPDW
Subjt:  SAKRSLPSWMSGKD---DGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDW

Query:  NSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDN
        N+  TLLICAFPNTPKFRQVE++ GTI+SKEWI+ CYAQKKL+DIE +L+HAGKPWR+S+   +A + K    SKKP+K VE+ +  +      S++R  
Subjt:  NSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDN

Query:  -NSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKF--CSKKE--SISKEE
         N  +E F   ++KKWA DD ++TISWLESQEEKP+P EIK+IAAEG+LTCLQDAIDSL Q QDI  VTE W FVP+VV+EL K    SKKE  + SKEE
Subjt:  -NSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKF--CSKKE--SISKEE

Query:  LRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDE---RVARGYDSDDTIEMTEEEIDLAFQNVA
        + +QA   KKIYE EL                        K  EDE   RVA GYDSD T+EMTEEEI+LA++NV+
Subjt:  LRRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDE---RVARGYDSDDTIEMTEEEIDLAFQNVA

AT1G80420.4 BRCT domain-containing DNA repair protein3.4e-8954.4Show/hide
Query:  SAKRSLPSWMSGKD---DGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDW
        S KR+LPSWMS +D     S S  KKP   G        E       S   E    SS   +FSKL+EGVVFVLSGFVNPERS LRSQAL MGA YQPDW
Subjt:  SAKRSLPSWMSGKD---DGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDW

Query:  NSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDN
        N+  TLLICAFPNTPKFRQVE++ GTI+SKEWI+ CYAQKKL+DIE +L+HAGKPWR+S+   +A + K  S           T+ L+      S     
Subjt:  NSDCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDN

Query:  NSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKF--CSKKE--SISKEEL
        N  +E F   ++KKWA DD ++TISWLESQEEKP+P EIK+IAAEG+LTCLQDAIDSL Q QDI  VTE W FVP+VV+EL K    SKKE  + SKEE+
Subjt:  NSSRECFSPPKLKKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKF--CSKKE--SISKEEL

Query:  RRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDE---RVARGYDSDDTIEMTEEEIDLAFQNVA
         +QA   KKIYE EL                        K  EDE   RVA GYDSD T+EMTEEEI+LA++NV+
Subjt:  RRQAIDSKKIYEVELNFLLDNSPDRKKRPNISKELKNGCKEKEDE---RVARGYDSDDTIEMTEEEIDLAFQNVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAATTCGAAGACTAATTTGGGTGGTGGCAGTGCTAAGCGCAGTCTTCCTTCATGGATGAGTGGGAAGGACGATGGGAGTACTTCTCGAGGCAAGAAACCTACCAG
TTCTGGTTCTGGTGGAAATGAAGTGATTGCCGAAGCAGAAGAGGGTAAGCAAGGGAGTGGAAACGGTGAAGACCCAATCAGCAGTTCATTACACCGAGATTTCTCCAAAC
TTCTGGAAGGAGTAGTATTCGTGTTATCAGGGTTTGTTAATCCCGAGCGCAGCATTCTTCGGTCCCAAGCATTAGAAATGGGAGCTCAATATCAACCGGATTGGAACTCG
GATTGCACTTTGTTGATCTGTGCGTTCCCAAATACTCCAAAATTTCGACAAGTTGAATCAGATTGCGGAACAATAGTATCAAAGGAGTGGATTTCAGGTTGTTATGCACA
GAAGAAGCTCATTGACATTGAAAGCCACCTTCTGCATGCTGGAAAACCATGGCGAAGAAGCAACCTTTCACATGAAGCCCGCCAAGCTAAGATTGCATCGTCATCCAAGA
AACCTCAGAAGCCAGTTGAAAGAACTTCACATTTAAAACAAAATGAGCAAGATATATCTCAGAGTAGAGATAATAATTCTTCAAGAGAGTGCTTTTCTCCTCCGAAATTG
AAGAAATGGGCTACTGATGATTACAATAAAACAATTTCCTGGCTGGAGAGTCAAGAAGAAAAGCCAGATCCAAGTGAGATAAAGAAAATAGCAGCAGAAGGAATTTTAAC
TTGTTTACAAGATGCTATAGATTCTCTTCATCAAGATCAGGACATCAATCAAGTGACAGAAGAGTGGAAATTCGTTCCTCAGGTGGTGGAAGAGCTGGCAAAGTTTTGCA
GTAAGAAAGAGTCAATATCAAAGGAGGAACTTCGCAGGCAAGCTATAGATTCTAAAAAGATTTATGAAGTAGAACTTAATTTTCTACTTGACAATTCCCCAGATAGAAAG
AAGAGGCCAAATATCAGCAAAGAACTGAAAAATGGTTGTAAGGAAAAGGAAGATGAGCGTGTCGCACGAGGATATGATAGCGATGACACGATTGAGATGACTGAGGAGGA
GATTGACCTTGCATTCCAGAATGTAGCTTGTAAGAAGACT
mRNA sequenceShow/hide mRNA sequence
ATGTCAAATTCGAAGACTAATTTGGGTGGTGGCAGTGCTAAGCGCAGTCTTCCTTCATGGATGAGTGGGAAGGACGATGGGAGTACTTCTCGAGGCAAGAAACCTACCAG
TTCTGGTTCTGGTGGAAATGAAGTGATTGCCGAAGCAGAAGAGGGTAAGCAAGGGAGTGGAAACGGTGAAGACCCAATCAGCAGTTCATTACACCGAGATTTCTCCAAAC
TTCTGGAAGGAGTAGTATTCGTGTTATCAGGGTTTGTTAATCCCGAGCGCAGCATTCTTCGGTCCCAAGCATTAGAAATGGGAGCTCAATATCAACCGGATTGGAACTCG
GATTGCACTTTGTTGATCTGTGCGTTCCCAAATACTCCAAAATTTCGACAAGTTGAATCAGATTGCGGAACAATAGTATCAAAGGAGTGGATTTCAGGTTGTTATGCACA
GAAGAAGCTCATTGACATTGAAAGCCACCTTCTGCATGCTGGAAAACCATGGCGAAGAAGCAACCTTTCACATGAAGCCCGCCAAGCTAAGATTGCATCGTCATCCAAGA
AACCTCAGAAGCCAGTTGAAAGAACTTCACATTTAAAACAAAATGAGCAAGATATATCTCAGAGTAGAGATAATAATTCTTCAAGAGAGTGCTTTTCTCCTCCGAAATTG
AAGAAATGGGCTACTGATGATTACAATAAAACAATTTCCTGGCTGGAGAGTCAAGAAGAAAAGCCAGATCCAAGTGAGATAAAGAAAATAGCAGCAGAAGGAATTTTAAC
TTGTTTACAAGATGCTATAGATTCTCTTCATCAAGATCAGGACATCAATCAAGTGACAGAAGAGTGGAAATTCGTTCCTCAGGTGGTGGAAGAGCTGGCAAAGTTTTGCA
GTAAGAAAGAGTCAATATCAAAGGAGGAACTTCGCAGGCAAGCTATAGATTCTAAAAAGATTTATGAAGTAGAACTTAATTTTCTACTTGACAATTCCCCAGATAGAAAG
AAGAGGCCAAATATCAGCAAAGAACTGAAAAATGGTTGTAAGGAAAAGGAAGATGAGCGTGTCGCACGAGGATATGATAGCGATGACACGATTGAGATGACTGAGGAGGA
GATTGACCTTGCATTCCAGAATGTAGCTTGTAAGAAGACT
Protein sequenceShow/hide protein sequence
MSNSKTNLGGGSAKRSLPSWMSGKDDGSTSRGKKPTSSGSGGNEVIAEAEEGKQGSGNGEDPISSSLHRDFSKLLEGVVFVLSGFVNPERSILRSQALEMGAQYQPDWNS
DCTLLICAFPNTPKFRQVESDCGTIVSKEWISGCYAQKKLIDIESHLLHAGKPWRRSNLSHEARQAKIASSSKKPQKPVERTSHLKQNEQDISQSRDNNSSRECFSPPKL
KKWATDDYNKTISWLESQEEKPDPSEIKKIAAEGILTCLQDAIDSLHQDQDINQVTEEWKFVPQVVEELAKFCSKKESISKEELRRQAIDSKKIYEVELNFLLDNSPDRK
KRPNISKELKNGCKEKEDERVARGYDSDDTIEMTEEEIDLAFQNVACKKT