; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039830 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039830
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Description53EXOc domain-containing protein
Genome locationscaffold10:47037112..47041058
RNA-Seq ExpressionSpg039830
SyntenySpg039830
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR002298 - DNA polymerase A
IPR002421 - 5'-3' exonuclease
IPR020045 - DNA polymerase I-like, H3TH domain
IPR020046 - 5'-3' exonuclease, alpha-helical arch, N-terminal
IPR029060 - PIN-like domain superfamily
IPR036279 - 5'-3' exonuclease, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602717.1 hypothetical protein SDJN03_07950, partial [Cucurbita argyrosperma subsp. sororia]5.6e-15671.88Show/hide
Query:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS
        MAEASAN+G+NIPPFLNS S  SLPSRTLKAAES  TTKA  WRTKPLKLS F AASSR TS+ FKQ +  KL P +EAD+ RKGRVFFLDVNPLCYQGS
Subjt:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI
        RPSLHNFGRWVSIFFEEVS SDPVIA                                                                          
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI

Query:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK
               V DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNS RSYQVIRDALR+CNVPVIKI GHEADDVVATL EQVLQRGFR VIASPDKDFK
Subjt:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK

Query:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK
        QLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHV PGFGRKTA+KLLKKHGSLENLLSAAA+RTVG+PYAQDALTK
Subjt:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK

Query:  HADYLRTNYKVLALRR
        +ADYLRTNYKVLALRR
Subjt:  HADYLRTNYKVLALRR

XP_022152532.1 uncharacterized protein LOC111020233 [Momordica charantia]1.5e-15671.88Show/hide
Query:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS
        MAEA ANIGVNIPPFLNSASR SLPSRTLK  ES LTTK+N+WRTK L+LSAF  AS     + FKQTDG  L+P IEAD+ RKGRVFFLDVNPLCY+GS
Subjt:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI
        RPSLHNFGRW SIFFE+VSHSDPVIA                                                                          
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI

Query:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK
               VFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNSRR YQVIRDALRNCNVPV+K+DGHEADDVVATL +QVLQRGFRVVIASPDKDFK
Subjt:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK

Query:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK
        QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHV PGFGRKTA+KLLKKHGSLENLLSAAAIRTVGRPYAQDALTK
Subjt:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK

Query:  HADYLRTNYKVLALRR
        +ADYLRTNYKVLALRR
Subjt:  HADYLRTNYKVLALRR

XP_022961199.1 uncharacterized protein LOC111461781 [Cucurbita moschata]3.3e-15672.12Show/hide
Query:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS
        MAEASANIG+NIPPFLNS S  SLPSRTLKAAES  TTKA +WRTKPLKLS F AASSR TS+ FKQ +  KL P +EAD+ RKGRVFFLDVNPLCYQGS
Subjt:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI
        RPSLHNFGRWVSIFFEEVS SDPVIA                                                                          
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI

Query:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK
               V DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNS RSYQVIRDALR+CNVPVIKI GHEADDVVATL EQVLQRGFR VIASPDKDFK
Subjt:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK

Query:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK
        QLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHV PGFGRKTA+KLLKKHGSLENLLSAAA+RTVG+PYAQDALTK
Subjt:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK

Query:  HADYLRTNYKVLALRR
        +ADYLRTNYKVLALRR
Subjt:  HADYLRTNYKVLALRR

XP_022990137.1 uncharacterized protein LOC111487120 [Cucurbita maxima]7.9e-15872.84Show/hide
Query:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS
        MAEASANIG+NIPPFLNSAS  SLPSRTLKAAES  TTKA +WRTKPLKLS FAAA SR TS+ FKQ +  KL PR+EAD+ RKGRVFFLDVNPLCYQGS
Subjt:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI
        RPSLHNFGRWVSIFFEEVSHSDPVIA                                                                          
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI

Query:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK
               V DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNS RSYQVIRDALR+C+VPVIKI GHEADDVVATL EQVLQRGFR VIASPDKDFK
Subjt:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK

Query:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK
        QLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHV PGFGRKTA+KLLKKHGSLENLLSAAAIRTVG+PYAQDALTK
Subjt:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK

Query:  HADYLRTNYKVLALRR
        +ADYLRTNYKVLALRR
Subjt:  HADYLRTNYKVLALRR

XP_023542307.1 uncharacterized protein LOC111802238 [Cucurbita pepo subsp. pepo]4.3e-15671.88Show/hide
Query:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS
        MAEASANIG+NIPPF NSAS  SLPSRTLKAAES  TTKA +WRTKPLKLS F AASSR TS+ FKQ +  KL PR+EAD+ RKGRVFFLDVNPLCYQGS
Subjt:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI
        RPSLHNFGRWVSIFFEEVSHSDPVIA                                                                          
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI

Query:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK
               V DGEGGSEHRRLLLPSYK+HRIKFTRQSSSQRFTKGNS RSYQVI+DALR+CNVPVIKI GHEADDVVATL EQVLQRG R VIASPDKDFK
Subjt:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK

Query:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK
        QLISEDVQLVMPLPELNRWSFYTL+HY AQYNCDPCSDLSLRCIMGDEVDGVPGIQHV PGFGRKTA+KLLKKHGSLENLLSAAAIRTVG+PYAQDALTK
Subjt:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK

Query:  HADYLRTNYKVLALRR
        +ADYLRTNYKVLALRR
Subjt:  HADYLRTNYKVLALRR

TrEMBL top hitse value%identityAlignment
A0A0A0LQN4 53EXOc domain-containing protein1.7e-15069.62Show/hide
Query:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTT--KANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQ
        MAEASANIGVN PPFLNS+S   LPSRTLK  E  LT+  K NTWRTKPL L+AF A SSR TSAAF QTD  K +PRIEAD+SR GRVFFLDVNPLCYQ
Subjt:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTT--KANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQ

Query:  GSRPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKR
        GS+PSL NFGRWVSIFFEEVSHSDPVIA                                                                        
Subjt:  GSRPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKR

Query:  EILRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKD
                 VFDGEGGSEHRRLLLPSYKAHRIKFTR  SS+RFTKGN R SYQVIRDALR+CNVPV++++GHEADDV+ATL EQVLQRG RVV+ASPDKD
Subjt:  EILRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKD

Query:  FKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDAL
        FKQLISED+QLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHV PGFGRKTALKLLKKHGSLENLLSAAAIRTVG+PYAQDAL
Subjt:  FKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDAL

Query:  TKHADYLRTNYKVLALRR
        TK+A+YLRTNYKVLALRR
Subjt:  TKHADYLRTNYKVLALRR

A0A1S3B2X7 5'-3' exonuclease3.8e-15069.62Show/hide
Query:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTT--KANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQ
        M EASA IGVN PPFLNS+SR  LPSRT      ALT+  K NTWRTKPLKL+AF   SSR TSAAF QTD  K +PRIEAD+ RKGRVFFLDVNPLCYQ
Subjt:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTT--KANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQ

Query:  GSRPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKR
        G++PSL NFGRWVSIFFEEVSHSDPVIA                                                                        
Subjt:  GSRPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKR

Query:  EILRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKD
                 VFDGEGGSEHRRLLLPSYKAHRIKFTR  SSQRFTKGN R SYQVIRDALR+CNVPV+K+DGHEADDVVATL EQVLQRG RVV+ASPDKD
Subjt:  EILRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKD

Query:  FKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDAL
        FKQLISEDVQLVMPLPELNRWSFYT+RHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHV PGFGRKTALKLLKKHGSLENLLSAAAIRTVG+PYAQDAL
Subjt:  FKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDAL

Query:  TKHADYLRTNYKVLALRR
        TK+A+YLRTNYKVLALRR
Subjt:  TKHADYLRTNYKVLALRR

A0A6J1DGI1 uncharacterized protein LOC1110202337.2e-15771.88Show/hide
Query:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS
        MAEA ANIGVNIPPFLNSASR SLPSRTLK  ES LTTK+N+WRTK L+LSAF  AS     + FKQTDG  L+P IEAD+ RKGRVFFLDVNPLCY+GS
Subjt:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI
        RPSLHNFGRW SIFFE+VSHSDPVIA                                                                          
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI

Query:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK
               VFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNSRR YQVIRDALRNCNVPV+K+DGHEADDVVATL +QVLQRGFRVVIASPDKDFK
Subjt:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK

Query:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK
        QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHV PGFGRKTA+KLLKKHGSLENLLSAAAIRTVGRPYAQDALTK
Subjt:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK

Query:  HADYLRTNYKVLALRR
        +ADYLRTNYKVLALRR
Subjt:  HADYLRTNYKVLALRR

A0A6J1HDC1 uncharacterized protein LOC1114617811.6e-15672.12Show/hide
Query:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS
        MAEASANIG+NIPPFLNS S  SLPSRTLKAAES  TTKA +WRTKPLKLS F AASSR TS+ FKQ +  KL P +EAD+ RKGRVFFLDVNPLCYQGS
Subjt:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI
        RPSLHNFGRWVSIFFEEVS SDPVIA                                                                          
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI

Query:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK
               V DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNS RSYQVIRDALR+CNVPVIKI GHEADDVVATL EQVLQRGFR VIASPDKDFK
Subjt:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK

Query:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK
        QLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHV PGFGRKTA+KLLKKHGSLENLLSAAA+RTVG+PYAQDALTK
Subjt:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK

Query:  HADYLRTNYKVLALRR
        +ADYLRTNYKVLALRR
Subjt:  HADYLRTNYKVLALRR

A0A6J1JHT6 uncharacterized protein LOC1114871203.8e-15872.84Show/hide
Query:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS
        MAEASANIG+NIPPFLNSAS  SLPSRTLKAAES  TTKA +WRTKPLKLS FAAA SR TS+ FKQ +  KL PR+EAD+ RKGRVFFLDVNPLCYQGS
Subjt:  MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGS

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI
        RPSLHNFGRWVSIFFEEVSHSDPVIA                                                                          
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREI

Query:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK
               V DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNS RSYQVIRDALR+C+VPVIKI GHEADDVVATL EQVLQRGFR VIASPDKDFK
Subjt:  LRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFK

Query:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK
        QLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHV PGFGRKTA+KLLKKHGSLENLLSAAAIRTVG+PYAQDALTK
Subjt:  QLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTK

Query:  HADYLRTNYKVLALRR
        +ADYLRTNYKVLALRR
Subjt:  HADYLRTNYKVLALRR

SwissProt top hitse value%identityAlignment
O67550 5'-3' exonuclease3.8e-2240.77Show/hide
Query:  VIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDG
        VI++ L+   +P++++ G+EADDV+A LAE+  Q+GF+V I SPDKD  QL+SE+V ++ P+ +      +T    + ++  +P        ++GD+VD 
Subjt:  VIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDG

Query:  VPGIQHVEPGFGRKTALKLLKKHGSLENLL
        VPGI+    G G KTA+ +LKK+GS+EN+L
Subjt:  VPGIQHVEPGFGRKTALKLLKKHGSLENLL

P52026 DNA polymerase I5.7e-1830.54Show/hide
Query:  YQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEV
        + ++R+ L+   +P  ++D +EADD++ T+A +  + GF V + S D+D  QL S  V + +    +     YT    + +Y   P   + L+ +MGD+ 
Subjt:  YQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEV

Query:  DGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKHADYLRTNYKVLALRR
        D +PG+    PG G KTA+KLLK+ G++EN+L  A+I  +     ++ L ++ D    + ++ A+ R
Subjt:  DGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKHADYLRTNYKVLALRR

Q04957 DNA polymerase I3.0e-1930.27Show/hide
Query:  FTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQY
        F      ++ T       + ++R+ LR   +P  +++ +EADD++ TLA +  Q GF V + S D+D  QL S  V + +    +     YT      +Y
Subjt:  FTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQY

Query:  NCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKHADYLRTNYKVLALRR
           P   + L+ +MGD+ D +PG+    PG G KTA+KLL++ G++EN+L  A+I  +     ++ L +H +    + K+ A+RR
Subjt:  NCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKHADYLRTNYKVLALRR

Q9RLB6 DNA polymerase I4.8e-1731.82Show/hide
Query:  VFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDV
        VFD  GG   R  + P YKA+R         Q            ++RD   N N P+++ +G+EADD++AT A +    G  VVI S DKD  QL+SE++
Subjt:  VFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDV

Query:  QLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSA
        ++  PL    R  + T    + ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ ++
Subjt:  QLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSA

Q9S1G2 DNA polymerase I8.2e-1732.16Show/hide
Query:  RRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELN
        R+ L  +YKA+R     +   Q          + +IR+A R  N+P I+ +G EADD++AT A Q    G  V I S DKD  QL+S +V +   + +  
Subjt:  RRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELN

Query:  RWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLS-AAAIRTVGRPYAQDALTKHADYLRTNYKVLALR
              +   + ++   P   + L+ + GD VD VPGI    PG G KTA +LL+++G L+ LL  A  I+ V R   ++ +  + D  R +  ++ LR
Subjt:  RWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLS-AAAIRTVGRPYAQDALTKHADYLRTNYKVLALR

Arabidopsis top hitse value%identityAlignment
AT1G34380.1 5'-3' exonuclease family protein1.2e-5239.81Show/hide
Query:  RTKPLKLSAFAAASSRFTSAAFKQT----------DGEKLRPRIEADDSRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCG
        +T+P +    +++SS F+S +  +T            + L    E    +  RVFFLDV+PLCY+G++PS   FG W+S+FF +VS +DPVIA       
Subjt:  RTKPLKLSAFAAASSRFTSAAFKQT----------DGEKLRPRIEADDSRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCG

Query:  TGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREILRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFT
                                                                                  V DGE G++ RR LLPSYKAHR    
Subjt:  TGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREILRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFT

Query:  RQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNC
        +  +  R+    S+R +Q + + LR CNVPV++I+GHEADDVVATL EQ +QRG+R VIASPDKDFKQLISE+VQ+V+PL +L RWSFYTL+HY AQYNC
Subjt:  RQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNC

Query:  DPCSDLSLR
        DP SDLS R
Subjt:  DPCSDLSLR

AT1G34380.2 5'-3' exonuclease family protein6.8e-9148.04Show/hide
Query:  RTKPLKLSAFAAASSRFTSAAFKQT----------DGEKLRPRIEADDSRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCG
        +T+P +    +++SS F+S +  +T            + L    E    +  RVFFLDV+PLCY+G++PS   FG W+S+FF +VS +DPVIA       
Subjt:  RTKPLKLSAFAAASSRFTSAAFKQT----------DGEKLRPRIEADDSRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSHSDPVIALPFLCCG

Query:  TGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREILRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFT
                                                                                  V DGE G++ RR LLPSYKAHR    
Subjt:  TGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREILRVPAPQVFDGEGGSEHRRLLLPSYKAHRIKFT

Query:  RQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNC
        +  +  R+    S+R +Q + + LR CNVPV++I+GHEADDVVATL EQ +QRG+R VIASPDKDFKQLISE+VQ+V+PL +L RWSFYTL+HY AQYNC
Subjt:  RQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNC

Query:  DPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKHADYLRTNYKVLALRR
        DP SDLS RCIMGDEVDGVPGIQH+ P FGRKTA+KL++KHGSLE+LLSAAA+RTVGRPYAQ+ALTK+ADYLR NY+VLAL R
Subjt:  DPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKHADYLRTNYKVLALRR

AT3G52050.2 5'-3' exonuclease family protein5.3e-1927.59Show/hide
Query:  GSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPL
        G   R  L P+YK++R            T     +  Q ++ +++  ++ VI++ G EADDV+ TLA + +  GF+V + SPDKDF Q++S  ++L+   
Subjt:  GSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPL

Query:  PELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKHADYLRTNYKVL
        P  +  + + +  +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL   ++  +     +++L   AD    + K+ 
Subjt:  PELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKHADYLRTNYKVL

Query:  ALR
         LR
Subjt:  ALR

AT3G52050.4 5'-3' exonuclease family protein8.1e-2028.04Show/hide
Query:  VFDGEG-----GSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQL
        VFD +G     G   R  L P+YK++R            T     +  Q ++ +++  ++ VI++ G EADDV+ TLA + +  GF+V + SPDKDF Q+
Subjt:  VFDGEG-----GSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQL

Query:  ISEDVQLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKH
        +S  ++L+   P  +  + + +  +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL   ++  +     +++L   
Subjt:  ISEDVQLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKH

Query:  ADYLRTNYKVLALR
        AD    + K+  LR
Subjt:  ADYLRTNYKVLALR

AT3G52050.5 5'-3' exonuclease family protein4.0e-1929.94Show/hide
Query:  VFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDV
        VFD + G   R  L P+YK++R            T     +  Q ++ +++  ++ VI++ G EADDV+ TLA + +  GF+V + SPDKDF Q++S  +
Subjt:  VFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDV

Query:  QLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSA
        +L+   P  +  + + +  +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL +
Subjt:  QLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGGCGAGCGCCAACATAGGCGTTAATATTCCGCCATTTTTGAACTCTGCATCGCGTGGTTCCTTACCGTCGAGGACTCTGAAAGCCGCCGAATCGGCATTAAC
AACCAAAGCCAATACATGGAGAACGAAGCCATTGAAGCTAAGTGCTTTTGCTGCCGCATCTTCTCGTTTTACTTCTGCAGCTTTTAAACAAACAGATGGTGAGAAGTTGC
GGCCAAGAATTGAAGCTGATGATTCAAGAAAGGGAAGGGTCTTCTTCCTTGACGTAAATCCTCTCTGCTATCAAGGTAGCAGACCTAGTTTGCACAATTTTGGCCGCTGG
GTTTCCATTTTCTTCGAGGAAGTTAGCCACAGTGATCCTGTTATTGCTCTGCCATTTTTGTGCTGTGGTACGGGACAATTTCAATCTACAATTCACCCGAATGAGATAAA
ATTTGTGCAACGGGAATTCATACGGAGCATAGGGCGGCAAGGAAATGATCGAGGAGTTACTTCTTCACCCTCCTTTACGCAAGAAATGAAAGTTCTTGTGGCACGCATGG
GTATGCGTTGTTCTGTGGAGTCTTTGGGGGAGAGGGATAAAAGAGAGATCTTGAGAGTACCGGCTCCGCAGGTTTTTGATGGGGAAGGAGGTAGTGAGCATCGCAGGCTG
TTGTTGCCTTCATATAAAGCACATCGGATCAAATTCACGAGACAATCATCTTCACAAAGATTCACAAAGGGGAATTCTAGAAGGTCATACCAAGTGATAAGAGATGCTCT
CAGAAATTGTAATGTGCCAGTTATAAAGATCGATGGTCATGAAGCAGATGATGTTGTTGCTACACTTGCGGAACAAGTTTTGCAAAGAGGGTTTCGAGTAGTAATAGCCT
CTCCTGATAAAGATTTCAAGCAGTTGATTTCAGAAGATGTCCAACTTGTTATGCCTCTGCCAGAGCTCAATAGATGGTCCTTTTACACTTTAAGGCACTACCTAGCTCAG
TATAACTGTGATCCGTGCTCTGACTTGAGTCTTAGATGCATTATGGGTGATGAGGTAGATGGTGTTCCAGGAATCCAGCATGTTGAGCCTGGATTTGGTCGAAAGACTGC
ATTGAAGCTCTTAAAGAAACATGGCTCTTTGGAGAATCTACTCAGTGCTGCTGCAATAAGAACTGTGGGCAGACCATATGCACAAGATGCTCTTACAAAGCATGCTGATT
ACCTGCGAACGAACTATAAAGTTCTAGCCCTAAGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAGGCGAGCGCCAACATAGGCGTTAATATTCCGCCATTTTTGAACTCTGCATCGCGTGGTTCCTTACCGTCGAGGACTCTGAAAGCCGCCGAATCGGCATTAAC
AACCAAAGCCAATACATGGAGAACGAAGCCATTGAAGCTAAGTGCTTTTGCTGCCGCATCTTCTCGTTTTACTTCTGCAGCTTTTAAACAAACAGATGGTGAGAAGTTGC
GGCCAAGAATTGAAGCTGATGATTCAAGAAAGGGAAGGGTCTTCTTCCTTGACGTAAATCCTCTCTGCTATCAAGGTAGCAGACCTAGTTTGCACAATTTTGGCCGCTGG
GTTTCCATTTTCTTCGAGGAAGTTAGCCACAGTGATCCTGTTATTGCTCTGCCATTTTTGTGCTGTGGTACGGGACAATTTCAATCTACAATTCACCCGAATGAGATAAA
ATTTGTGCAACGGGAATTCATACGGAGCATAGGGCGGCAAGGAAATGATCGAGGAGTTACTTCTTCACCCTCCTTTACGCAAGAAATGAAAGTTCTTGTGGCACGCATGG
GTATGCGTTGTTCTGTGGAGTCTTTGGGGGAGAGGGATAAAAGAGAGATCTTGAGAGTACCGGCTCCGCAGGTTTTTGATGGGGAAGGAGGTAGTGAGCATCGCAGGCTG
TTGTTGCCTTCATATAAAGCACATCGGATCAAATTCACGAGACAATCATCTTCACAAAGATTCACAAAGGGGAATTCTAGAAGGTCATACCAAGTGATAAGAGATGCTCT
CAGAAATTGTAATGTGCCAGTTATAAAGATCGATGGTCATGAAGCAGATGATGTTGTTGCTACACTTGCGGAACAAGTTTTGCAAAGAGGGTTTCGAGTAGTAATAGCCT
CTCCTGATAAAGATTTCAAGCAGTTGATTTCAGAAGATGTCCAACTTGTTATGCCTCTGCCAGAGCTCAATAGATGGTCCTTTTACACTTTAAGGCACTACCTAGCTCAG
TATAACTGTGATCCGTGCTCTGACTTGAGTCTTAGATGCATTATGGGTGATGAGGTAGATGGTGTTCCAGGAATCCAGCATGTTGAGCCTGGATTTGGTCGAAAGACTGC
ATTGAAGCTCTTAAAGAAACATGGCTCTTTGGAGAATCTACTCAGTGCTGCTGCAATAAGAACTGTGGGCAGACCATATGCACAAGATGCTCTTACAAAGCATGCTGATT
ACCTGCGAACGAACTATAAAGTTCTAGCCCTAAGAAGGTGA
Protein sequenceShow/hide protein sequence
MAEASANIGVNIPPFLNSASRGSLPSRTLKAAESALTTKANTWRTKPLKLSAFAAASSRFTSAAFKQTDGEKLRPRIEADDSRKGRVFFLDVNPLCYQGSRPSLHNFGRW
VSIFFEEVSHSDPVIALPFLCCGTGQFQSTIHPNEIKFVQREFIRSIGRQGNDRGVTSSPSFTQEMKVLVARMGMRCSVESLGERDKREILRVPAPQVFDGEGGSEHRRL
LLPSYKAHRIKFTRQSSSQRFTKGNSRRSYQVIRDALRNCNVPVIKIDGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQ
YNCDPCSDLSLRCIMGDEVDGVPGIQHVEPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKHADYLRTNYKVLALRR