; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004103 (gene) of Chayote v1 genome

Gene IDSed0004103
OrganismSechium edule (Chayote v1)
Description5'-3' exonuclease
Genome locationLG03:4117926..4120860
RNA-Seq ExpressionSed0004103
SyntenySed0004103
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004527 - exonuclease activity (molecular function)
InterPro domainsIPR002298 - DNA polymerase A
IPR002421 - 5'-3' exonuclease
IPR020045 - DNA polymerase I-like, H3TH domain
IPR020046 - 5'-3' exonuclease, alpha-helical arch, N-terminal
IPR029060 - PIN-like domain superfamily
IPR036279 - 5'-3' exonuclease, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152532.1 uncharacterized protein LOC111020233 [Momordica charantia]1.2e-16781.92Show/hide
Query:  MAEASANIGVNLPPLLNSAPRSSFPSRTLKAESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGSR
        MAEA ANIGVN+PP LNSA RSS PSRTLK ES + +K+N+WRTK L+LSA   A+     +  K+T+ G LQP  EA N R GRVFFLDVNPLCY+GSR
Subjt:  MAEASANIGVNLPPLLNSAPRSSFPSRTLKAESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGSR

Query:  PSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQ
        PSL +FGRW SIFF++VSH DPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKG S R YQ+I DALRNCNVPV+K++G EADDVVATLV+Q
Subjt:  PSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLS
        VLQRGFRVVIASPDKDFKQLISEDVQLV+PLPELNRWSFYTL+HY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLL+KHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLS

Query:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND
        AAAIRTVG+PYAQDALTKYADYLRTNYKVLALRRDV+VQFQ+EWLVERDR+NDS I+SK VE+ND
Subjt:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND

XP_022961199.1 uncharacterized protein LOC111461781 [Cucurbita moschata]2.8e-16782.51Show/hide
Query:  MAEASANIGVNLPPLLNSAPRSSFPSRTLK-AESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGS
        MAEASANIG+N+PP LNS   +S PSRTLK AES   +KA +WRTKPLKLS   AA+SRS S+  K+ E+G+L P  EA N R GRVFFLDVNPLCY GS
Subjt:  MAEASANIGVNLPPLLNSAPRSSFPSRTLK-AESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGS

Query:  RPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVE
        RPSL +FGRW+SIFF+EVS  DPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKG S R+YQ+I DALR+CNVPVIKI G EADDVVATLVE
Subjt:  RPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVE

Query:  QVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLL
        QVLQRGFR VIASPDKDFKQLISEDVQLV+PLPELNRWSFYTLKHY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLL+KHGSLENLL
Subjt:  QVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLL

Query:  SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND
        SAAA+RTVGKPYAQDALTKYADYLRTNYKVLALRRD++VQF++EWLV+RDR+NDSTI+SK VE+ND
Subjt:  SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND

XP_022990137.1 uncharacterized protein LOC111487120 [Cucurbita maxima]2.3e-16983.88Show/hide
Query:  MAEASANIGVNLPPLLNSAPRSSFPSRTLK-AESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGS
        MAEASANIG+N+PP LNSA  +S PSRTLK AES   +KA +WRTKPLKLS  +AA SRS S+  K+ E+G+L PR EA N R GRVFFLDVNPLCY GS
Subjt:  MAEASANIGVNLPPLLNSAPRSSFPSRTLK-AESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGS

Query:  RPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVE
        RPSL +FGRW+SIFF+EVSH DPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKG S R+YQ+I DALR+C+VPVIKI G EADDVVATLVE
Subjt:  RPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVE

Query:  QVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLL
        QVLQRGFR VIASPDKDFKQLISEDVQLV+PLPELNRWSFYTLKHY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLL+KHGSLENLL
Subjt:  QVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLL

Query:  SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND
        SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDV+VQF++EWLVERDR+NDSTI+SK VE+ND
Subjt:  SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND

XP_023542307.1 uncharacterized protein LOC111802238 [Cucurbita pepo subsp. pepo]3.3e-16883.33Show/hide
Query:  MAEASANIGVNLPPLLNSAPRSSFPSRTLK-AESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGS
        MAEASANIG+N+PP  NSA  +S PSRTLK AES   +KA +WRTKPLKLS   AA+SRS S+  K+ E+G+L PR EA N R GRVFFLDVNPLCY GS
Subjt:  MAEASANIGVNLPPLLNSAPRSSFPSRTLK-AESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGS

Query:  RPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVE
        RPSL +FGRW+SIFF+EVSH DPVIAV DGEGGSEHRRLLLPSYK+HRIKFTRQSSSQRFTKG S R+YQ+I DALR+CNVPVIKI G EADDVVATLVE
Subjt:  RPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVE

Query:  QVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLL
        QVLQRG R VIASPDKDFKQLISEDVQLV+PLPELNRWSFYTLKHY AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLL+KHGSLENLL
Subjt:  QVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLL

Query:  SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND
        SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDV+VQF++EWLVERDR+NDSTI+SK VE+ND
Subjt:  SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND

XP_038884032.1 5'-3' exonuclease [Benincasa hispida]1.1e-16682.61Show/hide
Query:  MAEASANIGV-NLPPLLNSAPRSSFPSRTLKAESAMAS--KANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYD
        MAEASANIGV NLPP LNS+ R+S PSRTLKAE+A+ S  KANTWRTKPLKL+   AA+SR  SAA  +T+ G+ QPR EA N R GRVFFLDVNPLCY 
Subjt:  MAEASANIGV-NLPPLLNSAPRSSFPSRTLKAESAMAS--KANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYD

Query:  GSRPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATL
        G+RPSL +FGRW+SIFF+EVSH DPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ SSQRFTKG S R+YQ+I DALRNCNVPV+K++G+EADDVVATL
Subjt:  GSRPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATL

Query:  VEQVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLEN
        VEQVLQRG RVVIASPDKDFKQLISEDVQLV+PLPELNRWSFYTL+HY+AQY+CDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLL+KHGSLEN
Subjt:  VEQVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLEN

Query:  LLSAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND
        LLSAAAIRTVGKPYAQ ALTKYA+YLRTNYKVLALRRDV+VQFQDEWLVERDR+ND  I+SK VE+ +
Subjt:  LLSAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND

TrEMBL top hitse value%identityAlignment
A0A0A0LQN4 53EXOc domain-containing protein7.4e-16680.65Show/hide
Query:  MAEASANIGVNLPPLLNSAPRSSFPSRTLKAESAMAS--KANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDG
        MAEASANIGVN PP LNS+  +  PSRTLK E  + S  K NTWRTKPL L+ A A +SR  SAA  +T++G+ QPR EA N R GRVFFLDVNPLCY G
Subjt:  MAEASANIGVNLPPLLNSAPRSSFPSRTLKAESAMAS--KANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDG

Query:  SRPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRT-YQMIGDALRNCNVPVIKINGEEADDVVATLV
        S+PSL++FGRW+SIFF+EVSH DPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR  SS+RFTKG  RT YQ+I DALR+CNVPV+++ G EADDV+ATLV
Subjt:  SRPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRT-YQMIGDALRNCNVPVIKINGEEADDVVATLV

Query:  EQVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENL
        EQVLQRG RVV+ASPDKDFKQLISED+QLV+PLPELNRWSFYTL+HY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLL+KHGSLENL
Subjt:  EQVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENL

Query:  LSAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND
        LSAAAIRTVGKPYAQDALTKYA+YLRTNYKVLALRRDV+VQFQDEWLVERDR+NDSTI+SK VE+ND
Subjt:  LSAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND

A0A1S3B2X7 5'-3' exonuclease9.0e-16480.27Show/hide
Query:  MAEASANIGVNLPPLLNSAPRSSFPSRTLKAESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGSR
        M EASA IGVN PP LNS+ R+  PSRT         K NTWRTKPLKL+ A   +SR  SAA  +T++G+ QPR EA N R GRVFFLDVNPLCY G++
Subjt:  MAEASANIGVNLPPLLNSAPRSSFPSRTLKAESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGSR

Query:  PSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRT-YQMIGDALRNCNVPVIKINGEEADDVVATLVEQ
        PSL++FGRW+SIFF+EVSH DPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR  SSQRFTKG  RT YQ+I DALR+CNVPV+K++G EADDVVATLVEQ
Subjt:  PSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRT-YQMIGDALRNCNVPVIKINGEEADDVVATLVEQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLS
        VLQRG RVV+ASPDKDFKQLISEDVQLV+PLPELNRWSFYT++HY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLL+KHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLS

Query:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND
        AAAIRTVGKPYAQDALTKYA+YLRTNYKVLALRRDV+VQFQDEWLVERDR+NDSTI+SK VE+ND
Subjt:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND

A0A6J1DGI1 uncharacterized protein LOC1110202336.0e-16881.92Show/hide
Query:  MAEASANIGVNLPPLLNSAPRSSFPSRTLKAESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGSR
        MAEA ANIGVN+PP LNSA RSS PSRTLK ES + +K+N+WRTK L+LSA   A+     +  K+T+ G LQP  EA N R GRVFFLDVNPLCY+GSR
Subjt:  MAEASANIGVNLPPLLNSAPRSSFPSRTLKAESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGSR

Query:  PSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQ
        PSL +FGRW SIFF++VSH DPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKG S R YQ+I DALRNCNVPV+K++G EADDVVATLV+Q
Subjt:  PSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLS
        VLQRGFRVVIASPDKDFKQLISEDVQLV+PLPELNRWSFYTL+HY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLL+KHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLS

Query:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND
        AAAIRTVG+PYAQDALTKYADYLRTNYKVLALRRDV+VQFQ+EWLVERDR+NDS I+SK VE+ND
Subjt:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND

A0A6J1HDC1 uncharacterized protein LOC1114617811.3e-16782.51Show/hide
Query:  MAEASANIGVNLPPLLNSAPRSSFPSRTLK-AESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGS
        MAEASANIG+N+PP LNS   +S PSRTLK AES   +KA +WRTKPLKLS   AA+SRS S+  K+ E+G+L P  EA N R GRVFFLDVNPLCY GS
Subjt:  MAEASANIGVNLPPLLNSAPRSSFPSRTLK-AESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGS

Query:  RPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVE
        RPSL +FGRW+SIFF+EVS  DPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKG S R+YQ+I DALR+CNVPVIKI G EADDVVATLVE
Subjt:  RPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVE

Query:  QVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLL
        QVLQRGFR VIASPDKDFKQLISEDVQLV+PLPELNRWSFYTLKHY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLL+KHGSLENLL
Subjt:  QVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLL

Query:  SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND
        SAAA+RTVGKPYAQDALTKYADYLRTNYKVLALRRD++VQF++EWLV+RDR+NDSTI+SK VE+ND
Subjt:  SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND

A0A6J1JHT6 uncharacterized protein LOC1114871201.1e-16983.88Show/hide
Query:  MAEASANIGVNLPPLLNSAPRSSFPSRTLK-AESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGS
        MAEASANIG+N+PP LNSA  +S PSRTLK AES   +KA +WRTKPLKLS  +AA SRS S+  K+ E+G+L PR EA N R GRVFFLDVNPLCY GS
Subjt:  MAEASANIGVNLPPLLNSAPRSSFPSRTLK-AESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGS

Query:  RPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVE
        RPSL +FGRW+SIFF+EVSH DPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKG S R+YQ+I DALR+C+VPVIKI G EADDVVATLVE
Subjt:  RPSLQSFGRWISIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGIS-RTYQMIGDALRNCNVPVIKINGEEADDVVATLVE

Query:  QVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLL
        QVLQRGFR VIASPDKDFKQLISEDVQLV+PLPELNRWSFYTLKHY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLL+KHGSLENLL
Subjt:  QVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLL

Query:  SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND
        SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDV+VQF++EWLVERDR+NDSTI+SK VE+ND
Subjt:  SAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND

SwissProt top hitse value%identityAlignment
O67550 5'-3' exonuclease3.9e-2328.4Show/hide
Query:  FLDVNPLCYDGSRPSLQSFGRWISIFFKEVSHGDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKIN
        F  + PL      P+   +G ++ + F  +    P  ++ VFD    ++ R  +   YK  R K       Q           +I + L+   +P++++ 
Subjt:  FLDVNPLCYDGSRPSLQSFGRWISIFFKEVSHGDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKIN

Query:  GEEADDVVATLVEQVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAI
        G EADDV+A L E+  Q+GF+V I SPDKD  QL+SE+V ++ P+ +      +T +  + ++  +P        ++GD+VD VPGI+    G G KTAI
Subjt:  GEEADDVVATLVEQVLQRGFRVVIASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAI

Query:  KLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDE
         +L+K+GS+EN+L         + + ++      + L  +YK++ L  D++++  +E
Subjt:  KLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRRDVNVQFQDE

P52026 DNA polymerase I2.0e-1926.61Show/hide
Query:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQLISE
        ++  FD  G +  R      YK  R         Q+    +S  + ++ + L+   +P  +++  EADD++ T+  +  + GF V + S D+D  QL S 
Subjt:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQLISE

Query:  DVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYADYL
         V + I    +     YT +  V +Y   P   + L+ +MGD+ D +PG+    PG G KTA+KLL++ G++EN+L  A+I  +     ++ L +Y D  
Subjt:  DVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYADYL

Query:  RTNYKVLALRRDVNVQFQDEWLVERDRKNDSTI
          + ++ A+ RD  V+   + +V +    +  +
Subjt:  RTNYKVLALRRDVNVQFQDEWLVERDRKNDSTI

Q04957 DNA polymerase I1.2e-1929.17Show/hide
Query:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQLISE
        ++  FD  G +  R      YK  R         Q+    +S  + ++ + LR   +P  ++   EADD++ TL  +  Q GF V + S D+D  QL S 
Subjt:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQLISE

Query:  DVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYADYL
         V + I    +     YT +    +Y   P   + L+ +MGD+ D +PG+    PG G KTA+KLL++ G++EN+L  A+I  +     ++ L ++ +  
Subjt:  DVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYADYL

Query:  RTNYKVLALRRDVNVQ
          + K+ A+RRD  V+
Subjt:  RTNYKVLALRRDVNVQ

Q92GB7 DNA polymerase I1.7e-1828.02Show/hide
Query:  SIFFKEVSHGDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVV
        S+  K +S   P  V  VFD  GG   R  + P YKA+R         Q           ++ D   N N P+++ NG EADD++AT   +    G  VV
Subjt:  SIFFKEVSHGDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVV

Query:  IASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGK
        I S DKD  QL++E++++  PL    +  + T    V ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ +  ++  V  
Subjt:  IASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGK

Query:  PYAQDALTKYADYLRTNYKVLALRRDVNVQFQ
           ++ L    +    +++++ L  +V++ FQ
Subjt:  PYAQDALTKYADYLRTNYKVLALRRDVNVQFQ

Q9RLB6 DNA polymerase I5.9e-1928.88Show/hide
Query:  SIFFKEVSHGDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVV
        S+  K +S   P  V  VFD  GG   R  + P YKA+R         Q           ++ D   N N P+++ NG EADD++AT   +    G  VV
Subjt:  SIFFKEVSHGDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVV

Query:  IASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGK
        I S DKD  QL+SE++++  PL    R  + T    V ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ +  ++  V  
Subjt:  IASPDKDFKQLISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGK

Query:  PYAQDALTKYADYLRTNYKVLALRRDVNVQFQ
           ++ L    +    +++++ L  +V++ FQ
Subjt:  PYAQDALTKYADYLRTNYKVLALRRDVNVQFQ

Arabidopsis top hitse value%identityAlignment
AT1G34380.1 5'-3' exonuclease family protein5.6e-6552.92Show/hide
Query:  PRSSFPSRTLKAESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGSRPSLQSFGRWISIFFKEVSH
        P S F   T   +    S+     +     S+ S+  +   +  V+  +   L   +E + ++  RVFFLDV+PLCY+G++PS Q+FG WIS+FF +VS 
Subjt:  PRSSFPSRTLKAESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGSRPSLQSFGRWISIFFKEVSH

Query:  GDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQL
         DPVIAV DGE G++ RR LLPSYKAHR    +  +  R++K   R +Q + + LR CNVPV++I G EADDVVATL+EQ +QRG+R VIASPDKDFKQL
Subjt:  GDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQL

Query:  ISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLR
        ISE+VQ+VIPL +L RWSFYTLKHY AQYNCDP SDLS R
Subjt:  ISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLR

AT1G34380.2 5'-3' exonuclease family protein5.5e-11360.06Show/hide
Query:  PRSSFPSRTLKAESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGSRPSLQSFGRWISIFFKEVSH
        P S F   T   +    S+     +     S+ S+  +   +  V+  +   L   +E + ++  RVFFLDV+PLCY+G++PS Q+FG WIS+FF +VS 
Subjt:  PRSSFPSRTLKAESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGSRPSLQSFGRWISIFFKEVSH

Query:  GDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQL
         DPVIAV DGE G++ RR LLPSYKAHR    +  +  R++K   R +Q + + LR CNVPV++I G EADDVVATL+EQ +QRG+R VIASPDKDFKQL
Subjt:  GDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQL

Query:  ISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYA
        ISE+VQ+VIPL +L RWSFYTLKHY AQYNCDP SDLS RCIMGDEVDGVPGIQH+ P FGRKTA+KL++KHGSLE+LLSAAA+RTVG+PYAQ+ALTKYA
Subjt:  ISEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYA

Query:  DYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVS
        DYLR NY+VLAL RDV VQ Q+EWL+ERD  NDS ++S
Subjt:  DYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVS

AT3G52050.1 5'-3' exonuclease family protein5.5e-2027.9Show/hide
Query:  GSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQLISEDVQLVIPLP
        G   R  L P+YK++     R  +     +G+    Q +  +++  ++ VI++ G EADDV+ TL  + +  GF+V + SPDKDF Q++S  ++L+   P
Subjt:  GSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQLISEDVQLVIPLP

Query:  ELNRWSFYTLKHYVAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYADYLRTNYKVLA
          +  + + ++ +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL +      GK   +++L   AD    + K+  
Subjt:  ELNRWSFYTLKHYVAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYADYLRTNYKVLA

Query:  LRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDN
        LR D+      +++V  D K+   +  K  EDN
Subjt:  LRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDN

AT3G52050.2 5'-3' exonuclease family protein5.5e-2027.9Show/hide
Query:  GSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQLISEDVQLVIPLP
        G   R  L P+YK++     R  +     +G+    Q +  +++  ++ VI++ G EADDV+ TL  + +  GF+V + SPDKDF Q++S  ++L+   P
Subjt:  GSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQLISEDVQLVIPLP

Query:  ELNRWSFYTLKHYVAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYADYLRTNYKVLA
          +  + + ++ +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL +      GK   +++L   AD    + K+  
Subjt:  ELNRWSFYTLKHYVAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYADYLRTNYKVLA

Query:  LRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDN
        LR D+      +++V  D K+   +  K  EDN
Subjt:  LRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDN

AT3G52050.4 5'-3' exonuclease family protein2.9e-2128.34Show/hide
Query:  VIAVFDGEG-----GSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFK
        V  VFD +G     G   R  L P+YK++     R  +     +G+    Q +  +++  ++ VI++ G EADDV+ TL  + +  GF+V + SPDKDF 
Subjt:  VIAVFDGEG-----GSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFK

Query:  QLISEDVQLVIPLPELNRWSFYTLKHYVAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALT
        Q++S  ++L+   P  +  + + ++ +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL +      GK   +++L 
Subjt:  QLISEDVQLVIPLPELNRWSFYTLKHYVAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALT

Query:  KYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDN
          AD    + K+  LR D+      +++V  D K+   +  K  EDN
Subjt:  KYADYLRTNYKVLALRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGGCGAGCGCCAACATCGGCGTTAACCTTCCGCCATTGTTGAACTCTGCTCCGCGTTCGTCCTTCCCGTCAAGAACTCTGAAAGCCGAGTCGGCAATGGCGTC
CAAAGCGAATACGTGGAGAACGAAGCCATTGAAGCTAAGTGCCGCCTCTGCAGCAACTTCTCGTTCTATTTCTGCGGCTGTTAAAGAAACAGAAAATGGACGATTGCAGC
CGAGAGATGAAGCCGTCAATCGAAGAATTGGAAGAGTCTTTTTCCTGGACGTGAATCCTTTATGCTATGATGGCAGTAGACCTAGTTTGCAGAGTTTTGGTCGCTGGATT
TCCATCTTCTTCAAGGAAGTTAGCCATGGTGATCCTGTTATTGCTGTTTTTGATGGGGAAGGAGGTAGTGAGCATCGCAGGTTGTTGTTACCCTCATATAAAGCGCATCG
TATCAAATTCACGAGACAATCATCTTCACAGAGATTTACAAAGGGAATTTCTAGGACATATCAAATGATAGGAGATGCTCTCAGAAACTGTAATGTGCCAGTTATAAAGA
TCAATGGTGAGGAAGCAGATGATGTTGTTGCTACACTTGTGGAACAAGTTTTGCAAAGAGGGTTTCGGGTGGTGATAGCCTCTCCTGATAAAGATTTCAAGCAGTTGATT
TCAGAAGATGTCCAACTCGTGATACCTTTGCCGGAGCTCAACAGATGGTCCTTTTACACCTTAAAACACTACGTCGCTCAGTATAACTGTGATCCGTGCTCTGACTTGAG
TCTTAGATGCATTATGGGTGACGAGGTAGATGGCGTTCCGGGAATTCAGCATGTTGCTCCTGGATTCGGTCGAAAGACTGCAATAAAGCTCTTACAGAAACACGGTTCAT
TGGAGAATCTACTCAGTGCTGCTGCAATAAGAACGGTGGGTAAACCGTATGCGCAAGATGCACTTACGAAGTATGCTGATTACCTGCGAACAAACTATAAAGTTCTAGCC
TTAAGAAGAGACGTTAATGTTCAATTTCAAGACGAGTGGTTGGTTGAAAGAGACAGAAAAAACGATTCGACTATTGTATCCAAGTTAGTAGAAGACAATGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAGGCGAGCGCCAACATCGGCGTTAACCTTCCGCCATTGTTGAACTCTGCTCCGCGTTCGTCCTTCCCGTCAAGAACTCTGAAAGCCGAGTCGGCAATGGCGTC
CAAAGCGAATACGTGGAGAACGAAGCCATTGAAGCTAAGTGCCGCCTCTGCAGCAACTTCTCGTTCTATTTCTGCGGCTGTTAAAGAAACAGAAAATGGACGATTGCAGC
CGAGAGATGAAGCCGTCAATCGAAGAATTGGAAGAGTCTTTTTCCTGGACGTGAATCCTTTATGCTATGATGGCAGTAGACCTAGTTTGCAGAGTTTTGGTCGCTGGATT
TCCATCTTCTTCAAGGAAGTTAGCCATGGTGATCCTGTTATTGCTGTTTTTGATGGGGAAGGAGGTAGTGAGCATCGCAGGTTGTTGTTACCCTCATATAAAGCGCATCG
TATCAAATTCACGAGACAATCATCTTCACAGAGATTTACAAAGGGAATTTCTAGGACATATCAAATGATAGGAGATGCTCTCAGAAACTGTAATGTGCCAGTTATAAAGA
TCAATGGTGAGGAAGCAGATGATGTTGTTGCTACACTTGTGGAACAAGTTTTGCAAAGAGGGTTTCGGGTGGTGATAGCCTCTCCTGATAAAGATTTCAAGCAGTTGATT
TCAGAAGATGTCCAACTCGTGATACCTTTGCCGGAGCTCAACAGATGGTCCTTTTACACCTTAAAACACTACGTCGCTCAGTATAACTGTGATCCGTGCTCTGACTTGAG
TCTTAGATGCATTATGGGTGACGAGGTAGATGGCGTTCCGGGAATTCAGCATGTTGCTCCTGGATTCGGTCGAAAGACTGCAATAAAGCTCTTACAGAAACACGGTTCAT
TGGAGAATCTACTCAGTGCTGCTGCAATAAGAACGGTGGGTAAACCGTATGCGCAAGATGCACTTACGAAGTATGCTGATTACCTGCGAACAAACTATAAAGTTCTAGCC
TTAAGAAGAGACGTTAATGTTCAATTTCAAGACGAGTGGTTGGTTGAAAGAGACAGAAAAAACGATTCGACTATTGTATCCAAGTTAGTAGAAGACAATGATTGA
Protein sequenceShow/hide protein sequence
MAEASANIGVNLPPLLNSAPRSSFPSRTLKAESAMASKANTWRTKPLKLSAASAATSRSISAAVKETENGRLQPRDEAVNRRIGRVFFLDVNPLCYDGSRPSLQSFGRWI
SIFFKEVSHGDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGISRTYQMIGDALRNCNVPVIKINGEEADDVVATLVEQVLQRGFRVVIASPDKDFKQLI
SEDVQLVIPLPELNRWSFYTLKHYVAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAIKLLQKHGSLENLLSAAAIRTVGKPYAQDALTKYADYLRTNYKVLA
LRRDVNVQFQDEWLVERDRKNDSTIVSKLVEDND