; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G030110 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G030110
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
Description5'-3' exonuclease
Genome locationCma_Chr04:19680249..19688736
RNA-Seq ExpressionCmaCh04G030110
SyntenyCmaCh04G030110
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004527 - exonuclease activity (molecular function)
InterPro domainsIPR002298 - DNA polymerase A
IPR002421 - 5'-3' exonuclease
IPR018790 - Protein of unknown function DUF2358
IPR020045 - DNA polymerase I-like, H3TH domain
IPR020046 - 5'-3' exonuclease, alpha-helical arch, N-terminal
IPR029060 - PIN-like domain superfamily
IPR032710 - NTF2-like domain superfamily
IPR036279 - 5'-3' exonuclease, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602717.1 hypothetical protein SDJN03_07950, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0087.68Show/hide
Query:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASAN+GLNIPPFLNS SPTSLPSRTLKAAESVRTTKAK WRTKPLKLSVFAA+SRSTSSGFKQAEDGKLPP +EADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVS SDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSC+VPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR-----------------------------------------------------------------A
        AAA+RTVGKPYAQDALTKYADYLRTNYKVLALRR                                                                 +
Subjt:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR-----------------------------------------------------------------A

Query:  SSSSSSSLSLTSVHSPQPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDPLNTFSGIERYKLIFWALRFH
        SSSSSSSL LTS+HSPQPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDPLNTFSGIERYKLIFWALRFH
Subjt:  SSSSSSSLSLTSVHSPQPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDPLNTFSGIERYKLIFWALRFH

Query:  GRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVLDLVTACPASPNPTFLWGTEE
        GRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVLDLVTACPASPNPTF WGTEE
Subjt:  GRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVLDLVTACPASPNPTFLWGTEE

Query:  LHCSSWVELYQAVRRSVGGEGYLITQDGFLTCS
        LHCSSWVELYQAVRRSVGGEGYLITQDGFLTCS
Subjt:  LHCSSWVELYQAVRRSVGGEGYLITQDGFLTCS

KAG7033404.1 hypothetical protein SDJN02_07460 [Cucurbita argyrosperma subsp. argyrosperma]2.1e-18297.01Show/hide
Query:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASANIGLNIPPFLNS SPTSLPSRTLKAAESVRTTKAK WRTKPLKLSVF A+SRSTSSGFKQAEDGKL P +EADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVS SDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSC+VPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAA+RTVGKPYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR

XP_022961199.1 uncharacterized protein LOC111461781 [Cucurbita moschata]6.0e-18598.2Show/hide
Query:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASANIGLNIPPFLNS SPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAA+SRSTSSGFKQAEDGKLPP MEADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVS SDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSC+VPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAA+RTVGKPYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR

XP_022990137.1 uncharacterized protein LOC111487120 [Cucurbita maxima]4.5e-188100Show/hide
Query:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR

XP_023542307.1 uncharacterized protein LOC111802238 [Cucurbita pepo subsp. pepo]7.9e-18597.9Show/hide
Query:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASANIGLNIPPF NSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAA+SRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYK+HRIKFTRQSSSQRFTKGNSGRSYQVI+DALRSC+VPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRG RAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHY AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR

TrEMBL top hitse value%identityAlignment
A0A0A0LQN4 53EXOc domain-containing protein9.5e-16084.52Show/hide
Query:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAK--SWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQG
        MAEASANIG+N PPFLNS+SPT LPSRTLK  E   T+K K  +WRTKPL L+ FA +SR TS+ F Q +DGK  PR+EADN R GRVFFLDVNPLCYQG
Subjt:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAK--SWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQG

Query:  SRPSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLV
        S+PSL NFGRWVSIFFEEVSHSDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTR  SS+RFTKGN   SYQVIRDALRSC+VPV++++GHEADDV+ATLV
Subjt:  SRPSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLV

Query:  EQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENL
        EQVLQRG R V+ASPDKDFKQLISED+QLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENL
Subjt:  EQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENL

Query:  LSAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
        LSAAAIRTVGKPYAQDALTKYA+YLRTNYKVLALRR
Subjt:  LSAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR

A0A1S3B2X7 5'-3' exonuclease1.4e-15883.83Show/hide
Query:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR
        M EASA IG+N PPFLNS+S T LPSRT   ++     K  +WRTKPLKL+ F  +SR TS+ F Q +DGK  PR+EADNPRKGRVFFLDVNPLCYQG++
Subjt:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ
        PSL NFGRWVSIFFEEVSHSDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTR  SSQRFTKGN   SYQVIRDALRSC+VPV+K+ GHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRG R V+ASPDKDFKQLISEDVQLVMPLPELNRWSFYT++HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAAIRTVGKPYAQDALTKYA+YLRTNYKVLALRR
Subjt:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR

A0A6J1DGI1 uncharacterized protein LOC1110202332.4e-16387.43Show/hide
Query:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR
        MAEA ANIG+NIPPFLNSAS +SLPSRTLK  ES  TTK+ SWRTK L+LS F  AS ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRW SIFFE+VSHSDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+C+VPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAAIRTVG+PYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR

A0A6J1HDC1 uncharacterized protein LOC1114617812.9e-18598.2Show/hide
Query:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASANIGLNIPPFLNS SPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAA+SRSTSSGFKQAEDGKLPP MEADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVS SDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSC+VPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAA+RTVGKPYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR

A0A6J1JHT6 uncharacterized protein LOC1114871202.2e-188100Show/hide
Query:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR

SwissProt top hitse value%identityAlignment
O67550 5'-3' exonuclease4.4e-2139.23Show/hide
Query:  VIRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDG
        VI++ L+   +P++++ G+EADDV+A L E+  Q+GF+  I SPDKD  QL+SE+V ++ P+ +      +T +  + ++  +P        ++GD+VD 
Subjt:  VIRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDG

Query:  VPGIQHVAPGFGRKTAMKLLKKHGSLENLL
        VPGI+    G G KTA+ +LKK+GS+EN+L
Subjt:  VPGIQHVAPGFGRKTAMKLLKKHGSLENLL

P52026 DNA polymerase I4.6e-1824.43Show/hide
Query:  KGRVFFLDVNPLCYQG--SRPSLHNFGRWVSIFFEEVSHSDPVIA---VIDGEGGSEHRRLLLPSYKAHRIKFTRQS-----SSQRFTKGNSGRSYQVIR
        K ++  +D N + Y+   + P LHN         ++  H++ V     +++     E    +L ++ A +  F  ++       ++ T       + ++R
Subjt:  KGRVFFLDVNPLCYQG--SRPSLHNFGRWVSIFFEEVSHSDPVIA---VIDGEGGSEHRRLLLPSYKAHRIKFTRQS-----SSQRFTKGNSGRSYQVIR

Query:  DALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPG
        + L++  +P  ++  +EADD++ T+  +  + GF   + S D+D  QL S  V + +    +     YT +  + +Y   P   + L+ +MGD+ D +PG
Subjt:  DALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPG

Query:  IQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR
        +    PG G KTA+KLLK+ G++EN+L  A+I  +     ++ L +Y D    + ++ A+ R
Subjt:  IQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGKPYAQDALTKYADYLRTNYKVLALRR

P52028 DNA polymerase I, thermostable1.6e-1529.69Show/hide
Query:  KGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRF------------TKGNSGRSYQV
        KGRV  +D + L Y+           + ++     S  +PV AV  G   S  + L    YKA  + F  ++ S R             T  +  R   +
Subjt:  KGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRF------------TKGNSGRSYQV

Query:  IRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGV
        I++ +       +++ G+EADDV+ATL ++  + G+   I + D+D  QL+S+ V ++ P   L      T +    +Y   P   +  R ++GD  D +
Subjt:  IRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGV

Query:  PGIQHVAPGFGRKTAMKLLKKHGSLENLL
        PG++    G G KTA+KLLK+ GSLENLL
Subjt:  PGIQHVAPGFGRKTAMKLLKKHGSLENLL

Q9RLB6 DNA polymerase I9.5e-1628.65Show/hide
Query:  IAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISE
        +AV+   GG   R  + P YKA+R         Q            ++RD   + + P+++  G+EADD++AT   +    G   VI S DKD  QL+SE
Subjt:  IAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISE

Query:  DVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSA
        ++++  PL    R  + T    + ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ ++
Subjt:  DVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSA

Q9S1G2 DNA polymerase I5.6e-1631.84Show/hide
Query:  IAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISE
        +AVI        R+ L  +YKA+R     +   Q          + +IR+A R+ ++P I+ +G EADD++AT   Q    G    I S DKD  QL+S 
Subjt:  IAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISE

Query:  DVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAA
        +V +   + +        +   + ++   P   + L+ + GD VD VPGI    PG G KTA +LL+++G L+ LL  A
Subjt:  DVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAA

Arabidopsis top hitse value%identityAlignment
AT1G16320.1 Uncharacterized conserved protein (DUF2358)7.1e-9163.78Show/hide
Query:  NYKVLALRRASSSSS----------SSLSLTSVHSPQPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDP
        N K +  +  SSSSS          S+LSL S+ SP   L+  Q+ +  +  D  ++  RD+FY+NLG+AVRTLREDLPL+F RDLNYDIYRDDITF+DP
Subjt:  NYKVLALRRASSSSS----------SSLSLTSVHSPQPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDP

Query:  LNTFSGIERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVL
        +NTF+G++ YK+IFWALRFHG+ILFR+I +E++R+WQPSEN+ILIRWNLKGVPRVPWEA+GEFQGTSR+K+DRNGKIYEHKVDNLAFNFPQ LKPAASVL
Subjt:  LNTFSGIERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVL

Query:  DLVTACPA-SPNPTFLWGTEELHCSSWVELYQAVRRSVGGEGYLITQDGFLTCS
        DLVTA PA SPNPTF +   + + SSWV+ YQAVR ++  E   +T D  +TCS
Subjt:  DLVTACPA-SPNPTFLWGTEELHCSSWVELYQAVRRSVGGEGYLITQDGFLTCS

AT1G34380.1 5'-3' exonuclease family protein1.6e-6352.7Show/hide
Query:  PTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQA-EDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSH
        P SL S + K+ +  + ++ K   +     S  ++      +G  Q  +   L    E    +  RVFFLDV+PLCY+G++PS   FG W+S+FF +VS 
Subjt:  PTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQA-EDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSH

Query:  SDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQ
        +DPVIAVIDGE G++ RR LLPSYKAHR    +  +  R++K    R +Q + + LR C+VPV++I+GHEADDVVATL+EQ +QRG+RAVIASPDKDFKQ
Subjt:  SDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQ

Query:  LISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLR
        LISE+VQ+V+PL +L RWSFYTLKHY AQYNCDP SDLS R
Subjt:  LISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLR

AT1G34380.2 5'-3' exonuclease family protein3.1e-10260Show/hide
Query:  PTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQA-EDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSH
        P SL S + K+ +  + ++ K   +     S  ++      +G  Q  +   L    E    +  RVFFLDV+PLCY+G++PS   FG W+S+FF +VS 
Subjt:  PTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQA-EDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSH

Query:  SDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQ
        +DPVIAVIDGE G++ RR LLPSYKAHR    +  +  R++K    R +Q + + LR C+VPV++I+GHEADDVVATL+EQ +QRG+RAVIASPDKDFKQ
Subjt:  SDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQ

Query:  LISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGKPYAQDALTKY
        LISE+VQ+V+PL +L RWSFYTLKHY AQYNCDP SDLS RCIMGDEVDGVPGIQH+ P FGRKTAMKL++KHGSLE+LLSAAA+RTVG+PYAQ+ALTKY
Subjt:  LISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGKPYAQDALTKY

Query:  ADYLRTNYKVLALRR
        ADYLR NY+VLAL R
Subjt:  ADYLRTNYKVLALRR

AT1G79510.1 Uncharacterized conserved protein (DUF2358)3.4e-9365.32Show/hide
Query:  ALRRASSSSS---------SSLSLTSVHSPQPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDPLNTFSG
        AL  +SSSS+         S LSL SV +P P ++G Q+ +  +  D      +DDFY+NLGLAVRTLREDLPL+F +DLNYDIYRDDIT +DP+NTFSG
Subjt:  ALRRASSSSS---------SSLSLTSVHSPQPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDPLNTFSG

Query:  IERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVLDLVTAC
        I+ YKLIFWALRFHG+ILFR+I +E++R+WQPSEN+ILIRWNLKGVPRVPWEA+GEFQGTSR+K+DRNGKIYEHKVDNLAFNFP  LKPA SVLD+VTAC
Subjt:  IERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVLDLVTAC

Query:  PASPNPTFLWGTEELHCSSWVELYQAVRRSVG-GEGYLITQDGFLTCS
        PASPNPTF++G  + + SSW+E Y+AV+R++   E  ++ QD F+ CS
Subjt:  PASPNPTFLWGTEELHCSSWVELYQAVRRSVG-GEGYLITQDGFLTCS

AT1G79510.2 Uncharacterized conserved protein (DUF2358)3.4e-9365.32Show/hide
Query:  ALRRASSSSS---------SSLSLTSVHSPQPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDPLNTFSG
        AL  +SSSS+         S LSL SV +P P ++G Q+ +  +  D      +DDFY+NLGLAVRTLREDLPL+F +DLNYDIYRDDIT +DP+NTFSG
Subjt:  ALRRASSSSS---------SSLSLTSVHSPQPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDPLNTFSG

Query:  IERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVLDLVTAC
        I+ YKLIFWALRFHG+ILFR+I +E++R+WQPSEN+ILIRWNLKGVPRVPWEA+GEFQGTSR+K+DRNGKIYEHKVDNLAFNFP  LKPA SVLD+VTAC
Subjt:  IERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVLDLVTAC

Query:  PASPNPTFLWGTEELHCSSWVELYQAVRRSVG-GEGYLITQDGFLTCS
        PASPNPTF++G  + + SSW+E Y+AV+R++   E  ++ QD F+ CS
Subjt:  PASPNPTFLWGTEELHCSSWVELYQAVRRSVG-GEGYLITQDGFLTCS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGGCGAGCGCCAACATAGGCCTTAATATTCCGCCATTTTTGAACTCTGCTTCGCCTACTTCCTTACCGTCGAGGACTCTGAAAGCCGCCGAGTCGGTAAGAAC
AACCAAAGCCAAATCATGGAGAACCAAGCCATTGAAGCTGAGTGTCTTCGCGGCAGCTTCTCGTTCTACTTCTTCGGGTTTTAAGCAAGCAGAAGATGGAAAATTGCCGC
CAAGAATGGAAGCTGATAATCCAAGAAAGGGAAGGGTCTTTTTCCTGGACGTAAATCCTCTATGTTATCAAGGCAGCAGACCTAGTTTGCACAATTTTGGTCGCTGGGTT
TCCATCTTCTTCGAGGAAGTTAGCCATAGCGATCCTGTTATTGCCGTTATTGATGGGGAAGGAGGTAGTGAGCATCGCAGGCTGTTGTTACCCTCATATAAAGCACATCG
GATCAAATTCACTAGACAATCATCTTCACAAAGATTTACAAAGGGAAATTCTGGAAGGTCATATCAAGTGATAAGAGATGCTCTCAGAAGCTGTGATGTGCCCGTTATAA
AGATCAAAGGTCACGAAGCAGATGACGTTGTCGCTACACTTGTGGAACAAGTTTTGCAACGAGGGTTCCGGGCAGTAATAGCCTCTCCTGATAAAGATTTCAAGCAGTTG
ATTTCAGAAGATGTCCAACTCGTGATGCCTTTGCCAGAGCTCAACAGATGGTCCTTTTACACCTTAAAACACTACCTTGCTCAGTATAACTGTGATCCGTGCTCTGACTT
GAGTCTTAGATGCATAATGGGTGATGAGGTAGATGGCGTTCCGGGAATCCAGCACGTTGCTCCTGGATTTGGTCGAAAGACTGCAATGAAGCTCTTAAAGAAACATGGTT
CTTTGGAGAATCTACTCAGTGCTGCTGCAATAAGAACTGTGGGCAAACCATATGCACAAGATGCACTTACAAAGTATGCCGATTACCTCCGAACGAACTACAAAGTTCTA
GCCTTAAGAAGGGCTTCTTCTTCTTCTTCTTCCTCTCTCTCCCTCACCAGCGTCCACTCCCCTCAGCCTCTTCTCCAGGGCCCTCAACTCGCCTCTCCTTCTGCTTTCCC
CGATTCCCGCTCCGACGACCGCAGAGACGACTTTTATGTCAATCTCGGCCTCGCTGTCAGGACGCTTCGCGAGGACCTTCCTCTCATCTTCGCCAGAGACCTCAATTACG
ACATTTACAGGGACGACATAACGTTTATTGATCCTCTCAACACGTTCAGTGGGATTGAGAGGTACAAATTGATATTCTGGGCGTTGAGATTTCATGGTAGAATTCTGTTC
CGGGAGATCGGGATTGAGGTGTACCGGATTTGGCAGCCTTCTGAAAACGTTATACTGATTCGGTGGAATTTGAAGGGCGTTCCTAGGGTTCCATGGGAGGCCAGAGGTGA
GTTTCAGGGCACTTCGCGGTTTAAGGTGGATCGAAATGGGAAAATTTACGAACACAAAGTGGACAATTTAGCATTCAATTTCCCTCAGTCACTGAAACCGGCTGCGTCGG
TGTTGGATTTGGTTACTGCGTGCCCTGCTAGCCCCAATCCCACGTTCTTGTGGGGAACGGAGGAATTGCATTGCTCTTCCTGGGTGGAGCTTTATCAGGCAGTAAGGAGA
AGTGTGGGTGGAGAAGGCTATTTGATTACACAAGATGGATTCCTCACATGTTCATAG
mRNA sequenceShow/hide mRNA sequence
TTCAGATTAAATCAAGTTATCGACTTTCGTTCATATTTCCAATTTTCCGCATCGTTTTCTGTTTCCTGATAGGACCCGTTCTAGAAATGGCTGAGGCGAGCGCCAACATA
GGCCTTAATATTCCGCCATTTTTGAACTCTGCTTCGCCTACTTCCTTACCGTCGAGGACTCTGAAAGCCGCCGAGTCGGTAAGAACAACCAAAGCCAAATCATGGAGAAC
CAAGCCATTGAAGCTGAGTGTCTTCGCGGCAGCTTCTCGTTCTACTTCTTCGGGTTTTAAGCAAGCAGAAGATGGAAAATTGCCGCCAAGAATGGAAGCTGATAATCCAA
GAAAGGGAAGGGTCTTTTTCCTGGACGTAAATCCTCTATGTTATCAAGGCAGCAGACCTAGTTTGCACAATTTTGGTCGCTGGGTTTCCATCTTCTTCGAGGAAGTTAGC
CATAGCGATCCTGTTATTGCCGTTATTGATGGGGAAGGAGGTAGTGAGCATCGCAGGCTGTTGTTACCCTCATATAAAGCACATCGGATCAAATTCACTAGACAATCATC
TTCACAAAGATTTACAAAGGGAAATTCTGGAAGGTCATATCAAGTGATAAGAGATGCTCTCAGAAGCTGTGATGTGCCCGTTATAAAGATCAAAGGTCACGAAGCAGATG
ACGTTGTCGCTACACTTGTGGAACAAGTTTTGCAACGAGGGTTCCGGGCAGTAATAGCCTCTCCTGATAAAGATTTCAAGCAGTTGATTTCAGAAGATGTCCAACTCGTG
ATGCCTTTGCCAGAGCTCAACAGATGGTCCTTTTACACCTTAAAACACTACCTTGCTCAGTATAACTGTGATCCGTGCTCTGACTTGAGTCTTAGATGCATAATGGGTGA
TGAGGTAGATGGCGTTCCGGGAATCCAGCACGTTGCTCCTGGATTTGGTCGAAAGACTGCAATGAAGCTCTTAAAGAAACATGGTTCTTTGGAGAATCTACTCAGTGCTG
CTGCAATAAGAACTGTGGGCAAACCATATGCACAAGATGCACTTACAAAGTATGCCGATTACCTCCGAACGAACTACAAAGTTCTAGCCTTAAGAAGGGCTTCTTCTTCT
TCTTCTTCCTCTCTCTCCCTCACCAGCGTCCACTCCCCTCAGCCTCTTCTCCAGGGCCCTCAACTCGCCTCTCCTTCTGCTTTCCCCGATTCCCGCTCCGACGACCGCAG
AGACGACTTTTATGTCAATCTCGGCCTCGCTGTCAGGACGCTTCGCGAGGACCTTCCTCTCATCTTCGCCAGAGACCTCAATTACGACATTTACAGGGACGACATAACGT
TTATTGATCCTCTCAACACGTTCAGTGGGATTGAGAGGTACAAATTGATATTCTGGGCGTTGAGATTTCATGGTAGAATTCTGTTCCGGGAGATCGGGATTGAGGTGTAC
CGGATTTGGCAGCCTTCTGAAAACGTTATACTGATTCGGTGGAATTTGAAGGGCGTTCCTAGGGTTCCATGGGAGGCCAGAGGTGAGTTTCAGGGCACTTCGCGGTTTAA
GGTGGATCGAAATGGGAAAATTTACGAACACAAAGTGGACAATTTAGCATTCAATTTCCCTCAGTCACTGAAACCGGCTGCGTCGGTGTTGGATTTGGTTACTGCGTGCC
CTGCTAGCCCCAATCCCACGTTCTTGTGGGGAACGGAGGAATTGCATTGCTCTTCCTGGGTGGAGCTTTATCAGGCAGTAAGGAGAAGTGTGGGTGGAGAAGGCTATTTG
ATTACACAAGATGGATTCCTCACATGTTCATAGAGTATTTTATAAATATAGATTCTTCTAAGCCATTTTGTTTTGTATTTTCACAATTATCGATAGCAAGCCACTAGAGT
GAAAATGAAACAGCATTTGAGTGGAAATAGAAATGACGAAGTCCAGAATGTCTTACTGTCGTGTCTCTGTTCTGTCCACAATATTATGAAGCAAATGAAAACATTGGAAG
TCTTCTTTATCATAGATATCCAAGGAATCTCTGGGTTCATAATTATGTGTATAGGGATGAGATTATGTGATTTGCAACAAGTAGAGTTCTTTTTATCGCAATTCTTGAGT
ATCAGGAGCTAAAGGTAAAATGCTATCAGAGTTGAATTACAGGACAAAATTCTTTTACCTCTTGAAAAGCAGAGCCCCTGAATCTATCTATTTTAAGAAAACGGTTGACA
ATTATGCTGAATTCCTGGGAATGAGGGGTTTGATAGGGGAAAGAGATCTAATTTCAGGTGCAGAACTGGGAAAGAGAGTATTCAACTTGTGCATTAGGATATCTGATCTT
GATGGAATCTCATTCAGAAGGAACTGTTGGACGATTGGTTTTTTGAACCACTCCATCACAGAATCTCCTGTTGCCCTCAGAACCACCTTACTAACCTCAAATGGGGATTC
CACAGAGCTCAACATTTGAGCGATCACAAGCTTCAAAGTAGGTCCAGTAACCAATTTTATGCGCCTTGGAACTATACCATAATACAACATTCGCTTCCTGAACCGATGGA
GTGTGTGCACGACTGCTATAAGGGCAGCTCCAACTGACAAGTTCCTAACATCGAGTGACCAGGCGATGTGTTGATCAAAGACTATGGCCTTTGGAAAGAGCTTATTCTCA
TAAGCGACCTTCCAAATGCATCGAGCTCGATTTATTTGTCCCATTAGAATGAGATAGTTGAGGAGGACATTCACAAAGTATCTTGCACTGCTCTCTTCCAGCTCATAGTC
GATGCTCTGAAAAAACTCCCTCACAAAAGATAAAACTGGTTGTTTCCTTTGTTCTGGTCCAGTGAACAGTCCACTCAAGAAAGAGTGTGCCTTATGTCTGGTAGCACTGA
GGATGGACATCAAATATCTCTCATTATGCTCTTCTTGACACCTTACAAGATGGGCCAGGATGGATGTGTAAAGTATGAGATCAACTTTTGCAGCAGAGTCTACATAGGAT
TCTAAGAGAGGCTTAGCTGACTCGTACATTCCTTTCTTCATACATGACTCAAACAATTGCCTGAGGATGAAGTTATTAGTTCTTATTCCAGACGAACCCATGAACTGAAG
CCACCTCAAAGCAGCATCGGTATGACCTTCCTTGATATACACCATCAATACATCACTTGCACTCACATCAACAGAGAATCCCATGGCCTTCATTTCAAGTAAAACTTTGG
CAGCAATGTCAATTAGCTTCTTATTAGCCAGGAGCGTCAATAAAGCAGTGTATGTACTTAGCCCGAGTCTCAAGCCTGCATTAGTCATAGAGTTGTAGAGTTTCATGGCA
GGATCTACTTGTCTCGATGCTGCGTGCATTTCCAAGAGACAGCAGTAAGTAGATGGAGTGGGAAGAAATCCAGCTTTCTCCATTTCGGTGAAGATAGACATAGCAACCTC
GAGTTTCCCTGATTTAGCATGTGACTCAACAACTATAGAGTACAAACCAAAGTTCGGCCTAAAACCTGCCCTTTTCATCTCATCCCAAAGCTTGAGAGCAGTATCCAACT
TCCCAGCCTTCACATGTGACTCAACTAAAGAAACGAACATAGATGCAGGTGGTCTGAGCTCTAGCAGCTGCATTTCCATGTAAATCTTCATCGATGTGTCGAGCCTCCCA
GCTTTCCCCATAGAATCCACAAGGGTTGTAAAAACATTTAGGCCAGGACGAAAATTCCTCTCTTTCATCTCTTGAAAGAGCTTCATTGCAGCATCCAGACGACCTGATTT
TGCCAAGCTTGGTATCATCAACTCAAAGGTAGATGCATCCAAAGAACACCCTGCTCCTCCCATGCTCTCGTATATCTCGAATGCCTTGTAAGGCAGACCCTTGTTTAAGA
ACAAGGTTATAAGCGAATTGTATGTTTGGGTATCAACTTTGAAACCTGAATCATGAATCTTCTTGAAACAACAGAAAGACACTTCCAATTTCTCAGCTTTAGCCAAATAC
TGAATCACACGATTATACGCACTGAATGACACAGTCCCATCATTGCTTAAATCACGAACAATCTCATCAAACAGCAACTGAATGGCATCAAAATCTCTTTTCTGATTCAA
CCCATCAAACAACAACCCATAGCATTCATCGTTAGCTGAATACCAGGACTGCCTCTTAGCCCAACGAAACAAACTCAAAGAAGACTCAGCATCATTGATAATCTTCAATG
CCTGAGTGATGTGTGTCATATTAGGAACAAATTGGAGTTTTTCAAGCTGGGATTCCAACTCCGGTCCCCATTTCCACCTCCATACGATCTCAACTATCTTAGCAACAGCC
GAGGCATTCAGAAAAGGCTTTTTGAGTCCGCCCACCATTACATGGTCATCCAGACCTGGTTCAACTGACCGAACGCCTTTACCGGAGAAAATCACACTTCCAGACTCATC
TAGATACTCAATATTCTCAGTCCACTCGCTAGACCCATTGCCACTGCTCTTTTCAGAAGAATAAGATCTGATAAAACTACGATTATGGAAAACATACGCCTTTTCGGATA
ACTCAGAGTTTCTGGAATTTAAAACAGATGGCGCTTCGCGACATGAAATGTGGGATTTAAACCATCTTCCCTGGAAGAGGAGGGAAGTCGGCAGAATAAATCGACGCTTG
CGTAGGGAATAGGACCCGAGAAGCAGTTGTACAGCACGAAAAGAAGGCATACTGGAATTGGATTGTTCCCTTTCCTCGAGGGTCAAATTGAATGAAGAACTAGAATAAGA
GAAATTTGATCACGTACCGGTATGCTTGTTGCGCGCTGAGCTGCGAGAATCTGCCGGTTTACTTGCAGGCATTTTTGCGAACAGGTTGTGGGCGTCTGAAGGGGGGGCGG
CAACGAAGTCGTGTTTGGAATTGTGGGTTCAAAAGATTGGGTCGATTCCAATGCCTAAAGCCGGCTGAAGTGAACACTGAAGGAGAAACTCTGGTCTGCTTCTGCTTCTT
CTTCTTCTTCTCATAAAGACTGGACTGGAAGGACCAATAAGTTGCGGGTCTAAATCGGCTGATAGTGTTCCCTTCCGCCTCCTTTTGACAAGCAACCGAGTTTGGACTTG
CCAAAATTCACAACTCTGTAGCCCAACTTTATAGTGGAGTCTCGGCTACATATATGGTTTGGTCCTTTATGTATGCCCAGCCCATTATATTTTCAGTTTCGT
Protein sequenceShow/hide protein sequence
MAEASANIGLNIPPFLNSASPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAAASRSTSSGFKQAEDGKLPPRMEADNPRKGRVFFLDVNPLCYQGSRPSLHNFGRWV
SIFFEEVSHSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCDVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQL
ISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGKPYAQDALTKYADYLRTNYKVL
ALRRASSSSSSSLSLTSVHSPQPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDPLNTFSGIERYKLIFWALRFHGRILF
REIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVLDLVTACPASPNPTFLWGTEELHCSSWVELYQAVRR
SVGGEGYLITQDGFLTCS