; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G031320 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G031320
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Description5'-3' exonuclease
Genome locationCmo_Chr04:21715124..21720817
RNA-Seq ExpressionCmoCh04G031320
SyntenyCmoCh04G031320
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004527 - exonuclease activity (molecular function)
InterPro domainsIPR002298 - DNA polymerase A
IPR002421 - 5'-3' exonuclease
IPR018790 - Protein of unknown function DUF2358
IPR020045 - DNA polymerase I-like, H3TH domain
IPR020046 - 5'-3' exonuclease, alpha-helical arch, N-terminal
IPR029060 - PIN-like domain superfamily
IPR032710 - NTF2-like domain superfamily
IPR036279 - 5'-3' exonuclease, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602717.1 hypothetical protein SDJN03_07950, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.94Show/hide
Query:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASAN+GLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAK WRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPT+EADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRRA---------------------------------------------------------------SS
        AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR                                                                +S
Subjt:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRRA---------------------------------------------------------------SS

Query:  SSSSSSSLPLTSLHSPRPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDPLNTFSGIERYKLIFWALRFH
        SSSSSSSLPLTSLHSP+PLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDPLNTFSGIERYKLIFWALRFH
Subjt:  SSSSSSSLPLTSLHSPRPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDPLNTFSGIERYKLIFWALRFH

Query:  GRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVLDLVTACPASPNPTFLWGTEE
        GRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVLDLVTACPASPNPTF WGTEE
Subjt:  GRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVLDLVTACPASPNPTFLWGTEE

Query:  LHCSSWVELYQAVRRSVGGEGYLITQDGFLTCS
        LHCSSWVELYQAVRRSVGGEGYLITQDGFLTCS
Subjt:  LHCSSWVELYQAVRRSVGGEGYLITQDGFLTCS

KAG7033404.1 hypothetical protein SDJN02_07460 [Cucurbita argyrosperma subsp. argyrosperma]2.7e-18598.8Show/hide
Query:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAK WRTKPLKLSVF ASSRSTSSGFKQAEDGKL PT+EADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR

XP_022961199.1 uncharacterized protein LOC111461781 [Cucurbita moschata]7.6e-188100Show/hide
Query:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR

XP_022990137.1 uncharacterized protein LOC111487120 [Cucurbita maxima]6.1e-18598.2Show/hide
Query:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASANIGLNIPPFLNS SPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAA+SRSTSSGFKQAEDGKLPP MEADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVS SDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSC+VPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAA+RTVGKPYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR

XP_023542307.1 uncharacterized protein LOC111802238 [Cucurbita pepo subsp. pepo]1.5e-18397.31Show/hide
Query:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASANIGLNIPPF NS SPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPP MEADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVS SDPVIAVIDGEGGSEHRRLLLPSYK+HRIKFTRQSSSQRFTKGNSGRSYQVI+DALRSCNVPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRG RAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHY AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAA+RTVGKPYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR

TrEMBL top hitse value%identityAlignment
A0A0A0LQN4 53EXOc domain-containing protein6.2e-15984.23Show/hide
Query:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAK--SWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQG
        MAEASANIG+N PPFLNS+SPT LPSRTLK  E   T+K K  +WRTKPL L+ FA SSR TS+ F Q +DGK  P +EADN R GRVFFLDVNPLCYQG
Subjt:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAK--SWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQG

Query:  SRPSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLV
        S+PSL NFGRWVSIFFEEVS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTR  SS+RFTKGN   SYQVIRDALRSCNVPV++++GHEADDV+ATLV
Subjt:  SRPSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLV

Query:  EQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENL
        EQVLQRG R V+ASPDKDFKQLISED+QLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENL
Subjt:  EQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENL

Query:  LSAAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
        LSAAA+RTVGKPYAQDALTKYA+YLRTNYKVLALRR
Subjt:  LSAAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR

A0A1S3B2X7 5'-3' exonuclease1.2e-15783.53Show/hide
Query:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR
        M EASA IG+N PPFLNS+S T LPSRT   ++     K  +WRTKPLKL+ F  SSR TS+ F Q +DGK  P +EADNPRKGRVFFLDVNPLCYQG++
Subjt:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
        PSL NFGRWVSIFFEEVS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTR  SSQRFTKGN   SYQVIRDALRSCNVPV+K+ GHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRG R V+ASPDKDFKQLISEDVQLVMPLPELNRWSFYT++HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAA+RTVGKPYAQDALTKYA+YLRTNYKVLALRR
Subjt:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR

A0A6J1DGI1 uncharacterized protein LOC1110202333.5e-16286.53Show/hide
Query:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR
        MAEA ANIG+NIPPFLNS S +SLPSRTLK  ES  TTK+ SWRTK L+LS F  +S ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRW SIFFE+VS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+CNVPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAA+RTVG+PYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR

A0A6J1HDC1 uncharacterized protein LOC1114617813.7e-188100Show/hide
Query:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR

A0A6J1JHT6 uncharacterized protein LOC1114871202.9e-18598.2Show/hide
Query:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR
        MAEASANIGLNIPPFLNS SPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAA+SRSTSSGFKQAEDGKLPP MEADNPRKGRVFFLDVNPLCYQGSR
Subjt:  MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ
        PSLHNFGRWVSIFFEEVS SDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSC+VPVIKIKGHEADDVVATLVEQ
Subjt:  PSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQ

Query:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
        AAA+RTVGKPYAQDALTKYADYLRTNYKVLALRR
Subjt:  AAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR

SwissProt top hitse value%identityAlignment
O67550 5'-3' exonuclease3.4e-2139.23Show/hide
Query:  VIRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDG
        VI++ L+   +P++++ G+EADDV+A L E+  Q+GF+  I SPDKD  QL+SE+V ++ P+ +      +T +  + ++  +P        ++GD+VD 
Subjt:  VIRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDG

Query:  VPGIQHVAPGFGRKTAMKLLKKHGSLENLL
        VPGI+    G G KTA+ +LKK+GS+EN+L
Subjt:  VPGIQHVAPGFGRKTAMKLLKKHGSLENLL

P52026 DNA polymerase I2.1e-1825.38Show/hide
Query:  KGRVFFLDVNPLCYQG--SRPSLHN---------FGRWVSIFFEEVSRSDPV-IAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQV
        K ++  +D N + Y+   + P LHN         +G +  +  + ++   P  I V    G +  R      YK  R +   + S Q          + +
Subjt:  KGRVFFLDVNPLCYQG--SRPSLHN---------FGRWVSIFFEEVSRSDPV-IAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQV

Query:  IRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGV
        +R+ L++  +P  ++  +EADD++ T+  +  + GF   + S D+D  QL S  V + +    +     YT +  + +Y   P   + L+ +MGD+ D +
Subjt:  IRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGV

Query:  PGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR
        PG+    PG G KTA+KLLK+ G++EN+L  A++  +     ++ L +Y D    + ++ A+ R
Subjt:  PGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAVRTVGKPYAQDALTKYADYLRTNYKVLALRR

P52028 DNA polymerase I, thermostable3.3e-1630.13Show/hide
Query:  KGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRF------------TKGNSGRSYQV
        KGRV  +D + L Y+           + ++     SR +PV AV  G   S  + L    YKA  + F  ++ S R             T  +  R   +
Subjt:  KGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRF------------TKGNSGRSYQV

Query:  IRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGV
        I++ +       +++ G+EADDV+ATL ++  + G+   I + D+D  QL+S+ V ++ P   L      T +    +Y   P   +  R ++GD  D +
Subjt:  IRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGV

Query:  PGIQHVAPGFGRKTAMKLLKKHGSLENLL
        PG++    G G KTA+KLLK+ GSLENLL
Subjt:  PGIQHVAPGFGRKTAMKLLKKHGSLENLL

Q9RLB6 DNA polymerase I2.5e-1629.21Show/hide
Query:  IAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISE
        +AV+   GG   R  + P YKA+R         Q            ++RD   + N P+++  G+EADD++AT   +    G   VI S DKD  QL+SE
Subjt:  IAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISE

Query:  DVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSA
        ++++  PL    R  + T    + ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ ++
Subjt:  DVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSA

Q9S1G2 DNA polymerase I1.5e-1632.4Show/hide
Query:  IAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISE
        +AVI        R+ L  +YKA+R     +   Q          + +IR+A R+ N+P I+ +G EADD++AT   Q    G    I S DKD  QL+S 
Subjt:  IAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPDKDFKQLISE

Query:  DVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAA
        +V +   + +        +   + ++   P   + L+ + GD VD VPGI    PG G KTA +LL+++G L+ LL  A
Subjt:  DVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAA

Arabidopsis top hitse value%identityAlignment
AT1G16320.1 Uncharacterized conserved protein (DUF2358)2.1e-9063.78Show/hide
Query:  NYKVLALRRASSSSSSS--------SSLPLTSLHSPRPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDP
        N K +  +  SSSSS S        S+L L S+ SP   L+  Q+ +  +  D  ++  RD+FY+NLG+AVRTLREDLPL+F RDLNYDIYRDDITF+DP
Subjt:  NYKVLALRRASSSSSSS--------SSLPLTSLHSPRPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDP

Query:  LNTFSGIERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVL
        +NTF+G++ YK+IFWALRFHG+ILFR+I +E++R+WQPSEN+ILIRWNLKGVPRVPWEA+GEFQGTSR+K+DRNGKIYEHKVDNLAFNFPQ LKPAASVL
Subjt:  LNTFSGIERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVL

Query:  DLVTACPA-SPNPTFLWGTEELHCSSWVELYQAVRRSVGGEGYLITQDGFLTCS
        DLVTA PA SPNPTF +   + + SSWV+ YQAVR ++  E   +T D  +TCS
Subjt:  DLVTACPA-SPNPTFLWGTEELHCSSWVELYQAVRRSVGGEGYLITQDGFLTCS

AT1G34380.1 5'-3' exonuclease family protein1.5e-6454.07Show/hide
Query:  PTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQA------EDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFF
        P SL S + K+ +     K +  RTK +  S  + SS S+   F +       +   L    E    +  RVFFLDV+PLCY+G++PS   FG W+S+FF
Subjt:  PTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQA------EDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFF

Query:  EEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPD
         +VS +DPVIAVIDGE G++ RR LLPSYKAHR    +  +  R++K    R +Q + + LR CNVPV++I+GHEADDVVATL+EQ +QRG+RAVIASPD
Subjt:  EEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPD

Query:  KDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLR
        KDFKQLISE+VQ+V+PL +L RWSFYTLKHY AQYNCDP SDLS R
Subjt:  KDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLR

AT1G34380.2 5'-3' exonuclease family protein1.6e-10361.25Show/hide
Query:  PTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQA------EDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFF
        P SL S + K+ +     K +  RTK +  S  + SS S+   F +       +   L    E    +  RVFFLDV+PLCY+G++PS   FG W+S+FF
Subjt:  PTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQA------EDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFF

Query:  EEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPD
         +VS +DPVIAVIDGE G++ RR LLPSYKAHR    +  +  R++K    R +Q + + LR CNVPV++I+GHEADDVVATL+EQ +QRG+RAVIASPD
Subjt:  EEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVIASPD

Query:  KDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAVRTVGKPYAQD
        KDFKQLISE+VQ+V+PL +L RWSFYTLKHY AQYNCDP SDLS RCIMGDEVDGVPGIQH+ P FGRKTAMKL++KHGSLE+LLSAAAVRTVG+PYAQ+
Subjt:  KDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAVRTVGKPYAQD

Query:  ALTKYADYLRTNYKVLALRR
        ALTKYADYLR NY+VLAL R
Subjt:  ALTKYADYLRTNYKVLALRR

AT1G79510.1 Uncharacterized conserved protein (DUF2358)9.9e-9363.78Show/hide
Query:  KVLALRRASSSSSSSSS----------LPLTSLHSPRPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDP
        K ++ +  SSSSSSS+S          L L S+ +P P ++G Q+ +  +  D      +DDFY+NLGLAVRTLREDLPL+F +DLNYDIYRDDIT +DP
Subjt:  KVLALRRASSSSSSSSS----------LPLTSLHSPRPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDP

Query:  LNTFSGIERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVL
        +NTFSGI+ YKLIFWALRFHG+ILFR+I +E++R+WQPSEN+ILIRWNLKGVPRVPWEA+GEFQGTSR+K+DRNGKIYEHKVDNLAFNFP  LKPA SVL
Subjt:  LNTFSGIERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVL

Query:  DLVTACPASPNPTFLWGTEELHCSSWVELYQAVRRSVG-GEGYLITQDGFLTCS
        D+VTACPASPNPTF++G  + + SSW+E Y+AV+R++   E  ++ QD F+ CS
Subjt:  DLVTACPASPNPTFLWGTEELHCSSWVELYQAVRRSVG-GEGYLITQDGFLTCS

AT1G79510.2 Uncharacterized conserved protein (DUF2358)9.9e-9363.78Show/hide
Query:  KVLALRRASSSSSSSSS----------LPLTSLHSPRPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDP
        K ++ +  SSSSSSS+S          L L S+ +P P ++G Q+ +  +  D      +DDFY+NLGLAVRTLREDLPL+F +DLNYDIYRDDIT +DP
Subjt:  KVLALRRASSSSSSSSS----------LPLTSLHSPRPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDP

Query:  LNTFSGIERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVL
        +NTFSGI+ YKLIFWALRFHG+ILFR+I +E++R+WQPSEN+ILIRWNLKGVPRVPWEA+GEFQGTSR+K+DRNGKIYEHKVDNLAFNFP  LKPA SVL
Subjt:  LNTFSGIERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVL

Query:  DLVTACPASPNPTFLWGTEELHCSSWVELYQAVRRSVG-GEGYLITQDGFLTCS
        D+VTACPASPNPTF++G  + + SSW+E Y+AV+R++   E  ++ QD F+ CS
Subjt:  DLVTACPASPNPTFLWGTEELHCSSWVELYQAVRRSVG-GEGYLITQDGFLTCS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGGCGAGCGCCAACATAGGCCTTAATATTCCGCCATTTTTGAACTCTACTTCGCCTACTTCCTTACCGTCGAGGACTCTGAAAGCCGCCGAGTCGGTA
AGAACAACCAAAGCCAAATCATGGAGAACCAAGCCATTGAAGCTGAGTGTCTTCGCGGCATCTTCTCGTTCTACTTCTTCGGGTTTTAAGCAAGCAGAAGACGGA
AAATTGCCACCAACAATGGAAGCTGATAATCCAAGAAAGGGAAGGGTCTTTTTCCTGGACGTAAATCCTCTCTGTTATCAAGGCAGCAGACCTAGTTTGCACAAT
TTTGGTCGCTGGGTTTCCATCTTCTTCGAGGAAGTTAGCCGTAGTGATCCTGTTATTGCCGTTATTGATGGGGAAGGAGGTAGTGAGCATCGCAGGCTGTTGCTA
CCCTCATATAAAGCACATCGGATAAAATTCACTAGACAATCATCTTCACAAAGATTTACAAAGGGAAATTCTGGAAGGTCATATCAAGTGATAAGAGATGCTCTC
AGAAGCTGTAATGTGCCCGTTATAAAGATCAAAGGTCACGAAGCAGATGACGTTGTCGCTACACTTGTGGAACAAGTTTTGCAGAGAGGGTTCCGGGCAGTAATA
GCCTCTCCTGATAAAGATTTCAAGCAGTTGATTTCAGAAGATGTCCAACTCGTGATGCCTTTGCCAGAGCTCAACAGATGGTCCTTTTACACCTTAAAGCACTAC
CTTGCTCAGTATAACTGTGATCCGTGCTCTGACTTGAGTCTTAGATGCATAATGGGTGACGAGGTAGATGGCGTTCCGGGAATCCAGCACGTTGCTCCTGGATTT
GGTCGAAAGACTGCAATGAAGCTCTTAAAGAAACATGGTTCTTTGGAGAATCTACTCAGTGCTGCTGCAGTAAGAACTGTGGGCAAACCATATGCACAAGATGCA
CTTACAAAGTATGCCGATTACCTCCGAACGAACTACAAAGTTCTAGCCTTAAGAAGGGCTTCTTCTTCTTCTTCTTCTTCTTCCTCTCTCCCCCTCACCAGCCTC
CACTCCCCGCGGCCTCTTCTCCAGGGCCCTCAACTCGCCTCTCCTTCTGCTTTCCCCGATTCCCGCTCCGACGACCGAAGAGACGACTTCTATGTCAATCTCGGC
CTCGCTGTCAGGACGCTTCGCGAGGACCTTCCTCTCATCTTCGCCAGAGACCTCAATTACGACATTTACAGGGACGACATAACGTTTATTGATCCTCTCAACACG
TTCAGTGGGATTGAGAGGTACAAATTGATATTCTGGGCGTTGAGATTTCATGGTAGAATTCTGTTCAGGGAGATCGGGATTGAGGTGTACAGGATTTGGCAGCCT
TCTGAAAACGTTATCTTGATTCGGTGGAATTTGAAGGGCGTTCCTAGGGTTCCATGGGAGGCCAGAGGTGAGTTTCAGGGCACTTCGCGGTTTAAGGTGGATCGA
AATGGGAAAATTTACGAACACAAAGTGGACAATTTAGCATTCAATTTCCCTCAGTCACTGAAACCGGCTGCGTCGGTGTTGGATTTGGTTACTGCGTGCCCTGCT
AGCCCCAATCCCACGTTCTTGTGGGGAACGGAGGAATTGCATTGCTCTTCCTGGGTGGAGCTTTATCAGGCAGTAAGAAGAAGTGTGGGTGGAGAAGGCTATTTG
ATTACACAAGATGGATTTCTCACATGTTCATAG
mRNA sequenceShow/hide mRNA sequence
TTCAGATTAAATGAAGTTATCGACTTTCGTTCATATTTCCAATTTTCCGCATCGTTTTTTCGTTTCCTGATAGGACCCGTTCTAGAAATGGCTGAGGCGAGCGCC
AACATAGGCCTTAATATTCCGCCATTTTTGAACTCTACTTCGCCTACTTCCTTACCGTCGAGGACTCTGAAAGCCGCCGAGTCGGTAAGAACAACCAAAGCCAAA
TCATGGAGAACCAAGCCATTGAAGCTGAGTGTCTTCGCGGCATCTTCTCGTTCTACTTCTTCGGGTTTTAAGCAAGCAGAAGACGGAAAATTGCCACCAACAATG
GAAGCTGATAATCCAAGAAAGGGAAGGGTCTTTTTCCTGGACGTAAATCCTCTCTGTTATCAAGGCAGCAGACCTAGTTTGCACAATTTTGGTCGCTGGGTTTCC
ATCTTCTTCGAGGAAGTTAGCCGTAGTGATCCTGTTATTGCCGTTATTGATGGGGAAGGAGGTAGTGAGCATCGCAGGCTGTTGCTACCCTCATATAAAGCACAT
CGGATAAAATTCACTAGACAATCATCTTCACAAAGATTTACAAAGGGAAATTCTGGAAGGTCATATCAAGTGATAAGAGATGCTCTCAGAAGCTGTAATGTGCCC
GTTATAAAGATCAAAGGTCACGAAGCAGATGACGTTGTCGCTACACTTGTGGAACAAGTTTTGCAGAGAGGGTTCCGGGCAGTAATAGCCTCTCCTGATAAAGAT
TTCAAGCAGTTGATTTCAGAAGATGTCCAACTCGTGATGCCTTTGCCAGAGCTCAACAGATGGTCCTTTTACACCTTAAAGCACTACCTTGCTCAGTATAACTGT
GATCCGTGCTCTGACTTGAGTCTTAGATGCATAATGGGTGACGAGGTAGATGGCGTTCCGGGAATCCAGCACGTTGCTCCTGGATTTGGTCGAAAGACTGCAATG
AAGCTCTTAAAGAAACATGGTTCTTTGGAGAATCTACTCAGTGCTGCTGCAGTAAGAACTGTGGGCAAACCATATGCACAAGATGCACTTACAAAGTATGCCGAT
TACCTCCGAACGAACTACAAAGTTCTAGCCTTAAGAAGGGCTTCTTCTTCTTCTTCTTCTTCTTCCTCTCTCCCCCTCACCAGCCTCCACTCCCCGCGGCCTCTT
CTCCAGGGCCCTCAACTCGCCTCTCCTTCTGCTTTCCCCGATTCCCGCTCCGACGACCGAAGAGACGACTTCTATGTCAATCTCGGCCTCGCTGTCAGGACGCTT
CGCGAGGACCTTCCTCTCATCTTCGCCAGAGACCTCAATTACGACATTTACAGGGACGACATAACGTTTATTGATCCTCTCAACACGTTCAGTGGGATTGAGAGG
TACAAATTGATATTCTGGGCGTTGAGATTTCATGGTAGAATTCTGTTCAGGGAGATCGGGATTGAGGTGTACAGGATTTGGCAGCCTTCTGAAAACGTTATCTTG
ATTCGGTGGAATTTGAAGGGCGTTCCTAGGGTTCCATGGGAGGCCAGAGGTGAGTTTCAGGGCACTTCGCGGTTTAAGGTGGATCGAAATGGGAAAATTTACGAA
CACAAAGTGGACAATTTAGCATTCAATTTCCCTCAGTCACTGAAACCGGCTGCGTCGGTGTTGGATTTGGTTACTGCGTGCCCTGCTAGCCCCAATCCCACGTTC
TTGTGGGGAACGGAGGAATTGCATTGCTCTTCCTGGGTGGAGCTTTATCAGGCAGTAAGAAGAAGTGTGGGTGGAGAAGGCTATTTGATTACACAAGATGGATTT
CTCACATGTTCATAG
Protein sequenceShow/hide protein sequence
MAEASANIGLNIPPFLNSTSPTSLPSRTLKAAESVRTTKAKSWRTKPLKLSVFAASSRSTSSGFKQAEDGKLPPTMEADNPRKGRVFFLDVNPLCYQGSRPSLHN
FGRWVSIFFEEVSRSDPVIAVIDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRFTKGNSGRSYQVIRDALRSCNVPVIKIKGHEADDVVATLVEQVLQRGFRAVI
ASPDKDFKQLISEDVQLVMPLPELNRWSFYTLKHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAVRTVGKPYAQDA
LTKYADYLRTNYKVLALRRASSSSSSSSSLPLTSLHSPRPLLQGPQLASPSAFPDSRSDDRRDDFYVNLGLAVRTLREDLPLIFARDLNYDIYRDDITFIDPLNT
FSGIERYKLIFWALRFHGRILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEARGEFQGTSRFKVDRNGKIYEHKVDNLAFNFPQSLKPAASVLDLVTACPA
SPNPTFLWGTEELHCSSWVELYQAVRRSVGGEGYLITQDGFLTCS