; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G21320 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G21320
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRibonuclease H
Genome locationChr1:16872579..16876683
RNA-Seq ExpressionCSPI01G21320
SyntenyCSPI01G21320
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]0.0e+0067.56Show/hide
Query:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN
        MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE  E GEGETSCH+ITIIE S+  THE+DA++ PQSLEDGGQSTVDELKEVN+GTIEEP  TFISASLSN
Subjt:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN

Query:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE
        EE DKYMSLLTEYRDIFAWSYKEM  LDPKVAVHHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE GFIREV YPTWI NIVPVRKKNGQLRVCV+
Subjt:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE

Query:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------
        F DLNNACPK DFPLPI EIMVDATTG+ ALSFMDGSSGYN   + LSDEEM AFRTPKGIYCYKVMPFGLKNVGATYQ  + KVFDDML +        
Subjt:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------

Query:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------
                                                 KFLGFIVRHRGIEIDQSK+  IQKM +PKSLHDLRSLQGRLAYI+RFISN+A       
Subjt:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------

Query:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------
                                      VL APVPDKPLILYIAAQ+ SLGALL QEE KGKER+LYYLSRTLI AEVNYSPIE              
Subjt:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------

Query:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG
                  + K+ PIKYVL RPIIS RLAKW VLLQQYDIV I QK  +   +  ++         K C  +           EPWTM+F GA  RSG
Subjt:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG

Query:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE
        AGA IVLI+PEKHMLPYSFA  ELCSNNV EYQALIIGLQ+ALEI VSFIE+YGDSKLIIN LSLQYDVKHE+LK YF+YAR+LMEKF ++MLEHVPRVE
Subjt:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE

Query:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF
        NKRAD + NLATAL M DD                     EVN  T +LIDEEDWRQPIIEYLEH KLPKDSRHK E+RRRA  FIYYKGT YRRSL+G 
Subjt:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF

Query:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS
        FLRCLGKE+ VKALKEVHA VCGAHQS PKLQ +LRRM YYW KMIQDSIDYVKKCE CQYHANFIHQP EPLHP VASW FEAW LDLV  IT KS
Subjt:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS

XP_031737039.1 uncharacterized protein LOC116402129 [Cucumis sativus]0.0e+0067.45Show/hide
Query:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN
        MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE  E GEGETSCH+ITIIE S+  THE+DA++ PQSLEDGGQSTVDELKEVN+GTIEEP  TFISASLSN
Subjt:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN

Query:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE
        EE DKYMSLLTEYRDIFAWSYKEM  LDPKVAVHHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE GFIREV YPTWI NIVPVRKKNGQLRVCV+
Subjt:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE

Query:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------
        F DLNNACPK DFPLPI EIMVDATTG+ ALSFMDGSSGYN   + LSDEEM AFRTPKGIYCYKVMPFGLKNVGATYQ  + KVFDDML +        
Subjt:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------

Query:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------
                                                 KFLGFIVRHRGIEIDQSK+  IQKM +PKSLHDLRSLQGRLAYI+RFISN+A       
Subjt:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------

Query:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------
                                      VL APVPDKPLILYIAAQ+ SLGALL QEE KGKER+LYYLSRTLI AEVNYSPIE              
Subjt:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------

Query:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG
                  + K+ PIKYVL RPII+ RLAKW VLLQQYDIV I QK  +   +  ++         K C  +           EPWTM+F GA  RSG
Subjt:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG

Query:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE
        AGA IVLI+PEKHMLPYSFA  ELCSNNV EYQALIIGLQ+ALEI VSFIE+YGDSKLIIN LSLQYDVKHE+LK YF+YAR+LMEKF ++MLEHVPRVE
Subjt:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE

Query:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF
        NKRAD + NLATAL M DD                     EVN  T +LIDEEDWRQPIIEYLEH KLPKDSRHK E+RRRA  FIYYKGT YRRSL+G 
Subjt:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF

Query:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS
        FLRCLGKE+ VKALKEVHA VCGAHQS PKLQ +LRRM YYW KMIQDSIDYVKKCE CQYHANFIHQP EPLHP VASW FEAW LDLV  IT KS
Subjt:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS

XP_031737045.1 uncharacterized protein LOC116402134 [Cucumis sativus]0.0e+0067.22Show/hide
Query:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN
        MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE  E GE ETSCH+ITIIE S+  THE+DA++ PQSLEDGGQSTVDELKEVN+GTIEEP  TFISASLSN
Subjt:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN

Query:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE
        EE DKYMSLLTEYRDIFAWSYKEM  LDPKVAVHHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE GFIREV YPTWI NIVPVRKKNGQLRVCV+
Subjt:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE

Query:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------
        F DLNNACPK DFPLPI EIMVDATTG+ ALSFMDGSSGYN   + LSDEEM AFRTPKGIYCYKVMPFGLKN GATYQ  + KVFDDML +        
Subjt:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------

Query:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------
                                                 KFLGFIVRHRGIEIDQSK+  IQKM +PKSLHDLRSLQGRLAYI+RFISN+A       
Subjt:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------

Query:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------
                                      VL APVPDKPLILYIAAQ+ SLGALL QEE KGKER+LYYLSRTLI AEVNYSPIE              
Subjt:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------

Query:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG
                  + K+ PIKYVL RPII+ RLAKW VLLQQYDIV I QK  +   +  ++         K C  +           EPWTM+F GA  RSG
Subjt:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG

Query:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE
        AGA IVLI+PEKHMLPYSFA  ELCSNNV EYQALIIGLQ+ALEI VSFIE+YGDSKLIIN LSLQYDVKHE+LK YF+YAR+LMEKF ++MLEHVPRVE
Subjt:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE

Query:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF
        NKRAD + NLATAL M DD                     EVN  T +LIDEEDWRQPIIEYLEH KLPKDSRHK E+RRRA  FIYYKGT YRRSL+G 
Subjt:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF

Query:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS
        FLRCLGKE+ VKALKEVHA VCGAHQS PKLQ +LRRM YYW KMIQDSIDYVKKCE CQYHANFIHQP EPLHP VASW FEAW LDLV  IT KS
Subjt:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]0.0e+0067.22Show/hide
Query:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN
        MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE  E GE ETSCH+ITIIE S+  THE+DA++ PQSLEDGGQSTVDELKEVN+GTIEEP  TFISASLSN
Subjt:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN

Query:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE
        EE DKYMSLLTEYRDIFAWSYKEM  LDPKVAVHHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE GFIREV YPTWI NIVPVRKKNGQLRVCV+
Subjt:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE

Query:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------
        F DLNNACPK DFPLPI EIMVDATTG+ ALSFMDGSSGYN   + LSDEEM AFRTPKGIYCYKVMPFGLKN GATYQ  + KVFDDML +        
Subjt:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------

Query:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------
                                                 KFLGFIVRHRGIEIDQSK+  IQKM +PKSLHDLRSLQGRLAYI+RFISN+A       
Subjt:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------

Query:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------
                                      VL APVPDKPLILYIAAQ+ SLGALL QEE KGKER+LYYLSRTLI AEVNYSPIE              
Subjt:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------

Query:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG
                  + K+ PIKYVL RPII+ RLAKW VLLQQYDIV I QK  +   +  ++         K C  +           EPWTM+F GA  RSG
Subjt:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG

Query:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE
        AGA IVLI+PEKHMLPYSFA  ELCSNNV EYQALIIGLQ+ALEI VSFIE+YGDSKLIIN LSLQYDVKHE+LK YF+YAR+LMEKF ++MLEHVPRVE
Subjt:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE

Query:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF
        NKRAD + NLATAL M DD                     EVN  T +LIDEEDWRQPIIEYLEH KLPKDSRHK E+RRRA  FIYYKGT YRRSL+G 
Subjt:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF

Query:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS
        FLRCLGKE+ VKALKEVHA VCGAHQS PKLQ +LRRM YYW KMIQDSIDYVKKCE CQYHANFIHQP EPLHP VASW FEAW LDLV  IT KS
Subjt:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]0.0e+0067.22Show/hide
Query:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN
        MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE  E GE ETSCH+ITIIE S+  THE+DA++ PQSLEDGGQSTVDELKEVN+GTIEEP  TFISASLSN
Subjt:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN

Query:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE
        EE DKYMSLLTEYRDIFAWSYKEM  LDPKVAVHHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE GFIREV YPTWI NIVPVRKKNGQLRVCV+
Subjt:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE

Query:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------
        F DLNNACPK DFPLPI EIMVDATTG+ ALSFMDGSSGYN   + LSDEEM AFRTPKGIYCYKVMPFGLKN GATYQ  + KVFDDML +        
Subjt:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------

Query:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------
                                                 KFLGFIVRHRGIEIDQSK+  IQKM +PKSLHDLRSLQGRLAYI+RFISN+A       
Subjt:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------

Query:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------
                                      VL APVPDKPLILYIAAQ+ SLGALL QEE KGKER+LYYLSRTLI AEVNYSPIE              
Subjt:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------

Query:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG
                  + K+ PIKYVL RPII+ RLAKW VLLQQYDIV I QK  +   +  ++         K C  +           EPWTM+F GA  RSG
Subjt:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG

Query:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE
        AGA IVLI+PEKHMLPYSFA  ELCSNNV EYQALIIGLQ+ALEI VSFIE+YGDSKLIIN LSLQYDVKHE+LK YF+YAR+LMEKF ++MLEHVPRVE
Subjt:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE

Query:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF
        NKRAD + NLATAL M DD                     EVN  T +LIDEEDWRQPIIEYLEH KLPKDSRHK E+RRRA  FIYYKGT YRRSL+G 
Subjt:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF

Query:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS
        FLRCLGKE+ VKALKEVHA VCGAHQS PKLQ +LRRM YYW KMIQDSIDYVKKCE CQYHANFIHQP EPLHP VASW FEAW LDLV  IT KS
Subjt:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS

TrEMBL top hitse value%identityAlignment
A0A5A7SPV8 Ribonuclease H1.8e-27960.42Show/hide
Query:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN
        MKRKTFVTLNTSQ                      GEGE SCH+ITI+E  + ET E+DA+D PQSLEDGGQS VDELKE+N+                 
Subjt:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN

Query:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE
                   EY+DIFAWSYKEM  LDPKVAVHHLAIKPGYR IKQAQR F+  LIP+I+VEVNKLIE GFIREV YPTWI NIVPVRKKNGQLRVCV+
Subjt:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE

Query:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------
        F DLNNACPK DFPLPI EIMVDATTG+  LSFMDGSSGYN   + LSDEEM AFRTPKGIYCYKV+PFGLKN GATYQ  + KVFDDML K        
Subjt:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------

Query:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------
                                                 KFLGFIVRHRGIEIDQSK+  IQKMP+PKSLHDLRSLQGRLAYI+RFISN+A       
Subjt:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------

Query:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------
                                      VL AP+P +PLILYIAAQ+ SLGALL QE+EKGKE ALYYLSRTL+ AEVNYSPIE              
Subjt:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------

Query:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTCQTMR----FSSR--EPWTMFFYGATLRSG
                  + K  PIKYVL RPIIS  LAKW V+LQQYDIV I QKT +   + +++         K C+ +     F ++  EPWTM+F GA  RSG
Subjt:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTCQTMR----FSSR--EPWTMFFYGATLRSG

Query:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE
        AGA IVLI+ E+HMLPYSF   ELC NNV EYQALIIGLQMALEI VSFI+IYGDSKLIIN LSLQYDVKHE+LK YF+YAR+LME+F S+ML+HVPR E
Subjt:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE

Query:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF
        NKRAD + NLATALMM D+                     E N T  HLI+EEDW QPIIEYLEH KLPKDSRHKTEVRRRA  FIYYKGT YRR L+G 
Subjt:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF

Query:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS
        FLRCLG++E +KAL+E HA VCGAHQS  KLQ +LRRM YYW KM+QDS+DY KKCEACQYHANFIHQP EPLHP VASW FEAW LDLV  IT KS
Subjt:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS

A0A5A7SPY4 RNase H domain-containing protein4.6e-29164.5Show/hide
Query:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN
        MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE+ + GEGE S H+ITI+E  + ET E+D +DVPQSLEDGGQS VDELKEVN+GTIEEP  TFISASLS+
Subjt:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN

Query:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE
        EEE KYMSLLTEY+DIFAWSYKEM  LDPKVA+HHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE  FIR+V YPTWI NI+PV+KKNGQLRVCV 
Subjt:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE

Query:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------
        FCDLNNACPK +FPL I EI+VDATTG+  LSFMDGSSGYN   + LSDEEM AFRTPKGIYCYKVMPFGLKN+ ATYQ  + KVF DML K        
Subjt:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------

Query:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPVP
                                                 KF+ FIVRHRGIEIDQSK+  IQKMP+PKSLHDLRSLQGRLAYI+R             
Subjt:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPVP

Query:  DKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-----------------------IGKSIPIKYVLFRPIISRRLAKWVVLL
                     SLGALL QE+EKGKE ALYYLSRTL+ AEVNYSPIE                        + K+ PIKYVL RPIISRRLAKW V++
Subjt:  DKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-----------------------IGKSIPIKYVLFRPIISRRLAKWVVLL

Query:  QQYDIVNILQKT------TQFLR---IGSYVKTCQTMRFSS------REPWTMFFYGATLRSGAGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALII
        QQYDIV I QK         FL    I S +K C+ +           EPWTM+F GA   SGAG  IVLI+PEKHMLPYS A  ELCSNNV EYQALII
Subjt:  QQYDIVNILQKT------TQFLR---IGSYVKTCQTMRFSS------REPWTMFFYGATLRSGAGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALII

Query:  GLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDD------------------
        GLQM LEI VSFIEIYGDSKLIIN LSLQ DVKHE+LK YF+YAR+LME+F S+MLEHVPR ENKRAD + NLAT LMM ++                  
Subjt:  GLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDD------------------

Query:  ---EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGFFLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRR
           + N TT HLIDEEDW QPIIEYLEHRKL KDS HKTEVRRR   FIYYKGT YRRSL+G FLRCLGKEE +KAL+E HA VCGAHQS PKLQ +LRR
Subjt:  ---EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGFFLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRR

Query:  MSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAW
        M YYW KM+QDS+DY KKCEACQYHANFIHQP EPLHP +ASW F+AW
Subjt:  MSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAW

A0A5A7TZU9 Ribonuclease H3.1e-27156.52Show/hide
Query:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN
        MKRK FV++NT +GSL VKRH+V+ T P+  EPE       C+++TI E S  +  E+DA+  P SLEDGGQST+DELKEVN+GT EEP  TFIS  LS+
Subjt:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN

Query:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE
         +E++Y++LL  Y+D+FAWSYKEM  LDPKVAVH LAIKP +R +KQAQR F+  LI +IE EVNKLIE GFIREV YPTWI NIVPVRKKNGQLRVCV+
Subjt:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE

Query:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------
        F DLNNACPK DFPLPI+EIM+DAT G+ ALSFMDGSSGYN   + L DEE  AFRTPKGIYCYKVMPFGLKN GATYQ  + ++FDDML K        
Subjt:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------

Query:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------
                                                 KFLGFIVRHRGIE+D SK+  IQKMP PK+LH+LR LQGRLAYI+RFISN+A       
Subjt:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------

Query:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------
                                      VL AP   KPLILYIAAQ+ SLGALL QE +KGKE ALYYLSRTL  AE+NYSPIE              
Subjt:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------

Query:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTC------QTMRFSSREPWTMFFYGATLRSG
                  + K+ P+KY+L RP+IS RLAKW ++LQQYDIV I QK  +   +  ++         K C      + +   S EPW MFF GA  RSG
Subjt:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTC------QTMRFSSREPWTMFFYGATLRSG

Query:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE
        AG  IV I+PEKHMLPYSF   ELCSNNV EYQA IIGLQMA E  +  IEI+GDSKLIIN LS QY+VKH++LK YFSYAR+LM++F S++LEH+PR E
Subjt:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE

Query:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF
        NK+AD + NLATAL + +D                     E +  + + IDEEDWRQPII+YLEH KLP D RH+ E+RRRA RFIYYK T YRRS +G 
Subjt:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF

Query:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS
         LRCLGKEE  KAL+E H+ +CGAHQS PKLQ +L+RM YYW  MI DS+ + K CEACQ+HANFIHQP EPLHP +ASW FEAW LDLV  IT KS
Subjt:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS

A0A5D3D1E5 Ribonuclease H3.1e-27156.52Show/hide
Query:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN
        MKRK FV++NT +GSL VKRH+V+ T P+  EPE       C+++TI E S  +  E+DA+  P SLEDGGQST+DELKEVN+GT EEP  TFIS  LS+
Subjt:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN

Query:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE
         +E++Y++LL  Y+D+FAWSYKEM  LDPKVAVH LAIKP +R +KQAQR F+  LI +IE EVNKLIE GFIREV YPTWI NIVPVRKKNGQLRVCV+
Subjt:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE

Query:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------
        F DLNNACPK DFPLPI+EIM+DAT G+ ALSFMDGSSGYN   + L DEE  AFRTPKGIYCYKVMPFGLKN GATYQ  + ++FDDML K        
Subjt:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------

Query:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------
                                                 KFLGFIVRHRGIE+D SK+  IQKMP PK+LH+LR LQGRLAYI+RFISN+A       
Subjt:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA-------

Query:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------
                                      VL AP   KPLILYIAAQ+ SLGALL QE +KGKE ALYYLSRTL  AE+NYSPIE              
Subjt:  ------------------------------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-------------

Query:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTC------QTMRFSSREPWTMFFYGATLRSG
                  + K+ P+KY+L RP+IS RLAKW ++LQQYDIV I QK  +   +  ++         K C      + +   S EPW MFF GA  RSG
Subjt:  ----------IGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYV---------KTC------QTMRFSSREPWTMFFYGATLRSG

Query:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE
        AG  IV I+PEKHMLPYSF   ELCSNNV EYQA IIGLQMA E  +  IEI+GDSKLIIN LS QY+VKH++LK YFSYAR+LM++F S++LEH+PR E
Subjt:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE

Query:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF
        NK+AD + NLATAL + +D                     E +  + + IDEEDWRQPII+YLEH KLP D RH+ E+RRRA RFIYYK T YRRS +G 
Subjt:  NKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGF

Query:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS
         LRCLGKEE  KAL+E H+ +CGAHQS PKLQ +L+RM YYW  MI DS+ + K CEACQ+HANFIHQP EPLHP +ASW FEAW LDLV  IT KS
Subjt:  FLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKS

A0A5D3DB95 RNase H domain-containing protein4.6e-29164.5Show/hide
Query:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN
        MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE+ + GEGE S H+ITI+E  + ET E+D +DVPQSLEDGGQS VDELKEVN+GTIEEP  TFISASLS+
Subjt:  MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSN

Query:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE
        EEE KYMSLLTEY+DIFAWSYKEM  LDPKVA+HHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE  FIR+V YPTWI NI+PV+KKNGQLRVCV 
Subjt:  EEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVE

Query:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------
        FCDLNNACPK +FPL I EI+VDATTG+  LSFMDGSSGYN   + LSDEEM AFRTPKGIYCYKVMPFGLKN+ ATYQ  + KVF DML K        
Subjt:  FCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK--------

Query:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPVP
                                                 KF+ FIVRHRGIEIDQSK+  IQKMP+PKSLHDLRSLQGRLAYI+R             
Subjt:  -----------------------------------------KFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPVP

Query:  DKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-----------------------IGKSIPIKYVLFRPIISRRLAKWVVLL
                     SLGALL QE+EKGKE ALYYLSRTL+ AEVNYSPIE                        + K+ PIKYVL RPIISRRLAKW V++
Subjt:  DKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIEN-----------------------IGKSIPIKYVLFRPIISRRLAKWVVLL

Query:  QQYDIVNILQKT------TQFLR---IGSYVKTCQTMRFSS------REPWTMFFYGATLRSGAGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALII
        QQYDIV I QK         FL    I S +K C+ +           EPWTM+F GA   SGAG  IVLI+PEKHMLPYS A  ELCSNNV EYQALII
Subjt:  QQYDIVNILQKT------TQFLR---IGSYVKTCQTMRFSS------REPWTMFFYGATLRSGAGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALII

Query:  GLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDD------------------
        GLQM LEI VSFIEIYGDSKLIIN LSLQ DVKHE+LK YF+YAR+LME+F S+MLEHVPR ENKRAD + NLAT LMM ++                  
Subjt:  GLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDD------------------

Query:  ---EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGFFLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRR
           + N TT HLIDEEDW QPIIEYLEHRKL KDS HKTEVRRR   FIYYKGT YRRSL+G FLRCLGKEE +KAL+E HA VCGAHQS PKLQ +LRR
Subjt:  ---EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGFFLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRR

Query:  MSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAW
        M YYW KM+QDS+DY KKCEACQYHANFIHQP EPLHP +ASW F+AW
Subjt:  MSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAW

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.8e-1325Show/hide
Query:  EVNKLIEEGFIREVNYPTWIENIVPVR---KKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYNL---TLSDEEMIAFRTPK
        E+N+ ++ G IRE    +   N  PV    KK G LR+ V++  LN       +PLP+IE ++    G    + +D  S Y+L      DE  +AFR P+
Subjt:  EVNKLIEEGFIREVNYPTWIENIVPVR---KKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYNL---TLSDEEMIAFRTPK

Query:  GIYCYKVMPFGLKNVGATYQCVVHKV------------FDDML-------------------------------------QKKFLGFIVRHRGIEIDQSK
        G++ Y VMP+G+    A +Q  ++ +             DD+L                                     Q KF+G+ +  +G    Q  
Subjt:  GIYCYKVMPFGLKNVGATYQCVVHKV------------FDDML-------------------------------------QKKFLGFIVRHRGIEIDQSK

Query:  L--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV
        +  + +  +PK+  +LR   G + Y+++FI   + L  P+
Subjt:  L--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV

P0CT35 Transposon Tf2-2 polyprotein8.8e-1325Show/hide
Query:  EVNKLIEEGFIREVNYPTWIENIVPVR---KKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYNL---TLSDEEMIAFRTPK
        E+N+ ++ G IRE    +   N  PV    KK G LR+ V++  LN       +PLP+IE ++    G    + +D  S Y+L      DE  +AFR P+
Subjt:  EVNKLIEEGFIREVNYPTWIENIVPVR---KKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYNL---TLSDEEMIAFRTPK

Query:  GIYCYKVMPFGLKNVGATYQCVVHKV------------FDDML-------------------------------------QKKFLGFIVRHRGIEIDQSK
        G++ Y VMP+G+    A +Q  ++ +             DD+L                                     Q KF+G+ +  +G    Q  
Subjt:  GIYCYKVMPFGLKNVGATYQCVVHKV------------FDDML-------------------------------------QKKFLGFIVRHRGIEIDQSK

Query:  L--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV
        +  + +  +PK+  +LR   G + Y+++FI   + L  P+
Subjt:  L--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV

P0CT41 Transposon Tf2-12 polyprotein8.8e-1325Show/hide
Query:  EVNKLIEEGFIREVNYPTWIENIVPVR---KKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYNL---TLSDEEMIAFRTPK
        E+N+ ++ G IRE    +   N  PV    KK G LR+ V++  LN       +PLP+IE ++    G    + +D  S Y+L      DE  +AFR P+
Subjt:  EVNKLIEEGFIREVNYPTWIENIVPVR---KKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYNL---TLSDEEMIAFRTPK

Query:  GIYCYKVMPFGLKNVGATYQCVVHKV------------FDDML-------------------------------------QKKFLGFIVRHRGIEIDQSK
        G++ Y VMP+G+    A +Q  ++ +             DD+L                                     Q KF+G+ +  +G    Q  
Subjt:  GIYCYKVMPFGLKNVGATYQCVVHKV------------FDDML-------------------------------------QKKFLGFIVRHRGIEIDQSK

Query:  L--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV
        +  + +  +PK+  +LR   G + Y+++FI   + L  P+
Subjt:  L--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.6e-1426.89Show/hide
Query:  HHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSF
        H + IKPG R  +    H        I   V KL++  FI     P     +V V KK+G  R+CV++  LN A     FPLP I+ ++      +  + 
Subjt:  HHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSF

Query:  MDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVF----------DDML---------------------------------
        +D  SGY+   +   D    AF TP G Y Y VMPFGL N  +T+   +   F          DD+L                                 
Subjt:  MDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVF----------DDML---------------------------------

Query:  ----QKKFLGFIVRHRGIEIDQSK--LIQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV
            + +FLG+ +  + I   Q K   I+  P PK++   +   G + Y +RFI N + +  P+
Subjt:  ----QKKFLGFIVRHRGIEIDQSK--LIQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.6e-1426.89Show/hide
Query:  HHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSF
        H + IKPG R  +    H        I   V KL++  FI     P     +V V KK+G  R+CV++  LN A     FPLP I+ ++      +  + 
Subjt:  HHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSF

Query:  MDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVF----------DDML---------------------------------
        +D  SGY+   +   D    AF TP G Y Y VMPFGL N  +T+   +   F          DD+L                                 
Subjt:  MDGSSGYN---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVF----------DDML---------------------------------

Query:  ----QKKFLGFIVRHRGIEIDQSK--LIQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV
            + +FLG+ +  + I   Q K   I+  P PK++   +   G + Y +RFI N + +  P+
Subjt:  ----QKKFLGFIVRHRGIEIDQSK--LIQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV

Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein6.1e-0940.66Show/hide
Query:  LCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATAL
        + +NN  EY ALI+GL+ A+E     I++ GDSKL+   +  Q+ V HE L      A+ L  K VS  + HV R  N  AD   NLA  L
Subjt:  LCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATAL

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.6e-0932.46Show/hide
Query:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE
        AGA  VL A +  +L Y    V   +NNV EY+AL++GL+ AL+     + + GDS L+   +   +   H  +      A++LM  F +  ++H+ R +
Subjt:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE

Query:  NKRADTMTNLATAL
        N  AD   N A  L
Subjt:  NKRADTMTNLATAL

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.6e-0932.46Show/hide
Query:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE
        AGA  VL A +  +L Y    V   +NNV EY+AL++GL+ AL+     + + GDS L+   +   +   H  +      A++LM  F +  ++H+ R +
Subjt:  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVE

Query:  NKRADTMTNLATAL
        N  AD   N A  L
Subjt:  NKRADTMTNLATAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGAAAAACCTTTGTTACTCTCAATACAAGTCAAGGTTCGTTGAATGTAAAAAGACATAATGTTATAGTAACGAATCCTAAAAAAGAAGAGCCAGAACACGGGGA
AGGTGAAACTTCATGTCATTATATCACCATTATTGAGGGATCAAAGGCTGAAACTCATGAAGATGACGCAAAAGATGTCCCACAGAGTTTAGAGGATGGTGGCCAATCTA
CTGTAGATGAGCTAAAAGAGGTGAACGTTGGTACAATAGAGGAACCATGCTCAACTTTCATTAGTGCATCCCTCTCTAACGAGGAGGAGGATAAATACATGAGTTTGCTC
ACCGAATACAGAGACATCTTTGCTTGGTCGTACAAGGAGATGTCAGAACTTGATCCAAAAGTAGCAGTCCATCATCTTGCAATTAAACCAGGGTATCGATCGATTAAACA
AGCACAACGACATTTTCAAGTGTATCTTATTCCTCGTATCGAGGTGGAAGTCAACAAGTTGATTGAAGAAGGGTTCATTCGCGAGGTCAATTATCCAACGTGGATAGAAA
ACATTGTCCCTGTTAGAAAAAAGAACGGGCAACTTCGCGTTTGTGTAGAATTTTGCGACCTGAATAATGCATGCCCTAAAGGTGATTTTCCCTTGCCTATCATAGAAATC
ATGGTTGATGCAACTACTGGATACAGAGCATTGTCCTTTATGGATGGGTCGTCTGGATATAATCTAACCCTCTCAGATGAAGAAATGATAGCTTTTAGGACTCCAAAGGG
AATATACTGTTACAAGGTGATGCCCTTTGGATTGAAAAATGTCGGTGCTACTTACCAATGTGTCGTGCATAAAGTGTTTGATGATATGTTGCAAAAAAAGTTTCTTGGCT
TCATTGTAAGGCACCGAGGGATCGAAATAGACCAATCCAAGTTGATTCAGAAGATGCCAAAACCTAAGAGTTTGCATGACCTAAGAAGTCTCCAAGGACGATTGGCTTAC
ATCCAAAGGTTCATCTCTAACATGGCCGTGTTGAGAGCTCCAGTACCTGACAAACCATTAATATTGTACATTGCTGCACAAGATGGGTCCTTAGGAGCATTACTAGTACA
AGAGGAGGAAAAGGGAAAGGAGCGTGCTCTCTACTATTTAAGCAGAACATTAATTAGGGCTGAAGTTAACTATTCTCCCATCGAGAACATTGGCAAAAGTATACCTATAA
AATATGTTCTGTTCAGGCCAATTATCTCTAGACGCTTAGCCAAATGGGTGGTTCTACTCCAACAATATGACATTGTCAATATTCTCCAAAAGACCACCCAATTCCTTCGG
ATTGGAAGTTATGTGAAGACCTGTCAGACGATGAGGTTTTCTTCACGCGAACCTTGGACTATGTTTTTTTACGGTGCAACATTAAGAAGTGGTGCAGGGGCTGACATCGT
TCTCATTGCTCCTGAGAAGCATATGTTGCCTTATAGCTTTGCATTTGTTGAACTGTGTTCAAATAATGTGGTTGAATATCAGGCTTTGATAATTGGCCTTCAAATGGCAT
TGGAAATCGAAGTATCATTCATAGAAATTTATGGTGATTCAAAGTTGATAATCAATCATCTTTCGCTTCAGTATGATGTGAAACATGAAAACTTGAAGACATACTTCTCT
TATGCTCGAAAATTGATGGAAAAGTTCGTTAGTATGATGCTAGAACACGTCCCTAGAGTAGAAAATAAGAGGGCAGACACAATGACGAATTTAGCCACTGCCTTAATGAT
GCTAGATGATGAAGTGAACGCGACGACATTCCATTTGATTGATGAAGAAGATTGGCGTCAACCCATCATAGAGTATCTTGAACATAGAAAGCTTCCAAAGGATTCCCGTC
ATAAAACTGAGGTACGAAGAAGAGCTACACGCTTCATTTATTACAAGGGAACCTTTTATCGCCGTTCTCTTCAAGGGTTCTTCCTTCGATGCCTTGGAAAGGAAGAGTTA
GTAAAAGCTCTAAAGGAAGTACATGCAGATGTTTGTGGAGCACATCAATCGAGACCAAAACTTCAATGCAAACTAAGAAGAATGAGCTACTATTGGCGTAAGATGATCCA
AGACTCAATAGACTATGTAAAGAAGTGTGAAGCTTGTCAATACCATGCAAACTTCATACACCAACCTCTAGAGCCTCTTCATCCAATTGTGGCTTCTTGGTCGTTTGAGG
CTTGGAGACTCGATCTGGTTGACTCTATTACACAAAAATCACCACTTTCAAAAGGTTTGATGTCTTCTTACCAATTCAAACCGAGACCACCACTAGCACTCAATTACTAT
CCTCTGGACAAAGAACCGAGTTGTGGGACCCAATTTGGTGTACAAAGTAATGGAGATAAAAGGGGATTGGAAGGTTCAACCTATAAACTTAATCCCTCTCGGGCCAATGA
GAGGGTGGGGCCCCTTGTTCAAGACTTGGATTCAGTGCTTAAGAGAGCAACCTATCTACTAACCCTAAAGCGGGTAGGAGTGAATTCCATCTTGTACCCTATGTTCCCAG
CTATCCACCCGATCTTACCCCTGAAATGGGAGGCTTATTGGGCCAACGCTGATGAGCTGCCCTCACCTATGCAGATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGAAAAACCTTTGTTACTCTCAATACAAGTCAAGGTTCGTTGAATGTAAAAAGACATAATGTTATAGTAACGAATCCTAAAAAAGAAGAGCCAGAACACGGGGA
AGGTGAAACTTCATGTCATTATATCACCATTATTGAGGGATCAAAGGCTGAAACTCATGAAGATGACGCAAAAGATGTCCCACAGAGTTTAGAGGATGGTGGCCAATCTA
CTGTAGATGAGCTAAAAGAGGTGAACGTTGGTACAATAGAGGAACCATGCTCAACTTTCATTAGTGCATCCCTCTCTAACGAGGAGGAGGATAAATACATGAGTTTGCTC
ACCGAATACAGAGACATCTTTGCTTGGTCGTACAAGGAGATGTCAGAACTTGATCCAAAAGTAGCAGTCCATCATCTTGCAATTAAACCAGGGTATCGATCGATTAAACA
AGCACAACGACATTTTCAAGTGTATCTTATTCCTCGTATCGAGGTGGAAGTCAACAAGTTGATTGAAGAAGGGTTCATTCGCGAGGTCAATTATCCAACGTGGATAGAAA
ACATTGTCCCTGTTAGAAAAAAGAACGGGCAACTTCGCGTTTGTGTAGAATTTTGCGACCTGAATAATGCATGCCCTAAAGGTGATTTTCCCTTGCCTATCATAGAAATC
ATGGTTGATGCAACTACTGGATACAGAGCATTGTCCTTTATGGATGGGTCGTCTGGATATAATCTAACCCTCTCAGATGAAGAAATGATAGCTTTTAGGACTCCAAAGGG
AATATACTGTTACAAGGTGATGCCCTTTGGATTGAAAAATGTCGGTGCTACTTACCAATGTGTCGTGCATAAAGTGTTTGATGATATGTTGCAAAAAAAGTTTCTTGGCT
TCATTGTAAGGCACCGAGGGATCGAAATAGACCAATCCAAGTTGATTCAGAAGATGCCAAAACCTAAGAGTTTGCATGACCTAAGAAGTCTCCAAGGACGATTGGCTTAC
ATCCAAAGGTTCATCTCTAACATGGCCGTGTTGAGAGCTCCAGTACCTGACAAACCATTAATATTGTACATTGCTGCACAAGATGGGTCCTTAGGAGCATTACTAGTACA
AGAGGAGGAAAAGGGAAAGGAGCGTGCTCTCTACTATTTAAGCAGAACATTAATTAGGGCTGAAGTTAACTATTCTCCCATCGAGAACATTGGCAAAAGTATACCTATAA
AATATGTTCTGTTCAGGCCAATTATCTCTAGACGCTTAGCCAAATGGGTGGTTCTACTCCAACAATATGACATTGTCAATATTCTCCAAAAGACCACCCAATTCCTTCGG
ATTGGAAGTTATGTGAAGACCTGTCAGACGATGAGGTTTTCTTCACGCGAACCTTGGACTATGTTTTTTTACGGTGCAACATTAAGAAGTGGTGCAGGGGCTGACATCGT
TCTCATTGCTCCTGAGAAGCATATGTTGCCTTATAGCTTTGCATTTGTTGAACTGTGTTCAAATAATGTGGTTGAATATCAGGCTTTGATAATTGGCCTTCAAATGGCAT
TGGAAATCGAAGTATCATTCATAGAAATTTATGGTGATTCAAAGTTGATAATCAATCATCTTTCGCTTCAGTATGATGTGAAACATGAAAACTTGAAGACATACTTCTCT
TATGCTCGAAAATTGATGGAAAAGTTCGTTAGTATGATGCTAGAACACGTCCCTAGAGTAGAAAATAAGAGGGCAGACACAATGACGAATTTAGCCACTGCCTTAATGAT
GCTAGATGATGAAGTGAACGCGACGACATTCCATTTGATTGATGAAGAAGATTGGCGTCAACCCATCATAGAGTATCTTGAACATAGAAAGCTTCCAAAGGATTCCCGTC
ATAAAACTGAGGTACGAAGAAGAGCTACACGCTTCATTTATTACAAGGGAACCTTTTATCGCCGTTCTCTTCAAGGGTTCTTCCTTCGATGCCTTGGAAAGGAAGAGTTA
GTAAAAGCTCTAAAGGAAGTACATGCAGATGTTTGTGGAGCACATCAATCGAGACCAAAACTTCAATGCAAACTAAGAAGAATGAGCTACTATTGGCGTAAGATGATCCA
AGACTCAATAGACTATGTAAAGAAGTGTGAAGCTTGTCAATACCATGCAAACTTCATACACCAACCTCTAGAGCCTCTTCATCCAATTGTGGCTTCTTGGTCGTTTGAGG
CTTGGAGACTCGATCTGGTTGACTCTATTACACAAAAATCACCACTTTCAAAAGGTTTGATGTCTTCTTACCAATTCAAACCGAGACCACCACTAGCACTCAATTACTAT
CCTCTGGACAAAGAACCGAGTTGTGGGACCCAATTTGGTGTACAAAGTAATGGAGATAAAAGGGGATTGGAAGGTTCAACCTATAAACTTAATCCCTCTCGGGCCAATGA
GAGGGTGGGGCCCCTTGTTCAAGACTTGGATTCAGTGCTTAAGAGAGCAACCTATCTACTAACCCTAAAGCGGGTAGGAGTGAATTCCATCTTGTACCCTATGTTCCCAG
CTATCCACCCGATCTTACCCCTGAAATGGGAGGCTTATTGGGCCAACGCTGATGAGCTGCCCTCACCTATGCAGATCTAA
Protein sequenceShow/hide protein sequence
MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSNEEEDKYMSLL
TEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEI
MVDATTGYRALSFMDGSSGYNLTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQKKFLGFIVRHRGIEIDQSKLIQKMPKPKSLHDLRSLQGRLAY
IQRFISNMAVLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIENIGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLR
IGSYVKTCQTMRFSSREPWTMFFYGATLRSGAGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFS
YARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDDEVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGFFLRCLGKEEL
VKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKSPLSKGLMSSYQFKPRPPLALNYY
PLDKEPSCGTQFGVQSNGDKRGLEGSTYKLNPSRANERVGPLVQDLDSVLKRATYLLTLKRVGVNSILYPMFPAIHPILPLKWEAYWANADELPSPMQI