; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g25560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g25560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr2:18221715..18226769
RNA-Seq ExpressionMoc02g25560
SyntenyMoc02g25560
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]2.8e-28367.58Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ
        K  +L + AQERS GALL Q+  KGKE +LYYLSR L GAE+NYSPIEKMCL+LFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIIS RLAKWA+LLQ
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ

Query:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG
        QYDIVY+PQKA+KGQ +  FLA+HPIPSDWKL + LP +E+F+ E+  PWTMYFDGAAR+ GAGAG+V IS +KHMLPYSF  +ELCSNNV EYQALIIG
Subjt:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG

Query:  LQIALEIGVTYKQIYGDSKLIINQLLLE--------------------------------AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---
        LQIALEIGV++ ++YGDSKLIINQL L+                                 ENKRA ALANLATALT+ +D  LNIPLCQRWIIPP+   
Subjt:  LQIALEIGVTYKQIYGDSKLIINQLLLE--------------------------------AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---

Query:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI
                                                                             E+S+KAL+E H+GV  AH+SGPKL  QL+R+
Subjt:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI

Query:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF
        GYYWP M+QDS+ Y K+CE  Q+HANFIHQ  EPLHPT+ASWPFEAWGLDLVGPITPKSSAGHSYILA TDYF RWAEA++L+EAKKENVADFIRTH+I+
Subjt:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF

Query:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA
        RYGI H I+TDNG+QFS  ++ +LCEKF FKQY SSMYNAAANGL EAFNKTLCNLLKK+VSKSKRDWQE+I EALWAYRTT+RTPTG TPYSLVYGV+A
Subjt:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA

Query:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK
        +LPLEREIPSLRMAVQEGLT EDN +LRLQELEALDE+RL+AQQALECYQARM K F+K V+PRSFQVG+LVLAVRR IITTRHT NK TPKW+GPY++K
Subjt:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK

Query:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA
        EVYTNG YKIVD+DGLRIGPINGKFLKK+YA
Subjt:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA

XP_031737372.1 uncharacterized protein LOC116402244 [Cucumis sativus]6.1e-28367.44Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ
        K  +L + AQERS GALL Q+  KGKE +LYYLSR L GAE+NYSPIEKMCL+LFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPII+ RLAKWA+LLQ
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ

Query:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG
        QYDIVY+PQKA+KGQ +  FLA+HPIPSDWKL + LP +E+F+ E+  PWTMYFDGAAR+ GAGAG+V IS +KHMLPYSF  +ELCSNNV EYQALIIG
Subjt:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG

Query:  LQIALEIGVTYKQIYGDSKLIINQLLLE--------------------------------AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---
        LQIALEIGV++ ++YGDSKLIINQL L+                                 ENKRA ALANLATALT+ +D  LNIPLCQRWIIPP+   
Subjt:  LQIALEIGVTYKQIYGDSKLIINQLLLE--------------------------------AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---

Query:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI
                                                                             E+S+KAL+E H+GV  AH+SGPKL  QL+R+
Subjt:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI

Query:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF
        GYYWP M+QDS+ Y K+CE  Q+HANFIHQ  EPLHPT+ASWPFEAWGLDLVGPITPKSSAGHSYILA TDYF RWAEA++L+EAKKENVADFIRTH+I+
Subjt:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF

Query:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA
        RYGI H I+TDNG+QFS  ++ +LCEKF FKQY SSMYNAAANGL EAFNKTLCNLLKK+VSKSKRDWQE+I EALWAYRTT+RTPTG TPYSLVYGV+A
Subjt:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA

Query:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK
        +LPLEREIPSLRMAVQEGLT EDN +LRLQELEALDE+RL+AQQALECYQARM K F+K V+PRSFQVG+LVLAVRR IITTRHT NK TPKW+GPY++K
Subjt:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK

Query:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA
        EVYTNG YKIVD+DGLRIGPINGKFLKK+YA
Subjt:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]6.1e-28367.44Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ
        K  +L + AQERS GALL Q+  KGKE +LYYLSR L GAE+NYSPIEKMCL+LFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPII+ RLAKWA+LLQ
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ

Query:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG
        QYDIVY+PQKA+KGQ +  FLA+HPIPSDWKL + LP +E+F+ E+  PWTMYFDGAAR+ GAGAG+V IS +KHMLPYSF  +ELCSNNV EYQALIIG
Subjt:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG

Query:  LQIALEIGVTYKQIYGDSKLIINQLLLE--------------------------------AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---
        LQIALEIGV++ ++YGDSKLIINQL L+                                 ENKRA ALANLATALT+ +D  LNIPLCQRWIIPP+   
Subjt:  LQIALEIGVTYKQIYGDSKLIINQLLLE--------------------------------AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---

Query:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI
                                                                             E+S+KAL+E H+GV  AH+SGPKL  QL+R+
Subjt:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI

Query:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF
        GYYWP M+QDS+ Y K+CE  Q+HANFIHQ  EPLHPT+ASWPFEAWGLDLVGPITPKSSAGHSYILA TDYF RWAEA++L+EAKKENVADFIRTH+I+
Subjt:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF

Query:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA
        RYGI H I+TDNG+QFS  ++ +LCEKF FKQY SSMYNAAANGL EAFNKTLCNLLKK+VSKSKRDWQE+I EALWAYRTT+RTPTG TPYSLVYGV+A
Subjt:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA

Query:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK
        +LPLEREIPSLRMAVQEGLT EDN +LRLQELEALDE+RL+AQQALECYQARM K F+K V+PRSFQVG+LVLAVRR IITTRHT NK TPKW+GPY++K
Subjt:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK

Query:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA
        EVYTNG YKIVD+DGLRIGPINGKFLKK+YA
Subjt:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]6.1e-28367.44Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ
        K  +L + AQERS GALL Q+  KGKE +LYYLSR L GAE+NYSPIEKMCL+LFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPII+ RLAKWA+LLQ
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ

Query:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG
        QYDIVY+PQKA+KGQ +  FLA+HPIPSDWKL + LP +E+F+ E+  PWTMYFDGAAR+ GAGAG+V IS +KHMLPYSF  +ELCSNNV EYQALIIG
Subjt:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG

Query:  LQIALEIGVTYKQIYGDSKLIINQLLLE--------------------------------AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---
        LQIALEIGV++ ++YGDSKLIINQL L+                                 ENKRA ALANLATALT+ +D  LNIPLCQRWIIPP+   
Subjt:  LQIALEIGVTYKQIYGDSKLIINQLLLE--------------------------------AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---

Query:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI
                                                                             E+S+KAL+E H+GV  AH+SGPKL  QL+R+
Subjt:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI

Query:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF
        GYYWP M+QDS+ Y K+CE  Q+HANFIHQ  EPLHPT+ASWPFEAWGLDLVGPITPKSSAGHSYILA TDYF RWAEA++L+EAKKENVADFIRTH+I+
Subjt:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF

Query:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA
        RYGI H I+TDNG+QFS  ++ +LCEKF FKQY SSMYNAAANGL EAFNKTLCNLLKK+VSKSKRDWQE+I EALWAYRTT+RTPTG TPYSLVYGV+A
Subjt:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA

Query:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK
        +LPLEREIPSLRMAVQEGLT EDN +LRLQELEALDE+RL+AQQALECYQARM K F+K V+PRSFQVG+LVLAVRR IITTRHT NK TPKW+GPY++K
Subjt:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK

Query:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA
        EVYTNG YKIVD+DGLRIGPINGKFLKK+YA
Subjt:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA

XP_031742888.1 uncharacterized protein LOC116404510 [Cucumis sativus]2.8e-28367.58Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ
        K  +L + AQERS GALL Q+  KGKE +LYYLSR L GAE+NYSPIEKMCL+LFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIIS RLAKWA+LLQ
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ

Query:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG
        QYDIVY+PQKA+KGQ +  FLA+HPIPSDWKL + LP +E+F+ E+  PWTMYFDGAAR+ GAGAG+V IS +KHMLPYSF  +ELCSNNV EYQALIIG
Subjt:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG

Query:  LQIALEIGVTYKQIYGDSKLIINQLLLE--------------------------------AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---
        LQIALEIGV++ ++YGDSKLIINQL L+                                 ENKRA ALANLATALT+ +D  LNIPLCQRWIIPP+   
Subjt:  LQIALEIGVTYKQIYGDSKLIINQLLLE--------------------------------AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---

Query:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI
                                                                             E+S+KAL+E H+GV  AH+SGPKL  QL+R+
Subjt:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI

Query:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF
        GYYWP M+QDS+ Y K+CE  Q+HANFIHQ  EPLHPT+ASWPFEAWGLDLVGPITPKSSAGHSYILA TDYF RWAEA++L+EAKKENVADFIRTH+I+
Subjt:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF

Query:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA
        RYGI H I+TDNG+QFS  ++ +LCEKF FKQY SSMYNAAANGL EAFNKTLCNLLKK+VSKSKRDWQE+I EALWAYRTT+RTPTG TPYSLVYGV+A
Subjt:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA

Query:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK
        +LPLEREIPSLRMAVQEGLT EDN +LRLQELEALDE+RL+AQQALECYQARM K F+K V+PRSFQVG+LVLAVRR IITTRHT NK TPKW+GPY++K
Subjt:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK

Query:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA
        EVYTNG YKIVD+DGLRIGPINGKFLKK+YA
Subjt:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H4.6e-27665.8Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ
        K  +L + AQE S GALL Q+ +KGKE  LYYLSR LTGAELNYSPIEKMCL+LFFAIDKLRHYMQAFT+HLVAKADP+KY+LSRP+IS RLAKWAI+LQ
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ

Query:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG
        QYDIVY+PQKAVKGQ +  FLA+HP+PS+WKL + LP EE+ ++E   PW M+FDGAAR+ GAG G+VFIS +KHMLPYSFT  ELCSNNV EYQA IIG
Subjt:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG

Query:  LQIALEIGVTYKQIYGDSKLIINQL----------------------------LLE----AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---
        LQ+A E G+   +I+GDSKLIINQL                            +LE    +ENK+A ALANLATALT+SED  +NI LCQ+WI+P I   
Subjt:  LQIALEIGVTYKQIYGDSKLIINQL----------------------------LLE----AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---

Query:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI
                                                                             EES KALEEAHSG+  AH+SGPKL  QLKR+
Subjt:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI

Query:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF
        GYYWPTM+ DSM +AK CEA QFHANFIHQ  EPLHPTIASWPFEAWGLDLVGPITPKS+AGHSYILAGTDYF +WAEAV L+EAKKEN+ +F++TH+I+
Subjt:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF

Query:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA
        RYGI H I+TDNGRQF+  L+ +LCEKF FKQ+ SSMYNAAANGL EAFNKTLC+LLKKVVSK+KRDWQE+I EALWAYRTT+RTPTG TPYSLVYGV+A
Subjt:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA

Query:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK
        +LPLEREIPSLRMA+QEGLT EDNA+LRL+ELEALDE+RL+AQQALECYQARM K F+K+VRPRSFQVG+LVLAVRR IITTRHT NK TPKW+GPY++K
Subjt:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK

Query:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA
        EV+TNG YKI+D+DGLRIGPING+FLKK+YA
Subjt:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA

A0A5A7UID6 Ribonuclease H5.8e-27165.12Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ
        K  +L + AQE S GALL Q+ +KGKE  LYYLSR LTGAELNYSPIEKMCL+LFFAIDKLRHYMQ FT+HLVAKADP+KY+LSRP+IS RLAKWAI+LQ
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ

Query:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG
        QYDIVY+PQKAVKGQ +  FLA+HP+PS+WKL + LP EE+ ++E    W M+FDG AR+ GAG G+VFIS +KHMLPYSFT  ELCSNNV EYQA IIG
Subjt:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG

Query:  LQIALEIGVTYKQIYGDSKLIINQL----------------------------LLE----AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---
        LQ+A E G+   +I+GDSKLIINQL                            +LE    +ENK+A ALANLAT LT+SED  +NI LCQ+WI+P I   
Subjt:  LQIALEIGVTYKQIYGDSKLIINQL----------------------------LLE----AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---

Query:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI
                                                                             EES KALEEAHSG+  AH+SGPKL  QLKR+
Subjt:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI

Query:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF
        GYYWPTM+ DSM +AK CEA QFHANFIHQ  EPLHPTIASWPFEAWGLDLV PITPKS+AGHSYILAGTDYF +WAEAV L+EAKKEN+ +F+RTH+I+
Subjt:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF

Query:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA
        RYGI H I+TDNGRQF+  L+ +LCEKF FKQY SSMYNAAANGL EAFNKTLC+LLKKVVSK+KRDWQE+I EALWAYRTT+RTPTG TPYSLVYGV+A
Subjt:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA

Query:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK
        +LPLEREIPSLRMA+Q+GLT EDNA+LRLQELEALDE+RL+AQQALECYQARM K F+K+VRPRSFQVG+LVLAVRR IITTRH  NK TPKW+GPY++K
Subjt:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK

Query:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA
        EV+TNG YKI+D+DGLRIGPING+FLKK+ A
Subjt:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA

A0A5D3BTY1 Ribonuclease H1.6e-27365.53Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ
        K  +L + AQE S GALL Q+ +KGKE  LYYLSR LTGAELNYSPIEKMCL+LFFAIDKLRHYMQAFT+HLVAKADP+KY+LSRP+IS RLAKWAI+LQ
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ

Query:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG
        QYDIVY+PQKAVKGQ +  FLA+HP+PS+WKL + LP EE+ ++E   PW M+FDGAAR+ GAG G+VFIS +KHMLPYSFT  ELCSNNV EYQA IIG
Subjt:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG

Query:  LQIALEIGVTYKQIYGDSKLIINQL----------------------------LLE----AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---
        LQ+A E G+   +I+GDSKLIINQL                            +LE    +ENK+A ALANLATALT+SED  +NI LCQ+WI+P I   
Subjt:  LQIALEIGVTYKQIYGDSKLIINQL----------------------------LLE----AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---

Query:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI
                                                                             EES KALEEAHSG+  AH+SGPKL  QLKR+
Subjt:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI

Query:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF
        GYYWPTM+ DSM +AK CEA QFHANFIHQ  EPLHPTIASWPFE WGLDLVGPITPKSSAGHSYILA TDYF RWAEAV L+EAKKEN+ +F++T++I+
Subjt:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF

Query:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA
        RYGI H I+TDNGRQF+  L+ +LCEKF FKQY SSMYNAAANGL EAFNKTLC+LLKKVVSK+KRDWQE+I EALWAYRTT+RTPTG TPYSLVYGV+A
Subjt:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA

Query:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK
        +LPLE+EIPSLRM++QEGLT +DNA+L LQELEALDE+RL+AQQALECYQARM K F+K+VRPRSFQVG+LVLAVRR IITTRHT NK TPKW+GPY++K
Subjt:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK

Query:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA
        EV+TNG YKI+D+DGLRIGPINGKFLKK+YA
Subjt:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA

A0A5D3D1E5 Ribonuclease H2.7e-27665.94Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ
        K  +L + AQE S GALL Q+ +KGKE  LYYLSR LTGAELNYSPIEKMCL+LFFAIDKLRHYMQAFT+HLVAKADP+KY+LSRP+IS RLAKWAI+LQ
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ

Query:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG
        QYDIVY+PQKAVKGQ +  FLA+HP+PS+WKL + LP EE+ ++E   PW M+FDGAAR+ GAG G+VFIS +KHMLPYSFT  ELCSNNV EYQA IIG
Subjt:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG

Query:  LQIALEIGVTYKQIYGDSKLIINQL----------------------------LLE----AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---
        LQ+A E G+   +I+GDSKLIINQL                            +LE    +ENK+A ALANLATALT+SED  +NI LCQ+WI+P I   
Subjt:  LQIALEIGVTYKQIYGDSKLIINQL----------------------------LLE----AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---

Query:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI
                                                                             EES KALEEAHSG+  AH+SGPKL  QLKR+
Subjt:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI

Query:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF
        GYYWPTM+ DSM +AK CEA QFHANFIHQ  EPLHPTIASWPFEAWGLDLVGPITPKS+AGHSYILAGTDYF +WAEAV L+EAKKEN+ +F++TH+I+
Subjt:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF

Query:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA
        RYGI H I+TDNGRQF+  L+ +LCEKF FKQ+ SSMYNAAANGL EAFNKTLC+LLKKVVSK+KRDWQE+I EALWAYRTT+RTPTG TPYSLVYGV+A
Subjt:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA

Query:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK
        +LPLEREIPSLRMA+QEGLT EDNA+LRLQELEALDE+RL+AQQALECYQARM K F+K+VRPRSFQVG+LVLAVRR IITTRHT NK TPKW+GPY++K
Subjt:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK

Query:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA
        EV+TNG YKI+D+DGL+IGPINGKFLKK+YA
Subjt:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA

A0A6J1DC95 Ribonuclease H8.4e-27867.03Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ
        K  +L +T QERS GALL Q+REKGKE   YYLSR L GAELNYS IEKMCLSLFF +DKLRHY+QAFTVHL AKADP+KY+LSRPIIS RLAKWAILLQ
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQ

Query:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG
        QYDIVYVPQKA+KGQ +  FLANHP+PSDWKL E LP EE+FY+E+  PWTMYFDGA R+ G G GVVF+S +KHMLPYSFT  ELC NN  EYQALIIG
Subjt:  QYDIVYVPQKAVKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIG

Query:  LQIALEIGVTYKQIYGDSKLIINQLLLE--------------------------------AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---
        LQIA+EIG+TY +I GDSKLIINQLLLE                                +ENK+A ALANLA A  +S DE L+I LCQ+W+ PPI   
Subjt:  LQIALEIGVTYKQIYGDSKLIINQLLLE--------------------------------AENKRAYALANLATALTISEDEVLNIPLCQRWIIPPI---

Query:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI
                                                                             EES KALEEAHSGV +A++SGPKL+ +LKR+
Subjt:  ---------------------------------------------------------------------EESIKALEEAHSGVYEAHRSGPKLHLQLKRI

Query:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF
        GYYWPTMVQDSM+YAK+CEA Q HANF+HQ  EPLHPT+ASWPFE+WGLDLVGPITPKSSAGHSYILA TDYF RWAEA+ALKEAKKEN+ +F+RTH+IF
Subjt:  GYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIF

Query:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA
        RYGI   I+TDNGRQFS  L+ +LCEKF FKQY SSMYNAAANGL EAFNKTLCNLLKKVVSK+KRDWQ++I E LWAYRTT RTPT  TPYSLVYGV+A
Subjt:  RYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDA

Query:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK
        +LPLEREIPSLRMAV EGLT EDNA+LRLQELEALDE+RLDAQQ LECYQAR+ K FNK VRPRSFQVG++VL VRR IITTRHTRNK TPKW+GPY+IK
Subjt:  MLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRPRSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIK

Query:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA
        EVYTNG Y IVDK GLRIGPINGKFLK++YA
Subjt:  EVYTNGTYKIVDKDGLRIGPINGKFLKKYYA

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein6.0e-2323.94Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLS-RPIISR---------
        K  LL   A + + GA+L QK +  K Y + Y S  ++ A+LNYS  +K  L++  ++   RHY++       +  +P K +   R +I R         
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLS-RPIISR---------

Query:  -RLAKWAILLQ--QYDIVYVPQKA--VKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNE
         RLA+W + LQ   ++I Y P  A  +   +        PIP D   SE      +  I IT                                    ++
Subjt:  -RLAKWAILLQ--QYDIVYVPQKA--VKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNE

Query:  LCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQLLLEAENKRAYALANLATALTI-SEDEVL--NIPLCQRWIIPPIEESIKALEEAHSGVYEA
          +  V EY                      D+KL+    LL  E+KR      L   L I S+D++L  N     R II    E  K +   H G+   
Subjt:  LCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQLLLEAENKRAYALANLATALTI-SEDEVL--NIPLCQRWIIPPIEESIKALEEAHSGVYEA

Query:  HRSGPKLHLQLKRIGYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIAS-WPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAV-ALKE
             +L   +    + W  + +   +Y + C   Q + +  H+   PL P   S  P+E+  +D +  + P+SS G++ +    D F + A  V   K 
Subjt:  HRSGPKLHLQLKRIGYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIAS-WPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAV-ALKE

Query:  AKKENVADFIRTHLIFRYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYR
           E  A      +I  +G    I+ DN   F+         K+ F    S  Y    +G  E  N+T+  LL+ V S     W + I+    +Y     
Subjt:  AKKENVADFIRTHLIFRYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYR

Query:  TPTGATPYSLVYGVD-AMLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRP-RSFQVGELVLAVRRSIITT
        + T  TP+ +V+    A+ PL  E+PS      E      N+Q  +Q  + + E        L     +M K F+ K++    FQ G+LV+ V+R+    
Subjt:  TPTGATPYSLVYGVD-AMLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRP-RSFQVGELVLAVRRSIITT

Query:  RHTRNKCTPKWEGPYVIKEVYTNGTYKIVDKDGLR
         H  NK  P + GP+ + +      Y++   D ++
Subjt:  RHTRNKCTPKWEGPYVIKEVYTNGTYKIVDKDGLR

P0CT35 Transposon Tf2-2 polyprotein6.0e-2323.94Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLS-RPIISR---------
        K  LL   A + + GA+L QK +  K Y + Y S  ++ A+LNYS  +K  L++  ++   RHY++       +  +P K +   R +I R         
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLS-RPIISR---------

Query:  -RLAKWAILLQ--QYDIVYVPQKA--VKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNE
         RLA+W + LQ   ++I Y P  A  +   +        PIP D   SE      +  I IT                                    ++
Subjt:  -RLAKWAILLQ--QYDIVYVPQKA--VKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNE

Query:  LCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQLLLEAENKRAYALANLATALTI-SEDEVL--NIPLCQRWIIPPIEESIKALEEAHSGVYEA
          +  V EY                      D+KL+    LL  E+KR      L   L I S+D++L  N     R II    E  K +   H G+   
Subjt:  LCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQLLLEAENKRAYALANLATALTI-SEDEVL--NIPLCQRWIIPPIEESIKALEEAHSGVYEA

Query:  HRSGPKLHLQLKRIGYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIAS-WPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAV-ALKE
             +L   +    + W  + +   +Y + C   Q + +  H+   PL P   S  P+E+  +D +  + P+SS G++ +    D F + A  V   K 
Subjt:  HRSGPKLHLQLKRIGYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIAS-WPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAV-ALKE

Query:  AKKENVADFIRTHLIFRYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYR
           E  A      +I  +G    I+ DN   F+         K+ F    S  Y    +G  E  N+T+  LL+ V S     W + I+    +Y     
Subjt:  AKKENVADFIRTHLIFRYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYR

Query:  TPTGATPYSLVYGVD-AMLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRP-RSFQVGELVLAVRRSIITT
        + T  TP+ +V+    A+ PL  E+PS      E      N+Q  +Q  + + E        L     +M K F+ K++    FQ G+LV+ V+R+    
Subjt:  TPTGATPYSLVYGVD-AMLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRP-RSFQVGELVLAVRRSIITT

Query:  RHTRNKCTPKWEGPYVIKEVYTNGTYKIVDKDGLR
         H  NK  P + GP+ + +      Y++   D ++
Subjt:  RHTRNKCTPKWEGPYVIKEVYTNGTYKIVDKDGLR

P0CT36 Transposon Tf2-3 polyprotein6.0e-2323.94Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLS-RPIISR---------
        K  LL   A + + GA+L QK +  K Y + Y S  ++ A+LNYS  +K  L++  ++   RHY++       +  +P K +   R +I R         
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLS-RPIISR---------

Query:  -RLAKWAILLQ--QYDIVYVPQKA--VKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNE
         RLA+W + LQ   ++I Y P  A  +   +        PIP D   SE      +  I IT                                    ++
Subjt:  -RLAKWAILLQ--QYDIVYVPQKA--VKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNE

Query:  LCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQLLLEAENKRAYALANLATALTI-SEDEVL--NIPLCQRWIIPPIEESIKALEEAHSGVYEA
          +  V EY                      D+KL+    LL  E+KR      L   L I S+D++L  N     R II    E  K +   H G+   
Subjt:  LCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQLLLEAENKRAYALANLATALTI-SEDEVL--NIPLCQRWIIPPIEESIKALEEAHSGVYEA

Query:  HRSGPKLHLQLKRIGYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIAS-WPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAV-ALKE
             +L   +    + W  + +   +Y + C   Q + +  H+   PL P   S  P+E+  +D +  + P+SS G++ +    D F + A  V   K 
Subjt:  HRSGPKLHLQLKRIGYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIAS-WPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAV-ALKE

Query:  AKKENVADFIRTHLIFRYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYR
           E  A      +I  +G    I+ DN   F+         K+ F    S  Y    +G  E  N+T+  LL+ V S     W + I+    +Y     
Subjt:  AKKENVADFIRTHLIFRYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYR

Query:  TPTGATPYSLVYGVD-AMLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRP-RSFQVGELVLAVRRSIITT
        + T  TP+ +V+    A+ PL  E+PS      E      N+Q  +Q  + + E        L     +M K F+ K++    FQ G+LV+ V+R+    
Subjt:  TPTGATPYSLVYGVD-AMLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRP-RSFQVGELVLAVRRSIITT

Query:  RHTRNKCTPKWEGPYVIKEVYTNGTYKIVDKDGLR
         H  NK  P + GP+ + +      Y++   D ++
Subjt:  RHTRNKCTPKWEGPYVIKEVYTNGTYKIVDKDGLR

P0CT41 Transposon Tf2-12 polyprotein6.0e-2323.94Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLS-RPIISR---------
        K  LL   A + + GA+L QK +  K Y + Y S  ++ A+LNYS  +K  L++  ++   RHY++       +  +P K +   R +I R         
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLS-RPIISR---------

Query:  -RLAKWAILLQ--QYDIVYVPQKA--VKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNE
         RLA+W + LQ   ++I Y P  A  +   +        PIP D   SE      +  I IT                                    ++
Subjt:  -RLAKWAILLQ--QYDIVYVPQKA--VKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNE

Query:  LCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQLLLEAENKRAYALANLATALTI-SEDEVL--NIPLCQRWIIPPIEESIKALEEAHSGVYEA
          +  V EY                      D+KL+    LL  E+KR      L   L I S+D++L  N     R II    E  K +   H G+   
Subjt:  LCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQLLLEAENKRAYALANLATALTI-SEDEVL--NIPLCQRWIIPPIEESIKALEEAHSGVYEA

Query:  HRSGPKLHLQLKRIGYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIAS-WPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAV-ALKE
             +L   +    + W  + +   +Y + C   Q + +  H+   PL P   S  P+E+  +D +  + P+SS G++ +    D F + A  V   K 
Subjt:  HRSGPKLHLQLKRIGYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIAS-WPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAV-ALKE

Query:  AKKENVADFIRTHLIFRYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYR
           E  A      +I  +G    I+ DN   F+         K+ F    S  Y    +G  E  N+T+  LL+ V S     W + I+    +Y     
Subjt:  AKKENVADFIRTHLIFRYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYR

Query:  TPTGATPYSLVYGVD-AMLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRP-RSFQVGELVLAVRRSIITT
        + T  TP+ +V+    A+ PL  E+PS      E      N+Q  +Q  + + E        L     +M K F+ K++    FQ G+LV+ V+R+    
Subjt:  TPTGATPYSLVYGVD-AMLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRP-RSFQVGELVLAVRRSIITT

Query:  RHTRNKCTPKWEGPYVIKEVYTNGTYKIVDKDGLR
         H  NK  P + GP+ + +      Y++   D ++
Subjt:  RHTRNKCTPKWEGPYVIKEVYTNGTYKIVDKDGLR

Q9UR07 Transposon Tf2-11 polyprotein6.0e-2323.94Show/hide
Query:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLS-RPIISR---------
        K  LL   A + + GA+L QK +  K Y + Y S  ++ A+LNYS  +K  L++  ++   RHY++       +  +P K +   R +I R         
Subjt:  KLCLLPLTAQERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLS-RPIISR---------

Query:  -RLAKWAILLQ--QYDIVYVPQKA--VKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNE
         RLA+W + LQ   ++I Y P  A  +   +        PIP D   SE      +  I IT                                    ++
Subjt:  -RLAKWAILLQ--QYDIVYVPQKA--VKGQVVPYFLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNE

Query:  LCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQLLLEAENKRAYALANLATALTI-SEDEVL--NIPLCQRWIIPPIEESIKALEEAHSGVYEA
          +  V EY                      D+KL+    LL  E+KR      L   L I S+D++L  N     R II    E  K +   H G+   
Subjt:  LCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQLLLEAENKRAYALANLATALTI-SEDEVL--NIPLCQRWIIPPIEESIKALEEAHSGVYEA

Query:  HRSGPKLHLQLKRIGYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIAS-WPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAV-ALKE
             +L   +    + W  + +   +Y + C   Q + +  H+   PL P   S  P+E+  +D +  + P+SS G++ +    D F + A  V   K 
Subjt:  HRSGPKLHLQLKRIGYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIAS-WPFEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAV-ALKE

Query:  AKKENVADFIRTHLIFRYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYR
           E  A      +I  +G    I+ DN   F+         K+ F    S  Y    +G  E  N+T+  LL+ V S     W + I+    +Y     
Subjt:  AKKENVADFIRTHLIFRYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTLCNLLKKVVSKSKRDWQERINEALWAYRTTYR

Query:  TPTGATPYSLVYGVD-AMLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRP-RSFQVGELVLAVRRSIITT
        + T  TP+ +V+    A+ PL  E+PS      E      N+Q  +Q  + + E        L     +M K F+ K++    FQ G+LV+ V+R+    
Subjt:  TPTGATPYSLVYGVD-AMLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRP-RSFQVGELVLAVRRSIITT

Query:  RHTRNKCTPKWEGPYVIKEVYTNGTYKIVDKDGLR
         H  NK  P + GP+ + +      Y++   D ++
Subjt:  RHTRNKCTPKWEGPYVIKEVYTNGTYKIVDKDGLR

Arabidopsis top hitse value%identityAlignment
AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.8e-0437.66Show/hide
Query:  TMYFDGAAR--KIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQL
        T+ FDGA++     AGAG V  +    +L Y        +NNV EY+AL++GL+ AL+ G     + GDS L+  Q+
Subjt:  TMYFDGAAR--KIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQL

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.8e-0437.66Show/hide
Query:  TMYFDGAAR--KIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQL
        T+ FDGA++     AGAG V  +    +L Y        +NNV EY+AL++GL+ AL+ G     + GDS L+  Q+
Subjt:  TMYFDGAAR--KIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCCAGTGCCAGCGCCCTGCCGCACATCGTCAGCACCCATGCCGCACATCACCAGCGCCCATGCCACCCCGCCCGCTCAACAGCGCCCCAGCCATGCCGCCCAGT
GCCAGCGCCCTGCCGCACATCGTCAGCACCCATGCCGCACATCACCAGCGCCCCTGCCACCCAACATCAGCGCCTGCCCCAGCGCCCAGTAATGCAAGGCGAAGGCGATG
TCAGTGGTGTCGGTGACAAGCTCAACGGGGGTCAGTTGGGGCCCCGGGCCACGACCCTAGAAGTCGGGGTTTCGTCGATAGTTTGGACGAGCGGCTTTAGAGGGGAGGCA
TGGAGTCAGCTCTCCACCACCCCTGGCCCCTGTGGCTTAAGTGAGCTACCCCATAGCACATGCGTGTGTCATGGCGTGAGCGAGCGTTGCAAGACCAAGGTCTTGGGCGT
CTTCTTTCGAGAATGGGAATCTCAAGGACAGTGTCGAACGGCACGGGTGTGCCGGCATTGCACCTCGTCCCATGGGCTAGAGATGGATGGATGTCCGAAAGACCCCGATC
ATAAATACAAGTCCATGGGTCGGTTCTCGGTATGCGTGCCCAAGAGTTCAGCGAAGTCCGATGACATCCCGATGCTTGGGGTGGCATCCTACAGTACCGGTGGGCACCCG
AAATCATCCCGGCAGTACCCAGTAAGGTCGTTTAGCTCGTTATTGGCTAGTTTCTCTGCAGGAATGGTCTTAGAGGCGTCTGATGGGTCCGATAAGCTAGTCAGAACTCA
TCCCGTTGAGTCGAGGAAGAAAGAACCTCGTCTAAGGTTCGACGAAGCAACAAAGGGCGCGAAGAGTAGATTGGGGCTAGAGTTCCCTGTAAGGCTGGCCATGGTCAAGC
TTAGATCATGGCACCGCGCGGTTGAAAAATCCTCGAGCATAGTGAGTAATCTGTCTGCTCAAGGTAACGACTACAAAGGTTTGAAGTTATGTCTCCTCCCTCTAACTGCA
CAAGAAAGGTCGTTTGGAGCATTACTCGTGCAAAAAAGAGAGAAAGGGAAAGAGTACACCCTCTACTATCTAAGTAGAAATCTAACCGGGGCTGAGCTAAACTATTCACC
CATTGAGAAGATGTGTCTGTCACTTTTCTTCGCCATCGACAAATTGAGGCATTACATGCAAGCATTCACAGTACACTTGGTGGCAAAAGCAGATCCAATCAAATATGTCT
TATCTAGACCAATTATCTCAAGGCGATTGGCTAAATGGGCAATCTTACTGCAACAGTACGACATTGTCTATGTTCCGCAAAAGGCAGTGAAGGGACAAGTTGTCCCATAT
TTTCTAGCAAACCATCCAATTCCGTCAGATTGGAAATTATCTGAAAGGTTGCCTCAGGAGGAGATTTTCTATATTGAAATCACTATGCCCTGGACAATGTATTTTGATGG
TGCAGCTCGAAAGATTGGCGCAGGGGCAGGTGTAGTCTTCATCTCACTCAAGAAACACATGTTGCCATACAGCTTCACGTTTAACGAGTTATGTTCGAATAATGTTGTTG
AATATCAAGCACTCATTATTGGACTTCAGATAGCCCTAGAAATTGGTGTGACATATAAACAAATCTATGGAGACTCGAAATTGATAATCAATCAACTACTGCTTGAAGCG
GAAAACAAGAGGGCATACGCATTGGCAAATTTAGCCACAGCTCTGACAATTTCTGAGGATGAAGTTTTGAATATCCCACTTTGTCAAAGATGGATCATACCACCAATTGA
AGAATCAATAAAAGCATTAGAGGAAGCACATTCAGGTGTGTACGAGGCTCATCGATCCGGACCAAAGCTCCACCTTCAACTCAAAAGAATTGGCTACTATTGGCCTACAA
TGGTTCAAGACTCGATGCAATATGCAAAGAGATGTGAAGCTTATCAGTTTCACGCTAATTTCATACATCAGTCTCTAGAGCCGCTCCATCCAACTATAGCTTCCTGGCCT
TTTGAAGCTTGGGGACTCGATCTTGTCGGTCCTATAACTCCCAAATCATCAGCGGGACATTCTTATATTCTCGCAGGAACTGATTATTTTTTAAGATGGGCCGAGGCTGT
TGCACTAAAAGAAGCTAAGAAAGAAAATGTAGCTGATTTTATCCGCACTCATCTCATCTTCCGGTATGGCATCCTGCATCACATTATGACAGATAATGGGAGACAATTTT
CAAAGGGCTTAATATACCAACTGTGTGAAAAGTTTGGCTTCAAGCAATATAATTCGTCAATGTACAACGCTGCAGCAAATGGGTTGGTAGAAGCGTTTAATAAAACGTTA
TGTAACTTATTGAAGAAAGTAGTCTCCAAGTCGAAACGAGATTGGCAAGAAAGGATCAATGAAGCATTATGGGCGTATCGAACCACCTATCGCACCCCGACCGGTGCTAC
ACCTTATTCTCTTGTCTATGGAGTAGATGCTATGCTCCCCCTAGAAAGAGAAATTCCATCTTTGCGAATGGCGGTGCAAGAAGGGCTAACAGTGGAAGATAATGCTCAGC
TACGTCTTCAAGAGTTGGAAGCGTTGGATGAAAGAAGATTGGATGCTCAGCAAGCTTTAGAATGCTACCAAGCACGAATGTTAAAAGTCTTCAACAAAAAAGTGCGACCT
CGATCATTTCAAGTTGGTGAGCTCGTACTTGCAGTTCGAAGATCGATTATTACAACTCGACATACAAGGAACAAGTGCACTCCAAAATGGGAAGGACCCTACGTTATAAA
AGAAGTCTACACAAATGGAACATACAAGATCGTGGACAAAGACGGGTTAAGAATTGGCCCAATAAATGGTAAATTCCTGAAGAAATACTATGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCCCAGTGCCAGCGCCCTGCCGCACATCGTCAGCACCCATGCCGCACATCACCAGCGCCCATGCCACCCCGCCCGCTCAACAGCGCCCCAGCCATGCCGCCCAGT
GCCAGCGCCCTGCCGCACATCGTCAGCACCCATGCCGCACATCACCAGCGCCCCTGCCACCCAACATCAGCGCCTGCCCCAGCGCCCAGTAATGCAAGGCGAAGGCGATG
TCAGTGGTGTCGGTGACAAGCTCAACGGGGGTCAGTTGGGGCCCCGGGCCACGACCCTAGAAGTCGGGGTTTCGTCGATAGTTTGGACGAGCGGCTTTAGAGGGGAGGCA
TGGAGTCAGCTCTCCACCACCCCTGGCCCCTGTGGCTTAAGTGAGCTACCCCATAGCACATGCGTGTGTCATGGCGTGAGCGAGCGTTGCAAGACCAAGGTCTTGGGCGT
CTTCTTTCGAGAATGGGAATCTCAAGGACAGTGTCGAACGGCACGGGTGTGCCGGCATTGCACCTCGTCCCATGGGCTAGAGATGGATGGATGTCCGAAAGACCCCGATC
ATAAATACAAGTCCATGGGTCGGTTCTCGGTATGCGTGCCCAAGAGTTCAGCGAAGTCCGATGACATCCCGATGCTTGGGGTGGCATCCTACAGTACCGGTGGGCACCCG
AAATCATCCCGGCAGTACCCAGTAAGGTCGTTTAGCTCGTTATTGGCTAGTTTCTCTGCAGGAATGGTCTTAGAGGCGTCTGATGGGTCCGATAAGCTAGTCAGAACTCA
TCCCGTTGAGTCGAGGAAGAAAGAACCTCGTCTAAGGTTCGACGAAGCAACAAAGGGCGCGAAGAGTAGATTGGGGCTAGAGTTCCCTGTAAGGCTGGCCATGGTCAAGC
TTAGATCATGGCACCGCGCGGTTGAAAAATCCTCGAGCATAGTGAGTAATCTGTCTGCTCAAGGTAACGACTACAAAGGTTTGAAGTTATGTCTCCTCCCTCTAACTGCA
CAAGAAAGGTCGTTTGGAGCATTACTCGTGCAAAAAAGAGAGAAAGGGAAAGAGTACACCCTCTACTATCTAAGTAGAAATCTAACCGGGGCTGAGCTAAACTATTCACC
CATTGAGAAGATGTGTCTGTCACTTTTCTTCGCCATCGACAAATTGAGGCATTACATGCAAGCATTCACAGTACACTTGGTGGCAAAAGCAGATCCAATCAAATATGTCT
TATCTAGACCAATTATCTCAAGGCGATTGGCTAAATGGGCAATCTTACTGCAACAGTACGACATTGTCTATGTTCCGCAAAAGGCAGTGAAGGGACAAGTTGTCCCATAT
TTTCTAGCAAACCATCCAATTCCGTCAGATTGGAAATTATCTGAAAGGTTGCCTCAGGAGGAGATTTTCTATATTGAAATCACTATGCCCTGGACAATGTATTTTGATGG
TGCAGCTCGAAAGATTGGCGCAGGGGCAGGTGTAGTCTTCATCTCACTCAAGAAACACATGTTGCCATACAGCTTCACGTTTAACGAGTTATGTTCGAATAATGTTGTTG
AATATCAAGCACTCATTATTGGACTTCAGATAGCCCTAGAAATTGGTGTGACATATAAACAAATCTATGGAGACTCGAAATTGATAATCAATCAACTACTGCTTGAAGCG
GAAAACAAGAGGGCATACGCATTGGCAAATTTAGCCACAGCTCTGACAATTTCTGAGGATGAAGTTTTGAATATCCCACTTTGTCAAAGATGGATCATACCACCAATTGA
AGAATCAATAAAAGCATTAGAGGAAGCACATTCAGGTGTGTACGAGGCTCATCGATCCGGACCAAAGCTCCACCTTCAACTCAAAAGAATTGGCTACTATTGGCCTACAA
TGGTTCAAGACTCGATGCAATATGCAAAGAGATGTGAAGCTTATCAGTTTCACGCTAATTTCATACATCAGTCTCTAGAGCCGCTCCATCCAACTATAGCTTCCTGGCCT
TTTGAAGCTTGGGGACTCGATCTTGTCGGTCCTATAACTCCCAAATCATCAGCGGGACATTCTTATATTCTCGCAGGAACTGATTATTTTTTAAGATGGGCCGAGGCTGT
TGCACTAAAAGAAGCTAAGAAAGAAAATGTAGCTGATTTTATCCGCACTCATCTCATCTTCCGGTATGGCATCCTGCATCACATTATGACAGATAATGGGAGACAATTTT
CAAAGGGCTTAATATACCAACTGTGTGAAAAGTTTGGCTTCAAGCAATATAATTCGTCAATGTACAACGCTGCAGCAAATGGGTTGGTAGAAGCGTTTAATAAAACGTTA
TGTAACTTATTGAAGAAAGTAGTCTCCAAGTCGAAACGAGATTGGCAAGAAAGGATCAATGAAGCATTATGGGCGTATCGAACCACCTATCGCACCCCGACCGGTGCTAC
ACCTTATTCTCTTGTCTATGGAGTAGATGCTATGCTCCCCCTAGAAAGAGAAATTCCATCTTTGCGAATGGCGGTGCAAGAAGGGCTAACAGTGGAAGATAATGCTCAGC
TACGTCTTCAAGAGTTGGAAGCGTTGGATGAAAGAAGATTGGATGCTCAGCAAGCTTTAGAATGCTACCAAGCACGAATGTTAAAAGTCTTCAACAAAAAAGTGCGACCT
CGATCATTTCAAGTTGGTGAGCTCGTACTTGCAGTTCGAAGATCGATTATTACAACTCGACATACAAGGAACAAGTGCACTCCAAAATGGGAAGGACCCTACGTTATAAA
AGAAGTCTACACAAATGGAACATACAAGATCGTGGACAAAGACGGGTTAAGAATTGGCCCAATAAATGGTAAATTCCTGAAGAAATACTATGCTTGA
Protein sequenceShow/hide protein sequence
MPPSASALPHIVSTHAAHHQRPCHPARSTAPQPCRPVPAPCRTSSAPMPHITSAPATQHQRLPQRPVMQGEGDVSGVGDKLNGGQLGPRATTLEVGVSSIVWTSGFRGEA
WSQLSTTPGPCGLSELPHSTCVCHGVSERCKTKVLGVFFREWESQGQCRTARVCRHCTSSHGLEMDGCPKDPDHKYKSMGRFSVCVPKSSAKSDDIPMLGVASYSTGGHP
KSSRQYPVRSFSSLLASFSAGMVLEASDGSDKLVRTHPVESRKKEPRLRFDEATKGAKSRLGLEFPVRLAMVKLRSWHRAVEKSSSIVSNLSAQGNDYKGLKLCLLPLTA
QERSFGALLVQKREKGKEYTLYYLSRNLTGAELNYSPIEKMCLSLFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISRRLAKWAILLQQYDIVYVPQKAVKGQVVPY
FLANHPIPSDWKLSERLPQEEIFYIEITMPWTMYFDGAARKIGAGAGVVFISLKKHMLPYSFTFNELCSNNVVEYQALIIGLQIALEIGVTYKQIYGDSKLIINQLLLEA
ENKRAYALANLATALTISEDEVLNIPLCQRWIIPPIEESIKALEEAHSGVYEAHRSGPKLHLQLKRIGYYWPTMVQDSMQYAKRCEAYQFHANFIHQSLEPLHPTIASWP
FEAWGLDLVGPITPKSSAGHSYILAGTDYFLRWAEAVALKEAKKENVADFIRTHLIFRYGILHHIMTDNGRQFSKGLIYQLCEKFGFKQYNSSMYNAAANGLVEAFNKTL
CNLLKKVVSKSKRDWQERINEALWAYRTTYRTPTGATPYSLVYGVDAMLPLEREIPSLRMAVQEGLTVEDNAQLRLQELEALDERRLDAQQALECYQARMLKVFNKKVRP
RSFQVGELVLAVRRSIITTRHTRNKCTPKWEGPYVIKEVYTNGTYKIVDKDGLRIGPINGKFLKKYYA