; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G19270 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G19270
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRibonuclease H
Genome locationChr1:14740531..14742865
RNA-Seq ExpressionCSPI01G19270
SyntenyCSPI01G19270
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031752.1 uncharacterized protein E6C27_scaffold506G00150 [Cucumis melo var. makuwa]0.0e+0083.21Show/hide
Query:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR
        ++  K  +F+WDQSCQNAFDSIK YLLNPPVLSAP  GKPLILYIAAQE SLGALLAQENDK KE            ++LNYSPIEKMCLALFFAIDKLR
Subjt:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR

Query:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG
        HYMQAFTIHLVAKADP+KYILSRP+ISGRLAKWAIILQQYDIVYI QKAVKGQALADFLADH VPS+WKLC+DL DEEVLFVESM+ WIMFFDGA+R++G
Subjt:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG

Query:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE
        AGVGIVF SPEKHMLPYSFTL ELCSNNVAEYQA II LQMASEF             +INQLSYQYE+KHQDLKPYF+YARRLMDRFD IILEHIPRSE
Subjt:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE

Query:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL
        NKKADALANLAT LT+ ED+P+NISL QKWI+P I+SQ+EE D              PIIDYL+HGKLPT+ RHRAEIRRR ARFIYY DTLYRRSYEGL
Subjt:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL

Query:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG
        LL+CLGKEESTKALEEAHSGICGAHQSGPKLQ+QLKRMGYYWPT+IHDSM++AKHCE CQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPIT KS+ G
Subjt:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG

Query:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS
        HSY+L GTDYFS+WAE VPLREAKKENIVNFV+ HIIYRYGIPHRI+TDNGRQFAN+LMDKLCEK NFKQ+KS MYNAAANGLAEAFNKTLC+LLKKVVS
Subjt:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS

Query:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDK
        KTKRDWQEKIGE LWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRM IQEGLTTEDN +LRL+ELEAL+EKRLEAQQALECYQARMSKAFDK
Subjt:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDK

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]0.0e+0083.71Show/hide
Query:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR
        ++  K  +F+WDQSCQNAFDSIK YLLNPPVLSAP  GKPLILYIAAQE SLGALLAQENDK KE            ++LNYSPIEKMCLALFFAIDKLR
Subjt:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR

Query:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG
        HYMQAFTIHLVAKADP+KYILSRP+ISGRLAKWAIILQQYDIVYI QKAVKGQALADFLADH VPS+WKLC+DL DEEVLFVESM+ WIMFFDGA+R++G
Subjt:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG

Query:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE
        AGVGIVF SPEKHMLPYSFTL ELCSNNVAEYQA II LQMASEF             +INQLSYQYE+KHQDLKPYF+YARRLMDRFD IILEHIPRSE
Subjt:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE

Query:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL
        NKKADALANLAT LT+ ED+P+NISL QKWI+P I+SQ+EE D              PIIDYL+HGKLPT+ RHRAEIRRR ARFIYY DTLYRRSYEGL
Subjt:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL

Query:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG
        LL+CLGKEESTKALEEAHSGICGAHQSGPKLQ+QLKRMGYYWPT+IHDSM++AKHCE CQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPIT KS+ G
Subjt:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG

Query:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS
        HSY+L GTDYFS+WAE VPLREAKKENIVNFV+ HIIYRYGIPHRI+TDNGRQFAN+LMDKLCEK NFKQ+KS MYNAAANGLAEAFNKTLC+LLKKVVS
Subjt:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS

Query:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR
        KTKRDWQEKIGE LWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRM IQEGLTTEDN +LRL+ELEAL+EKRLEAQQALECYQARMSKAFDK VR
Subjt:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR

Query:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN
        PRSFQV +LVLA+RRPIITTRHTGNKFTPKWDGPYIVKEVFTN
Subjt:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN

KAA0053465.1 uncharacterized protein E6C27_scaffold190G00130 [Cucumis melo var. makuwa]0.0e+0083.04Show/hide
Query:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR
        ++  K  +F+WDQSCQNAFDSIK YLLNPPVLSAP  GKPLILYIAAQE SLGALLAQENDK KE            ++LNYSPIEKMCLALFFAIDKLR
Subjt:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR

Query:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG
        HYMQ FTIHLVAKADP+KYILSRP+ISGRLAKWAIILQQYDIVYI QKAVKGQALADFLADH VPS+WKLC+DL DEEVLFVESM+ WIMFFDG +R++G
Subjt:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG

Query:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE
        AGVGIVF SPEKHMLPYSFTL ELCSNNVAEYQA II LQMASEF             +INQLSYQY +KHQDLKPYF+YARRLMDRFD IILEHIPRSE
Subjt:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE

Query:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL
        NKKADALANLATTLT+ ED+P+NI L QKWI+P I+SQ+EE D              PIIDYL+HGKLPT+ RHRAEIRRR ARFIYY DTLYRRSYEGL
Subjt:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL

Query:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG
        LL+CLGKEES KALEEAHSGICGAHQSGPKLQ+QLKRMGYYWPT+IHDSM++AK+CE CQFHANFIHQPPEPLHPTIASWPFEAWGLDLV PIT KS+ G
Subjt:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG

Query:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS
        HSY+L GTDYFS+WAE VPLREAKKENIVNFVR HIIYRYGIPHRI+TDNGRQFAN+LMDKLCEK NFKQYKS MYNAAANGLAEAFNKTLC+LLKKVVS
Subjt:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS

Query:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR
        KTKRDWQEKIGE LWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRM IQ+GLTTEDN +LRLQELEAL+EKRLEAQQALECYQARMSKAFDK VR
Subjt:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR

Query:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN
        PRSFQV +LVLA+RRPIITTRH GNKFTPKWDGPYIVKEVFTN
Subjt:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN

TYK02262.1 uncharacterized protein E5676_scaffold18G00630 [Cucumis melo var. makuwa]0.0e+0083.31Show/hide
Query:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR
        ++  K  +F+WDQSCQNAFDSIK YLLNPPVLSAP  GKPLILYIAAQE SLGALLAQENDK KE            ++LNYSPIEKMCLALFFAIDKLR
Subjt:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR

Query:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG
        HYMQAFTIHLVAKADP+KYILSRP+ISGRLAKWAIILQQYDIVYI QKAVKGQALADFLADH VPS+WKLC+DL DEEVLFVESM+ WIMFFDGA+R++G
Subjt:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG

Query:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE
        AGVGIVF SPEKHMLPYSFTL ELCSNNVAEYQA II LQMASEF             +INQLSYQYE+KHQDLKPYF+YARRLMDRFD IILEHIPRSE
Subjt:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE

Query:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL
        NKKADALANLAT LT+ ED+P+NISL QKWI+P I+SQ+EE D              PIIDYL+HGKLPT+ RHRAEIRRR ARFIYY DTLYRRSYEGL
Subjt:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL

Query:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG
        LL+CLGKEESTKALEEAHSGICGAHQSGPKLQ+QLKRMGYYWPT+IHDSM++AK+CE CQFHANFIHQPPEPLHPTIASWPFE WGLDLVGPIT KSS G
Subjt:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG

Query:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS
        HSY+L  TDYFSRWAE VPLREAKKENIVNFV+ +IIYRYGIPHRI+TDNGRQFAN+LMDKLCEK NFKQYKS MYNAAANGLAEAFNKTLC+LLKKVVS
Subjt:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS

Query:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR
        KTKRDWQEKIGE LWAYRTTHRTPTGVTPYSLVYGVEAVLPLE+EIPSLRM+IQEGLTT+DN +L LQELEAL+EKRLEAQQALECYQARMSKAFDK VR
Subjt:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR

Query:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN
        PRSFQV +LVLA+RRPIITTRHTGNKFTPKWDGPYIVKEVFTN
Subjt:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN

TYK18071.1 uncharacterized protein E5676_scaffold306G004020 [Cucumis melo var. makuwa]0.0e+0083.85Show/hide
Query:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR
        ++  K  +F+WDQSCQNAFDSIK YLLNPPVLSAP  GKPLILYIAAQE SLGALLAQENDK KE            ++LNYSPIEKMCLALFFAIDKLR
Subjt:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR

Query:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG
        HYMQAFTIHLVAKADP+KYILSRP+ISGRLAKWAIILQQYDIVYI QKAVKGQALADFLADH VPS+WKLC+DL DEEVLFVESM+ WIMFFDGA+R++G
Subjt:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG

Query:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE
        AGVGIVF SPEKHMLPYSFTL ELCSNNVAEYQA II LQMASEF             +INQLSYQYE+KHQDLKPYF+YARRLMDRFD IILEHIPRSE
Subjt:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE

Query:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL
        NKKADALANLAT LT+ ED+P+NISL QKWI+P I+SQ+EE D              PIIDYL+HGKLPT+ RHRAEIRRR ARFIYY DTLYRRSYEGL
Subjt:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL

Query:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG
        LL+CLGKEESTKALEEAHSGICGAHQSGPKLQ+QLKRMGYYWPT+IHDSM++AKHCE CQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPIT KS+ G
Subjt:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG

Query:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS
        HSY+L GTDYFS+WAE VPLREAKKENIVNFV+ HIIYRYGIPHRI+TDNGRQFAN+LMDKLCEK NFKQ+KS MYNAAANGLAEAFNKTLC+LLKKVVS
Subjt:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS

Query:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR
        KTKRDWQEKIGE LWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRM IQEGLTTEDN +LRLQELEAL+EKRLEAQQALECYQARMSKAFDK VR
Subjt:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR

Query:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN
        PRSFQV +LVLA+RRPIITTRHTGNKFTPKWDGPYIVKEVFTN
Subjt:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN

TrEMBL top hitse value%identityAlignment
A0A5A7SKZ3 Ribonuclease H0.0e+0083.21Show/hide
Query:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR
        ++  K  +F+WDQSCQNAFDSIK YLLNPPVLSAP  GKPLILYIAAQE SLGALLAQENDK KE            ++LNYSPIEKMCLALFFAIDKLR
Subjt:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR

Query:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG
        HYMQAFTIHLVAKADP+KYILSRP+ISGRLAKWAIILQQYDIVYI QKAVKGQALADFLADH VPS+WKLC+DL DEEVLFVESM+ WIMFFDGA+R++G
Subjt:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG

Query:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE
        AGVGIVF SPEKHMLPYSFTL ELCSNNVAEYQA II LQMASEF             +INQLSYQYE+KHQDLKPYF+YARRLMDRFD IILEHIPRSE
Subjt:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE

Query:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL
        NKKADALANLAT LT+ ED+P+NISL QKWI+P I+SQ+EE D              PIIDYL+HGKLPT+ RHRAEIRRR ARFIYY DTLYRRSYEGL
Subjt:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL

Query:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG
        LL+CLGKEESTKALEEAHSGICGAHQSGPKLQ+QLKRMGYYWPT+IHDSM++AKHCE CQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPIT KS+ G
Subjt:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG

Query:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS
        HSY+L GTDYFS+WAE VPLREAKKENIVNFV+ HIIYRYGIPHRI+TDNGRQFAN+LMDKLCEK NFKQ+KS MYNAAANGLAEAFNKTLC+LLKKVVS
Subjt:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS

Query:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDK
        KTKRDWQEKIGE LWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRM IQEGLTTEDN +LRL+ELEAL+EKRLEAQQALECYQARMSKAFDK
Subjt:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDK

A0A5A7TZU9 Ribonuclease H0.0e+0083.71Show/hide
Query:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR
        ++  K  +F+WDQSCQNAFDSIK YLLNPPVLSAP  GKPLILYIAAQE SLGALLAQENDK KE            ++LNYSPIEKMCLALFFAIDKLR
Subjt:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR

Query:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG
        HYMQAFTIHLVAKADP+KYILSRP+ISGRLAKWAIILQQYDIVYI QKAVKGQALADFLADH VPS+WKLC+DL DEEVLFVESM+ WIMFFDGA+R++G
Subjt:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG

Query:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE
        AGVGIVF SPEKHMLPYSFTL ELCSNNVAEYQA II LQMASEF             +INQLSYQYE+KHQDLKPYF+YARRLMDRFD IILEHIPRSE
Subjt:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE

Query:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL
        NKKADALANLAT LT+ ED+P+NISL QKWI+P I+SQ+EE D              PIIDYL+HGKLPT+ RHRAEIRRR ARFIYY DTLYRRSYEGL
Subjt:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL

Query:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG
        LL+CLGKEESTKALEEAHSGICGAHQSGPKLQ+QLKRMGYYWPT+IHDSM++AKHCE CQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPIT KS+ G
Subjt:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG

Query:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS
        HSY+L GTDYFS+WAE VPLREAKKENIVNFV+ HIIYRYGIPHRI+TDNGRQFAN+LMDKLCEK NFKQ+KS MYNAAANGLAEAFNKTLC+LLKKVVS
Subjt:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS

Query:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR
        KTKRDWQEKIGE LWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRM IQEGLTTEDN +LRL+ELEAL+EKRLEAQQALECYQARMSKAFDK VR
Subjt:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR

Query:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN
        PRSFQV +LVLA+RRPIITTRHTGNKFTPKWDGPYIVKEVFTN
Subjt:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN

A0A5A7UID6 Ribonuclease H0.0e+0083.04Show/hide
Query:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR
        ++  K  +F+WDQSCQNAFDSIK YLLNPPVLSAP  GKPLILYIAAQE SLGALLAQENDK KE            ++LNYSPIEKMCLALFFAIDKLR
Subjt:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR

Query:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG
        HYMQ FTIHLVAKADP+KYILSRP+ISGRLAKWAIILQQYDIVYI QKAVKGQALADFLADH VPS+WKLC+DL DEEVLFVESM+ WIMFFDG +R++G
Subjt:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG

Query:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE
        AGVGIVF SPEKHMLPYSFTL ELCSNNVAEYQA II LQMASEF             +INQLSYQY +KHQDLKPYF+YARRLMDRFD IILEHIPRSE
Subjt:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE

Query:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL
        NKKADALANLATTLT+ ED+P+NI L QKWI+P I+SQ+EE D              PIIDYL+HGKLPT+ RHRAEIRRR ARFIYY DTLYRRSYEGL
Subjt:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL

Query:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG
        LL+CLGKEES KALEEAHSGICGAHQSGPKLQ+QLKRMGYYWPT+IHDSM++AK+CE CQFHANFIHQPPEPLHPTIASWPFEAWGLDLV PIT KS+ G
Subjt:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG

Query:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS
        HSY+L GTDYFS+WAE VPLREAKKENIVNFVR HIIYRYGIPHRI+TDNGRQFAN+LMDKLCEK NFKQYKS MYNAAANGLAEAFNKTLC+LLKKVVS
Subjt:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS

Query:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR
        KTKRDWQEKIGE LWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRM IQ+GLTTEDN +LRLQELEAL+EKRLEAQQALECYQARMSKAFDK VR
Subjt:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR

Query:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN
        PRSFQV +LVLA+RRPIITTRH GNKFTPKWDGPYIVKEVFTN
Subjt:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN

A0A5D3BTY1 Ribonuclease H0.0e+0083.31Show/hide
Query:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR
        ++  K  +F+WDQSCQNAFDSIK YLLNPPVLSAP  GKPLILYIAAQE SLGALLAQENDK KE            ++LNYSPIEKMCLALFFAIDKLR
Subjt:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR

Query:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG
        HYMQAFTIHLVAKADP+KYILSRP+ISGRLAKWAIILQQYDIVYI QKAVKGQALADFLADH VPS+WKLC+DL DEEVLFVESM+ WIMFFDGA+R++G
Subjt:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG

Query:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE
        AGVGIVF SPEKHMLPYSFTL ELCSNNVAEYQA II LQMASEF             +INQLSYQYE+KHQDLKPYF+YARRLMDRFD IILEHIPRSE
Subjt:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE

Query:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL
        NKKADALANLAT LT+ ED+P+NISL QKWI+P I+SQ+EE D              PIIDYL+HGKLPT+ RHRAEIRRR ARFIYY DTLYRRSYEGL
Subjt:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL

Query:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG
        LL+CLGKEESTKALEEAHSGICGAHQSGPKLQ+QLKRMGYYWPT+IHDSM++AK+CE CQFHANFIHQPPEPLHPTIASWPFE WGLDLVGPIT KSS G
Subjt:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG

Query:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS
        HSY+L  TDYFSRWAE VPLREAKKENIVNFV+ +IIYRYGIPHRI+TDNGRQFAN+LMDKLCEK NFKQYKS MYNAAANGLAEAFNKTLC+LLKKVVS
Subjt:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS

Query:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR
        KTKRDWQEKIGE LWAYRTTHRTPTGVTPYSLVYGVEAVLPLE+EIPSLRM+IQEGLTT+DN +L LQELEAL+EKRLEAQQALECYQARMSKAFDK VR
Subjt:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR

Query:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN
        PRSFQV +LVLA+RRPIITTRHTGNKFTPKWDGPYIVKEVFTN
Subjt:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN

A0A5D3D1E5 Ribonuclease H0.0e+0083.85Show/hide
Query:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR
        ++  K  +F+WDQSCQNAFDSIK YLLNPPVLSAP  GKPLILYIAAQE SLGALLAQENDK KE            ++LNYSPIEKMCLALFFAIDKLR
Subjt:  KIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKE------------SKLNYSPIEKMCLALFFAIDKLR

Query:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG
        HYMQAFTIHLVAKADP+KYILSRP+ISGRLAKWAIILQQYDIVYI QKAVKGQALADFLADH VPS+WKLC+DL DEEVLFVESM+ WIMFFDGA+R++G
Subjt:  HYMQAFTIHLVAKADPIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTG

Query:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE
        AGVGIVF SPEKHMLPYSFTL ELCSNNVAEYQA II LQMASEF             +INQLSYQYE+KHQDLKPYF+YARRLMDRFD IILEHIPRSE
Subjt:  AGVGIVFTSPEKHMLPYSFTLSELCSNNVAEYQALIIDLQMASEF-------------VINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSE

Query:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL
        NKKADALANLAT LT+ ED+P+NISL QKWI+P I+SQ+EE D              PIIDYL+HGKLPT+ RHRAEIRRR ARFIYY DTLYRRSYEGL
Subjt:  NKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETD--------------PIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGL

Query:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG
        LL+CLGKEESTKALEEAHSGICGAHQSGPKLQ+QLKRMGYYWPT+IHDSM++AKHCE CQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPIT KS+ G
Subjt:  LLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTG

Query:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS
        HSY+L GTDYFS+WAE VPLREAKKENIVNFV+ HIIYRYGIPHRI+TDNGRQFAN+LMDKLCEK NFKQ+KS MYNAAANGLAEAFNKTLC+LLKKVVS
Subjt:  HSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVS

Query:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR
        KTKRDWQEKIGE LWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRM IQEGLTTEDN +LRLQELEAL+EKRLEAQQALECYQARMSKAFDK VR
Subjt:  KTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVR

Query:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN
        PRSFQV +LVLA+RRPIITTRHTGNKFTPKWDGPYIVKEVFTN
Subjt:  PRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN

SwissProt top hitse value%identityAlignment
A4FUB7 Gypsy retrotransposon integrase-like protein 12.2e-2223.96Show/hide
Query:  LPTELRHRAEIRRRTARFIYYNDTLYRRSYEGLLLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIH
        LP+E   R+ IRR   +F++                    +E  K L E H    GAH  G      L    YYW ++ +D   +   C+ CQ   N + 
Subjt:  LPTELRHRAEIRRRTARFIYYNDTLYRRSYEGLLLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIH

Query:  QPPEPLHPTIASWPFEAWGLDLVGPITSKSSTGHSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLN
          P+  H      P+    +DL+GP  + S+  H Y ++ TD F++W  ++PL +     I   +  +I + YG P +I+ D   +F + +  +LCE   
Subjt:  QPPEPLHPTIASWPFEAWGLDLVGPITSKSSTGHSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLN

Query:  FKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVSKTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKL--
         KQ      +   N  AE+   T+   L K       DW + +  V +A+  TH  PT  TPY  ++     +P   +I  +     +G  T    K+  
Subjt:  FKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVSKTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKL--

Query:  RLQELEALNEKRLEAQQALE---CYQARMSKAFDKHVRPRSFQVDELVLAIRRPIITTRHT---GNKFTPKWDGPYIVKEVFTN
         ++E + + E +  +   +E   C++   SK     V+ +  Q +   L +   ++  R       +F  +W GP ++  +  N
Subjt:  RLQELEALNEKRLEAQQALE---CYQARMSKAFDKHVRPRSFQVDELVLAIRRPIITTRHT---GNKFTPKWDGPYIVKEVFTN

P03360 Gag-Pol polyprotein (Fragment)3.9e-1927.63Show/hide
Query:  PFEAWGLDLVGPITSKSSTGHSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAA
        P E W +D    IT+K   G+ Y+LV  D FS W E  P +    + ++  +   II R+G+P +I +DNG  F   +  +LCE LN        Y   +
Subjt:  PFEAWGLDLVGPITSKSSTGHSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAA

Query:  NGLAEAFNKTLCNLLKKVVSKTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEA
        +G  E  N+TL   + K+  +T  DW   + + L   R T     G++P+ ++YG++  +     +P +       +T +      L+ L+AL   R  A
Subjt:  NGLAEAFNKTLCNLLKKVVSKTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEA

Query:  QQALECYQARMSKAFDKHVRPRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIV
        +  L   + ++ +   +  R   FQ  +LV          +H   +  P+WDGPY V
Subjt:  QQALECYQARMSKAFDKHVRPRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIV

P10394 Retrovirus-related Pol polyprotein from transposon 4121.7e-2521.1Show/hide
Query:  IPKIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDK-----------ESKLNYSPIEKMCLALFFAIDKL
        I ++ +K   F W   CQ AF  +K+ L+NP +L  P   K   +   A + + GA+L Q ++  +           + + N S  E+   A+ +AI   
Subjt:  IPKIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDK-----------ESKLNYSPIEKMCLALFFAIDKL

Query:  RHYMQAFTIHLVAKAD--PIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASR
        R Y+  +  H   K D  P+ Y+ S    S +L +  + L++Y+    + + +KG+   + +AD L     K  +D++   +      +S         +
Subjt:  RHYMQAFTIHLVAKAD--PIKYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASR

Query:  KTGAGVGIVFTSPEKHMLPYSFTLSELCSN-NVAEYQALIIDLQMASEFVINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSENKKADALAN
        K+ AG         K  L       E+ S  NV E    +I      + V  QL+        D    F + ++++ R+D           +   + + +
Subjt:  KTGAGVGIVFTSPEKHMLPYSFTLSELCSN-NVAEYQALIIDLQMASEFVINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSENKKADALAN

Query:  LATTLTILEDVPVNISLSQKWIIPLIK-SQHEETDPIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGLL---LQCLGKEESTKALEEAHS
        L   L  LE       +SQ  + P  K  +H   D                          +F    + + +     LL    Q   ++E    L   H 
Subjt:  LATTLTILEDVPVNISLSQKWIIPLIK-SQHEETDPIIDYLKHGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGLL---LQCLGKEESTKALEEAHS

Query:  G-ICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTGHSYVLVGTDYFSRWAEVV
          I G H    K   ++KR  YYW  +      Y + C++CQ      H              F+   +D +GP+  KS  G+ Y +      +++   +
Subjt:  G-ICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLHPTIASWPFEAWGLDLVGPITSKSSTGHSYVLVGTDYFSRWAEVV

Query:  PLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVSKTKRDWQEKIGEVLWAYR
        P+     + +   + +  I +YG     +TD G ++ NS++  LC+ L  K   S  ++    G+ E  ++TL   ++  +S  K DW   +   ++ + 
Subjt:  PLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKVVSKTKRDWQEKIGEVLWAYR

Query:  TTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVRPRSFQVDELVLAIRRPII
        TT        PY LV+G  + LP  +    L  +I+     +D  K     LE    +   A++ LE ++ +  + +D        +V ++ L +   ++
Subjt:  TTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKHVRPRSFQVDELVLAIRRPII

Query:  TTRHTGNKFTPKWDGPYIVKEVFTN
             G+K   K+ GPY ++ +  N
Subjt:  TTRHTGNKFTPKWDGPYIVKEVFTN

Q2F7J0 Gag-Pol polyprotein2.0e-1526.78Show/hide
Query:  GHSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKV-
        G+ Y+LV  D FS W E  P +    + +   + + I  R+G+P  + +DNG  FA+ +   + + L         Y   ++G  E  N+T+   L K+ 
Subjt:  GHSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKV-

Query:  VSKTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKH
        ++   RDW   +   L+  R T   P G+TPY ++YG    L +    P +       LT   +++  LQ L+A+       Q+  +   A      D+ 
Subjt:  VSKTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKH

Query:  VRPRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIV
        V P  F+V + V          RH      P+W GPY V
Subjt:  VRPRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIV

Q2F7J3 Gag-Pol polyprotein1.2e-1526.78Show/hide
Query:  GHSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKV-
        G+ Y+LV  D FS W E  P +    + +   + + I  R+G+P  + +DNG  FA+ +   + + L         Y   ++G  E  N+T+   L K+ 
Subjt:  GHSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLAEAFNKTLCNLLKKV-

Query:  VSKTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKH
        ++   RDW   +   L+  R T   P G+TPY ++YG    L +    P +       LT   +++  LQ L+A+       Q+  +   A      D+ 
Subjt:  VSKTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKAFDKH

Query:  VRPRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIV
        V P  F+V + V          RH      P+W GPY V
Subjt:  VRPRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAACTATTCCAAAGATTGATGAGAAAGGATGCATCTTTAATTGGGATCAGTCATGCCAAAATGCATTTGATAGCATAAAGAATTATTTGCTCAATCCTCCGGTTTT
GAGTGCACCGGTAGCTGGAAAGCCATTGATATTATATATTGCAGCTCAAGAAGGTTCGCTTGGAGCATTACTTGCACAAGAAAATGACAAGGACAAGGAATCTAAATTGA
ATTATTCTCCAATCGAGAAAATGTGTCTCGCTCTTTTCTTTGCAATAGATAAGCTAAGACATTATATGCAAGCTTTCACTATACATTTAGTGGCAAAAGCTGATCCTATT
AAGTATATCTTATCAAGGCCAATTATCTCGGGACGTCTTGCTAAATGGGCAATTATACTTCAACAATATGATATTGTATATATTTCCCAAAAAGCAGTAAAAGGTCAAGC
ATTGGCAGATTTCTTGGCTGACCATCTAGTTCCATCAGATTGGAAATTATGTGAAGACTTATCGGATGAGGAAGTTTTGTTTGTTGAAAGCATGAAATCTTGGATCATGT
TCTTTGATGGTGCATCACGAAAAACTGGAGCTGGTGTCGGCATTGTCTTTACCTCTCCAGAGAAACACATGTTACCATATAGCTTCACACTTAGTGAATTATGTTCGAAT
AATGTGGCAGAGTACCAGGCCCTTATCATTGACTTACAAATGGCTTCAGAATTTGTTATAAATCAACTCTCCTATCAGTATGAGATCAAACATCAAGATCTGAAGCCGTA
CTTCACTTATGCTAGGAGATTGATGGACAGATTTGACGGCATAATATTGGAACATATACCAAGATCAGAAAATAAGAAAGCCGATGCCTTAGCAAATTTGGCCACGACTT
TAACAATTTTAGAAGATGTGCCAGTAAATATTTCTCTTAGCCAAAAGTGGATTATTCCCTTAATCAAAAGCCAACACGAAGAAACCGATCCCATCATAGACTATCTGAAG
CATGGAAAACTTCCCACCGAGCTTCGACATCGAGCCGAGATACGAAGAAGGACTGCACGATTTATTTATTACAACGACACACTTTATCGACGCTCGTATGAGGGTCTTCT
TCTGCAGTGCTTGGGAAAAGAGGAATCAACAAAGGCTCTAGAAGAAGCACATTCAGGTATATGTGGTGCTCACCAGTCTGGTCCAAAACTTCAGCATCAGTTGAAAAGAA
TGGGTTACTATTGGCCCACTATCATCCACGACTCAATGTATTATGCAAAACATTGTGAAGAGTGTCAATTCCATGCAAATTTTATACATCAACCACCAGAGCCTCTTCAT
CCAACAATAGCTTCATGGCCTTTTGAAGCTTGGGGACTTGACTTGGTTGGACCGATCACGTCGAAGTCATCGACTGGTCATTCTTACGTTCTTGTGGGAACCGATTATTT
TTCTAGATGGGCTGAAGTTGTACCATTAAGAGAAGCAAAGAAGGAAAACATCGTAAATTTCGTTCGAAAACACATCATTTACCGATATGGTATTCCTCATCGCATCATGA
CTGATAATGGAAGACAATTTGCTAACAGTCTAATGGATAAGTTGTGCGAGAAGCTTAACTTCAAACAGTACAAGTCTTTTATGTACAATGCTGCAGCAAATGGACTGGCA
GAAGCTTTCAACAAAACTCTATGTAATCTTCTGAAGAAGGTGGTCTCCAAGACAAAAAGAGATTGGCAAGAAAAGATAGGAGAAGTATTATGGGCCTATCGAACTACCCA
TCGTACTCCTACTGGTGTTACACCTTATTCTTTAGTTTACGGAGTAGAAGCGGTACTGCCGCTAGAGAGAGAAATTCCATCATTGAGAATGACAATTCAAGAAGGGCTAA
CTACTGAAGACAACGTTAAACTACGCCTTCAAGAGTTAGAAGCACTTAATGAAAAGAGACTAGAAGCTCAACAAGCACTCGAATGTTATCAAGCGCGAATGTCCAAAGCT
TTTGACAAACATGTAAGGCCCCGATCATTTCAGGTTGATGAGTTAGTGCTTGCAATAAGAAGACCTATTATCACGACGAGACATACGGGGAATAAGTTTACACCTAAATG
GGATGGACCCTACATCGTCAAAGAAGTTTTCACAAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAACTATTCCAAAGATTGATGAGAAAGGATGCATCTTTAATTGGGATCAGTCATGCCAAAATGCATTTGATAGCATAAAGAATTATTTGCTCAATCCTCCGGTTTT
GAGTGCACCGGTAGCTGGAAAGCCATTGATATTATATATTGCAGCTCAAGAAGGTTCGCTTGGAGCATTACTTGCACAAGAAAATGACAAGGACAAGGAATCTAAATTGA
ATTATTCTCCAATCGAGAAAATGTGTCTCGCTCTTTTCTTTGCAATAGATAAGCTAAGACATTATATGCAAGCTTTCACTATACATTTAGTGGCAAAAGCTGATCCTATT
AAGTATATCTTATCAAGGCCAATTATCTCGGGACGTCTTGCTAAATGGGCAATTATACTTCAACAATATGATATTGTATATATTTCCCAAAAAGCAGTAAAAGGTCAAGC
ATTGGCAGATTTCTTGGCTGACCATCTAGTTCCATCAGATTGGAAATTATGTGAAGACTTATCGGATGAGGAAGTTTTGTTTGTTGAAAGCATGAAATCTTGGATCATGT
TCTTTGATGGTGCATCACGAAAAACTGGAGCTGGTGTCGGCATTGTCTTTACCTCTCCAGAGAAACACATGTTACCATATAGCTTCACACTTAGTGAATTATGTTCGAAT
AATGTGGCAGAGTACCAGGCCCTTATCATTGACTTACAAATGGCTTCAGAATTTGTTATAAATCAACTCTCCTATCAGTATGAGATCAAACATCAAGATCTGAAGCCGTA
CTTCACTTATGCTAGGAGATTGATGGACAGATTTGACGGCATAATATTGGAACATATACCAAGATCAGAAAATAAGAAAGCCGATGCCTTAGCAAATTTGGCCACGACTT
TAACAATTTTAGAAGATGTGCCAGTAAATATTTCTCTTAGCCAAAAGTGGATTATTCCCTTAATCAAAAGCCAACACGAAGAAACCGATCCCATCATAGACTATCTGAAG
CATGGAAAACTTCCCACCGAGCTTCGACATCGAGCCGAGATACGAAGAAGGACTGCACGATTTATTTATTACAACGACACACTTTATCGACGCTCGTATGAGGGTCTTCT
TCTGCAGTGCTTGGGAAAAGAGGAATCAACAAAGGCTCTAGAAGAAGCACATTCAGGTATATGTGGTGCTCACCAGTCTGGTCCAAAACTTCAGCATCAGTTGAAAAGAA
TGGGTTACTATTGGCCCACTATCATCCACGACTCAATGTATTATGCAAAACATTGTGAAGAGTGTCAATTCCATGCAAATTTTATACATCAACCACCAGAGCCTCTTCAT
CCAACAATAGCTTCATGGCCTTTTGAAGCTTGGGGACTTGACTTGGTTGGACCGATCACGTCGAAGTCATCGACTGGTCATTCTTACGTTCTTGTGGGAACCGATTATTT
TTCTAGATGGGCTGAAGTTGTACCATTAAGAGAAGCAAAGAAGGAAAACATCGTAAATTTCGTTCGAAAACACATCATTTACCGATATGGTATTCCTCATCGCATCATGA
CTGATAATGGAAGACAATTTGCTAACAGTCTAATGGATAAGTTGTGCGAGAAGCTTAACTTCAAACAGTACAAGTCTTTTATGTACAATGCTGCAGCAAATGGACTGGCA
GAAGCTTTCAACAAAACTCTATGTAATCTTCTGAAGAAGGTGGTCTCCAAGACAAAAAGAGATTGGCAAGAAAAGATAGGAGAAGTATTATGGGCCTATCGAACTACCCA
TCGTACTCCTACTGGTGTTACACCTTATTCTTTAGTTTACGGAGTAGAAGCGGTACTGCCGCTAGAGAGAGAAATTCCATCATTGAGAATGACAATTCAAGAAGGGCTAA
CTACTGAAGACAACGTTAAACTACGCCTTCAAGAGTTAGAAGCACTTAATGAAAAGAGACTAGAAGCTCAACAAGCACTCGAATGTTATCAAGCGCGAATGTCCAAAGCT
TTTGACAAACATGTAAGGCCCCGATCATTTCAGGTTGATGAGTTAGTGCTTGCAATAAGAAGACCTATTATCACGACGAGACATACGGGGAATAAGTTTACACCTAAATG
GGATGGACCCTACATCGTCAAAGAAGTTTTCACAAATTGAGCATACAAAATCATTGATCAAGACGGATTACGAATTGGCCCAATCAACGGCAAATTCCTCAAGAAGTTTT
ATGCTTAATTTAGTTTTA
Protein sequenceShow/hide protein sequence
MSTIPKIDEKGCIFNWDQSCQNAFDSIKNYLLNPPVLSAPVAGKPLILYIAAQEGSLGALLAQENDKDKESKLNYSPIEKMCLALFFAIDKLRHYMQAFTIHLVAKADPI
KYILSRPIISGRLAKWAIILQQYDIVYISQKAVKGQALADFLADHLVPSDWKLCEDLSDEEVLFVESMKSWIMFFDGASRKTGAGVGIVFTSPEKHMLPYSFTLSELCSN
NVAEYQALIIDLQMASEFVINQLSYQYEIKHQDLKPYFTYARRLMDRFDGIILEHIPRSENKKADALANLATTLTILEDVPVNISLSQKWIIPLIKSQHEETDPIIDYLK
HGKLPTELRHRAEIRRRTARFIYYNDTLYRRSYEGLLLQCLGKEESTKALEEAHSGICGAHQSGPKLQHQLKRMGYYWPTIIHDSMYYAKHCEECQFHANFIHQPPEPLH
PTIASWPFEAWGLDLVGPITSKSSTGHSYVLVGTDYFSRWAEVVPLREAKKENIVNFVRKHIIYRYGIPHRIMTDNGRQFANSLMDKLCEKLNFKQYKSFMYNAAANGLA
EAFNKTLCNLLKKVVSKTKRDWQEKIGEVLWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMTIQEGLTTEDNVKLRLQELEALNEKRLEAQQALECYQARMSKA
FDKHVRPRSFQVDELVLAIRRPIITTRHTGNKFTPKWDGPYIVKEVFTN