; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0012059 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0012059
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRibonuclease H
Genome locationchr09:14767549..14769102
RNA-Seq ExpressionPay0012059
SyntenyPay0012059
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031752.1 uncharacterized protein E6C27_scaffold506G00150 [Cucumis melo var. makuwa]9.9e-22579.3Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM
        M FGLKNA ATYQRAMQ+IFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRL+KYQLRMNPLKCAFGV SGKFLGFIV HRGIEVDHSKIDAIQKM
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM

Query:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------
        PS            RLAYIRRFISNLAGRC PFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLS PA GKPLIL                       
Subjt:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------

Query:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
                                                                 PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
Subjt:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS

Query:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ
        NWKLCDDLPDEEVLFVESMEPWIMFF+GAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGI CIEIFGDSKLIINQLSYQ
Subjt:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ

Query:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG
        YEVKHQDLKPYFSYARRLMDRFDSI+LEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEA VISVYAIDEEDWRQPII+YLEHG
Subjt:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG

Query:  KLPTDPRHRAEIRRRAA
        KLPTDPRHRAEIRRRAA
Subjt:  KLPTDPRHRAEIRRRAA

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]1.3e-22479.11Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM
        M FGLKNA ATYQRAMQ+IFDDMLHKH+ECYVDDLVVKSKKKCDHLKDLKLVLDRL+KYQLRMNPLKCAFGV SGKFLGFIV HRGIEVDHSKIDAIQKM
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM

Query:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------
        PS            RLAYIRRFISNLAGRC PFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLS PA GKPLIL                       
Subjt:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------

Query:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
                                                                 PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
Subjt:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS

Query:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ
        NWKLCDDLPDEEVLFVESMEPWIMFF+GAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGI CIEIFGDSKLIINQLSYQ
Subjt:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ

Query:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG
        YEVKHQDLKPYFSYARRLMDRFDSI+LEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEA VISVYAIDEEDWRQPII+YLEHG
Subjt:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG

Query:  KLPTDPRHRAEIRRRAA
        KLPTDPRHRAEIRRRAA
Subjt:  KLPTDPRHRAEIRRRAA

TYK02262.1 uncharacterized protein E5676_scaffold18G00630 [Cucumis melo var. makuwa]9.9e-22579.3Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM
        M FGLKNA ATYQRAMQ+IFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRL+KYQLRMNPLKCAFGV SGKFLGFIV HRGIEVDHSKIDAIQKM
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM

Query:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------
        PS            RLAYIRRFISNLAGRC PFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLS PA GKPLIL                       
Subjt:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------

Query:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
                                                                 PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
Subjt:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS

Query:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ
        NWKLCDDLPDEEVLFVESMEPWIMFF+GAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGI CIEIFGDSKLIINQLSYQ
Subjt:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ

Query:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG
        YEVKHQDLKPYFSYARRLMDRFDSI+LEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEA VISVYAIDEEDWRQPII+YLEHG
Subjt:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG

Query:  KLPTDPRHRAEIRRRAA
        KLPTDPRHRAEIRRRAA
Subjt:  KLPTDPRHRAEIRRRAA

TYK16275.1 uncharacterized protein E5676_scaffold21G00440 [Cucumis melo var. makuwa]7.6e-22579.11Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM
        M FGLKNA ATYQRAMQ+IFDDMLHKH+ECYVDDLVVKSKKKCDHLKDLKLVLDRL+KYQLRMNPLKCAFGV SGKFLGFIV HRGIEVDHSKIDAIQKM
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM

Query:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------
        PS            RLAYIRRFISNLAGRC PFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLS PA GKPLIL                       
Subjt:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------

Query:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
                                                                 PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
Subjt:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS

Query:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ
        NWKLCDDLPDEEVLFVESMEPWIMFF+GAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGI CIEIFGDSKLIINQLSYQ
Subjt:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ

Query:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG
        YEVKHQDLKPYFSYARRLMDRFDS +LEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPII+YLEHG
Subjt:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG

Query:  KLPTDPRHRAEIRRRAA
        KLPTDPRHRAEIRRRAA
Subjt:  KLPTDPRHRAEIRRRAA

TYK18071.1 uncharacterized protein E5676_scaffold306G004020 [Cucumis melo var. makuwa]1.3e-22479.11Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM
        M FGLKNA ATYQRAMQ+IFDDMLHKH+ECYVDDLVVKSKKKCDHLKDLKLVLDRL+KYQLRMNPLKCAFGV SGKFLGFIV HRGIEVDHSKIDAIQKM
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM

Query:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------
        PS            RLAYIRRFISNLAGRC PFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLS PA GKPLIL                       
Subjt:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------

Query:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
                                                                 PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
Subjt:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS

Query:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ
        NWKLCDDLPDEEVLFVESMEPWIMFF+GAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGI CIEIFGDSKLIINQLSYQ
Subjt:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ

Query:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG
        YEVKHQDLKPYFSYARRLMDRFDSI+LEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEA VISVYAIDEEDWRQPII+YLEHG
Subjt:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG

Query:  KLPTDPRHRAEIRRRAA
        KLPTDPRHRAEIRRRAA
Subjt:  KLPTDPRHRAEIRRRAA

TrEMBL top hitse value%identityAlignment
A0A5A7SKZ3 Ribonuclease H4.8e-22579.3Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM
        M FGLKNA ATYQRAMQ+IFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRL+KYQLRMNPLKCAFGV SGKFLGFIV HRGIEVDHSKIDAIQKM
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM

Query:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------
        PS            RLAYIRRFISNLAGRC PFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLS PA GKPLIL                       
Subjt:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------

Query:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
                                                                 PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
Subjt:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS

Query:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ
        NWKLCDDLPDEEVLFVESMEPWIMFF+GAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGI CIEIFGDSKLIINQLSYQ
Subjt:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ

Query:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG
        YEVKHQDLKPYFSYARRLMDRFDSI+LEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEA VISVYAIDEEDWRQPII+YLEHG
Subjt:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG

Query:  KLPTDPRHRAEIRRRAA
        KLPTDPRHRAEIRRRAA
Subjt:  KLPTDPRHRAEIRRRAA

A0A5A7TZU9 Ribonuclease H6.3e-22579.11Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM
        M FGLKNA ATYQRAMQ+IFDDMLHKH+ECYVDDLVVKSKKKCDHLKDLKLVLDRL+KYQLRMNPLKCAFGV SGKFLGFIV HRGIEVDHSKIDAIQKM
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM

Query:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------
        PS            RLAYIRRFISNLAGRC PFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLS PA GKPLIL                       
Subjt:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------

Query:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
                                                                 PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
Subjt:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS

Query:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ
        NWKLCDDLPDEEVLFVESMEPWIMFF+GAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGI CIEIFGDSKLIINQLSYQ
Subjt:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ

Query:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG
        YEVKHQDLKPYFSYARRLMDRFDSI+LEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEA VISVYAIDEEDWRQPII+YLEHG
Subjt:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG

Query:  KLPTDPRHRAEIRRRAA
        KLPTDPRHRAEIRRRAA
Subjt:  KLPTDPRHRAEIRRRAA

A0A5D3BTY1 Ribonuclease H4.8e-22579.3Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM
        M FGLKNA ATYQRAMQ+IFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRL+KYQLRMNPLKCAFGV SGKFLGFIV HRGIEVDHSKIDAIQKM
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM

Query:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------
        PS            RLAYIRRFISNLAGRC PFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLS PA GKPLIL                       
Subjt:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------

Query:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
                                                                 PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
Subjt:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS

Query:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ
        NWKLCDDLPDEEVLFVESMEPWIMFF+GAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGI CIEIFGDSKLIINQLSYQ
Subjt:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ

Query:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG
        YEVKHQDLKPYFSYARRLMDRFDSI+LEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEA VISVYAIDEEDWRQPII+YLEHG
Subjt:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG

Query:  KLPTDPRHRAEIRRRAA
        KLPTDPRHRAEIRRRAA
Subjt:  KLPTDPRHRAEIRRRAA

A0A5D3CXS1 Uncharacterized protein3.7e-22579.11Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM
        M FGLKNA ATYQRAMQ+IFDDMLHKH+ECYVDDLVVKSKKKCDHLKDLKLVLDRL+KYQLRMNPLKCAFGV SGKFLGFIV HRGIEVDHSKIDAIQKM
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM

Query:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------
        PS            RLAYIRRFISNLAGRC PFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLS PA GKPLIL                       
Subjt:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------

Query:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
                                                                 PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
Subjt:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS

Query:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ
        NWKLCDDLPDEEVLFVESMEPWIMFF+GAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGI CIEIFGDSKLIINQLSYQ
Subjt:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ

Query:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG
        YEVKHQDLKPYFSYARRLMDRFDS +LEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPII+YLEHG
Subjt:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG

Query:  KLPTDPRHRAEIRRRAA
        KLPTDPRHRAEIRRRAA
Subjt:  KLPTDPRHRAEIRRRAA

A0A5D3D1E5 Ribonuclease H6.3e-22579.11Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM
        M FGLKNA ATYQRAMQ+IFDDMLHKH+ECYVDDLVVKSKKKCDHLKDLKLVLDRL+KYQLRMNPLKCAFGV SGKFLGFIV HRGIEVDHSKIDAIQKM
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM

Query:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------
        PS            RLAYIRRFISNLAGRC PFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLS PA GKPLIL                       
Subjt:  PS------------RLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLIL-----------------------

Query:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
                                                                 PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS
Subjt:  ---------------------------------------------------------PVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPS

Query:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ
        NWKLCDDLPDEEVLFVESMEPWIMFF+GAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGI CIEIFGDSKLIINQLSYQ
Subjt:  NWKLCDDLPDEEVLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQ

Query:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG
        YEVKHQDLKPYFSYARRLMDRFDSI+LEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEA VISVYAIDEEDWRQPII+YLEHG
Subjt:  YEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHG

Query:  KLPTDPRHRAEIRRRAA
        KLPTDPRHRAEIRRRAA
Subjt:  KLPTDPRHRAEIRRRAA

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.8e-2033.68Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM
        M FGLKNA AT+QR M  I   +L+KH   Y+DD++V S    +HL+ L LV ++L K  L++   KC F      FLG ++T  GI+ +  KI+AIQK 
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM

Query:  P------------SRLAYIRRFISNLAGRCLPFQRLMRKDAVFD-WDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLILPVISGRLAKWAIILQ
        P                Y R+FI N A    P  + ++K+   D  +    +AF  +K  +    +L VP   K   L   +  +A  A++ Q
Subjt:  P------------SRLAYIRRFISNLAGRCLPFQRLMRKDAVFD-WDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLILPVISGRLAKWAIILQ

P0CT34 Transposon Tf2-1 polyprotein9.0e-1928.21Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAI--Q
        M +G+  A A +Q  +  I  +    H+ CY+DD+++ SK + +H+K +K VL +LK   L +N  KC F     KF+G+ ++ +G       ID +   
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAI--Q

Query:  KMPSR----------LAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLILPVISGRLAKWAIILQQYD
        K P            + Y+R+FI   +    P   L++KD  + W  +   A ++IK+ L++P VL      K ++L   +  +A  A++ Q++D
Subjt:  KMPSR----------LAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLILPVISGRLAKWAIILQQYD

P0CT35 Transposon Tf2-2 polyprotein9.0e-1928.21Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAI--Q
        M +G+  A A +Q  +  I  +    H+ CY+DD+++ SK + +H+K +K VL +LK   L +N  KC F     KF+G+ ++ +G       ID +   
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAI--Q

Query:  KMPSR----------LAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLILPVISGRLAKWAIILQQYD
        K P            + Y+R+FI   +    P   L++KD  + W  +   A ++IK+ L++P VL      K ++L   +  +A  A++ Q++D
Subjt:  KMPSR----------LAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLILPVISGRLAKWAIILQQYD

P0CT41 Transposon Tf2-12 polyprotein9.0e-1928.21Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAI--Q
        M +G+  A A +Q  +  I  +    H+ CY+DD+++ SK + +H+K +K VL +LK   L +N  KC F     KF+G+ ++ +G       ID +   
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAI--Q

Query:  KMPSR----------LAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLILPVISGRLAKWAIILQQYD
        K P            + Y+R+FI   +    P   L++KD  + W  +   A ++IK+ L++P VL      K ++L   +  +A  A++ Q++D
Subjt:  KMPSR----------LAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLILPVISGRLAKWAIILQQYD

P10394 Retrovirus-related Pol polyprotein from transposon 4129.6e-2130.99Show/hide
Query:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM
        + FGLK A  ++QR M   F  +       Y+DDL+V    +   LK+L  V  + ++Y L+++P KC+F +    FLG   T +GI  D  K D IQ  
Subjt:  MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKM

Query:  P------------SRLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLILPVISGRLAKWAIILQ-----QYD
        P            +   Y RRFI N A       RL +K+  F+W   CQ AF  +K  L+NP++L  P   K   +   + + A  A++ Q     Q  
Subjt:  P------------SRLAYIRRFISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLILPVISGRLAKWAIILQ-----QYD

Query:  IVYIPQKAVKGQA
        + Y  +   KG++
Subjt:  IVYIPQKAVKGQA

Arabidopsis top hitse value%identityAlignment
AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.4e-1232Show/hide
Query:  FNGAARRS--GAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQYEVKHQDLKPYFSYARRLMDRFD
        F+GA++ +   AG G V  + +  +L Y        +NNVAEY+A ++GL+ A + G   + + GDS L+  Q+   ++  H  +      A+ LM+ F 
Subjt:  FNGAARRS--GAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQYEVKHQDLKPYFSYARRLMDRFD

Query:  SIMLEHIPRSENKKADALANLATAL
        +  ++HI R +N +AD  AN A  L
Subjt:  SIMLEHIPRSENKKADALANLATAL

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.4e-1232Show/hide
Query:  FNGAARRS--GAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQYEVKHQDLKPYFSYARRLMDRFD
        F+GA++ +   AG G V  + +  +L Y        +NNVAEY+A ++GL+ A + G   + + GDS L+  Q+   ++  H  +      A+ LM+ F 
Subjt:  FNGAARRS--GAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQYEVKHQDLKPYFSYARRLMDRFD

Query:  SIMLEHIPRSENKKADALANLATAL
        +  ++HI R +N +AD  AN A  L
Subjt:  SIMLEHIPRSENKKADALANLATAL

AT5G51080.1 RNase H family protein8.9e-1431.74Show/hide
Query:  ALADFLADHPVPSNWKLCDDLPDEEVLFVESMEPWIMFFNGAAR-RSGAGVGIVFISPEKHMLPYSFTLG-ELCSNNVAEYQAFIIGLQMASEFGINCIE
        AL   L    +PS     + L + E     S E  I+ F+GA++   G       +  E   L +    G  + +NN AEY   I+GL+ A E G   I+
Subjt:  ALADFLADHPVPSNWKLCDDLPDEEVLFVESMEPWIMFFNGAAR-RSGAGVGIVFISPEKHMLPYSFTLG-ELCSNNVAEYQAFIIGLQMASEFGINCIE

Query:  IFGDSKLIINQLSYQYEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSE
        +  DSKL+  Q+  Q++V H+ L      A++L D+  S  + H+ RS N  AD  AN+A  L+  E
Subjt:  IFGDSKLIINQLSYQYEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSE

AT5G51080.2 RNase H family protein8.9e-1431.74Show/hide
Query:  ALADFLADHPVPSNWKLCDDLPDEEVLFVESMEPWIMFFNGAAR-RSGAGVGIVFISPEKHMLPYSFTLG-ELCSNNVAEYQAFIIGLQMASEFGINCIE
        AL   L    +PS     + L + E     S E  I+ F+GA++   G       +  E   L +    G  + +NN AEY   I+GL+ A E G   I+
Subjt:  ALADFLADHPVPSNWKLCDDLPDEEVLFVESMEPWIMFFNGAAR-RSGAGVGIVFISPEKHMLPYSFTLG-ELCSNNVAEYQAFIIGLQMASEFGINCIE

Query:  IFGDSKLIINQLSYQYEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSE
        +  DSKL+  Q+  Q++V H+ L      A++L D+  S  + H+ RS N  AD  AN+A  L+  E
Subjt:  IFGDSKLIINQLSYQYEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSE

AT5G51080.3 RNase H family protein8.9e-1431.74Show/hide
Query:  ALADFLADHPVPSNWKLCDDLPDEEVLFVESMEPWIMFFNGAAR-RSGAGVGIVFISPEKHMLPYSFTLG-ELCSNNVAEYQAFIIGLQMASEFGINCIE
        AL   L    +PS     + L + E     S E  I+ F+GA++   G       +  E   L +    G  + +NN AEY   I+GL+ A E G   I+
Subjt:  ALADFLADHPVPSNWKLCDDLPDEEVLFVESMEPWIMFFNGAAR-RSGAGVGIVFISPEKHMLPYSFTLG-ELCSNNVAEYQAFIIGLQMASEFGINCIE

Query:  IFGDSKLIINQLSYQYEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSE
        +  DSKL+  Q+  Q++V H+ L      A++L D+  S  + H+ RS N  AD  AN+A  L+  E
Subjt:  IFGDSKLIINQLSYQYEVKHQDLKPYFSYARRLMDRFDSIMLEHIPRSENKKADALANLATALTVSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTCGGATTGAAAAATGCAAGTGCTACATACCAGCGCGCTATGCAAAAGATCTTTGATGATATGCTGCATAAACACATTGAATGTTATGTTGACGATCTCGTAGT
CAAGTCAAAGAAGAAATGTGATCATTTGAAAGACCTGAAGCTGGTACTTGATCGCCTAAAAAAATATCAACTAAGAATGAACCCTCTCAAGTGTGCATTTGGTGTAATTT
CAGGGAAGTTTTTGGGATTTATAGTGACACATCGCGGCATCGAAGTTGATCACTCAAAAATTGATGCTATCCAAAAGATGCCAAGTCGTTTGGCTTACATTAGAAGGTTT
ATATCTAATCTTGCAGGTCGATGTCTACCATTCCAGAGACTAATGAGGAAGGATGCAGTCTTTGATTGGGACCAGTCATGCCAAAATGCATTTGATAGCATAAAGAAGTA
TCTGCTCAACCCTTCGGTCTTAAGTGTGCCTGCAGCTGGAAAACCATTAATATTGCCAGTCATCTCGGGACGCCTCGCGAAGTGGGCTATTATACTCCAACAATATGATA
TTGTATATATCCCCCAAAAAGCAGTGAAGGGCCAAGCATTGGCAGATTTCCTGGCTGATCATCCAGTTCCATCAAATTGGAAGTTATGTGACGACTTACCTGATGAGGAA
GTATTGTTTGTTGAAAGCATGGAGCCTTGGATCATGTTCTTTAATGGTGCGGCACGAAGAAGTGGAGCTGGTGTTGGCATTGTCTTCATTTCTCCTGAGAAACATATGTT
GCCATATAGCTTCACACTCGGTGAATTGTGTTCAAATAACGTTGCCGAGTACCAAGCCTTTATTATCGGCCTCCAAATGGCTTCAGAATTTGGGATAAACTGCATAGAAA
TATTCGGCGATTCGAAGTTAATCATAAATCAGCTCTCTTATCAGTACGAGGTGAAGCATCAAGACTTGAAGCCTTACTTTAGTTATGCTAGAAGATTGATGGACAGATTC
GACAGCATAATGTTGGAGCATATACCGAGATCAGAAAATAAGAAAGCTGATGCACTTGCAAATTTGGCCACTGCTTTAACAGTCTCGGAAGATATACCAATAAACATTTC
CCTTTGCCAAAAATGGATTGTGCCTTCAATTGAAAGTCAATATGAAGAAGCTGGTGTGATATCTGTATATGCAATTGATGAAGAAGATTGGCGCCAACCCATTATAAACT
ATTTGGAGCATGGAAAACTTCCCACCGATCCTCGACATAGAGCTGAAATACGTAGAAGAGCTGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTCGGATTGAAAAATGCAAGTGCTACATACCAGCGCGCTATGCAAAAGATCTTTGATGATATGCTGCATAAACACATTGAATGTTATGTTGACGATCTCGTAGT
CAAGTCAAAGAAGAAATGTGATCATTTGAAAGACCTGAAGCTGGTACTTGATCGCCTAAAAAAATATCAACTAAGAATGAACCCTCTCAAGTGTGCATTTGGTGTAATTT
CAGGGAAGTTTTTGGGATTTATAGTGACACATCGCGGCATCGAAGTTGATCACTCAAAAATTGATGCTATCCAAAAGATGCCAAGTCGTTTGGCTTACATTAGAAGGTTT
ATATCTAATCTTGCAGGTCGATGTCTACCATTCCAGAGACTAATGAGGAAGGATGCAGTCTTTGATTGGGACCAGTCATGCCAAAATGCATTTGATAGCATAAAGAAGTA
TCTGCTCAACCCTTCGGTCTTAAGTGTGCCTGCAGCTGGAAAACCATTAATATTGCCAGTCATCTCGGGACGCCTCGCGAAGTGGGCTATTATACTCCAACAATATGATA
TTGTATATATCCCCCAAAAAGCAGTGAAGGGCCAAGCATTGGCAGATTTCCTGGCTGATCATCCAGTTCCATCAAATTGGAAGTTATGTGACGACTTACCTGATGAGGAA
GTATTGTTTGTTGAAAGCATGGAGCCTTGGATCATGTTCTTTAATGGTGCGGCACGAAGAAGTGGAGCTGGTGTTGGCATTGTCTTCATTTCTCCTGAGAAACATATGTT
GCCATATAGCTTCACACTCGGTGAATTGTGTTCAAATAACGTTGCCGAGTACCAAGCCTTTATTATCGGCCTCCAAATGGCTTCAGAATTTGGGATAAACTGCATAGAAA
TATTCGGCGATTCGAAGTTAATCATAAATCAGCTCTCTTATCAGTACGAGGTGAAGCATCAAGACTTGAAGCCTTACTTTAGTTATGCTAGAAGATTGATGGACAGATTC
GACAGCATAATGTTGGAGCATATACCGAGATCAGAAAATAAGAAAGCTGATGCACTTGCAAATTTGGCCACTGCTTTAACAGTCTCGGAAGATATACCAATAAACATTTC
CCTTTGCCAAAAATGGATTGTGCCTTCAATTGAAAGTCAATATGAAGAAGCTGGTGTGATATCTGTATATGCAATTGATGAAGAAGATTGGCGCCAACCCATTATAAACT
ATTTGGAGCATGGAAAACTTCCCACCGATCCTCGACATAGAGCTGAAATACGTAGAAGAGCTGCGTGA
Protein sequenceShow/hide protein sequence
MSFGLKNASATYQRAMQKIFDDMLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLKKYQLRMNPLKCAFGVISGKFLGFIVTHRGIEVDHSKIDAIQKMPSRLAYIRRF
ISNLAGRCLPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSVPAAGKPLILPVISGRLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKLCDDLPDEE
VLFVESMEPWIMFFNGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGINCIEIFGDSKLIINQLSYQYEVKHQDLKPYFSYARRLMDRF
DSIMLEHIPRSENKKADALANLATALTVSEDIPINISLCQKWIVPSIESQYEEAGVISVYAIDEEDWRQPIINYLEHGKLPTDPRHRAEIRRRAA