; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0022637 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0022637
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRibonuclease H
Genome locationchr02:14960554..14962021
RNA-Seq ExpressionPay0022637
SyntenyPay0022637
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031752.1 uncharacterized protein E6C27_scaffold506G00150 [Cucumis melo var. makuwa]2.0e-23492.43Show/hide
Query:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
        MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRH  IEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
Subjt:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF

Query:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
        ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLSAPA  K LILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
Subjt:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM

Query:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW
        CLALF AIDKLRHYMQAFTIHLVAK +PVKYILSR VISG LAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL               EPW
Subjt:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW

Query:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF
        IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQ FIIGLQM  EFGIKCIEIFGDSKLIIN LSYQYEVKHQDLKPYFSYARRLMD+F
Subjt:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF

Query:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES
        DSIIL+HI RSENKKADALANLATALTVS+DIPINISLCQKWIVPSIES
Subjt:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]2.6e-23492.2Show/hide
Query:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
        MLHKH+ECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRH  IEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
Subjt:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF

Query:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
        ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLSAPA  K LILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
Subjt:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM

Query:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW
        CLALF AIDKLRHYMQAFTIHLVAK +PVKYILSR VISG LAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL               EPW
Subjt:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW

Query:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF
        IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQ FIIGLQM  EFGIKCIEIFGDSKLIIN LSYQYEVKHQDLKPYFSYARRLMD+F
Subjt:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF

Query:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES
        DSIIL+HI RSENKKADALANLATALTVS+DIPINISLCQKWIVPSIES
Subjt:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES

TYK02262.1 uncharacterized protein E5676_scaffold18G00630 [Cucumis melo var. makuwa]2.0e-23492.43Show/hide
Query:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
        MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRH  IEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
Subjt:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF

Query:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
        ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLSAPA  K LILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
Subjt:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM

Query:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW
        CLALF AIDKLRHYMQAFTIHLVAK +PVKYILSR VISG LAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL               EPW
Subjt:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW

Query:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF
        IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQ FIIGLQM  EFGIKCIEIFGDSKLIIN LSYQYEVKHQDLKPYFSYARRLMD+F
Subjt:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF

Query:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES
        DSIIL+HI RSENKKADALANLATALTVS+DIPINISLCQKWIVPSIES
Subjt:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES

TYK16275.1 uncharacterized protein E5676_scaffold21G00440 [Cucumis melo var. makuwa]1.0e-23391.98Show/hide
Query:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
        MLHKH+ECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRH  IEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
Subjt:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF

Query:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
        ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLSAPA  K LILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
Subjt:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM

Query:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW
        CLALF AIDKLRHYMQAFTIHLVAK +PVKYILSR VISG LAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL               EPW
Subjt:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW

Query:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF
        IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQ FIIGLQM  EFGIKCIEIFGDSKLIIN LSYQYEVKHQDLKPYFSYARRLMD+F
Subjt:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF

Query:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES
        DS IL+HI RSENKKADALANLATALTVS+DIPINISLCQKWIVPSIES
Subjt:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES

TYK18071.1 uncharacterized protein E5676_scaffold306G004020 [Cucumis melo var. makuwa]2.6e-23492.2Show/hide
Query:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
        MLHKH+ECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRH  IEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
Subjt:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF

Query:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
        ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLSAPA  K LILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
Subjt:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM

Query:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW
        CLALF AIDKLRHYMQAFTIHLVAK +PVKYILSR VISG LAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL               EPW
Subjt:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW

Query:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF
        IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQ FIIGLQM  EFGIKCIEIFGDSKLIIN LSYQYEVKHQDLKPYFSYARRLMD+F
Subjt:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF

Query:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES
        DSIIL+HI RSENKKADALANLATALTVS+DIPINISLCQKWIVPSIES
Subjt:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES

TrEMBL top hitse value%identityAlignment
A0A5A7SKZ3 Ribonuclease H9.8e-23592.43Show/hide
Query:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
        MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRH  IEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
Subjt:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF

Query:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
        ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLSAPA  K LILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
Subjt:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM

Query:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW
        CLALF AIDKLRHYMQAFTIHLVAK +PVKYILSR VISG LAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL               EPW
Subjt:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW

Query:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF
        IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQ FIIGLQM  EFGIKCIEIFGDSKLIIN LSYQYEVKHQDLKPYFSYARRLMD+F
Subjt:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF

Query:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES
        DSIIL+HI RSENKKADALANLATALTVS+DIPINISLCQKWIVPSIES
Subjt:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES

A0A5A7TZU9 Ribonuclease H1.3e-23492.2Show/hide
Query:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
        MLHKH+ECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRH  IEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
Subjt:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF

Query:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
        ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLSAPA  K LILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
Subjt:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM

Query:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW
        CLALF AIDKLRHYMQAFTIHLVAK +PVKYILSR VISG LAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL               EPW
Subjt:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW

Query:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF
        IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQ FIIGLQM  EFGIKCIEIFGDSKLIIN LSYQYEVKHQDLKPYFSYARRLMD+F
Subjt:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF

Query:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES
        DSIIL+HI RSENKKADALANLATALTVS+DIPINISLCQKWIVPSIES
Subjt:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES

A0A5D3BTY1 Ribonuclease H9.8e-23592.43Show/hide
Query:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
        MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRH  IEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
Subjt:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF

Query:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
        ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLSAPA  K LILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
Subjt:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM

Query:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW
        CLALF AIDKLRHYMQAFTIHLVAK +PVKYILSR VISG LAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL               EPW
Subjt:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW

Query:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF
        IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQ FIIGLQM  EFGIKCIEIFGDSKLIIN LSYQYEVKHQDLKPYFSYARRLMD+F
Subjt:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF

Query:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES
        DSIIL+HI RSENKKADALANLATALTVS+DIPINISLCQKWIVPSIES
Subjt:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES

A0A5D3CXS1 Uncharacterized protein4.9e-23491.98Show/hide
Query:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
        MLHKH+ECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRH  IEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
Subjt:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF

Query:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
        ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLSAPA  K LILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
Subjt:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM

Query:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW
        CLALF AIDKLRHYMQAFTIHLVAK +PVKYILSR VISG LAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL               EPW
Subjt:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW

Query:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF
        IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQ FIIGLQM  EFGIKCIEIFGDSKLIIN LSYQYEVKHQDLKPYFSYARRLMD+F
Subjt:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF

Query:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES
        DS IL+HI RSENKKADALANLATALTVS+DIPINISLCQKWIVPSIES
Subjt:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES

A0A5D3D1E5 Ribonuclease H1.3e-23492.2Show/hide
Query:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
        MLHKH+ECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRH  IEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF
Subjt:  MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRF

Query:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
        ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNP VLSAPA  K LILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM
Subjt:  ISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKM

Query:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW
        CLALF AIDKLRHYMQAFTIHLVAK +PVKYILSR VISG LAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL               EPW
Subjt:  CLALFLAIDKLRHYMQAFTIHLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKL--------------SEPW

Query:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF
        IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQ FIIGLQM  EFGIKCIEIFGDSKLIIN LSYQYEVKHQDLKPYFSYARRLMD+F
Subjt:  IMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKF

Query:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES
        DSIIL+HI RSENKKADALANLATALTVS+DIPINISLCQKWIVPSIES
Subjt:  DSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIES

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.9e-3131.44Show/hide
Query:  HIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRFISNL
        H+ CY+DD+++ SK + +H+K +K VL +L+   L +N  KC F  +  KF+G+ +           ID + +   PKN  ELR+  G + Y+R+FI   
Subjt:  HIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRFISNL

Query:  AGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKMCLAL
        +    P   L++KD  + W  +   A ++IK+ L++P VL      K ++L   A + ++GA+L+Q++D  K   + Y S  ++ A+LNYS  +K  LA+
Subjt:  AGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKMCLAL

Query:  FLAIDKLRHYMQA----FTIHLVAKTNPVKYILSRSVISG-HLAKWAIILQ--QYDIVYIPQKA
          ++   RHY+++    F I L    N +  I + S      LA+W + LQ   ++I Y P  A
Subjt:  FLAIDKLRHYMQA----FTIHLVAKTNPVKYILSRSVISG-HLAKWAIILQ--QYDIVYIPQKA

P0CT35 Transposon Tf2-2 polyprotein3.9e-3131.44Show/hide
Query:  HIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRFISNL
        H+ CY+DD+++ SK + +H+K +K VL +L+   L +N  KC F  +  KF+G+ +           ID + +   PKN  ELR+  G + Y+R+FI   
Subjt:  HIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRFISNL

Query:  AGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKMCLAL
        +    P   L++KD  + W  +   A ++IK+ L++P VL      K ++L   A + ++GA+L+Q++D  K   + Y S  ++ A+LNYS  +K  LA+
Subjt:  AGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKMCLAL

Query:  FLAIDKLRHYMQA----FTIHLVAKTNPVKYILSRSVISG-HLAKWAIILQ--QYDIVYIPQKA
          ++   RHY+++    F I L    N +  I + S      LA+W + LQ   ++I Y P  A
Subjt:  FLAIDKLRHYMQA----FTIHLVAKTNPVKYILSRSVISG-HLAKWAIILQ--QYDIVYIPQKA

P0CT36 Transposon Tf2-3 polyprotein3.9e-3131.44Show/hide
Query:  HIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRFISNL
        H+ CY+DD+++ SK + +H+K +K VL +L+   L +N  KC F  +  KF+G+ +           ID + +   PKN  ELR+  G + Y+R+FI   
Subjt:  HIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRFISNL

Query:  AGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKMCLAL
        +    P   L++KD  + W  +   A ++IK+ L++P VL      K ++L   A + ++GA+L+Q++D  K   + Y S  ++ A+LNYS  +K  LA+
Subjt:  AGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKMCLAL

Query:  FLAIDKLRHYMQA----FTIHLVAKTNPVKYILSRSVISG-HLAKWAIILQ--QYDIVYIPQKA
          ++   RHY+++    F I L    N +  I + S      LA+W + LQ   ++I Y P  A
Subjt:  FLAIDKLRHYMQA----FTIHLVAKTNPVKYILSRSVISG-HLAKWAIILQ--QYDIVYIPQKA

P0CT37 Transposon Tf2-4 polyprotein3.9e-3131.44Show/hide
Query:  HIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRFISNL
        H+ CY+DD+++ SK + +H+K +K VL +L+   L +N  KC F  +  KF+G+ +           ID + +   PKN  ELR+  G + Y+R+FI   
Subjt:  HIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRFISNL

Query:  AGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKMCLAL
        +    P   L++KD  + W  +   A ++IK+ L++P VL      K ++L   A + ++GA+L+Q++D  K   + Y S  ++ A+LNYS  +K  LA+
Subjt:  AGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKMCLAL

Query:  FLAIDKLRHYMQA----FTIHLVAKTNPVKYILSRSVISG-HLAKWAIILQ--QYDIVYIPQKA
          ++   RHY+++    F I L    N +  I + S      LA+W + LQ   ++I Y P  A
Subjt:  FLAIDKLRHYMQA----FTIHLVAKTNPVKYILSRSVISG-HLAKWAIILQ--QYDIVYIPQKA

P0CT41 Transposon Tf2-12 polyprotein3.9e-3131.44Show/hide
Query:  HIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRFISNL
        H+ CY+DD+++ SK + +H+K +K VL +L+   L +N  KC F  +  KF+G+ +           ID + +   PKN  ELR+  G + Y+R+FI   
Subjt:  HIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRFISNL

Query:  AGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKMCLAL
        +    P   L++KD  + W  +   A ++IK+ L++P VL      K ++L   A + ++GA+L+Q++D  K   + Y S  ++ A+LNYS  +K  LA+
Subjt:  AGRCQPFQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKMCLAL

Query:  FLAIDKLRHYMQA----FTIHLVAKTNPVKYILSRSVISG-HLAKWAIILQ--QYDIVYIPQKA
          ++   RHY+++    F I L    N +  I + S      LA+W + LQ   ++I Y P  A
Subjt:  FLAIDKLRHYMQA----FTIHLVAKTNPVKYILSRSVISG-HLAKWAIILQ--QYDIVYIPQKA

Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein1.1e-1234.06Show/hide
Query:  PSNWKLSEPWIMFFDGAAR-RSGAGVGIVFISPEKHMLPYSFTLG-ELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKP
        PS +   E   + FDGA++   G       +  E   L      G  + +NN AEY   I+GL+  +E G K I++ GDSKL+   +  Q++V H+ L  
Subjt:  PSNWKLSEPWIMFFDGAAR-RSGAGVGIVFISPEKHMLPYSFTLG-ELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKP

Query:  YFSYARRLMDKFDSIILKHILRSENKKADALANLATAL
            A+ L +K  S  + H+LR+ N  AD  ANLA  L
Subjt:  YFSYARRLMDKFDSIILKHILRSENKKADALANLATAL

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-1232.8Show/hide
Query:  FDGAARRS--GAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKFD
        FDGA++ +   AG G V  + +  +L Y        +NNVAEY+  ++GL+  L+ G K + + GDS L+   +   ++  H  +      A+ LM+ F 
Subjt:  FDGAARRS--GAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKFD

Query:  SIILKHILRSENKKADALANLATAL
        +  +KHI R +N +AD  AN A  L
Subjt:  SIILKHILRSENKKADALANLATAL

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-1232.8Show/hide
Query:  FDGAARRS--GAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKFD
        FDGA++ +   AG G V  + +  +L Y        +NNVAEY+  ++GL+  L+ G K + + GDS L+   +   ++  H  +      A+ LM+ F 
Subjt:  FDGAARRS--GAGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKFD

Query:  SIILKHILRSENKKADALANLATAL
        +  +KHI R +N +AD  AN A  L
Subjt:  SIILKHILRSENKKADALANLATAL

AT5G51080.1 RNase H family protein1.4e-1234.09Show/hide
Query:  EPWIMFFDGAAR-RSGAGVGIVFISPEKHMLPYSFTLG-ELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARR
        E  I+ FDGA++   G       +  E   L +    G  + +NN AEY   I+GL+  +E G   I++  DSKL+   +  Q++V H+ L      A++
Subjt:  EPWIMFFDGAAR-RSGAGVGIVFISPEKHMLPYSFTLG-ELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARR

Query:  LMDKFDSIILKHILRSENKKADALANLATALT
        L DK  S  + H+LRS N  AD  AN+A  L+
Subjt:  LMDKFDSIILKHILRSENKKADALANLATALT

AT5G51080.2 RNase H family protein1.4e-1234.09Show/hide
Query:  EPWIMFFDGAAR-RSGAGVGIVFISPEKHMLPYSFTLG-ELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARR
        E  I+ FDGA++   G       +  E   L +    G  + +NN AEY   I+GL+  +E G   I++  DSKL+   +  Q++V H+ L      A++
Subjt:  EPWIMFFDGAAR-RSGAGVGIVFISPEKHMLPYSFTLG-ELCSNNVAEYQPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARR

Query:  LMDKFDSIILKHILRSENKKADALANLATALT
        L DK  S  + H+LRS N  AD  AN+A  L+
Subjt:  LMDKFDSIILKHILRSENKKADALANLATALT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCATAAACACATTGAATGTTATGTTGACGATCTCGTAGTCAAGTCCAAGAAGAAATGCGATCACTTGAAAGACCTGAAGCTGGTACTTGATCGCCTCAGAAAATA
TCAACTAAGAATGAACCCTCTCAAGTGTGCATTTGGTGTAACTTCAGGGAAGTTTTTGGGATTTATAGTGAGACATCACGACATCGAAGTTGATCACTCAAAAATTGATG
CTATCCAGAAGATGCCAAGTCCGAAGAACCTGCACGAATTGAGACGATTGCAAGGTCGTTTGGCTTACATTAGAAGGTTTATATCTAATCTTGCAGGTCGATGTCAACCA
TTCCAGAGACTAATGAGGAAGGATGCAGTCTTTGATTGGGACCAGTCATGCCAAAATGCATTTGATAGCATAAAAAAGTATCTGCTCAACCCTTCGGTCTTAAGTGCGCC
TGCAGCTGAAAAATCATTAATATTGTATATTGCGGCTCAAGAGACTTCGCTCGGGGCATTACTTGCACAAGAAAATGATAAGGGTAAGGAATGTGCACTCTATTATCTAA
GTAGAACTCTGACAGGAGCTGAATTAAATTATTCTCCAATTGAGAAAATGTGTCTCGCCCTCTTCCTTGCAATAGATAAATTGAGACATTATATGCAAGCCTTCACTATA
CACTTGGTGGCAAAAACTAATCCTGTCAAATATATCTTATCAAGGTCAGTCATCTCGGGACACCTCGCGAAGTGGGCTATTATACTTCAACAATATGATATTGTATATAT
CCCCCAAAAAGCAGTGAAGGGCCAAGCATTGGCAGATTTCCTGGCTGATCATCCAGTTCCATCAAATTGGAAGTTATCGGAGCCCTGGATCATGTTCTTTGATGGTGCGG
CACGAAGAAGTGGAGCTGGTGTTGGCATTGTCTTCATTTCTCCTGAGAAACATATGTTGCCATATAGCTTCACACTCGGTGAATTGTGTTCAAATAATGTTGCCGAGTAC
CAACCCTTTATTATCGGCCTCCAAATGACTTTAGAATTTGGGATAAAGTGCATAGAAATATTTGGGGATTCGAAGTTAATCATAAATCCGCTCTCTTATCAGTACGAGGT
AAAGCATCAAGACTTGAAGCCTTACTTTAGTTATGCTAGAAGATTGATGGACAAATTTGACAGCATAATATTAAAGCATATACTGAGATCAGAAAATAAGAAAGCTGATG
CACTTGCAAATTTGGCCACTGCTTTAACAGTCTCAAAAGATATACCAATAAACATTTCCCTTTGCCAAAAATGGATTGTGCCTTCAATTGAAAGTCTTCACATCTACAAG
GTTTGCAGCAAAAGAAAAAGGGGGCAAAGGGGGCATAAATCAAAGCCTCCACTTGAAGTTCTTAAATTCTTCTCGTGCAGCTTCCATGCTCTGGCGAACCGTAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTGCATAAACACATTGAATGTTATGTTGACGATCTCGTAGTCAAGTCCAAGAAGAAATGCGATCACTTGAAAGACCTGAAGCTGGTACTTGATCGCCTCAGAAAATA
TCAACTAAGAATGAACCCTCTCAAGTGTGCATTTGGTGTAACTTCAGGGAAGTTTTTGGGATTTATAGTGAGACATCACGACATCGAAGTTGATCACTCAAAAATTGATG
CTATCCAGAAGATGCCAAGTCCGAAGAACCTGCACGAATTGAGACGATTGCAAGGTCGTTTGGCTTACATTAGAAGGTTTATATCTAATCTTGCAGGTCGATGTCAACCA
TTCCAGAGACTAATGAGGAAGGATGCAGTCTTTGATTGGGACCAGTCATGCCAAAATGCATTTGATAGCATAAAAAAGTATCTGCTCAACCCTTCGGTCTTAAGTGCGCC
TGCAGCTGAAAAATCATTAATATTGTATATTGCGGCTCAAGAGACTTCGCTCGGGGCATTACTTGCACAAGAAAATGATAAGGGTAAGGAATGTGCACTCTATTATCTAA
GTAGAACTCTGACAGGAGCTGAATTAAATTATTCTCCAATTGAGAAAATGTGTCTCGCCCTCTTCCTTGCAATAGATAAATTGAGACATTATATGCAAGCCTTCACTATA
CACTTGGTGGCAAAAACTAATCCTGTCAAATATATCTTATCAAGGTCAGTCATCTCGGGACACCTCGCGAAGTGGGCTATTATACTTCAACAATATGATATTGTATATAT
CCCCCAAAAAGCAGTGAAGGGCCAAGCATTGGCAGATTTCCTGGCTGATCATCCAGTTCCATCAAATTGGAAGTTATCGGAGCCCTGGATCATGTTCTTTGATGGTGCGG
CACGAAGAAGTGGAGCTGGTGTTGGCATTGTCTTCATTTCTCCTGAGAAACATATGTTGCCATATAGCTTCACACTCGGTGAATTGTGTTCAAATAATGTTGCCGAGTAC
CAACCCTTTATTATCGGCCTCCAAATGACTTTAGAATTTGGGATAAAGTGCATAGAAATATTTGGGGATTCGAAGTTAATCATAAATCCGCTCTCTTATCAGTACGAGGT
AAAGCATCAAGACTTGAAGCCTTACTTTAGTTATGCTAGAAGATTGATGGACAAATTTGACAGCATAATATTAAAGCATATACTGAGATCAGAAAATAAGAAAGCTGATG
CACTTGCAAATTTGGCCACTGCTTTAACAGTCTCAAAAGATATACCAATAAACATTTCCCTTTGCCAAAAATGGATTGTGCCTTCAATTGAAAGTCTTCACATCTACAAG
GTTTGCAGCAAAAGAAAAAGGGGGCAAAGGGGGCATAAATCAAAGCCTCCACTTGAAGTTCTTAAATTCTTCTCGTGCAGCTTCCATGCTCTGGCGAACCGTAGCTAG
Protein sequenceShow/hide protein sequence
MLHKHIECYVDDLVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHHDIEVDHSKIDAIQKMPSPKNLHELRRLQGRLAYIRRFISNLAGRCQP
FQRLMRKDAVFDWDQSCQNAFDSIKKYLLNPSVLSAPAAEKSLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAELNYSPIEKMCLALFLAIDKLRHYMQAFTI
HLVAKTNPVKYILSRSVISGHLAKWAIILQQYDIVYIPQKAVKGQALADFLADHPVPSNWKLSEPWIMFFDGAARRSGAGVGIVFISPEKHMLPYSFTLGELCSNNVAEY
QPFIIGLQMTLEFGIKCIEIFGDSKLIINPLSYQYEVKHQDLKPYFSYARRLMDKFDSIILKHILRSENKKADALANLATALTVSKDIPINISLCQKWIVPSIESLHIYK
VCSKRKRGQRGHKSKPPLEVLKFFSCSFHALANRS