; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022533 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022533
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:31911664..31919070
RNA-Seq ExpressionLag0022533
SyntenyLag0022533
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR027850 - Protein of unknown function DUF4504
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032583.1 reverse transcriptase [Cucumis melo var. makuwa]5.8e-16854.18Show/hide
Query:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT
        MK VNSAALPI+G+ KR  ++LG WSG  DF++VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG   + V T ++QP+G+KMISA+QLKKGL+ DEPT
Subjt:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT

Query:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ
        FMAIP+   ENS   VP+EI RVL+KYRDVMPDSLPK+LPPRR IDHEIEL+PGA PPAKNAYRMA                       K +   P+LFQ
Subjt:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ

Query:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE
        +KKDG+L+L                                      LRSGYYQVRI EGDEPKTT VTRYG+FEFLVMPFGLTNAP TFCTLMNQVFHE
Subjt:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE

Query:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------
        YLDKFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVKREKCSF+Q+RINFLG VIE GRI                                   
Subjt:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------

Query:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV
                                 EC+ AFDGLK+ +MEGP+LGI DVTKPFEVET+ASDYALGGVLL++GHPIAYESRKLN AE+R+  SEKEMLAVV
Subjt:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV

Query:  HCLRAWRQYLLGAN-------------------RSPQSE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKD
        HCLRAWRQYLLG++                    S Q+             EH  G                 +  +A+LQ S + GS+RD++RE L+KD
Subjt:  HCLRAWRQYLLGAN-------------------RSPQSE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKD

Query:  MTAQAIVKLVEEGKT-NLWAGE
          AQ ++ L + GKT   W  E
Subjt:  MTAQAIVKLVEEGKT-NLWAGE

KAA0036830.1 reverse transcriptase [Cucumis melo var. makuwa]5.8e-16854.5Show/hide
Query:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT
        MK VNSAALPI+G+ KR  ++LG WSG  DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG   + V T ++QP+G+KMISA+QLKKGL+ DEPT
Subjt:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT

Query:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ
        FMAIP+   ENS   VP+EI RVL+KYRDVMPDSLPK+LPPRR IDHEIEL+PGA PPAKNAYRMA                       K     P+LFQ
Subjt:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ

Query:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE
        +KKDG+L+L                                      LRSGYYQVRI EGDEPKTT VTRYG+FEFLVMPFGLTNAP TFCTLMNQVFHE
Subjt:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE

Query:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------
        YLDKFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVKREKCSF+Q+RINFLG VIE GRI                                   
Subjt:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------

Query:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV
                                 EC+ AFDGLK+ +MEGP+LGI DVTKPFEVET+ASDYALGGVLL++GHPIAYESRKLN AE+R+  SEKEMLAVV
Subjt:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV

Query:  HCLRAWRQYLLGAN-------------------RSPQSE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKD
        HCLRAWRQYLLG++                    S Q+             EH  G                 +  +A+LQ S + GSIRD++RE L+KD
Subjt:  HCLRAWRQYLLGAN-------------------RSPQSE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKD

Query:  MTAQAIVKLVEEGKT-NLWAGE
          AQ ++ L + GKT   W  E
Subjt:  MTAQAIVKLVEEGKT-NLWAGE

TYK07954.1 reverse transcriptase [Cucumis melo var. makuwa]1.5e-16854.66Show/hide
Query:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT
        MK VNSAALPI+G+ KR  ++LG WSG  DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG   + V T ++QP+G+KMISA+QLKKGL+ DEPT
Subjt:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT

Query:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ
        FMAIP+   ENS   VP+EI RVL+KYRDVMPDSLPK+LPPRR IDHEIEL+PGA PPAKNAYRMA                       K     P+LFQ
Subjt:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ

Query:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE
        KKKDG+L+L                                      LRSGYYQVRI EGDEPKTT VTRYG+FEFLVMPFGLTNAP TFCTLMNQVFHE
Subjt:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE

Query:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------
        YLDKFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVKREKCSF+Q+RINFLG VIE GRI                                   
Subjt:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------

Query:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV
                                 EC+AAFDGLK+ +MEGP+LGI DVTKPFEVET+ASDYALGGVLL++GHPIAYESRKLN AE+R+  SEKEMLAVV
Subjt:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV

Query:  HCLRAWRQYLLGAN-------------------RSPQSE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKD
        HCLRAWRQYLLG++                    S Q+             EH  G                 +  +A+LQ S + GS+RD++RE L+KD
Subjt:  HCLRAWRQYLLGAN-------------------RSPQSE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKD

Query:  MTAQAIVKLVEEGKT-NLWAGE
          AQ ++ L + GKT   W  E
Subjt:  MTAQAIVKLVEEGKT-NLWAGE

TYK07955.1 reverse transcriptase [Cucumis melo var. makuwa]1.5e-16854.17Show/hide
Query:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT
        MK VNSAALPI+G+ KR  ++LG WSG  DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG   + V T ++QP+G+KMISA+QLKKGL+ DEPT
Subjt:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT

Query:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ
        FMAIP+   ENS   VP+EI RVL+KYRDVMPDSLPK+LPPRR IDHEIEL+PGA PPAKNAYRMA                       K     P+LFQ
Subjt:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ

Query:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE
        +KKDG+L+L                                      LRSGYYQVRI EGDEPKTT VTRYG+FEFLVMPFGLTNAP TFCTLMNQVFHE
Subjt:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE

Query:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------
        YLDKFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVKREKCSF+Q+RINFLG VIE GRI                                   
Subjt:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------

Query:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV
                                 EC+AAFDGLK+ MMEGP+LGI +VTKPFEVET+ASDYALGGVLL++GHPIAYESRKLN AE+R+  SEKEMLAVV
Subjt:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV

Query:  HCLRAWRQYLLG-------------------------------------------------ANRSPQSEEHTCGPVHAIAYLQASRLRGSIRDSVREHLK
        HCLRAWRQYLLG                                                 A+   + +EHT   +  +A+ + S + GS+RD++RE L+
Subjt:  HCLRAWRQYLLG-------------------------------------------------ANRSPQSEEHTCGPVHAIAYLQASRLRGSIRDSVREHLK

Query:  KDMTAQAIVKLVEEGKT-NLWAGE
        KD  AQ ++ LV+EGKT   W  E
Subjt:  KDMTAQAIVKLVEEGKT-NLWAGE

TYK08407.1 reverse transcriptase [Cucumis melo var. makuwa]1.1e-16957.44Show/hide
Query:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT
        MK VNSAALPI+G+ KR T++LG WSG  DFV+VK+DDFDVVLGMEFLLEH+VI MPLAKCLVITGS  + V   ++QP+G+KMISA+QLKKGL+ DEPT
Subjt:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT

Query:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ
        FMAIP+   ENS   VP+EI RVL+KYRDVMPDSLPK+LPPRR IDHEIEL+PGA PPAKNAYRMA                       K      +LFQ
Subjt:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ

Query:  KKKDGNLQLYLRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKN
        KKKDGN    LRSGYYQVRI EGDEPKTT VTRYG+FEFLVMPFGLTNAP TFCTLMNQVFHEYLDKFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+N
Subjt:  KKKDGNLQLYLRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKN

Query:  QLYVKREKCSFSQQRINFLGQVIEAGRI------------------------------------------------------------ECEAAFDGLKRT
        QLYVKREKCSF+Q+RINFLG VIE GRI                                                            EC+ AFDGLK+ 
Subjt:  QLYVKREKCSFSQQRINFLGQVIEAGRI------------------------------------------------------------ECEAAFDGLKRT

Query:  MMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYLLGAN-------------------RSPQ
        +MEGP+LGI DVTKPFEVET+ASDYALGGVLL++GHPIAYESRKLN AE+R+  SEKEMLAVVHCLRAWRQYLLG++                    S Q
Subjt:  MMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYLLGAN-------------------RSPQ

Query:  SE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKDMTAQAIVKLVEEGKT-NLWAGE
        +             EH  G                 +  +A+LQ S + GS+RD++RE L+KD  AQ ++ L + GKT   W  E
Subjt:  SE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKDMTAQAIVKLVEEGKT-NLWAGE

TrEMBL top hitse value%identityAlignment
A0A5A7SQC7 Reverse transcriptase2.8e-16854.18Show/hide
Query:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT
        MK VNSAALPI+G+ KR  ++LG WSG  DF++VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG   + V T ++QP+G+KMISA+QLKKGL+ DEPT
Subjt:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT

Query:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ
        FMAIP+   ENS   VP+EI RVL+KYRDVMPDSLPK+LPPRR IDHEIEL+PGA PPAKNAYRMA                       K +   P+LFQ
Subjt:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ

Query:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE
        +KKDG+L+L                                      LRSGYYQVRI EGDEPKTT VTRYG+FEFLVMPFGLTNAP TFCTLMNQVFHE
Subjt:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE

Query:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------
        YLDKFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVKREKCSF+Q+RINFLG VIE GRI                                   
Subjt:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------

Query:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV
                                 EC+ AFDGLK+ +MEGP+LGI DVTKPFEVET+ASDYALGGVLL++GHPIAYESRKLN AE+R+  SEKEMLAVV
Subjt:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV

Query:  HCLRAWRQYLLGAN-------------------RSPQSE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKD
        HCLRAWRQYLLG++                    S Q+             EH  G                 +  +A+LQ S + GS+RD++RE L+KD
Subjt:  HCLRAWRQYLLGAN-------------------RSPQSE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKD

Query:  MTAQAIVKLVEEGKT-NLWAGE
          AQ ++ L + GKT   W  E
Subjt:  MTAQAIVKLVEEGKT-NLWAGE

A0A5A7T0E2 Reverse transcriptase2.8e-16854.5Show/hide
Query:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT
        MK VNSAALPI+G+ KR  ++LG WSG  DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG   + V T ++QP+G+KMISA+QLKKGL+ DEPT
Subjt:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT

Query:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ
        FMAIP+   ENS   VP+EI RVL+KYRDVMPDSLPK+LPPRR IDHEIEL+PGA PPAKNAYRMA                       K     P+LFQ
Subjt:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ

Query:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE
        +KKDG+L+L                                      LRSGYYQVRI EGDEPKTT VTRYG+FEFLVMPFGLTNAP TFCTLMNQVFHE
Subjt:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE

Query:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------
        YLDKFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVKREKCSF+Q+RINFLG VIE GRI                                   
Subjt:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------

Query:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV
                                 EC+ AFDGLK+ +MEGP+LGI DVTKPFEVET+ASDYALGGVLL++GHPIAYESRKLN AE+R+  SEKEMLAVV
Subjt:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV

Query:  HCLRAWRQYLLGAN-------------------RSPQSE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKD
        HCLRAWRQYLLG++                    S Q+             EH  G                 +  +A+LQ S + GSIRD++RE L+KD
Subjt:  HCLRAWRQYLLGAN-------------------RSPQSE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKD

Query:  MTAQAIVKLVEEGKT-NLWAGE
          AQ ++ L + GKT   W  E
Subjt:  MTAQAIVKLVEEGKT-NLWAGE

A0A5D3C8W7 Reverse transcriptase5.1e-17057.44Show/hide
Query:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT
        MK VNSAALPI+G+ KR T++LG WSG  DFV+VK+DDFDVVLGMEFLLEH+VI MPLAKCLVITGS  + V   ++QP+G+KMISA+QLKKGL+ DEPT
Subjt:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT

Query:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ
        FMAIP+   ENS   VP+EI RVL+KYRDVMPDSLPK+LPPRR IDHEIEL+PGA PPAKNAYRMA                       K      +LFQ
Subjt:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ

Query:  KKKDGNLQLYLRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKN
        KKKDGN    LRSGYYQVRI EGDEPKTT VTRYG+FEFLVMPFGLTNAP TFCTLMNQVFHEYLDKFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+N
Subjt:  KKKDGNLQLYLRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKN

Query:  QLYVKREKCSFSQQRINFLGQVIEAGRI------------------------------------------------------------ECEAAFDGLKRT
        QLYVKREKCSF+Q+RINFLG VIE GRI                                                            EC+ AFDGLK+ 
Subjt:  QLYVKREKCSFSQQRINFLGQVIEAGRI------------------------------------------------------------ECEAAFDGLKRT

Query:  MMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYLLGAN-------------------RSPQ
        +MEGP+LGI DVTKPFEVET+ASDYALGGVLL++GHPIAYESRKLN AE+R+  SEKEMLAVVHCLRAWRQYLLG++                    S Q
Subjt:  MMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYLLGAN-------------------RSPQ

Query:  SE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKDMTAQAIVKLVEEGKT-NLWAGE
        +             EH  G                 +  +A+LQ S + GS+RD++RE L+KD  AQ ++ L + GKT   W  E
Subjt:  SE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKDMTAQAIVKLVEEGKT-NLWAGE

A0A5D3C8Z6 Reverse transcriptase7.4e-16954.17Show/hide
Query:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT
        MK VNSAALPI+G+ KR  ++LG WSG  DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG   + V T ++QP+G+KMISA+QLKKGL+ DEPT
Subjt:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT

Query:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ
        FMAIP+   ENS   VP+EI RVL+KYRDVMPDSLPK+LPPRR IDHEIEL+PGA PPAKNAYRMA                       K     P+LFQ
Subjt:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ

Query:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE
        +KKDG+L+L                                      LRSGYYQVRI EGDEPKTT VTRYG+FEFLVMPFGLTNAP TFCTLMNQVFHE
Subjt:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE

Query:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------
        YLDKFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVKREKCSF+Q+RINFLG VIE GRI                                   
Subjt:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------

Query:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV
                                 EC+AAFDGLK+ MMEGP+LGI +VTKPFEVET+ASDYALGGVLL++GHPIAYESRKLN AE+R+  SEKEMLAVV
Subjt:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV

Query:  HCLRAWRQYLLG-------------------------------------------------ANRSPQSEEHTCGPVHAIAYLQASRLRGSIRDSVREHLK
        HCLRAWRQYLLG                                                 A+   + +EHT   +  +A+ + S + GS+RD++RE L+
Subjt:  HCLRAWRQYLLG-------------------------------------------------ANRSPQSEEHTCGPVHAIAYLQASRLRGSIRDSVREHLK

Query:  KDMTAQAIVKLVEEGKT-NLWAGE
        KD  AQ ++ LV+EGKT   W  E
Subjt:  KDMTAQAIVKLVEEGKT-NLWAGE

A0A5D3C9P8 Reverse transcriptase7.4e-16954.66Show/hide
Query:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT
        MK VNSAALPI+G+ KR  ++LG WSG  DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG   + V T ++QP+G+KMISA+QLKKGL+ DEPT
Subjt:  MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPT

Query:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ
        FMAIP+   ENS   VP+EI RVL+KYRDVMPDSLPK+LPPRR IDHEIEL+PGA PPAKNAYRMA                       K     P+LFQ
Subjt:  FMAIPMTEDENSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMA----------------------RKGTTRGPMLFQ

Query:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE
        KKKDG+L+L                                      LRSGYYQVRI EGDEPKTT VTRYG+FEFLVMPFGLTNAP TFCTLMNQVFHE
Subjt:  KKKDGNLQLY-------------------------------------LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHE

Query:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------
        YLDKFVVVYLDDIVVYS TMEEH+ HLQ VF+KLK+NQLYVKREKCSF+Q+RINFLG VIE GRI                                   
Subjt:  YLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRI-----------------------------------

Query:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV
                                 EC+AAFDGLK+ +MEGP+LGI DVTKPFEVET+ASDYALGGVLL++GHPIAYESRKLN AE+R+  SEKEMLAVV
Subjt:  -------------------------ECEAAFDGLKRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVV

Query:  HCLRAWRQYLLGAN-------------------RSPQSE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKD
        HCLRAWRQYLLG++                    S Q+             EH  G                 +  +A+LQ S + GS+RD++RE L+KD
Subjt:  HCLRAWRQYLLGAN-------------------RSPQSE------------EHTCG----------------PVHAIAYLQASRLRGSIRDSVREHLKKD

Query:  MTAQAIVKLVEEGKT-NLWAGE
          AQ ++ L + GKT   W  E
Subjt:  MTAQAIVKLVEEGKT-NLWAGE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.5e-3735.34Show/hide
Query:  LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCS
        L  G++Q+ +      KT   T++G +E+L MPFGL NAP TF   MN +    L+K  +VYLDDI+V+S +++EH   L LVFEKL K  L ++ +KC 
Subjt:  LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCS

Query:  FSQQRINFLGQV-------------------------------------------------------------IEAGRIECEAAFDGLKRTMMEGPILGI
        F +Q   FLG V                                                             I+    E ++AF  LK  + E PIL +
Subjt:  FSQQRINFLGQV-------------------------------------------------------------IEAGRIECEAAFDGLKRTMMEGPILGI

Query:  VDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYLLG
         D TK F + T+ASD ALG VL +DGHP++Y SR LN+ E  ++  EKE+LA+V   + +R YLLG
Subjt:  VDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYLLG

P0CT34 Transposon Tf2-1 polyprotein1.2e-2728.41Show/hide
Query:  QLYLRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKRE
        +L L+S Y+ +R+ +GDE K       G FE+LVMP+G++ AP  F   +N +  E  +  VV Y+DDI+++S +  EH  H++ V +KLK   L + + 
Subjt:  QLYLRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKRE

Query:  KCSFSQQRINFLG-QVIEAGRIECE-----------------------------------------------------------AAFDGLKRTMMEGPIL
        KC F Q ++ F+G  + E G   C+                                                            A + +K+ ++  P+L
Subjt:  KCSFSQQRINFLG-QVIEAGRIECE-----------------------------------------------------------AAFDGLKRTMMEGPIL

Query:  GIVDVTKPFEVETEASDYALGGVLLKDG-----HPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYL
           D +K   +ET+ASD A+G VL +       +P+ Y S K++ A+  ++ S+KEMLA++  L+ WR YL
Subjt:  GIVDVTKPFEVETEASDYALGGVLLKDG-----HPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYL

P0CT41 Transposon Tf2-12 polyprotein1.2e-2728.41Show/hide
Query:  QLYLRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKRE
        +L L+S Y+ +R+ +GDE K       G FE+LVMP+G++ AP  F   +N +  E  +  VV Y+DDI+++S +  EH  H++ V +KLK   L + + 
Subjt:  QLYLRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKRE

Query:  KCSFSQQRINFLG-QVIEAGRIECE-----------------------------------------------------------AAFDGLKRTMMEGPIL
        KC F Q ++ F+G  + E G   C+                                                            A + +K+ ++  P+L
Subjt:  KCSFSQQRINFLG-QVIEAGRIECE-----------------------------------------------------------AAFDGLKRTMMEGPIL

Query:  GIVDVTKPFEVETEASDYALGGVLLKDG-----HPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYL
           D +K   +ET+ASD A+G VL +       +P+ Y S K++ A+  ++ S+KEMLA++  L+ WR YL
Subjt:  GIVDVTKPFEVETEASDYALGGVLLKDG-----HPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYL

P20825 Retrovirus-related Pol polyprotein from transposon 2971.3e-3433.46Show/hide
Query:  LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCS
        L  G++Q+ + E    KT   T+ G +E+L MPFGL NAP TF   MN +    L+K  +VYLDDI+++S ++ EH   +QLVF KL    L ++ +KC 
Subjt:  LRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCS

Query:  FSQQRINFLGQV-------------------------------------------------------------IEAGRIECEAAFDGLKRTMMEGPILGI
        F ++  NFLG +                                                             I+  ++E   AF+ LK  ++  PIL +
Subjt:  FSQQRINFLGQV-------------------------------------------------------------IEAGRIECEAAFDGLKRTMMEGPILGI

Query:  VDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYLLG
         D  K F + T+AS+ ALG VL ++GHPI++ SR LND E  ++A EKE+LA+V   + +R YLLG
Subjt:  VDVTKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYLLG

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus2.2e-2930.14Show/hide
Query:  LYLRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREK
        L L SG++Q+ + E D PKT   T  G +EFL +PFGL NAP  F  +++ +  E++ K   VY+DDI+V+S   + H  +L+LV   L K  L V  EK
Subjt:  LYLRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVMPFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREK

Query:  CSFSQQRINFLGQVIEAGRIECE-----------------------------------------------------------------------AAFDGL
          F   ++ FLG ++ A  I+ +                                                                        +F+ L
Subjt:  CSFSQQRINFLGQVIEAGRIECE-----------------------------------------------------------------------AAFDGL

Query:  KRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKD----GHPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYLLGANRSPQSEEH
        K  +    IL     TKPF + T+AS++A+G VL +D      PIAY SR LN  E+ +A  EKEMLA++  L   R YL GA       +H
Subjt:  KRTMMEGPILGIVDVTKPFEVETEASDYALGGVLLKD----GHPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYLLGANRSPQSEEH

Arabidopsis top hitse value%identityAlignment
AT3G59490.1 unknown protein1.2e-3543.98Show/hide
Query:  VLALCTGMRPVVMIDYGGKMPELQQRLCALLKLIQ--------------------------------------------------KMLIDAEKSPLGLQL
        VLALCTGMRPVVMIDYGGKMPELQ RL +LL+LI+                                                  KM+  +++S LG+QL
Subjt:  VLALCTGMRPVVMIDYGGKMPELQQRLCALLKLIQ--------------------------------------------------KMLIDAEKSPLGLQL

Query:  KSIQKLFSSLFSQDKMENDPSPSLGESCTTNTGSSFHGISSQSSIIDLSNFLDHTKITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFKLSVS
        +SIQKLFSS F  D    D +  L E+ ++ T          S  IDLS  L  TK+T+PTLNGWLL YP+VYLF  +HI EA YNLS K L +FK+ V 
Subjt:  KSIQKLFSSLFSQDKMENDPSPSLGESCTTNTGSSFHGISSQSSIIDLSNFLDHTKITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFKLSVS

Query:  RKGASTIGSQLEELLS
        R G +   S LEEL S
Subjt:  RKGASTIGSQLEELLS

AT3G59490.2 unknown protein5.1e-4546.19Show/hide
Query:  VLALCTGMRPVVMIDYGGKMPELQQRLCALLKLIQ--------------------------------------------------KMLIDAEKSPLGLQL
        VLALCTGMRPVVMIDYGGKMPELQ RL +LL+LI+                                                  KM+  +++S LG+QL
Subjt:  VLALCTGMRPVVMIDYGGKMPELQQRLCALLKLIQ--------------------------------------------------KMLIDAEKSPLGLQL

Query:  KSIQKLFSSLFSQDKMENDPSPSLGESCTTNTGSSFHGISSQSSIIDLSNFLDHTKITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFKLSVS
        +SIQKLFSS F  D    D +  L E+ ++ T          S  IDLS  L  TK+T+PTLNGWLL YP+VYLF  +HI EA YNLS K L +FK+ V 
Subjt:  KSIQKLFSSLFSQDKMENDPSPSLGESCTTNTGSSFHGISSQSSIIDLSNFLDHTKITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFKLSVS

Query:  RKGASTIGSQLEELLSFTVPYELSMGGAKEAWAEAF
        R G +   S LEEL SF+VPY+LSM G+KE WAE F
Subjt:  RKGASTIGSQLEELLSFTVPYELSMGGAKEAWAEAF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGTGGTGAACTCAGCTGCCCTACCTATCATGGGGATTGCTAAGAGAGCGACAGTTAAGTTGGGGGCTTGGAGTGGTCAGACAGATTTCGTGATTGTAAAAATGGA
TGATTTTGACGTGGTGTTGGGAATGGAATTCCTTCTCGAACACAAAGTCATCCCCATGCCCTTGGCAAAATGCTTGGTGATCACAGGATCCAACCTCACGGCCGTGACGA
CTAGTATCAAACAACCGAATGGGGTGAAAATGATATCAGCCCTCCAGTTAAAGAAAGGCCTTGCACACGACGAACCCACATTTATGGCCATCCCAATGACAGAAGATGAA
AACTCGGAAGCGCCAGTCCCTAGAGAAATCCAACGAGTGTTGGACAAGTACCGTGATGTGATGCCCGACAGTTTGCCCAAGACCCTACCTCCACGTAGGGGGATAGACCA
TGAGATTGAACTGCTGCCAGGGGCGGGACCGCCCGCTAAAAATGCTTACCGCATGGCCCGCAAAGGCACCACACGGGGCCCAATGTTATTCCAAAAGAAGAAAGATGGAA
ATCTGCAGTTATATCTCAGGTCGGGATATTACCAAGTGCGCATCGTCGAAGGAGATGAACCAAAGACAACGTCTGTCACAAGGTATGGATCCTTTGAGTTCTTAGTCATG
CCATTCGGTCTAACGAATGCTCCGACTACCTTTTGCACACTAATGAACCAAGTGTTTCACGAGTATCTCGATAAATTTGTGGTGGTTTATCTCGATGATATAGTGGTCTA
TAGCCCGACCATGGAGGAACATCAACTTCACTTGCAGTTGGTTTTTGAGAAATTAAAGAAAAATCAACTGTATGTGAAGAGAGAGAAATGCTCTTTTTCACAACAGCGCA
TCAATTTCCTCGGACAGGTGATTGAAGCCGGCCGTATTGAATGTGAAGCTGCATTCGACGGCTTAAAGCGCACCATGATGGAAGGTCCGATACTCGGAATTGTCGATGTT
ACCAAACCTTTCGAAGTAGAGACCGAAGCGTCAGACTACGCGTTGGGCGGTGTTCTTCTCAAGGACGGACACCCCATCGCGTATGAAAGTCGGAAGTTGAATGATGCGGA
AAAGAGGTTTGCAGCCTCAGAGAAAGAGATGTTGGCAGTAGTTCACTGCTTGAGAGCTTGGAGACAGTATCTCCTCGGGGCCAACAGAAGCCCTCAGTCGGAAGAGCACA
CATGCGGCCCTGTGCATGCTATAGCTTATTTACAAGCAAGCCGATTGAGGGGATCGATTCGAGACTCTGTCAGGGAACATTTAAAGAAAGACATGACAGCTCAAGCAATA
GTGAAGTTGGTCGAAGAAGGAAAGACCAACTTATGGGCTGGAGAATCCTTCTCTGATCCTCATCATTATCAAAGTACAGTGGTCAAGAGGAATTTCCGCTATGTTCGTGG
TACTATTTCATATGGGCATTTACTTACTAGAAGATTTGCTGATGCTGATTGGGCTTTGGACTCTGACCACAGTAAATCAACCTCTGGTTTTTGTATATTCTTTTTGGAGA
GTTACATCTCGACACATCTCAGATTTCTACTCTTTGGTGCAAACCCCATTCTTCATTCACGAACTAAATACGTTGAGCTAGACATTTACTCTGTTCGAGATTTAGTGCTC
CAAAAGAAGATTAACATTCATCATCTTCCAGCTACCGCACAGACAGCTGATATACTTGCAAAACCACTGTTAGCTACCTCTTTTCTGCCTCTATGGTCCAAGCTCACTGT
CTTTAATTCTCAAGACATTGGCTTGAAGAGGGTATTAAGTGAGCTCACTGATATATCAGTGTATGGTTATGAATATGTCTTAGCGTTATGTACAGGAATGAGACCGGTTG
TGATGATAGACTATGGTGGAAAGATGCCTGAACTGCAACAGCGGCTTTGTGCACTTCTAAAACTAATTCAAAAGATGCTAATAGATGCTGAGAAAAGCCCACTGGGTTTG
CAGCTTAAATCAATTCAAAAGTTGTTCTCATCTTTATTTTCTCAAGATAAAATGGAAAATGACCCATCACCATCTCTCGGGGAAAGCTGCACAACCAATACTGGATCGTC
TTTTCATGGGATTAGTTCTCAGTCTTCTATCATTGATCTCAGTAACTTTTTGGATCATACTAAGATCACTCTGCCAACTTTAAATGGATGGCTTCTTGGCTATCCAATTG
TATACCTATTTGACAAGGAGCACATCTCTGAGGCTACTTACAATCTTTCTGCCAAGCCTCTTCACATCTTCAAATTATCAGTAAGCAGGAAGGGAGCCTCCACAATAGGA
TCTCAGCTGGAAGAGCTTCTAAGTTTCACGGTGCCTTATGAACTGAGCATGGGAGGGGCGAAGGAGGCATGGGCGGAGGCATTTTTCGCAAATCCCCACCCCATGTCGCT
TCAACTTGTTGCATTCGTCGATGTTCCTTCATTGAGAGGTGTTCAGTCGATTTTTCTCTCTTGTTTCTTGGTTCTAATTAGGGCGTTCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGTGGTGAACTCAGCTGCCCTACCTATCATGGGGATTGCTAAGAGAGCGACAGTTAAGTTGGGGGCTTGGAGTGGTCAGACAGATTTCGTGATTGTAAAAATGGA
TGATTTTGACGTGGTGTTGGGAATGGAATTCCTTCTCGAACACAAAGTCATCCCCATGCCCTTGGCAAAATGCTTGGTGATCACAGGATCCAACCTCACGGCCGTGACGA
CTAGTATCAAACAACCGAATGGGGTGAAAATGATATCAGCCCTCCAGTTAAAGAAAGGCCTTGCACACGACGAACCCACATTTATGGCCATCCCAATGACAGAAGATGAA
AACTCGGAAGCGCCAGTCCCTAGAGAAATCCAACGAGTGTTGGACAAGTACCGTGATGTGATGCCCGACAGTTTGCCCAAGACCCTACCTCCACGTAGGGGGATAGACCA
TGAGATTGAACTGCTGCCAGGGGCGGGACCGCCCGCTAAAAATGCTTACCGCATGGCCCGCAAAGGCACCACACGGGGCCCAATGTTATTCCAAAAGAAGAAAGATGGAA
ATCTGCAGTTATATCTCAGGTCGGGATATTACCAAGTGCGCATCGTCGAAGGAGATGAACCAAAGACAACGTCTGTCACAAGGTATGGATCCTTTGAGTTCTTAGTCATG
CCATTCGGTCTAACGAATGCTCCGACTACCTTTTGCACACTAATGAACCAAGTGTTTCACGAGTATCTCGATAAATTTGTGGTGGTTTATCTCGATGATATAGTGGTCTA
TAGCCCGACCATGGAGGAACATCAACTTCACTTGCAGTTGGTTTTTGAGAAATTAAAGAAAAATCAACTGTATGTGAAGAGAGAGAAATGCTCTTTTTCACAACAGCGCA
TCAATTTCCTCGGACAGGTGATTGAAGCCGGCCGTATTGAATGTGAAGCTGCATTCGACGGCTTAAAGCGCACCATGATGGAAGGTCCGATACTCGGAATTGTCGATGTT
ACCAAACCTTTCGAAGTAGAGACCGAAGCGTCAGACTACGCGTTGGGCGGTGTTCTTCTCAAGGACGGACACCCCATCGCGTATGAAAGTCGGAAGTTGAATGATGCGGA
AAAGAGGTTTGCAGCCTCAGAGAAAGAGATGTTGGCAGTAGTTCACTGCTTGAGAGCTTGGAGACAGTATCTCCTCGGGGCCAACAGAAGCCCTCAGTCGGAAGAGCACA
CATGCGGCCCTGTGCATGCTATAGCTTATTTACAAGCAAGCCGATTGAGGGGATCGATTCGAGACTCTGTCAGGGAACATTTAAAGAAAGACATGACAGCTCAAGCAATA
GTGAAGTTGGTCGAAGAAGGAAAGACCAACTTATGGGCTGGAGAATCCTTCTCTGATCCTCATCATTATCAAAGTACAGTGGTCAAGAGGAATTTCCGCTATGTTCGTGG
TACTATTTCATATGGGCATTTACTTACTAGAAGATTTGCTGATGCTGATTGGGCTTTGGACTCTGACCACAGTAAATCAACCTCTGGTTTTTGTATATTCTTTTTGGAGA
GTTACATCTCGACACATCTCAGATTTCTACTCTTTGGTGCAAACCCCATTCTTCATTCACGAACTAAATACGTTGAGCTAGACATTTACTCTGTTCGAGATTTAGTGCTC
CAAAAGAAGATTAACATTCATCATCTTCCAGCTACCGCACAGACAGCTGATATACTTGCAAAACCACTGTTAGCTACCTCTTTTCTGCCTCTATGGTCCAAGCTCACTGT
CTTTAATTCTCAAGACATTGGCTTGAAGAGGGTATTAAGTGAGCTCACTGATATATCAGTGTATGGTTATGAATATGTCTTAGCGTTATGTACAGGAATGAGACCGGTTG
TGATGATAGACTATGGTGGAAAGATGCCTGAACTGCAACAGCGGCTTTGTGCACTTCTAAAACTAATTCAAAAGATGCTAATAGATGCTGAGAAAAGCCCACTGGGTTTG
CAGCTTAAATCAATTCAAAAGTTGTTCTCATCTTTATTTTCTCAAGATAAAATGGAAAATGACCCATCACCATCTCTCGGGGAAAGCTGCACAACCAATACTGGATCGTC
TTTTCATGGGATTAGTTCTCAGTCTTCTATCATTGATCTCAGTAACTTTTTGGATCATACTAAGATCACTCTGCCAACTTTAAATGGATGGCTTCTTGGCTATCCAATTG
TATACCTATTTGACAAGGAGCACATCTCTGAGGCTACTTACAATCTTTCTGCCAAGCCTCTTCACATCTTCAAATTATCAGTAAGCAGGAAGGGAGCCTCCACAATAGGA
TCTCAGCTGGAAGAGCTTCTAAGTTTCACGGTGCCTTATGAACTGAGCATGGGAGGGGCGAAGGAGGCATGGGCGGAGGCATTTTTCGCAAATCCCCACCCCATGTCGCT
TCAACTTGTTGCATTCGTCGATGTTCCTTCATTGAGAGGTGTTCAGTCGATTTTTCTCTCTTGTTTCTTGGTTCTAATTAGGGCGTTCGATTGA
Protein sequenceShow/hide protein sequence
MKVVNSAALPIMGIAKRATVKLGAWSGQTDFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGSNLTAVTTSIKQPNGVKMISALQLKKGLAHDEPTFMAIPMTEDE
NSEAPVPREIQRVLDKYRDVMPDSLPKTLPPRRGIDHEIELLPGAGPPAKNAYRMARKGTTRGPMLFQKKKDGNLQLYLRSGYYQVRIVEGDEPKTTSVTRYGSFEFLVM
PFGLTNAPTTFCTLMNQVFHEYLDKFVVVYLDDIVVYSPTMEEHQLHLQLVFEKLKKNQLYVKREKCSFSQQRINFLGQVIEAGRIECEAAFDGLKRTMMEGPILGIVDV
TKPFEVETEASDYALGGVLLKDGHPIAYESRKLNDAEKRFAASEKEMLAVVHCLRAWRQYLLGANRSPQSEEHTCGPVHAIAYLQASRLRGSIRDSVREHLKKDMTAQAI
VKLVEEGKTNLWAGESFSDPHHYQSTVVKRNFRYVRGTISYGHLLTRRFADADWALDSDHSKSTSGFCIFFLESYISTHLRFLLFGANPILHSRTKYVELDIYSVRDLVL
QKKINIHHLPATAQTADILAKPLLATSFLPLWSKLTVFNSQDIGLKRVLSELTDISVYGYEYVLALCTGMRPVVMIDYGGKMPELQQRLCALLKLIQKMLIDAEKSPLGL
QLKSIQKLFSSLFSQDKMENDPSPSLGESCTTNTGSSFHGISSQSSIIDLSNFLDHTKITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFKLSVSRKGASTIG
SQLEELLSFTVPYELSMGGAKEAWAEAFFANPHPMSLQLVAFVDVPSLRGVQSIFLSCFLVLIRAFD