; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025706 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025706
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr10:18192759..18194785
RNA-Seq ExpressionLag0025706
SyntenyLag0025706
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]1.1e-10240.28Show/hide
Query:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMT-----
        K A+  +++ +I  LE+++G  +++   I+EEI++++  LYT      + ++G++W PI  +S   LE+PF+EEEIF+A+  M+  KA GPD  T     
Subjt:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMT-----

Query:  ------------------------------------------------------------AKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAV
                                                                    AK LA R++ VL +TI   Q AFV GRQILD +L+ANE V
Subjt:  ------------------------------------------------------------AKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAV

Query:  EEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIE
        +E R + +EG++ K+D EKAYD VSW+FLD +L +KGFG  WR W+RGCL + +F++++NG  +  + ASRGLRQGDPLSPFLFTIV D LS+ +    E
Subjt:  EEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIE

Query:  RNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKR
        RN+L+GF VG+NR  VS LQFADDT+ F  + EE M     +L      +GL +NL K+NI GINL+   ++  A  + C+   +PI YLG PLGGN K 
Subjt:  RNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKR

Query:  IAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGW
          FWDP++++   +LD W+   LS GGR+TL QS L  +  Y  SL K P ++  K+E++ RDF+WSG       +LVNW+    P   G LG+
Subjt:  IAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGW

CAN74312.1 hypothetical protein VITISV_037520 [Vitis vinifera]1.4e-10343.52Show/hide
Query:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMT-----
        + A+  +S+ YI +L S+ G  L++   I EEIV F+ NLY++ +   + I+G++W PI E+S   L+ PFSEE +  AV  +  +KA GPD  T     
Subjt:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMT-----

Query:  -----------------------AKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKG
                               AK L+ RL+ VL +TI   Q AFV GRQILD +L+ANE V+E R + +EG++ K+D EKAYD V W FLD +L  KG
Subjt:  -----------------------AKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKG

Query:  FGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMD
        F   WR W+RGCL +++F+I++NG  +  + ASRGLRQGDPLSPFLFTIV D LS+ +    E  I +GF VG++R  VS+LQFADDT+ F     +++ 
Subjt:  FGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMD

Query:  KWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILN
            IL      +GL +NL K+ I GIN   E ++  A+ + CRV  +P++YLG PLGGN K I FWDP+V++   +LD W+   LS+GGR+TL QS L+
Subjt:  KWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILN

Query:  SILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGWDPLPTAMWRSLLNG
         I  Y  SL K P +I  K+EK+ RDF+WSG       +L+ WE  + P   G LG+    T+M  S L G
Subjt:  SILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGWDPLPTAMWRSLLNG

RVW19920.1 Biotin synthase, mitochondrial [Vitis vinifera]3.0e-10343.47Show/hide
Query:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMTAK---
        + A+  +S+ YI +L S+ G  L++   I EEIV F+ NLY++ +   + I+G++W PI E+S   L+ PFSEEE+  AV  +  +KA GPD  T     
Subjt:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMTAK---

Query:  ---------------------FLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHT
                              L+ RL+ VL +TI   Q AFV GRQILD +L+ANE V+E R + +EG++ K+D EKAYD V W FLD +L  KGF   
Subjt:  ---------------------FLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHT

Query:  WRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWE
        WR W+RGCL +++F+I++NG  +  + ASRGLRQGDPLSPFLFT+V D LS+ +    E  I +GF VG++R  VS+LQFADDT+ F     +++     
Subjt:  WRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWE

Query:  ILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILI
        IL      +GL +NL K+ I GIN   E ++  A+ + CRV  +P++YLG PLGGN K I FWDP+V++   +LD W+   LS+GGR+TL QS L+ I  
Subjt:  ILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILI

Query:  YHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGWDPLPTAMWRSLLNG
        Y  SL K P +I  K+EK+ RDF+WSG       +L+ WE  + P   G LG+    T+M  S L G
Subjt:  YHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGWDPLPTAMWRSLLNG

RVW99790.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.6e-10239.88Show/hide
Query:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMT-----
        K A+  +++ +I  LE++ G  L++   I+EEI++++  LY       + ++G++W PID +S + LE+PF+EEEI++A+  M+  KA GPD  T     
Subjt:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMT-----

Query:  ------------------------------------------------------------AKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAV
                                                                    AK LA RL+ VL +TI   Q AFV GRQILD +L+ANE V
Subjt:  ------------------------------------------------------------AKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAV

Query:  EEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIE
        +E R   +EG++ K+D EKAYD VSW+FLD +L +KGF   WR W+RGCL + ++++++NG  +  + ASRGLRQGDPLSPFLFTIV D LS+ +    E
Subjt:  EEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIE

Query:  RNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKR
        RN+L+GF VG+NR  VS LQFADDT+ F    EE +     +L      +GL +NL K+NI GINL+   ++  A+ + C+   +PI YLG PLGGN K 
Subjt:  RNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKR

Query:  IAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGW
          FWDP++++   +LDVW+   LS GGR+TL QS L  +  Y  SL K P ++  K+E++ R+F+WSG       +LVNW+    P   G LG+
Subjt:  IAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGW

RVX14115.1 Protein SWEETIE [Vitis vinifera]6.5e-10642.4Show/hide
Query:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMT-----
        K A+  +++ +I  LE+++G  +++   I+EEI++++  LYT      + ++G++W PI  +S   LE+PF+EEEIF+A+  M+  KA GPD  T     
Subjt:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMT-----

Query:  ---------------------------------AKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWE
                                         AK LA R++ VL +TI   Q AFV GRQILD +L+ANE V+E R +++EG++ K+D EKAYD VSW+
Subjt:  ---------------------------------AKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWE

Query:  FLDTILALKGFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMI
        FLD +L +KGFG  WR W+RGCL + +F++++NG  +  + ASRGLRQGDPLSPFLFTIV D LS+ +    E+N+L+GF VG+NR  VS LQFADDT+ 
Subjt:  FLDTILALKGFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMI

Query:  FCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGG
        F  + EE M     +L      +GL +NL K+NI GINL+   ++  A  + C+   +PI YLG PLGGN K   FWDP++++   +LD W+   LS GG
Subjt:  FCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGG

Query:  RVTLAQSILNSILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGW
        R+TL QS L  +  Y  SL K P ++  K+E++ RDF+WSG       +LVNW+    P   G LG+
Subjt:  RVTLAQSILNSILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGW

TrEMBL top hitse value%identityAlignment
A0A438C9K9 Biotin synthase, mitochondrial1.5e-10343.47Show/hide
Query:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMTAK---
        + A+  +S+ YI +L S+ G  L++   I EEIV F+ NLY++ +   + I+G++W PI E+S   L+ PFSEEE+  AV  +  +KA GPD  T     
Subjt:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMTAK---

Query:  ---------------------FLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHT
                              L+ RL+ VL +TI   Q AFV GRQILD +L+ANE V+E R + +EG++ K+D EKAYD V W FLD +L  KGF   
Subjt:  ---------------------FLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHT

Query:  WRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWE
        WR W+RGCL +++F+I++NG  +  + ASRGLRQGDPLSPFLFT+V D LS+ +    E  I +GF VG++R  VS+LQFADDT+ F     +++     
Subjt:  WRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWE

Query:  ILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILI
        IL      +GL +NL K+ I GIN   E ++  A+ + CRV  +P++YLG PLGGN K I FWDP+V++   +LD W+   LS+GGR+TL QS L+ I  
Subjt:  ILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILI

Query:  YHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGWDPLPTAMWRSLLNG
        Y  SL K P +I  K+EK+ RDF+WSG       +L+ WE  + P   G LG+    T+M  S L G
Subjt:  YHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGWDPLPTAMWRSLLNG

A0A438JYU3 Protein SWEETIE3.1e-10642.4Show/hide
Query:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMT-----
        K A+  +++ +I  LE+++G  +++   I+EEI++++  LYT      + ++G++W PI  +S   LE+PF+EEEIF+A+  M+  KA GPD  T     
Subjt:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMT-----

Query:  ---------------------------------AKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWE
                                         AK LA R++ VL +TI   Q AFV GRQILD +L+ANE V+E R +++EG++ K+D EKAYD VSW+
Subjt:  ---------------------------------AKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWE

Query:  FLDTILALKGFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMI
        FLD +L +KGFG  WR W+RGCL + +F++++NG  +  + ASRGLRQGDPLSPFLFTIV D LS+ +    E+N+L+GF VG+NR  VS LQFADDT+ 
Subjt:  FLDTILALKGFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMI

Query:  FCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGG
        F  + EE M     +L      +GL +NL K+NI GINL+   ++  A  + C+   +PI YLG PLGGN K   FWDP++++   +LD W+   LS GG
Subjt:  FCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGG

Query:  RVTLAQSILNSILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGW
        R+TL QS L  +  Y  SL K P ++  K+E++ RDF+WSG       +LVNW+    P   G LG+
Subjt:  RVTLAQSILNSILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGW

A0A803P465 Uncharacterized protein6.6e-10439.03Show/hide
Query:  QRLNARATASRRDVLQRVFQILSLIKRRSNATWKGLDVGAVSGAEKCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVN
        Q L     A +++  Q VF+    +  +S   W               +A KS+  I+ +E +DG FL  K EI +EI+ F+S+LYT +      I+G++
Subjt:  QRLNARATASRRDVLQRVFQILSLIKRRSNATWKGLDVGAVSGAEKCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVN

Query:  WEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPD---------------------------------------------------------------
        W  I + S   LE PF E E+  AV   E  KA GPD                                                               
Subjt:  WEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPD---------------------------------------------------------------

Query:  --VMTAKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNF
           + AK L+ RL+ VL +TI E Q+AFV GRQILD +L+ANE VE+YR   + GL+ K+D EKAYD+V WEF+D +L  KGFG  WR WI+GC+ +T+F
Subjt:  --VMTAKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNF

Query:  SIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMN
        S+ IN  PR K   SRGLRQGDPLSPFLFT+V D L +     +    + GF VGK RV+VS LQFADDT+ F  NE+  + K   ++ A    +GL +N
Subjt:  SIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMN

Query:  LAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSLLKAPKTIIM
        L+K+ ++GI +D+E V+  A ++GC V ++P+ YLG PLGG+ ++ +FW+P++DK   +LD W+   LS GGR+TL QS+L+S+ +Y  SL KAP+++  
Subjt:  LAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSLLKAPKTIIM

Query:  KLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLG
         LEK++RDF+W G     G +LV W+    P + G LG
Subjt:  KLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLG

A0A803P8A0 Uncharacterized protein4.3e-10340.9Show/hide
Query:  SAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPD-----------
        +A K++  I+ +E  +G  + S+ EI EE++ F+S LYT +      ++G+ W+ I E S   LE PF E+E+   V   E  KA GPD           
Subjt:  SAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPD-----------

Query:  ------------------------------------------------------VMTAKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEY
                                                               + AK LA RL+ VL +TISE Q+AFV GRQILD +L+ANEAVE+Y
Subjt:  ------------------------------------------------------VMTAKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEY

Query:  RINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNI
        R   K+G ++K+D EKAYD+V W FLD +L  KGFG  WR WIRGC+ +T+FSI +NG+ R K H SRGLRQGDPLSPFLFT+V D L + +   +E   
Subjt:  RINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNI

Query:  LKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAF
          GF +GK+ + +S LQFADDT+ F   +E+ + K  +I+ A    +GL +NL K+ ++GI L DE V   A  +GC V  +P+ YLG PLGG+ ++  F
Subjt:  LKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAF

Query:  WDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRL
        W+P++DK   ++D W+   LS GGR+TL QS+L+S+ IY+ SL K PK ++ +LEK++RDF W GG    G +LV W+    P   G L
Subjt:  WDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRL

A5B978 Reverse transcriptase domain-containing protein6.6e-10443.52Show/hide
Query:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMT-----
        + A+  +S+ YI +L S+ G  L++   I EEIV F+ NLY++ +   + I+G++W PI E+S   L+ PFSEE +  AV  +  +KA GPD  T     
Subjt:  KCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETPFSEEEIFRAVKGMEDQKASGPDVMT-----

Query:  -----------------------AKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKG
                               AK L+ RL+ VL +TI   Q AFV GRQILD +L+ANE V+E R + +EG++ K+D EKAYD V W FLD +L  KG
Subjt:  -----------------------AKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKG

Query:  FGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMD
        F   WR W+RGCL +++F+I++NG  +  + ASRGLRQGDPLSPFLFTIV D LS+ +    E  I +GF VG++R  VS+LQFADDT+ F     +++ 
Subjt:  FGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMD

Query:  KWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILN
            IL      +GL +NL K+ I GIN   E ++  A+ + CRV  +P++YLG PLGGN K I FWDP+V++   +LD W+   LS+GGR+TL QS L+
Subjt:  KWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILN

Query:  SILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGWDPLPTAMWRSLLNG
         I  Y  SL K P +I  K+EK+ RDF+WSG       +L+ WE  + P   G LG+    T+M  S L G
Subjt:  SILIYHFSLLKAPKTIIMKLEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGWDPLPTAMWRSLLNG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.5e-2525.13Show/hide
Query:  EEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETP---FSEEEIFRAVKGME-DQKASGPDVMTAKFLADRLKNVLPQTISECQAAFVYGRQIL
        +EE+V F   L+   +      +G+      E S  L+  P    +++E FR +  M  D K      +  K LA+R++  + + I   Q  F+ G Q  
Subjt:  EEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDEQSNALLETP---FSEEEIFRAVKGME-DQKASGPDVMTAKFLADRLKNVLPQTISECQAAFVYGRQIL

Query:  DLILVANEAVEEY-RINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGD
          I  +   ++   R   K  +++ +D EKA+DK+   F+   L   G    +   IR        +II+NG+         G RQG PLSP LF IV +
Subjt:  DLILVANEAVEEY-RINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGD

Query:  ALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINY
         L+++I+   +   +KG  +GK  V++S+  FADD +++  N         +++      +G  +N+ K+     N + +  +    ++   + +  I Y
Subjt:  ALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINY

Query:  LGFPLGGNHKRI--AFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSL--LKAPKTIIMKLEKIIRDFVWS
        LG  L  + K +    + PL+ + K   + W++   S  GR+ + +  +   +IY F+   +K P T   +LEK    F+W+
Subjt:  LGFPLGGNHKRI--AFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSL--LKAPKTIIMKLEKIIRDFVWS

P08548 LINE-1 reverse transcriptase homolog1.4e-2324.92Show/hide
Query:  KFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEY-RINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSIIIN
        K L +R++  + + I   Q  F+ G Q    I  +   ++   ++  K+ +++ +D EKA+D +   F+   L   G   T+   I         +II+N
Subjt:  KFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEY-RINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSIIIN

Query:  GKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTN
        G   +      G RQG PLSP LF IV + L+ +I+   E   +KG  +G   +++S+  FADD +++  N  +   K  E+++   + +G  +N  K+ 
Subjt:  GKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTN

Query:  IIGINLDDEKVNDRAIKMGCRVENFP--INYLGFPLGGNHKRI--AFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSL--LKAPKTII
         +     +    ++ +K        P  + YLG  L  + K +    ++ L  +    ++ W++   S  GR+ + +  +    IY+F+   +KAP +  
Subjt:  IIGINLDDEKVNDRAIKMGCRVENFP--INYLGFPLGGNHKRI--AFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSL--LKAPKTII

Query:  MKLEKIIRDFVWS
          LEKII  F+W+
Subjt:  MKLEKIIRDFVWS

P11369 LINE-1 retrotransposable element ORF2 protein5.1e-2124.22Show/hide
Query:  KFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEY-RINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSIIIN
        K LA+R++  +   I   Q  F+ G Q    I  +   +    ++  K  +++ LD EKA+DK+   F+  +L   G    +   I+        +I +N
Subjt:  KFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEY-RINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSIIIN

Query:  GKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTN
        G+    I    G RQG PLSP+LF IV + L+++I+   ++  +KG  +GK  V++S+L  ADD +++  + +    +   ++ +  +  G  +N  K+ 
Subjt:  GKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAKTN

Query:  IIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRI--AFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSL--LKAPKTIIMK
              + +   +        +    I YLG  L    K +    +  L  + K  L  W+    S  GR+ + +  +    IY F+   +K P     +
Subjt:  IIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRI--AFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSL--LKAPKTIIMK

Query:  LEKIIRDFVWSGGSYKLGGNLV
        LE  I  FVW+    ++  +L+
Subjt:  LEKIIRDFVWSGGSYKLGGNLV

P14381 Transposon TX1 uncharacterized 149 kDa protein5.0e-1623.22Show/hide
Query:  MTAKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSII
        + AK ++ RLK+VL + I   Q+  V GR I D + +  + +   R        + LD EKA+D+V  ++L   L    FG  +  +++    +    + 
Subjt:  MTAKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCNTNFSII

Query:  INGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAK
        IN      +   RG+RQG PLS  L+++  +        C+ R  L G  + +  + V +  +ADD ++   +  + +++  E        +   +N +K
Subjt:  INGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAVLDGAGLSMNLAK

Query:  TNII---GINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIA-FWDPLVDKFKAKLDVWRHF--LLSMGGRVTLAQSILNSILIYHFSLLKAPKT
        ++ +    + +D      R I    ++    I YLG  L      ++  +  L +    +L  W+ F  +LSM GR  +   ++ S + Y    L   + 
Subjt:  TNII---GINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIA-FWDPLVDKFKAKLDVWRHF--LLSMGGRVTLAQSILNSILIYHFSLLKAPKT

Query:  IIMKLEKIIRDFVWSGGSYKLGG
         I K+++ + DF+W G  +   G
Subjt:  IIMKLEKIIRDFVWSGGSYKLGG

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)1.4e-1030.39Show/hide
Query:  ASGPDVMTAKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCN
        AS    +  + LA RL+  +    ++   A + G  +  L+L  +  +   R  +K   ++ LD+ KA+D VS   +   L   G       +I G L +
Subjt:  ASGPDVMTAKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTWRMWIRGCLCN

Query:  TNFSIIIN-GKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNE
        +  +I +  G   RKI   RG++QGDPLSPFLF  V D L  S+Q         G ++G+ ++ V  L FADD ++   N+
Subjt:  TNFSIIIN-GKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNE

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.4e-1040.74Show/hide
Query:  LADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINK--KEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTW
        + +RLK ++   I   QA+F+ GR   D I+   EAV   R  K  K  +L+KLDLEKAYD++ W++L+  L   GF   W
Subjt:  LADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINK--KEGLLMKLDLEKAYDKVSWEFLDTILALKGFGHTW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.8e-1143.42Show/hide
Query:  LCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDT
        +C+     IING P+  +  SRGLRQGDPLSP+LF +  + LS   +   E+  L G  V  N   ++ L FADDT
Subjt:  LCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGGGCAACAACGTCTCAACGCTAGGGCAACAGCGTCTCGACGCGACGTGCTTCAGCGCGTATTCCAGATCCTTTCCTTAATCAAACGGCGTTCTAACGCTACGTG
GAAGGGTCTCGACGTTGGAGCTGTTTCAGGGGCAGAAAAATGCGCCTCTGCCATGAAAAGTAAGGCTTACATTGCTGCCTTGGAAAGCCAAGATGGTAGGTTTTTATCCT
CAAAAGCAGAGATTGAGGAGGAAATCGTCCAGTTTTATTCAAATCTCTATACTAGGGATGATAGCCCTCGGTTTGTCATTCAAGGGGTCAATTGGGAGCCTATTGATGAG
CAGAGCAATGCCTTGCTTGAGACTCCCTTCAGTGAGGAAGAGATATTTAGAGCTGTAAAAGGCATGGAAGACCAAAAAGCCTCGGGTCCCGACGTCATGACTGCAAAATT
CCTTGCTGATAGATTGAAGAATGTGCTCCCACAGACAATTAGTGAGTGTCAAGCTGCTTTTGTTTATGGTAGACAGATTTTAGATCTGATTCTTGTGGCTAACGAAGCAG
TGGAAGAGTATAGGATCAATAAGAAAGAGGGCCTTTTGATGAAACTTGATCTTGAGAAAGCCTACGATAAAGTGAGCTGGGAGTTCCTCGACACCATTCTTGCCCTCAAA
GGTTTTGGTCATACTTGGAGAATGTGGATAAGGGGATGTCTTTGTAATACAAACTTCTCCATCATTATCAATGGTAAGCCAAGAAGGAAAATTCATGCTTCTAGAGGGCT
TCGGCAAGGGGACCCCCTCTCCCCTTTTCTTTTCACCATTGTGGGTGATGCTTTGAGTCAATCTATTCAGTATTGCATCGAGAGGAATATTTTGAAAGGTTTCTCGGTGG
GGAAGAATAGAGTTGAGGTGTCTATGCTCCAATTTGCGGATGACACCATGATTTTTTGCCCAAATGAGGAAGAGATCATGGATAAGTGGTGGGAGATTCTGAGGGCGGTT
TTAGATGGTGCTGGCCTTTCCATGAACCTTGCCAAGACTAACATCATTGGTATCAACCTAGATGATGAGAAAGTGAATGATAGGGCGATCAAAATGGGTTGCAGAGTGGA
AAATTTCCCGATTAACTACCTTGGATTTCCGTTAGGTGGAAACCACAAGAGGATTGCATTTTGGGATCCTCTGGTGGACAAATTCAAAGCAAAGTTAGATGTGTGGAGGC
ATTTTTTGTTATCTATGGGTGGTAGAGTTACCTTGGCTCAATCCATTCTAAATAGCATCCTAATTTATCACTTCTCTCTGTTGAAGGCTCCTAAAACAATTATTATGAAG
CTGGAAAAGATTATTAGAGATTTCGTGTGGAGTGGTGGTTCATATAAACTGGGTGGCAACTTAGTCAACTGGGAGTGGACAACTTCACCTACTTATCATGGGAGATTGGG
GTGGGATCCCTTGCCCACCGCAATGTGGCGCTCCTTACTAAATGGCTTTGGAGATTTACCCAAGAAAAAAGCTCCCTTTGGAGAAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAGGGCAACAACGTCTCAACGCTAGGGCAACAGCGTCTCGACGCGACGTGCTTCAGCGCGTATTCCAGATCCTTTCCTTAATCAAACGGCGTTCTAACGCTACGTG
GAAGGGTCTCGACGTTGGAGCTGTTTCAGGGGCAGAAAAATGCGCCTCTGCCATGAAAAGTAAGGCTTACATTGCTGCCTTGGAAAGCCAAGATGGTAGGTTTTTATCCT
CAAAAGCAGAGATTGAGGAGGAAATCGTCCAGTTTTATTCAAATCTCTATACTAGGGATGATAGCCCTCGGTTTGTCATTCAAGGGGTCAATTGGGAGCCTATTGATGAG
CAGAGCAATGCCTTGCTTGAGACTCCCTTCAGTGAGGAAGAGATATTTAGAGCTGTAAAAGGCATGGAAGACCAAAAAGCCTCGGGTCCCGACGTCATGACTGCAAAATT
CCTTGCTGATAGATTGAAGAATGTGCTCCCACAGACAATTAGTGAGTGTCAAGCTGCTTTTGTTTATGGTAGACAGATTTTAGATCTGATTCTTGTGGCTAACGAAGCAG
TGGAAGAGTATAGGATCAATAAGAAAGAGGGCCTTTTGATGAAACTTGATCTTGAGAAAGCCTACGATAAAGTGAGCTGGGAGTTCCTCGACACCATTCTTGCCCTCAAA
GGTTTTGGTCATACTTGGAGAATGTGGATAAGGGGATGTCTTTGTAATACAAACTTCTCCATCATTATCAATGGTAAGCCAAGAAGGAAAATTCATGCTTCTAGAGGGCT
TCGGCAAGGGGACCCCCTCTCCCCTTTTCTTTTCACCATTGTGGGTGATGCTTTGAGTCAATCTATTCAGTATTGCATCGAGAGGAATATTTTGAAAGGTTTCTCGGTGG
GGAAGAATAGAGTTGAGGTGTCTATGCTCCAATTTGCGGATGACACCATGATTTTTTGCCCAAATGAGGAAGAGATCATGGATAAGTGGTGGGAGATTCTGAGGGCGGTT
TTAGATGGTGCTGGCCTTTCCATGAACCTTGCCAAGACTAACATCATTGGTATCAACCTAGATGATGAGAAAGTGAATGATAGGGCGATCAAAATGGGTTGCAGAGTGGA
AAATTTCCCGATTAACTACCTTGGATTTCCGTTAGGTGGAAACCACAAGAGGATTGCATTTTGGGATCCTCTGGTGGACAAATTCAAAGCAAAGTTAGATGTGTGGAGGC
ATTTTTTGTTATCTATGGGTGGTAGAGTTACCTTGGCTCAATCCATTCTAAATAGCATCCTAATTTATCACTTCTCTCTGTTGAAGGCTCCTAAAACAATTATTATGAAG
CTGGAAAAGATTATTAGAGATTTCGTGTGGAGTGGTGGTTCATATAAACTGGGTGGCAACTTAGTCAACTGGGAGTGGACAACTTCACCTACTTATCATGGGAGATTGGG
GTGGGATCCCTTGCCCACCGCAATGTGGCGCTCCTTACTAAATGGCTTTGGAGATTTACCCAAGAAAAAAGCTCCCTTTGGAGAAGAGTGA
Protein sequenceShow/hide protein sequence
MLGQQRLNARATASRRDVLQRVFQILSLIKRRSNATWKGLDVGAVSGAEKCASAMKSKAYIAALESQDGRFLSSKAEIEEEIVQFYSNLYTRDDSPRFVIQGVNWEPIDE
QSNALLETPFSEEEIFRAVKGMEDQKASGPDVMTAKFLADRLKNVLPQTISECQAAFVYGRQILDLILVANEAVEEYRINKKEGLLMKLDLEKAYDKVSWEFLDTILALK
GFGHTWRMWIRGCLCNTNFSIIINGKPRRKIHASRGLRQGDPLSPFLFTIVGDALSQSIQYCIERNILKGFSVGKNRVEVSMLQFADDTMIFCPNEEEIMDKWWEILRAV
LDGAGLSMNLAKTNIIGINLDDEKVNDRAIKMGCRVENFPINYLGFPLGGNHKRIAFWDPLVDKFKAKLDVWRHFLLSMGGRVTLAQSILNSILIYHFSLLKAPKTIIMK
LEKIIRDFVWSGGSYKLGGNLVNWEWTTSPTYHGRLGWDPLPTAMWRSLLNGFGDLPKKKAPFGEE