; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg023742 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg023742
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold13:455671..459363
RNA-Seq ExpressionSpg023742
SyntenySpg023742
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0005488 - binding (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN61757.1 hypothetical protein VITISV_030741 [Vitis vinifera]2.6e-7130.78Show/hide
Query:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL
        +K K+KEWNK S+G+  KRK  ++  +   D LE++  +    +  R   K EL ++ + E+    QK +++ + EG  NS FF K A+   NR  I  L
Subjt:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL

Query:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE-----------------------EPFPINYLGFPLGGKYRRKALWEPLIERLRFKLDKWRKFP
        EN++G +L++ E I++E+L ++ KLY       + +E                         +PI YLG P+GG  +    W+P+IER+  +LD W+K  
Subjt:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE-----------------------EPFPINYLGFPLGGKYRRKALWEPLIERLRFKLDKWRKFP

Query:  ISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQEKKSLWRS
        +S GGR+TL Q+ L  +P Y  SL K   SV   +E++ RDF+W+G       +L+                     RN AL+ KWLWR+ +E  +LW  
Subjt:  ISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQEKKSLWRS

Query:  VIASIYGVDPFGWK----------------------------------------SDRWCDNVPLQVSFPDLYSLSGKKGNFIFEC--WDQPNQTWNLALR
        VI SIYG    GW                                          D W  + PL   +P L S+   K   I     + +P  +WN   R
Subjt:  VIASIYGVDPFGWK----------------------------------------SDRWCDNVPLQVSFPDLYSLSGKKGNFIFEC--WDQPNQTWNLALR

Query:  RGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQKKFQHWY
        R L D EIE   +L+  +  + +   + D R W I  SG FT KSFF A  +            +W    P KVK F+W +A++ +NT++ LQ +  H  
Subjt:  RGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQKKFQHWY

Query:  LSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED
        LSP+ C+LC++  E +DHLF+HC      W  +  L  +    P+ I D
Subjt:  LSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED

CAN74312.1 hypothetical protein VITISV_037520 [Vitis vinifera]2.9e-0929.2Show/hide
Query:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL
        +K K+KEWN   +G   +RK  ++  + +ID +E++ ++    +S+R   + EL D+ + E+    QK +++ + EG  NS FF + A+  ++R +I  L
Subjt:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL

Query:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE
         ++ G+ L++ E I +E++ F+  LY +     + IE
Subjt:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE

CAN74312.1 hypothetical protein VITISV_037520 [Vitis vinifera]2.5e-6632.53Show/hide
Query:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL
        L +  +++ FFSK +  +     I LL    G +     ++EK  +   +   E   +   ++E     +P++YLG PLGG  +    W+P++ER+  +L
Subjt:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL

Query:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE
        D W+K  +S GGR+TL Q+ L+ IP Y  SL K P S+   +EK+ RDF+W+G       +L++WE  + P + GGLG G    RN AL+ KWLWRF +E
Subjt:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE

Query:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT
        +  LW  VIASIYG  P GW +                                        D W  N  L   F +LY +S  +   +          +
Subjt:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT

Query:  WNLALRRGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK
        WN   RR L D EI+    L+  + +V L     D R W + SSG+F+ KSFF A  K    +       +W    P KVK   W +A+  +NT++KLQ 
Subjt:  WNLALRRGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK

Query:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED
        +  +  L P  C LC R  E +DHLF+HC      W  +  L+G+    P+ IED
Subjt:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED

CAN75609.1 hypothetical protein VITISV_002943 [Vitis vinifera]5.1e-6732.53Show/hide
Query:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL
        L +  +++ FFSK +  +     I LL    G +     ++EK  +   +   E   +   ++E     +P++YLG PLGG  +    W+P++ER+  +L
Subjt:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL

Query:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE
        D W+K  +S GGR+TL Q+ L+ IP Y  SL K P S+   +EK+ RDF+W+G       +L++WE  + P + GGLG G    RN AL+ KWLWRF +E
Subjt:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE

Query:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT
        +  LW  VIASIYG  P GW +                                        D W  N  L   F +LY +S  +   +          +
Subjt:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT

Query:  WNLALRRGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK
        WN   RR L D EI+ +  L+  + +V L     D R W + SSG+F+ KSFF A  K    +       +W    P KVK   W +A+  +NT++KLQ 
Subjt:  WNLALRRGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK

Query:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED
        +  +  L P  C LC R  E +DHLF+HC      W  +  L+G+    P+ IED
Subjt:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED

CAN75609.1 hypothetical protein VITISV_002943 [Vitis vinifera]3.4e-1025.98Show/hide
Query:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL
        +K K+KEWN   +G   +RK  ++  + +ID +E++ ++    +S+R   + EL D+ + E+    QK +++ + EG  NS FF + A+  ++R +I  L
Subjt:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL

Query:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIEEPFPINYLGFPLGGKYRRKALW------EPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSI
         ++ G+ L++ E I +E++ F+  LY +     + IE        G          A+W      E  +    F+L+K  K P   G  + + Q   + I
Subjt:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIEEPFPINYLGFPLGGKYRRKALW------EPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSI

Query:  PQYL
         + L
Subjt:  PQYL

CAN75609.1 hypothetical protein VITISV_002943 [Vitis vinifera]1.5e-6632.53Show/hide
Query:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL
        L +  +++ FFSK +  +     I LL    G +     ++EK  +   +   E   +   ++E     +P++YLG PLGG  +    W+P++ER+  +L
Subjt:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL

Query:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE
        D W+K  +S GGR+TL Q+ L+ IP Y  SL K P S+   +EK+ RDF+W+G       +L++WE  + P + GGLG G    RN AL+ KWLWRF +E
Subjt:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE

Query:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT
        +  LW  VIASIYG  P GW +                                        D W  N  L   F +LY +S  +   +          +
Subjt:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT

Query:  WNLALRRGLFDREIESWMALVEKISNVQLG-SGLDDIRWQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK
        WN   RR L D EI+    L+  + +V L  S  D   W + SSG+F+ KSFF A  K    +       +W    P KVK   W +A+  +NT++KLQ 
Subjt:  WNLALRRGLFDREIESWMALVEKISNVQLG-SGLDDIRWQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK

Query:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED
        +  +  L P  C LC R  E +DHLF+HC      W  +  L+G+    P+ IED
Subjt:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]4.8e-6532.31Show/hide
Query:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL
        L +  +++ FFSK  ++M++  ++ ++    G +     ++EK  +   +   E   +   + +     +P++YLG PLGG  +    W+P++ER+  +L
Subjt:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL

Query:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE
        D W+K  +S GGR+TL Q+ L+ IP Y  SL K P S+   +EK+ R+F+W+G       +LV+WE  + P + GGLG G +  RN AL+ KWLWRF +E
Subjt:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE

Query:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT
        +  LW  VI SIYG  P GW +                                        D W  N  L   F DLY +   K   +           
Subjt:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT

Query:  WNLALRRGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK
        WNL  RR L D EI+    L+  +S+V+    L D R W + SSG FT KSFF A  K    I       +W    P KVK   W +A+  +NT++KLQ 
Subjt:  WNLALRRGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK

Query:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED
        +  +  L P  C LC    E +DHLF+HC      W+ +  L G+    P+  ED
Subjt:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]8.0e-1226.87Show/hide
Query:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL
        +K K+KEWN  ++G   +RK  ++  + +IDL+E++ ++    + +R   + EL D+ + E+    QK +++ + EG  NS FF + A+  ++R  I  L
Subjt:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL

Query:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIEEPFPINYLGFPLGGK---YRRKALWEPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSIPQY
         ++ G+ L++ EDI +E++ F+  LY + +   + +E    I+++  P+ G+   +  +   E  + R  F+L+K  K P   G  + + Q   + I + 
Subjt:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIEEPFPINYLGFPLGGK---YRRKALWEPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSIPQY

Query:  L
        L
Subjt:  L

TrEMBL top hitse value%identityAlignment
A0A803P465 Uncharacterized protein3.8e-6836.5Show/hide
Query:  FPINYLGFPLGGKYRRKALWEPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTA
        +P+ YLG PLGG  R+ + WEP++++   +LD W+   +SKGGR+TL Q++L+S+P Y  SL KAP+SVTKA+EK+ RDF+W G     G +LV W+   
Subjt:  FPINYLGFPLGGKYRRKALWEPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTA

Query:  LPIKFGGLGVGALYQRNYALITKWLWRFAQEKKSLWRSVIASIYG-------------VDPFG-WK--------------------------SDRWCDNV
         P   GGLG+G L  RN +L+ KWLWRF  E+ SLW  V+ S YG             + P G W+                           D W D+ 
Subjt:  LPIKFGGLGVGALYQRNYALITKWLWRFAQEKKSLWRSVIASIYG-------------VDPFG-WK--------------------------SDRWCDNV

Query:  PLQVSFPDLYSLSGKKGNFIFECWDQPN------QTWNLALRRGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIK
        PL  +FPDL  +S  +   I E            ++WN   RR L DRE+ S ++L++K+ +V++ S  +D R W+ D SG F+ KS F   V  P    
Subjt:  PLQVSFPDLYSLSGKKGNFIFECWDQPN------QTWNLALRRGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIK

Query:  PTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQKKFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKI
              +WK  +P KVK+F W LA   +N  +K+QK+    Y+SP  C  C    E + HLF+ C F  + W ++ G  G+S  +P+ +
Subjt:  PTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQKKFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKI

A0A803P465 Uncharacterized protein3.4e-0829.63Show/hide
Query:  NQKRKNREVTNLLRTWEKEAEAKSEIVLDQGFENRRLKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRS
        NQ  ++   +    +W ++AE           + R +K  I EW+K +YG K   KI + +++  +D LE  +      + +R  +K E   +   E+R 
Subjt:  NQKRKNREVTNLLRTWEKEAEAKSEIVLDQGFENRRLKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRS

Query:  LNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLY
        +  K K +   EG  NS FF    +A K+R  IS +E ++G  L   E+I KE++ F+S LY
Subjt:  LNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLY

A0A803P465 Uncharacterized protein2.5e-6732.53Show/hide
Query:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL
        L +  +++ FFSK +  +     I LL    G +     ++EK  +   +   E   +   ++E     +P++YLG PLGG  +    W+P++ER+  +L
Subjt:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL

Query:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE
        D W+K  +S GGR+TL Q+ L+ IP Y  SL K P S+   +EK+ RDF+W+G       +L++WE  + P + GGLG G    RN AL+ KWLWRF +E
Subjt:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE

Query:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT
        +  LW  VIASIYG  P GW +                                        D W  N  L   F +LY +S  +   +          +
Subjt:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT

Query:  WNLALRRGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK
        WN   RR L D EI+ +  L+  + +V L     D R W + SSG+F+ KSFF A  K    +       +W    P KVK   W +A+  +NT++KLQ 
Subjt:  WNLALRRGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK

Query:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED
        +  +  L P  C LC R  E +DHLF+HC      W  +  L+G+    P+ IED
Subjt:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED

A5AFR7 Uncharacterized protein1.6e-1025.98Show/hide
Query:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL
        +K K+KEWN   +G   +RK  ++  + +ID +E++ ++    +S+R   + EL D+ + E+    QK +++ + EG  NS FF + A+  ++R +I  L
Subjt:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL

Query:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIEEPFPINYLGFPLGGKYRRKALW------EPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSI
         ++ G+ L++ E I +E++ F+  LY +     + IE        G          A+W      E  +    F+L+K  K P   G  + + Q   + I
Subjt:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIEEPFPINYLGFPLGGKYRRKALW------EPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSI

Query:  PQYL
         + L
Subjt:  PQYL

A5AFR7 Uncharacterized protein7.2e-6732.53Show/hide
Query:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL
        L +  +++ FFSK +  +     I LL    G +     ++EK  +   +   E   +   ++E     +P++YLG PLGG  +    W+P++ER+  +L
Subjt:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL

Query:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE
        D W+K  +S GGR+TL Q+ L+ IP Y  SL K P S+   +EK+ RDF+W+G       +L++WE  + P + GGLG G    RN AL+ KWLWRF +E
Subjt:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE

Query:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT
        +  LW  VIASIYG  P GW +                                        D W  N  L   F +LY +S  +   +          +
Subjt:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT

Query:  WNLALRRGLFDREIESWMALVEKISNVQLG-SGLDDIRWQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK
        WN   RR L D EI+    L+  + +V L  S  D   W + SSG+F+ KSFF A  K    +       +W    P KVK   W +A+  +NT++KLQ 
Subjt:  WNLALRRGLFDREIESWMALVEKISNVQLG-SGLDDIRWQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK

Query:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED
        +  +  L P  C LC R  E +DHLF+HC      W  +  L+G+    P+ IED
Subjt:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED

A5B978 Reverse transcriptase domain-containing protein1.4e-0929.2Show/hide
Query:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL
        +K K+KEWN   +G   +RK  ++  + +ID +E++ ++    +S+R   + EL D+ + E+    QK +++ + EG  NS FF + A+  ++R +I  L
Subjt:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL

Query:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE
         ++ G+ L++ E I +E++ F+  LY +     + IE
Subjt:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE

A5B978 Reverse transcriptase domain-containing protein1.2e-6632.53Show/hide
Query:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL
        L +  +++ FFSK +  +     I LL    G +     ++EK  +   +   E   +   ++E     +P++YLG PLGG  +    W+P++ER+  +L
Subjt:  LTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE---EPFPINYLGFPLGGKYRRKALWEPLIERLRFKL

Query:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE
        D W+K  +S GGR+TL Q+ L+ IP Y  SL K P S+   +EK+ RDF+W+G       +L++WE  + P + GGLG G    RN AL+ KWLWRF +E
Subjt:  DKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE

Query:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT
        +  LW  VIASIYG  P GW +                                        D W  N  L   F +LY +S  +   +          +
Subjt:  KKSLWRSVIASIYGVDPFGWKS----------------------------------------DRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQP-NQT

Query:  WNLALRRGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK
        WN   RR L D EI+    L+  + +V L     D R W + SSG+F+ KSFF A  K    +       +W    P KVK   W +A+  +NT++KLQ 
Subjt:  WNLALRRGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQK

Query:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED
        +  +  L P  C LC R  E +DHLF+HC      W  +  L+G+    P+ IED
Subjt:  KFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED

A5BTG1 zf-RVT domain-containing protein1.3e-7130.78Show/hide
Query:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL
        +K K+KEWNK S+G+  KRK  ++  +   D LE++  +    +  R   K EL ++ + E+    QK +++ + EG  NS FF K A+   NR  I  L
Subjt:  LKMKIKEWNKDSYGKKVKRKIEVIQQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLL

Query:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE-----------------------EPFPINYLGFPLGGKYRRKALWEPLIERLRFKLDKWRKFP
        EN++G +L++ E I++E+L ++ KLY       + +E                         +PI YLG P+GG  +    W+P+IER+  +LD W+K  
Subjt:  ENDNGDILSSDEDIEKEVLGFYSKLYERDINPRFIIE-----------------------EPFPINYLGFPLGGKYRRKALWEPLIERLRFKLDKWRKFP

Query:  ISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQEKKSLWRS
        +S GGR+TL Q+ L  +P Y  SL K   SV   +E++ RDF+W+G       +L+                     RN AL+ KWLWR+ +E  +LW  
Subjt:  ISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQEKKSLWRS

Query:  VIASIYGVDPFGWK----------------------------------------SDRWCDNVPLQVSFPDLYSLSGKKGNFIFEC--WDQPNQTWNLALR
        VI SIYG    GW                                          D W  + PL   +P L S+   K   I     + +P  +WN   R
Subjt:  VIASIYGVDPFGWK----------------------------------------SDRWCDNVPLQVSFPDLYSLSGKKGNFIFEC--WDQPNQTWNLALR

Query:  RGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQKKFQHWY
        R L D EIE   +L+  +  + +   + D R W I  SG FT KSFF A  +            +W    P KVK F+W +A++ +NT++ LQ +  H  
Subjt:  RGLFDREIESWMALVEKISNVQLGSGLDDIR-WQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQKKFQHWY

Query:  LSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED
        LSP+ C+LC++  E +DHLF+HC      W  +  L  +    P+ I D
Subjt:  LSPSGCRLCLREEEHLDHLFIHCGFAWKAWSFIAGLLGISLCVPQKIED

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog2.2e-0427.54Show/hide
Query:  PFPINYLGFPL--GGKYRRKALWEPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSIPQYLFSL----LKAPKSVTKAMEKISRDFIWNGGTYKPGSNL
        P  + YLG  L    K   K  +E L + +   ++KW+  P S  GR+ + +  ++ +P+ +++     +KAP S  K +EKI   FIWN    +    L
Subjt:  PFPINYLGFPL--GGKYRRKALWEPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSIPQYLFSL----LKAPKSVTKAMEKISRDFIWNGGTYKPGSNL

Query:  VKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE
        +  +  A  I    L    LY ++  + T W W   +E
Subjt:  VKWEWTALPIKFGGLGVGALYQRNYALITKWLWRFAQE

P0C2F6 Putative ribonuclease H protein At1g657501.3e-2827.42Show/hide
Query:  PLGGKYRRKALWEPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGL
        P+  K   K  +  ++ER+  ++  WR+  +S  GR+TLT+A+L+S+P +  S +  P+S+   ++++SR F+W     K   +LVKW     P K GGL
Subjt:  PLGGKYRRKALWEPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKFGGL

Query:  GVGALYQRNYALITKWLWRFAQEKKSLWRSVIASIYGVD---------PFG-----WKSDR------------WCDNVPLQVSFPDLYSLSGK------K
        GV A    N ALI+K  WR  QEK SLW  V+   Y V          P G     W+S              W      Q+ F     +SGK       
Subjt:  GVGALYQRNYALITKWLWRFAQEKKSLWRSVIASIYGVD---------PFG-----WKSDR------------WCDNVPLQVSFPDLYSLSGK------K

Query:  GNFIFEC--------WDQPNQTWNLALRRGLFDREIESWMALVEKISNVQLGSGL-DDIRWQIDSSGTFTTKSFFQATV--KTPVKIKPTLINLIWKHNS
        G    +C        W  P + W+ A      D    +   L  +   + L +G  D + W+    G F+ +S ++     + P     +  N +WK   
Subjt:  GNFIFEC--------WDQPNQTWNLALRRGLFDREIESWMALVEKISNVQLGSGL-DDIRWQIDSSGTFTTKSFFQATV--KTPVKIKPTLINLIWKHNS

Query:  PKKVKIFLWSLAYRNINTDEKLQKKFQHWYLSPSG-CRLCLREEEHLDHLFIHCGFAWKAW
        P++VK FLW +  + + T+E+  ++    +LS S  C++C    E + H+   C      W
Subjt:  PKKVKIFLWSLAYRNINTDEKLQKKFQHWYLSPSG-CRLCLREEEHLDHLFIHCGFAWKAW

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.3e-0429.73Show/hide
Query:  DDIRWQIDSSGTFTTKSFFQATVKTPVKIKPTL------INL---IWKHNSPKKVKIFLWSLAYRNINTDEKLQKKFQHWYLSPSGCRLCLREEEHLDHL
        D I W  +++G +T +S +      P    P +      I+L   IW      K+K FLW    + + T E+L  +     + PS C  C RE E ++H 
Subjt:  DDIRWQIDSSGTFTTKSFFQATVKTPVKIKPTL------INL---IWKHNSPKKVKIFLWSLAYRNINTDEKLQKKFQHWYLSPSGCRLCLREEEHLDHL

Query:  FIHCGFAWKAW
           C FA  AW
Subjt:  FIHCGFAWKAW

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.7e-2124.64Show/hide
Query:  PINYLGFPLGGKYRRKALWEPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTAL
        P+ YLG PL  K    + + PL+E++R ++ KW    +S  GR+ L  ++++S+  +  S  + P +  K ++ I   F+W+G         V W     
Subjt:  PINYLGFPLGGKYRRKALWEPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTAL

Query:  PIKFGGLGVGALYQRN---------YALITKWLWRFAQEKKSLWRSVIASIYGVDPF--GWKSDRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQ----
        P   GGLG+ +L + N            +  W+W     KK L    +AS +       G  +  W DN         L  ++G +G     C D     
Subjt:  PIKFGGLGVGALYQRN---------YALITKWLWRFAQEKKSLWRSVIASIYGVDPF--GWKSDRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQ----

Query:  ----PNQTWNLALRRGLFDR--EIESWMALVEKISNVQLGSGLDDIRWQIDS---SGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLA
                 N   RR   D    IE    ++ ++ +  L SG D +RW+ +       F TK  + AT + P K+K      +W  ++  K  +  W   
Subjt:  ----PNQTWNLALRRGLFDR--EIESWMALVEKISNVQLGSGLDDIRWQIDS---SGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLA

Query:  YRNINTDEKLQKKFQHWYL-SPSGCRLCLREEEHLDHLFIHCGFA
           + T +++      W   + S C LC    E  DHLF  C ++
Subjt:  YRNINTDEKLQKKFQHWYL-SPSGCRLCLREEEHLDHLFIHCGFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCCTCAGCAGGGAAATCAGACTATCATTAGAGATACTACGATCCCCGAAGGCTTTGTGATTAGCAAAGATATAGTCCTTACCTTGAGGAAGAACAATCTGTGTAT
TAGGCCAATTTCAAACTCCAATATGAAGAAGGGGAACACCAATCAGAAGAGGAAAAACAGAGAGGTAACCAACCTCTTAAGAACTTGGGAGAAAGAAGCAGAGGCCAAGT
CAGAAATTGTCTTAGACCAGGGATTTGAGAATAGGAGGCTAAAGATGAAGATCAAAGAGTGGAACAAAGACTCTTATGGCAAGAAAGTTAAAAGGAAAATTGAAGTTATT
CAGCAAATAGAGCAAATAGACCTTTTAGAAGAGCAAGATAGCATTCTCCCACATCATATATCTGATAGAAACAGACTTAAAGCTGAGCTTCTGGATATCACCATTAATGA
GCAGAGAAGCCTAAACCAAAAATGCAAGATTAGACGGTTAACAGAAGGTGGCGAGAATTCTGCCTTTTTTAGCAAATGGGCTTCGGCAATGAAAAACAGAGCTCACATCT
CTTTGTTAGAGAATGATAACGGAGATATTCTTTCCTCTGATGAGGACATTGAAAAGGAGGTTTTGGGTTTCTACAGCAAGCTTTATGAGAGAGATATCAATCCCCGGTTC
ATTATAGAAGAGCCGTTTCCTATAAATTATCTTGGATTTCCCTTGGGGGGAAAGTATCGCCGGAAAGCTCTGTGGGAACCTTTGATAGAAAGGCTCAGATTCAAGCTTGA
TAAATGGAGAAAATTTCCTATATCGAAGGGTGGGAGAGTGACGCTAACTCAAGCTATCCTCAATAGCATACCTCAGTACCTCTTTTCCCTTTTAAAAGCTCCTAAATCAG
TTACCAAAGCCATGGAGAAAATCAGCAGAGACTTTATATGGAATGGTGGGACTTATAAGCCGGGTAGCAATCTGGTTAAATGGGAATGGACTGCTCTTCCCATCAAGTTT
GGTGGTTTAGGGGTGGGTGCCTTATATCAAAGAAATTATGCTCTAATCACAAAATGGCTCTGGAGATTTGCCCAGGAGAAAAAGTCCTTGTGGAGATCAGTGATAGCTAG
TATTTATGGAGTTGATCCCTTCGGCTGGAAATCTGATAGGTGGTGTGACAATGTGCCTCTACAGGTTTCCTTTCCGGATCTTTACTCCCTCTCGGGGAAAAAAGGCAACT
TTATTTTTGAATGTTGGGACCAACCTAACCAAACCTGGAATTTGGCTCTTCGAAGAGGGCTGTTTGATAGAGAAATAGAAAGCTGGATGGCGTTGGTTGAGAAAATCAGT
AATGTCCAACTGGGCTCGGGTCTGGATGATATTCGATGGCAGATTGATAGCAGTGGGACGTTTACAACTAAATCTTTCTTCCAGGCTACTGTTAAAACTCCTGTTAAGAT
CAAGCCAACTCTAATAAACCTTATTTGGAAGCATAACAGTCCCAAAAAAGTTAAAATCTTTTTGTGGTCCTTGGCATACCGCAACATTAACACTGATGAAAAGCTTCAAA
AGAAATTTCAGCATTGGTACCTCTCTCCCTCGGGTTGTCGTTTGTGTTTGAGGGAGGAAGAACATCTTGACCACCTTTTTATCCACTGTGGCTTTGCTTGGAAGGCTTGG
AGTTTTATTGCTGGGTTGCTGGGCATTTCTCTTTGTGTCCCTCAAAAGATTGAAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCCCTCAGCAGGGAAATCAGACTATCATTAGAGATACTACGATCCCCGAAGGCTTTGTGATTAGCAAAGATATAGTCCTTACCTTGAGGAAGAACAATCTGTGTAT
TAGGCCAATTTCAAACTCCAATATGAAGAAGGGGAACACCAATCAGAAGAGGAAAAACAGAGAGGTAACCAACCTCTTAAGAACTTGGGAGAAAGAAGCAGAGGCCAAGT
CAGAAATTGTCTTAGACCAGGGATTTGAGAATAGGAGGCTAAAGATGAAGATCAAAGAGTGGAACAAAGACTCTTATGGCAAGAAAGTTAAAAGGAAAATTGAAGTTATT
CAGCAAATAGAGCAAATAGACCTTTTAGAAGAGCAAGATAGCATTCTCCCACATCATATATCTGATAGAAACAGACTTAAAGCTGAGCTTCTGGATATCACCATTAATGA
GCAGAGAAGCCTAAACCAAAAATGCAAGATTAGACGGTTAACAGAAGGTGGCGAGAATTCTGCCTTTTTTAGCAAATGGGCTTCGGCAATGAAAAACAGAGCTCACATCT
CTTTGTTAGAGAATGATAACGGAGATATTCTTTCCTCTGATGAGGACATTGAAAAGGAGGTTTTGGGTTTCTACAGCAAGCTTTATGAGAGAGATATCAATCCCCGGTTC
ATTATAGAAGAGCCGTTTCCTATAAATTATCTTGGATTTCCCTTGGGGGGAAAGTATCGCCGGAAAGCTCTGTGGGAACCTTTGATAGAAAGGCTCAGATTCAAGCTTGA
TAAATGGAGAAAATTTCCTATATCGAAGGGTGGGAGAGTGACGCTAACTCAAGCTATCCTCAATAGCATACCTCAGTACCTCTTTTCCCTTTTAAAAGCTCCTAAATCAG
TTACCAAAGCCATGGAGAAAATCAGCAGAGACTTTATATGGAATGGTGGGACTTATAAGCCGGGTAGCAATCTGGTTAAATGGGAATGGACTGCTCTTCCCATCAAGTTT
GGTGGTTTAGGGGTGGGTGCCTTATATCAAAGAAATTATGCTCTAATCACAAAATGGCTCTGGAGATTTGCCCAGGAGAAAAAGTCCTTGTGGAGATCAGTGATAGCTAG
TATTTATGGAGTTGATCCCTTCGGCTGGAAATCTGATAGGTGGTGTGACAATGTGCCTCTACAGGTTTCCTTTCCGGATCTTTACTCCCTCTCGGGGAAAAAAGGCAACT
TTATTTTTGAATGTTGGGACCAACCTAACCAAACCTGGAATTTGGCTCTTCGAAGAGGGCTGTTTGATAGAGAAATAGAAAGCTGGATGGCGTTGGTTGAGAAAATCAGT
AATGTCCAACTGGGCTCGGGTCTGGATGATATTCGATGGCAGATTGATAGCAGTGGGACGTTTACAACTAAATCTTTCTTCCAGGCTACTGTTAAAACTCCTGTTAAGAT
CAAGCCAACTCTAATAAACCTTATTTGGAAGCATAACAGTCCCAAAAAAGTTAAAATCTTTTTGTGGTCCTTGGCATACCGCAACATTAACACTGATGAAAAGCTTCAAA
AGAAATTTCAGCATTGGTACCTCTCTCCCTCGGGTTGTCGTTTGTGTTTGAGGGAGGAAGAACATCTTGACCACCTTTTTATCCACTGTGGCTTTGCTTGGAAGGCTTGG
AGTTTTATTGCTGGGTTGCTGGGCATTTCTCTTTGTGTCCCTCAAAAGATTGAAGACTAG
Protein sequenceShow/hide protein sequence
MIPQQGNQTIIRDTTIPEGFVISKDIVLTLRKNNLCIRPISNSNMKKGNTNQKRKNREVTNLLRTWEKEAEAKSEIVLDQGFENRRLKMKIKEWNKDSYGKKVKRKIEVI
QQIEQIDLLEEQDSILPHHISDRNRLKAELLDITINEQRSLNQKCKIRRLTEGGENSAFFSKWASAMKNRAHISLLENDNGDILSSDEDIEKEVLGFYSKLYERDINPRF
IIEEPFPINYLGFPLGGKYRRKALWEPLIERLRFKLDKWRKFPISKGGRVTLTQAILNSIPQYLFSLLKAPKSVTKAMEKISRDFIWNGGTYKPGSNLVKWEWTALPIKF
GGLGVGALYQRNYALITKWLWRFAQEKKSLWRSVIASIYGVDPFGWKSDRWCDNVPLQVSFPDLYSLSGKKGNFIFECWDQPNQTWNLALRRGLFDREIESWMALVEKIS
NVQLGSGLDDIRWQIDSSGTFTTKSFFQATVKTPVKIKPTLINLIWKHNSPKKVKIFLWSLAYRNINTDEKLQKKFQHWYLSPSGCRLCLREEEHLDHLFIHCGFAWKAW
SFIAGLLGISLCVPQKIED