; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021704 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021704
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold2:3426574..3436484
RNA-Seq ExpressionSpg021704
SyntenySpg021704
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW99869.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.6e-8229.94Show/hide
Query:  EVWALGKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAI
        E W L  +R+    T++ + P  V++QETKK   D + + S+W+     W  L + GASGGIL +W       +EV+ G FSIS+   +      W+SAI
Subjt:  EVWALGKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAI

Query:  YGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLH
        YGP+    R DFW EL+D+ GL    W +GGDFNV R S EK  G  +T SMR F+ +I+   LLD PL+N S+TWS+   +      LDRFL +N+   
Subjt:  YGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLH

Query:  KFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGH--------------GFMM-----------------
         F       L R TSDH+P+AL      WGP PFRF+N WL   SF+E  +NWW      GW GH              G ++                 
Subjt:  KFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGH--------------GFMM-----------------

Query:  ------KLKGLKMELRK---------------WNI---------------------TNRNDVSQLP--------------SLISQLKS---------LDS
               L  LKM+  K               W++                     TN + +  LP              SLI+ L           L  
Subjt:  ------KLKGLKMELRK---------------WNI---------------------TNRNDVSQLP--------------SLISQLKS---------LDS

Query:  IGDEHILSTD-------QKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFADDTLLFSIYDSKALDNLFEIIK
        +  E I ST        Q +   L+  +I D+  R         KL   +      ++       RN R  ++HLQFADDT+ FS    + L  L  ++ 
Subjt:  IGDEHILSTD-------QKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFADDTLLFSIYDSKALDNLFEIIK

Query:  LFEMASGLNINFAK------NLVASRIQRRLGN-GCSTLFWHDSWLSCGV-----------------------LSEAFPRLYRLSNRSDGTVADFWVSLN
         F   SGL +N  K      NL  + I R      C    W   +L   +                       L   +P L+R+       V D  + ++
Subjt:  LFEMASGLNINFAK------NLVASRIQRRLGN-GCSTLFWHDSWLSCGV-----------------------LSEAFPRLYRLSNRSDGTVADFWVSLN

Query:  S--------AWDLSLRRNLNDSETNEWASLSHLLSSIRIR-VIDDTWSWPIDSSNAFTVKSL---MGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWEL
        S        +W+L+ RRNL+DSE  +   L   L  + +   + D   WP+ SS  F+VKS    +    G      SK    VW    P K++ FIW +
Subjt:  S--------AWDLSLRRNLNDSETNEWASLSHLLSSIRIR-VIDDTWSWPIDSSNAFTVKSL---MGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWEL

Query:  SLGAINTSDRLQRRMPYLHLSPSWCVMCCSDAENTCHLFVHCSFASRYWSTIFNAFEWSLALPNNIYDVLASIFVGHPFHGVKKILWLALNRVFLWFLWG
        +   +NT+D LQ R PY  LSP  C++C    E+  H+F+HCS     W  +F   +     P +I D++   F G         LW A N   +  +W 
Subjt:  SLGAINTSDRLQRRMPYLHLSPSWCVMCCSDAENTCHLFVHCSFASRYWSTIFNAFEWSLALPNNIYDVLASIFVGHPFHGVKKILWLALNRVFLWFLWG

Query:  ERNGRIFRDSFSSFENFMDLILFYA
        ERN RIF D   + E+  D I+F A
Subjt:  ERNGRIFRDSFSSFENFMDLILFYA

RVX15530.1 putative ribonuclease H protein [Vitis vinifera]7.0e-8826.3Show/hide
Query:  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR
        KKR ++++ +  QNP  V+LQETK+ + D +F+ S+W    + WA+L + GASGGI+ILW    F   E + G FS+++     +  SFWL+++YGP   
Subjt:  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR

Query:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN
          R DFW EL DL GL   RW +GGDFNV R   EK     +T +MR F+++I    LLD PL+N ++TWS+   D      LDRFL +++    F  + 
Subjt:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN

Query:  LLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLPSLI-SQLKSLDSI
           L R TSDH P+ L    + WGP PFRF+N WL    F+E  + WW +   +GW GH FM KLK +K +L++WNI    D+ +   LI   L  +D I
Subjt:  LLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLPSLI-SQLKSLDSI

Query:  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLS-------------------------------------
          E  L+ D  ++R L R ++ED   ++ + W+Q+ +++W+KEGD N++FFHR+    + R+                                      
Subjt:  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLS-------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  LTHLQFA-----------------------------------DDTLLFSIYDSKALDNLFEIIKLFEMASGLNINFAKNL------------------VA
        L+   FA                                   +DT+ FS    + L NL  I+ +F   SGL IN  K+                   V 
Subjt:  LTHLQFA-----------------------------------DDTLLFSIYDSKALDNLFEIIKLFEMASGLNINFAKNL------------------VA

Query:  SRIQRRLGNGCST----LFWHDSWLSCG-------VLSEAFPR-----------------------LYRLSNRSDG----TVADFWVSLNSAWDLSLRRN
         RI RRL    ++    +  +  W   G       V  E   R                       L+R      G     +   + +  + WD ++   
Subjt:  SRIQRRLGNGCST----LFWHDSWLSCG-------VLSEAFPR-----------------------LYRLSNRSDG----TVADFWVSLNSAWDLSLRRN

Query:  LNDSETNEWASLSHLLSSIRI---------------RVIDDTWSWPIDSSNAFTVKSLMGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWELSLGAINT
        +  S    W +++ +     +               ++      W + SS  F+VKS    +   S+P        +W    P K+K   W ++ G +NT
Subjt:  LNDSETNEWASLSHLLSSIRI---------------RVIDDTWSWPIDSSNAFTVKSLMGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWELSLGAINT

Query:  SDRLQRRMPYLHLSPSWCVMCCSDAENTCHLFVHCSFASRYWSTIFNAFEWSLALPNNIYDVLASIF--VGHPFHGVKKILWLALNRVFLWFLWGERNGR
        +D+LQ R PY  L P WC++C  + E+  HLF++C      W+ +F         P +  D+L   F  +G+   G  K LW       +W +W ERN R
Subjt:  SDRLQRRMPYLHLSPSWCVMCCSDAENTCHLFVHCSFASRYWSTIFNAFEWSLALPNNIYDVLASIF--VGHPFHGVKKILWLALNRVFLWFLWGERNGR

Query:  IFRDSFSSFENFMDLILFYA
        IF D   S E   DLILFY+
Subjt:  IFRDSFSSFENFMDLILFYA

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.4e-8025.42Show/hide
Query:  RSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINHLLNSGDRRRL
        RS  ++RK F +  D+ S+ +   +TE     + S+ +S + L W+  + K+L   P + +F  + R  +  +W+ K  N  G   EI  +     +  +
Subjt:  RSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINHLLNSGDRRRL

Query:  LIPSEDNKQGWFSFFSLIS---------------------------DYPGEAHR---------STKSYKDVLQQKESHVVTTHPSSSVPSPQPLDSEIIV
        L+P   +K GW SF S+I+                           DY   ++          +T    D     +S   +++     PS   L++ +++
Subjt:  LIPSEDNKQGWFSFFSLIS---------------------------DYPGEAHR---------STKSYKDVLQQKESHVVTTHPSSSVPSPQPLDSEIIV

Query:  VQRFHKKDDWPSIRNTILAGISHRCSINPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIF
        V+RF   DDW  I   +        + N F   KAL+H       + LC NK WS++GKY ++F   +        +  S+GGW     +PL LW    F
Subjt:  VQRFHKKDDWPSIRNTILAGISHRCSINPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIF

Query:  RYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG-----------GDVTVEIKGLTASLFK---SARFEESPSFSEQNNLEIKR
        + IG  C G  +++  T    NL  A+IK+R N  GF+PA +++  +              G   +E        FK   +A F++    SEQ   E   
Subjt:  RYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG-----------GDVTVEIKGLTASLFK---SARFEESPSFSEQNNLEIKR

Query:  -------SEKLDGKNLKLPEE-------IKSPPKNQEFIPILAE----------TASP------KGLSPCLVSPQTISQPRSILQAADHSNISPKKKTAH
               S   DG+    P++       I  P +N      L E          TA+        G+S   V  +   +    LQ     N+   K+   
Subjt:  -------SEKLDGKNLKLPEE-------IKSPPKNQEFIPILAE----------TASP------KGLSPCLVSPQTISQPRSILQAADHSNISPKKKTAH

Query:  SNKGKSPLHVASPIEA-KNHSNYFLPVGPTTLGLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTS
         N   +  ++ +P  A  NHS         +L   EKK   ++  +   ++  + P N K+        T P        D      S  + L  LP   
Subjt:  SNKGKSPLHVASPIEA-KNHSNYFLPVGPTTLGLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTS

Query:  HIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMVLPTGSIPKPPTKKTKATTGKKSKLKREVWALGKKRALIKKTIQQQN
           S      S   +V  +      P  P  + +P      +   + +     V       +   +K K    +  K +   W   KK  L   T    +
Subjt:  HIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMVLPTGSIPKPPTKKTKATTGKKSKLKREVWALGKKRALIKKTIQQQN

Query:  ----PSFVLLQETKK--TSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQ
             + VLL +        + + IKS+W S+ I W + ++ G+SGGILILW   + ++    +G FS+S +  + +  S+WL+ +YGP +R  R  FW 
Subjt:  ----PSFVLLQETKK--TSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQ

Query:  ELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVT
        ELH+L  L    WILGGD NV R   E +   + + + R  N +I+N  L+D PL N  +TWS+  +   + S +DRFL  +   + F       L R T
Subjt:  ELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVT

Query:  SDHYPLAL--SFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQL-PSLISQLKSLDSIGDEHIL
        SDH+PL    S   ++WGP PFR ++  L    F+  +  WW  +   G+PG  F+ +LK L   ++ W     + ++    ++I ++ S+D    +  L
Subjt:  SDHYPLAL--SFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQL-PSLISQLKSLDSIGDEHIL

Query:  STDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQ
        + ++  +R  L+  + + + ++   W QR K  WL+EGDEN+ FFHRI ++R  R  +  +Q
Subjt:  STDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQ

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-1733.54Show/hide
Query:  LGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSDGTVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFTVKSLM
        L NG    FW+ +W   G LS A+PRL+ L+   + +V D W + ++ W++  RR LND E   W  +  +L + R        +W  DS+N+F++ S  
Subjt:  LGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSDGTVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFTVKSLM

Query:  GDMVGDSDPT----SSKLYNVVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHLSPS
          +    D T     +KL  ++WK   P KIK F+W L    INT + +Q++MP   L P+
Subjt:  GDMVGDSDPT----SSKLYNVVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHLSPS

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.6e-7639.49Show/hide
Query:  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR
        KKR ++K  +  + P  V++QETKK   D + + S+WS     WA+L + GASGGILI+W       +EV+ G FS+SI   M    S WLSA+YGP+  
Subjt:  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR

Query:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN
          R DFW EL D+AGL   RW +GGDFNV R S EK  G  +T  M+ F+++I +  L+D PL++ SYTWS+  ++      LDRFL +N+    F  + 
Subjt:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN

Query:  LLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVS-QLPSLISQLKSLDSI
           L R TSDH+P+ L      WGP PFRF+N WL   SF+E    WW++    GW GH FM KL+ +K +L++WN T+  ++S +   +++ L + DS+
Subjt:  LLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVS-QLPSLISQLKSLDSI

Query:  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFADDTLLFSIYDSKALDNLFEIIKLFE
          E  LS +  VQR   + ++E+   R+ I W+Q+ +++W+KEGD N++FFH++   R +R  +  L+     +L +    K      EI+K FE
Subjt:  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFADDTLLFSIYDSKALDNLFEIIKLFE

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.7e-9447.41Show/hide
Query:  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR
        KK ALIK+ I + NP+ V+LQETK + +D   +KS+WS+  I W++LD+ G + GILILW+DPD    E+I+G FS++I+  ++DGF FW+S IYGPS  
Subjt:  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR

Query:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN
        E    FWQEL DL+ L  + WIL GDFNVTRWSWEKS+GR +T+SM  FN +I +  L+D+PL NG +TWS         SL+D FLLTN C+ K G   
Subjt:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN

Query:  LLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDV-SQLPSLISQLKSLDSI
          R+ R TSDH+P+ L FG   WG  PFRF+N WL  ++F+  L+ WW   PL GWPGHG MMKLK LK  ++ W   +   + SQ   L + + SLD +
Subjt:  LLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDV-SQLPSLISQLKSLDSI

Query:  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHL
             ++ DQ   R   +E +    A++   W+QRCK +WL EGDENT+FFHR +A +  R  +T +
Subjt:  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHL

TrEMBL top hitse value%identityAlignment
A0A438IT16 Transposon TX1 uncharacterized 149 kDa protein1.3e-8229.94Show/hide
Query:  EVWALGKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAI
        E W L  +R+    T++ + P  V++QETKK   D + + S+W+     W  L + GASGGIL +W       +EV+ G FSIS+   +      W+SAI
Subjt:  EVWALGKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAI

Query:  YGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLH
        YGP+    R DFW EL+D+ GL    W +GGDFNV R S EK  G  +T SMR F+ +I+   LLD PL+N S+TWS+   +      LDRFL +N+   
Subjt:  YGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLH

Query:  KFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGH--------------GFMM-----------------
         F       L R TSDH+P+AL      WGP PFRF+N WL   SF+E  +NWW      GW GH              G ++                 
Subjt:  KFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGH--------------GFMM-----------------

Query:  ------KLKGLKMELRK---------------WNI---------------------TNRNDVSQLP--------------SLISQLKS---------LDS
               L  LKM+  K               W++                     TN + +  LP              SLI+ L           L  
Subjt:  ------KLKGLKMELRK---------------WNI---------------------TNRNDVSQLP--------------SLISQLKS---------LDS

Query:  IGDEHILSTD-------QKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFADDTLLFSIYDSKALDNLFEIIK
        +  E I ST        Q +   L+  +I D+  R         KL   +      ++       RN R  ++HLQFADDT+ FS    + L  L  ++ 
Subjt:  IGDEHILSTD-------QKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFADDTLLFSIYDSKALDNLFEIIK

Query:  LFEMASGLNINFAK------NLVASRIQRRLGN-GCSTLFWHDSWLSCGV-----------------------LSEAFPRLYRLSNRSDGTVADFWVSLN
         F   SGL +N  K      NL  + I R      C    W   +L   +                       L   +P L+R+       V D  + ++
Subjt:  LFEMASGLNINFAK------NLVASRIQRRLGN-GCSTLFWHDSWLSCGV-----------------------LSEAFPRLYRLSNRSDGTVADFWVSLN

Query:  S--------AWDLSLRRNLNDSETNEWASLSHLLSSIRIR-VIDDTWSWPIDSSNAFTVKSL---MGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWEL
        S        +W+L+ RRNL+DSE  +   L   L  + +   + D   WP+ SS  F+VKS    +    G      SK    VW    P K++ FIW +
Subjt:  S--------AWDLSLRRNLNDSETNEWASLSHLLSSIRIR-VIDDTWSWPIDSSNAFTVKSL---MGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWEL

Query:  SLGAINTSDRLQRRMPYLHLSPSWCVMCCSDAENTCHLFVHCSFASRYWSTIFNAFEWSLALPNNIYDVLASIFVGHPFHGVKKILWLALNRVFLWFLWG
        +   +NT+D LQ R PY  LSP  C++C    E+  H+F+HCS     W  +F   +     P +I D++   F G         LW A N   +  +W 
Subjt:  SLGAINTSDRLQRRMPYLHLSPSWCVMCCSDAENTCHLFVHCSFASRYWSTIFNAFEWSLALPNNIYDVLASIFVGHPFHGVKKILWLALNRVFLWFLWG

Query:  ERNGRIFRDSFSSFENFMDLILFYA
        ERN RIF D   + E+  D I+F A
Subjt:  ERNGRIFRDSFSSFENFMDLILFYA

A0A438K2W1 Putative ribonuclease H protein3.4e-8826.3Show/hide
Query:  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR
        KKR ++++ +  QNP  V+LQETK+ + D +F+ S+W    + WA+L + GASGGI+ILW    F   E + G FS+++     +  SFWL+++YGP   
Subjt:  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR

Query:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN
          R DFW EL DL GL   RW +GGDFNV R   EK     +T +MR F+++I    LLD PL+N ++TWS+   D      LDRFL +++    F  + 
Subjt:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN

Query:  LLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLPSLI-SQLKSLDSI
           L R TSDH P+ L    + WGP PFRF+N WL    F+E  + WW +   +GW GH FM KLK +K +L++WNI    D+ +   LI   L  +D I
Subjt:  LLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLPSLI-SQLKSLDSI

Query:  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLS-------------------------------------
          E  L+ D  ++R L R ++ED   ++ + W+Q+ +++W+KEGD N++FFHR+    + R+                                      
Subjt:  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLS-------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  LTHLQFA-----------------------------------DDTLLFSIYDSKALDNLFEIIKLFEMASGLNINFAKNL------------------VA
        L+   FA                                   +DT+ FS    + L NL  I+ +F   SGL IN  K+                   V 
Subjt:  LTHLQFA-----------------------------------DDTLLFSIYDSKALDNLFEIIKLFEMASGLNINFAKNL------------------VA

Query:  SRIQRRLGNGCST----LFWHDSWLSCG-------VLSEAFPR-----------------------LYRLSNRSDG----TVADFWVSLNSAWDLSLRRN
         RI RRL    ++    +  +  W   G       V  E   R                       L+R      G     +   + +  + WD ++   
Subjt:  SRIQRRLGNGCST----LFWHDSWLSCG-------VLSEAFPR-----------------------LYRLSNRSDG----TVADFWVSLNSAWDLSLRRN

Query:  LNDSETNEWASLSHLLSSIRI---------------RVIDDTWSWPIDSSNAFTVKSLMGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWELSLGAINT
        +  S    W +++ +     +               ++      W + SS  F+VKS    +   S+P        +W    P K+K   W ++ G +NT
Subjt:  LNDSETNEWASLSHLLSSIRI---------------RVIDDTWSWPIDSSNAFTVKSLMGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWELSLGAINT

Query:  SDRLQRRMPYLHLSPSWCVMCCSDAENTCHLFVHCSFASRYWSTIFNAFEWSLALPNNIYDVLASIF--VGHPFHGVKKILWLALNRVFLWFLWGERNGR
        +D+LQ R PY  L P WC++C  + E+  HLF++C      W+ +F         P +  D+L   F  +G+   G  K LW       +W +W ERN R
Subjt:  SDRLQRRMPYLHLSPSWCVMCCSDAENTCHLFVHCSFASRYWSTIFNAFEWSLALPNNIYDVLASIF--VGHPFHGVKKILWLALNRVFLWFLWGERNGR

Query:  IFRDSFSSFENFMDLILFYA
        IF D   S E   DLILFY+
Subjt:  IFRDSFSSFENFMDLILFYA

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein1.2e-8025.42Show/hide
Query:  RSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINHLLNSGDRRRL
        RS  ++RK F +  D+ S+ +   +TE     + S+ +S + L W+  + K+L   P + +F  + R  +  +W+ K  N  G   EI  +     +  +
Subjt:  RSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINHLLNSGDRRRL

Query:  LIPSEDNKQGWFSFFSLIS---------------------------DYPGEAHR---------STKSYKDVLQQKESHVVTTHPSSSVPSPQPLDSEIIV
        L+P   +K GW SF S+I+                           DY   ++          +T    D     +S   +++     PS   L++ +++
Subjt:  LIPSEDNKQGWFSFFSLIS---------------------------DYPGEAHR---------STKSYKDVLQQKESHVVTTHPSSSVPSPQPLDSEIIV

Query:  VQRFHKKDDWPSIRNTILAGISHRCSINPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIF
        V+RF   DDW  I   +        + N F   KAL+H       + LC NK WS++GKY ++F   +        +  S+GGW     +PL LW    F
Subjt:  VQRFHKKDDWPSIRNTILAGISHRCSINPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIF

Query:  RYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG-----------GDVTVEIKGLTASLFK---SARFEESPSFSEQNNLEIKR
        + IG  C G  +++  T    NL  A+IK+R N  GF+PA +++  +              G   +E        FK   +A F++    SEQ   E   
Subjt:  RYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG-----------GDVTVEIKGLTASLFK---SARFEESPSFSEQNNLEIKR

Query:  -------SEKLDGKNLKLPEE-------IKSPPKNQEFIPILAE----------TASP------KGLSPCLVSPQTISQPRSILQAADHSNISPKKKTAH
               S   DG+    P++       I  P +N      L E          TA+        G+S   V  +   +    LQ     N+   K+   
Subjt:  -------SEKLDGKNLKLPEE-------IKSPPKNQEFIPILAE----------TASP------KGLSPCLVSPQTISQPRSILQAADHSNISPKKKTAH

Query:  SNKGKSPLHVASPIEA-KNHSNYFLPVGPTTLGLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTS
         N   +  ++ +P  A  NHS         +L   EKK   ++  +   ++  + P N K+        T P        D      S  + L  LP   
Subjt:  SNKGKSPLHVASPIEA-KNHSNYFLPVGPTTLGLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTS

Query:  HIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMVLPTGSIPKPPTKKTKATTGKKSKLKREVWALGKKRALIKKTIQQQN
           S      S   +V  +      P  P  + +P      +   + +     V       +   +K K    +  K +   W   KK  L   T    +
Subjt:  HIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMVLPTGSIPKPPTKKTKATTGKKSKLKREVWALGKKRALIKKTIQQQN

Query:  ----PSFVLLQETKK--TSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQ
             + VLL +        + + IKS+W S+ I W + ++ G+SGGILILW   + ++    +G FS+S +  + +  S+WL+ +YGP +R  R  FW 
Subjt:  ----PSFVLLQETKK--TSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQ

Query:  ELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVT
        ELH+L  L    WILGGD NV R   E +   + + + R  N +I+N  L+D PL N  +TWS+  +   + S +DRFL  +   + F       L R T
Subjt:  ELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVT

Query:  SDHYPLAL--SFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQL-PSLISQLKSLDSIGDEHIL
        SDH+PL    S   ++WGP PFR ++  L    F+  +  WW  +   G+PG  F+ +LK L   ++ W     + ++    ++I ++ S+D    +  L
Subjt:  SDHYPLAL--SFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQL-PSLISQLKSLDSIGDEHIL

Query:  STDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQ
        + ++  +R  L+  + + + ++   W QR K  WL+EGDEN+ FFHRI ++R  R  +  +Q
Subjt:  STDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQ

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein6.3e-1833.54Show/hide
Query:  LGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSDGTVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFTVKSLM
        L NG    FW+ +W   G LS A+PRL+ L+   + +V D W + ++ W++  RR LND E   W  +  +L + R        +W  DS+N+F++ S  
Subjt:  LGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSDGTVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFTVKSLM

Query:  GDMVGDSDPT----SSKLYNVVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHLSPS
          +    D T     +KL  ++WK   P KIK F+W L    INT + +Q++MP   L P+
Subjt:  GDMVGDSDPT----SSKLYNVVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHLSPS

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein7.9e-7739.49Show/hide
Query:  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR
        KKR ++K  +  + P  V++QETKK   D + + S+WS     WA+L + GASGGILI+W       +EV+ G FS+SI   M    S WLSA+YGP+  
Subjt:  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR

Query:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN
          R DFW EL D+AGL   RW +GGDFNV R S EK  G  +T  M+ F+++I +  L+D PL++ SYTWS+  ++      LDRFL +N+    F  + 
Subjt:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN

Query:  LLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVS-QLPSLISQLKSLDSI
           L R TSDH+P+ L      WGP PFRF+N WL   SF+E    WW++    GW GH FM KL+ +K +L++WN T+  ++S +   +++ L + DS+
Subjt:  LLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVS-QLPSLISQLKSLDSI

Query:  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFADDTLLFSIYDSKALDNLFEIIKLFE
          E  LS +  VQR   + ++E+   R+ I W+Q+ +++W+KEGD N++FFH++   R +R  +  L+     +L +    K      EI+K FE
Subjt:  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFADDTLLFSIYDSKALDNLFEIIKLFE

A0A6J1E2G6 uncharacterized protein LOC1110254058.4e-9547.41Show/hide
Query:  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR
        KK ALIK+ I + NP+ V+LQETK + +D   +KS+WS+  I W++LD+ G + GILILW+DPD    E+I+G FS++I+  ++DGF FW+S IYGPS  
Subjt:  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR

Query:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN
        E    FWQEL DL+ L  + WIL GDFNVTRWSWEKS+GR +T+SM  FN +I +  L+D+PL NG +TWS         SL+D FLLTN C+ K G   
Subjt:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN

Query:  LLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDV-SQLPSLISQLKSLDSI
          R+ R TSDH+P+ L FG   WG  PFRF+N WL  ++F+  L+ WW   PL GWPGHG MMKLK LK  ++ W   +   + SQ   L + + SLD +
Subjt:  LLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDV-SQLPSLISQLKSLDSI

Query:  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHL
             ++ DQ   R   +E +    A++   W+QRCK +WL EGDENT+FFHR +A +  R  +T +
Subjt:  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.7e-1224.08Show/hide
Query:  RSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDI--AWGPCPFRFDNAWLHIESFR
        R +  F   + +  L+DIP +   YTWS+  DD   +  LDR +   D    F SA  +      SDH P  +   ++      C   F     H     
Subjt:  RSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDI--AWGPCPFRFDNAWLHIESFR

Query:  EVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLPSLISQ-LKSLDSIGDEHILSTDQKVQR--RLLREQIEDQTARDHIAWQQRCKLQ
         +   W  Q P+     H F +  + LK   +   + NR     +     + L SL+SI  + + +    + R   + R++     A     ++Q+ +++
Subjt:  EVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLPSLISQ-LKSLDSIGDEHILSTDQKVQR--RLLREQIEDQTARDHIAWQQRCKLQ

Query:  WLKEGDENTRFFHRIMAARNSRLSLTHLQFADDTLLFSIYDSKAL
        WL++GD NTRFFH+++ A  ++  +  L+  DD  + ++   K +
Subjt:  WLKEGDENTRFFHRIMAARNSRLSLTHLQFADDTLLFSIYDSKAL

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.3e-0924.05Show/hide
Query:  LGNGCSTLFWHDSWLSCGVLSEAF----PRLYRLSNRSDGTVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIRIRV---IDDTWSWPID---S
        +G+G +  FWHD+W+  G L E      PR   L    D  V D      ++W ++  R+ N         L +LL   +  +    DD++ W  D    
Subjt:  LGNGCSTLFWHDSWLSCGVLSEAF----PRLYRLSNRSDGTVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIRIRV---IDDTWSWPID---S

Query:  SNAFTVKSLMGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHLSPSWCVMCCSDAENTCHLFVHCSFASRYWSTIFNAF
        SN F+       +   S   +   +  VW   +  K     W ++   ++T DRLQ    +    P+ C++C +  ++  HLF  C F+   W   F   
Subjt:  SNAFTVKSLMGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHLSPSWCVMCCSDAENTCHLFVHCSFASRYWSTIFNAF

Query:  EWSLALPNNIYDVLASIFVGHPFHGVKKILWLALNRVFLWFLWGERNGRIFRDSFSSFENFM
          +L  P  + D L  +        +  I+ LA +   ++ +W ERN R+      S E+ +
Subjt:  EWSLALPNNIYDVLASIFVGHPFHGVKKILWLALNRVFLWFLWGERNGRIFRDSFSSFENFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACTGCTATCTCTGCCACAGCTCAACCTTGGAACCACTCCACCAGATCTATCTACATTGATCGAAAAACTTTCTCCATTGAATTTGATGAACCTTCTAGGGGAAG
CCGAGCAAAAATCACAGAGCATAGTAGAGCCTCCTCCCATTCCTTAACTTTGTCTTGGAAATCTCTCCATTGGCTAGCATCCTCCTTCAAAACTCTTGCCCATGAACCGT
GCTCCTACAAATTCTCCTCCAAGATAAGAACTGATGACTATGTTCTCTGGTTGGAAAAACTCAGCAATAAGTATGGCTTCTTTGTGGAAATTAATCATCTGTTGAATTCA
GGTGATCGACGTCGACTCCTTATACCATCTGAAGACAACAAGCAAGGTTGGTTCTCCTTTTTCTCCCTCATCTCTGATTACCCAGGAGAGGCTCATCGATCAACAAAATC
ATATAAAGATGTCCTCCAACAAAAGGAAAGTCATGTTGTCACCACTCACCCTTCATCATCTGTCCCTTCACCACAGCCTCTCGACAGTGAGATTATTGTTGTTCAACGAT
TCCATAAAAAGGATGATTGGCCTTCCATTCGGAACACCATTCTTGCCGGCATATCCCACCGTTGCTCCATCAATCCATTTCAAGATAATAAAGCTTTGTTACATGTATAT
GATCAAAATATCGTGTCAAAACTTTGCAACAACAAGGATTGGTCCTCCATTGGCAAATACCGATTGAAGTTTTACCCATTGACTACCGACTCATTTTATCAAGACACTAT
GACTAATTCTTTTGGTGGATGGATTGAAGTGCTGCAACTTCCTTTACCTTTATGGACAGAACAAATTTTCAGATATATTGGTGATGTTTGTGGAGGCTTCACTGAAATAT
CCAACCACACCAGCAGGAAGCTAAATCTCACAGCGGCAAAGATTAAAATCCGGCAAAATTCCATCGGTTTCATCCCGGCCAGAATTAAGCTACCTTCATCCCTTGCCGGC
GGCGACGTTACAGTGGAAATCAAAGGGTTGACGGCCAGCCTTTTCAAATCAGCGAGATTTGAGGAATCCCCGTCATTTTCGGAACAAAATAATTTAGAAATTAAGAGGAG
TGAGAAATTGGATGGAAAAAATTTGAAACTACCAGAGGAAATCAAATCGCCTCCTAAAAATCAGGAATTCATTCCTATTTTGGCCGAAACTGCTTCTCCAAAGGGTTTAT
CTCCTTGTCTGGTTTCTCCCCAAACTATATCACAGCCAAGAAGTATTTTGCAAGCGGCAGATCACTCCAATATATCTCCTAAAAAAAAGACAGCACACTCCAACAAAGGT
AAATCTCCCCTCCACGTGGCCTCCCCCATTGAGGCAAAGAATCATTCAAATTATTTTCTTCCAGTGGGACCCACCACTCTTGGTTTAGGAGAAAAGAAATCAACAGGCAA
CAAGATAATAGCATCTGATACTGAGGCTTACTTATCAAGTCCAGCCAATGACAAATCCCCTCATACTTCGGTTTGTGACCCTACATCGCCTCGAAACTTTGACCTTGCAA
TTTTTGATGAGTTACATTTACCCGAGTCGGAACAAATTCCATTGGCAAGCTTACCTCCGACCTCCCACATTCCATCATCCCCCCATATTATTCCCTCTCCCACAAAAGAT
GTTACCCCACTACAACAACCATCTAGCTCTCCACCCGAACCATCACCCCTTTCCCTCCCAACATATCTCTGTCATTTAGCTCCAATGCTTAGTAAACATGGTTTATGCAT
CATGGTTCTTCCAACTGGCTCAATACCCAAACCACCAACTAAGAAAACTAAAGCTACTACAGGGAAAAAGTCAAAACTTAAGAGAGAGGTTTGGGCTCTTGGAAAAAAAA
GAGCATTAATTAAGAAGACCATCCAACAACAAAATCCGAGCTTCGTGCTACTTCAAGAAACTAAAAAGACATCGGTTGATGGAAAATTTATTAAATCTATATGGAGTTCT
TCTTGCATTGGTTGGGCTTCCCTTGATTCCATTGGAGCATCCGGAGGCATCCTTATTCTTTGGAGTGATCCTGATTTCACGATCAAAGAAGTTATTCAAGGTCACTTTTC
AATCTCAATTCATGTTTTTATGGCTGACGGTTTTTCTTTTTGGCTTTCGGCTATTTATGGTCCTTCTAGGCGTGAGCACCGTGCAGATTTTTGGCAAGAACTCCATGATT
TGGCTGGTTTAGGTGGTGATCGATGGATCCTTGGAGGGGATTTTAATGTTACTCGCTGGTCTTGGGAGAAATCTCATGGTCGAAATGTTACTAGGAGCATGCGCACTTTC
AATCAATGGATTGCCAATTACCATCTCTTGGACATTCCACTACAAAATGGCTCTTACACTTGGTCTAGCTTTGGGGATGACATTGAATATCTCTCACTTCTGGATAGATT
TTTATTAACAAATGATTGCCTTCACAAATTTGGGTCAGCAAATCTCCTTCGTCTTGATAGAGTCACATCAGATCACTACCCTTTAGCTCTTTCTTTTGGAGACATAGCTT
GGGGGCCTTGTCCTTTCCGTTTTGACAATGCTTGGTTACATATTGAGTCCTTTCGTGAAGTTCTGAAAAACTGGTGGAACCAAAATCCTCTCCAAGGCTGGCCAGGGCAT
GGTTTTATGATGAAACTCAAGGGATTGAAAATGGAACTAAGAAAATGGAACATCACGAATCGTAATGATGTTTCCCAACTACCATCTCTTATTTCTCAATTGAAGAGTTT
GGACAGTATTGGGGATGAGCACATTTTATCTACAGATCAGAAAGTACAGAGACGATTATTGAGGGAACAAATTGAAGACCAGACAGCCCGTGATCATATTGCTTGGCAAC
AAAGATGTAAGTTACAATGGCTCAAGGAAGGTGATGAAAATACTAGATTTTTTCATCGTATCATGGCTGCCCGTAACTCTCGTCTCTCATTGACACACTTGCAATTTGCG
GACGATACTCTTCTTTTCTCCATCTATGATTCTAAAGCATTGGATAATCTTTTTGAGATTATCAAACTCTTTGAGATGGCTTCTGGTTTGAACATCAATTTTGCTAAGAA
TCTTGTTGCTAGTCGTATTCAGAGACGGCTTGGAAATGGTTGTTCCACCCTTTTTTGGCATGATTCTTGGCTAAGTTGTGGAGTCTTGTCTGAGGCTTTCCCTCGTCTTT
ATAGATTATCTAATCGCTCGGACGGTACAGTTGCTGACTTTTGGGTTTCATTGAATTCGGCTTGGGATTTGAGTCTTCGTCGAAATTTAAATGATTCGGAGACAAATGAG
TGGGCTAGTCTCTCTCATCTGCTTTCTTCCATCAGAATTCGAGTTATTGATGACACTTGGTCTTGGCCTATTGATTCGTCTAATGCATTCACAGTTAAATCTCTTATGGG
AGATATGGTTGGTGATTCTGACCCCACATCGAGCAAATTATATAATGTGGTGTGGAAAGACGTTTATCCAAAGAAGATCAAAATTTTTATCTGGGAGCTTAGTCTTGGAG
CTATTAATACGTCTGATCGACTTCAAAGACGAATGCCTTATTTGCACCTTTCTCCATCCTGGTGTGTTATGTGTTGTTCTGATGCTGAAAATACTTGTCATCTATTTGTG
CATTGCTCCTTTGCTTCCCGTTATTGGTCTACAATCTTCAATGCCTTTGAGTGGTCCTTGGCTCTACCAAACAACATTTATGATGTTCTTGCTTCCATTTTTGTGGGACA
TCCCTTCCATGGTGTGAAGAAGATCCTTTGGCTTGCTCTTAACCGGGTCTTCCTCTGGTTTCTTTGGGGCGAAAGGAATGGTCGAATTTTCAGGGATTCTTTCTCATCTT
TTGAGAACTTTATGGATTTGATCCTTTTTTATGCTTTATATTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCACTGCTATCTCTGCCACAGCTCAACCTTGGAACCACTCCACCAGATCTATCTACATTGATCGAAAAACTTTCTCCATTGAATTTGATGAACCTTCTAGGGGAAG
CCGAGCAAAAATCACAGAGCATAGTAGAGCCTCCTCCCATTCCTTAACTTTGTCTTGGAAATCTCTCCATTGGCTAGCATCCTCCTTCAAAACTCTTGCCCATGAACCGT
GCTCCTACAAATTCTCCTCCAAGATAAGAACTGATGACTATGTTCTCTGGTTGGAAAAACTCAGCAATAAGTATGGCTTCTTTGTGGAAATTAATCATCTGTTGAATTCA
GGTGATCGACGTCGACTCCTTATACCATCTGAAGACAACAAGCAAGGTTGGTTCTCCTTTTTCTCCCTCATCTCTGATTACCCAGGAGAGGCTCATCGATCAACAAAATC
ATATAAAGATGTCCTCCAACAAAAGGAAAGTCATGTTGTCACCACTCACCCTTCATCATCTGTCCCTTCACCACAGCCTCTCGACAGTGAGATTATTGTTGTTCAACGAT
TCCATAAAAAGGATGATTGGCCTTCCATTCGGAACACCATTCTTGCCGGCATATCCCACCGTTGCTCCATCAATCCATTTCAAGATAATAAAGCTTTGTTACATGTATAT
GATCAAAATATCGTGTCAAAACTTTGCAACAACAAGGATTGGTCCTCCATTGGCAAATACCGATTGAAGTTTTACCCATTGACTACCGACTCATTTTATCAAGACACTAT
GACTAATTCTTTTGGTGGATGGATTGAAGTGCTGCAACTTCCTTTACCTTTATGGACAGAACAAATTTTCAGATATATTGGTGATGTTTGTGGAGGCTTCACTGAAATAT
CCAACCACACCAGCAGGAAGCTAAATCTCACAGCGGCAAAGATTAAAATCCGGCAAAATTCCATCGGTTTCATCCCGGCCAGAATTAAGCTACCTTCATCCCTTGCCGGC
GGCGACGTTACAGTGGAAATCAAAGGGTTGACGGCCAGCCTTTTCAAATCAGCGAGATTTGAGGAATCCCCGTCATTTTCGGAACAAAATAATTTAGAAATTAAGAGGAG
TGAGAAATTGGATGGAAAAAATTTGAAACTACCAGAGGAAATCAAATCGCCTCCTAAAAATCAGGAATTCATTCCTATTTTGGCCGAAACTGCTTCTCCAAAGGGTTTAT
CTCCTTGTCTGGTTTCTCCCCAAACTATATCACAGCCAAGAAGTATTTTGCAAGCGGCAGATCACTCCAATATATCTCCTAAAAAAAAGACAGCACACTCCAACAAAGGT
AAATCTCCCCTCCACGTGGCCTCCCCCATTGAGGCAAAGAATCATTCAAATTATTTTCTTCCAGTGGGACCCACCACTCTTGGTTTAGGAGAAAAGAAATCAACAGGCAA
CAAGATAATAGCATCTGATACTGAGGCTTACTTATCAAGTCCAGCCAATGACAAATCCCCTCATACTTCGGTTTGTGACCCTACATCGCCTCGAAACTTTGACCTTGCAA
TTTTTGATGAGTTACATTTACCCGAGTCGGAACAAATTCCATTGGCAAGCTTACCTCCGACCTCCCACATTCCATCATCCCCCCATATTATTCCCTCTCCCACAAAAGAT
GTTACCCCACTACAACAACCATCTAGCTCTCCACCCGAACCATCACCCCTTTCCCTCCCAACATATCTCTGTCATTTAGCTCCAATGCTTAGTAAACATGGTTTATGCAT
CATGGTTCTTCCAACTGGCTCAATACCCAAACCACCAACTAAGAAAACTAAAGCTACTACAGGGAAAAAGTCAAAACTTAAGAGAGAGGTTTGGGCTCTTGGAAAAAAAA
GAGCATTAATTAAGAAGACCATCCAACAACAAAATCCGAGCTTCGTGCTACTTCAAGAAACTAAAAAGACATCGGTTGATGGAAAATTTATTAAATCTATATGGAGTTCT
TCTTGCATTGGTTGGGCTTCCCTTGATTCCATTGGAGCATCCGGAGGCATCCTTATTCTTTGGAGTGATCCTGATTTCACGATCAAAGAAGTTATTCAAGGTCACTTTTC
AATCTCAATTCATGTTTTTATGGCTGACGGTTTTTCTTTTTGGCTTTCGGCTATTTATGGTCCTTCTAGGCGTGAGCACCGTGCAGATTTTTGGCAAGAACTCCATGATT
TGGCTGGTTTAGGTGGTGATCGATGGATCCTTGGAGGGGATTTTAATGTTACTCGCTGGTCTTGGGAGAAATCTCATGGTCGAAATGTTACTAGGAGCATGCGCACTTTC
AATCAATGGATTGCCAATTACCATCTCTTGGACATTCCACTACAAAATGGCTCTTACACTTGGTCTAGCTTTGGGGATGACATTGAATATCTCTCACTTCTGGATAGATT
TTTATTAACAAATGATTGCCTTCACAAATTTGGGTCAGCAAATCTCCTTCGTCTTGATAGAGTCACATCAGATCACTACCCTTTAGCTCTTTCTTTTGGAGACATAGCTT
GGGGGCCTTGTCCTTTCCGTTTTGACAATGCTTGGTTACATATTGAGTCCTTTCGTGAAGTTCTGAAAAACTGGTGGAACCAAAATCCTCTCCAAGGCTGGCCAGGGCAT
GGTTTTATGATGAAACTCAAGGGATTGAAAATGGAACTAAGAAAATGGAACATCACGAATCGTAATGATGTTTCCCAACTACCATCTCTTATTTCTCAATTGAAGAGTTT
GGACAGTATTGGGGATGAGCACATTTTATCTACAGATCAGAAAGTACAGAGACGATTATTGAGGGAACAAATTGAAGACCAGACAGCCCGTGATCATATTGCTTGGCAAC
AAAGATGTAAGTTACAATGGCTCAAGGAAGGTGATGAAAATACTAGATTTTTTCATCGTATCATGGCTGCCCGTAACTCTCGTCTCTCATTGACACACTTGCAATTTGCG
GACGATACTCTTCTTTTCTCCATCTATGATTCTAAAGCATTGGATAATCTTTTTGAGATTATCAAACTCTTTGAGATGGCTTCTGGTTTGAACATCAATTTTGCTAAGAA
TCTTGTTGCTAGTCGTATTCAGAGACGGCTTGGAAATGGTTGTTCCACCCTTTTTTGGCATGATTCTTGGCTAAGTTGTGGAGTCTTGTCTGAGGCTTTCCCTCGTCTTT
ATAGATTATCTAATCGCTCGGACGGTACAGTTGCTGACTTTTGGGTTTCATTGAATTCGGCTTGGGATTTGAGTCTTCGTCGAAATTTAAATGATTCGGAGACAAATGAG
TGGGCTAGTCTCTCTCATCTGCTTTCTTCCATCAGAATTCGAGTTATTGATGACACTTGGTCTTGGCCTATTGATTCGTCTAATGCATTCACAGTTAAATCTCTTATGGG
AGATATGGTTGGTGATTCTGACCCCACATCGAGCAAATTATATAATGTGGTGTGGAAAGACGTTTATCCAAAGAAGATCAAAATTTTTATCTGGGAGCTTAGTCTTGGAG
CTATTAATACGTCTGATCGACTTCAAAGACGAATGCCTTATTTGCACCTTTCTCCATCCTGGTGTGTTATGTGTTGTTCTGATGCTGAAAATACTTGTCATCTATTTGTG
CATTGCTCCTTTGCTTCCCGTTATTGGTCTACAATCTTCAATGCCTTTGAGTGGTCCTTGGCTCTACCAAACAACATTTATGATGTTCTTGCTTCCATTTTTGTGGGACA
TCCCTTCCATGGTGTGAAGAAGATCCTTTGGCTTGCTCTTAACCGGGTCTTCCTCTGGTTTCTTTGGGGCGAAAGGAATGGTCGAATTTTCAGGGATTCTTTCTCATCTT
TTGAGAACTTTATGGATTTGATCCTTTTTTATGCTTTATATTGA
Protein sequenceShow/hide protein sequence
MTTAISATAQPWNHSTRSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINHLLNS
GDRRRLLIPSEDNKQGWFSFFSLISDYPGEAHRSTKSYKDVLQQKESHVVTTHPSSSVPSPQPLDSEIIVVQRFHKKDDWPSIRNTILAGISHRCSINPFQDNKALLHVY
DQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIFRYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG
GDVTVEIKGLTASLFKSARFEESPSFSEQNNLEIKRSEKLDGKNLKLPEEIKSPPKNQEFIPILAETASPKGLSPCLVSPQTISQPRSILQAADHSNISPKKKTAHSNKG
KSPLHVASPIEAKNHSNYFLPVGPTTLGLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTSHIPSSPHIIPSPTKD
VTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMVLPTGSIPKPPTKKTKATTGKKSKLKREVWALGKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSS
SCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTF
NQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGH
GFMMKLKGLKMELRKWNITNRNDVSQLPSLISQLKSLDSIGDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFA
DDTLLFSIYDSKALDNLFEIIKLFEMASGLNINFAKNLVASRIQRRLGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSDGTVADFWVSLNSAWDLSLRRNLNDSETNE
WASLSHLLSSIRIRVIDDTWSWPIDSSNAFTVKSLMGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHLSPSWCVMCCSDAENTCHLFV
HCSFASRYWSTIFNAFEWSLALPNNIYDVLASIFVGHPFHGVKKILWLALNRVFLWFLWGERNGRIFRDSFSSFENFMDLILFYALY