; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036808 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036808
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold5:47785974..47790979
RNA-Seq ExpressionSpg036808
SyntenySpg036808
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]2.8e-7127.2Show/hide
Query:  NHSTRSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINQLLNSGD
        N   R   +++K F +  D+ SR S   ITE     S S+ ++  SL WL  +FK L + P + +F  + R  D+ LW++ + N+ G+  EI ++ + G 
Subjt:  NHSTRSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINQLLNSGD

Query:  RRRLLIPSEDNKQGWFSFFSLIS--------DYPGEAHRSTKSYKDVLQQK--------------ESHVVTTHPSSSVPSPQPLDSETIVVQRFHHKDDW
        +  +L+P   +K GW  F  +++        + P   + +    K+ +QQ                  V ++  SSS  S       T  ++R    DDW
Subjt:  RRRLLIPSEDNKQGWFSFFSLIS--------DYPGEAHRSTKSYKDVLQQK--------------ESHVVTTHPSSSVPSPQPLDSETIVVQRFHHKDDW

Query:  NSIRNTILAGISHRCS---INPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIFRYIGDVC
          I + +      + S     PF  +KALL + D+ +   LC N  W+++G + +KF   + ++     +  S+GGW     +PL +W    F  IG+  
Subjt:  NSIRNTILAGISHRCS---INPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIFRYIGDVC

Query:  GGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAGGDVTMEI----KG---------LTASFFKSA--RFEESPSFSEQ----NNLEIKRSE
        GGF + +  +  KL LT A IK+++N  GF+PA I++     G D  ++     KG         +  SF K+A   F E   ++EQ     NL +    
Subjt:  GGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAGGDVTMEI----KG---------LTASFFKSA--RFEESPSFSEQ----NNLEIKRSE

Query:  KLDGKNLKLPEEIKSPPKNQEFIPILAETASPKDLSPCLVSPQTISQPRSILQAADH-SNISPKKKTAHSNKGKSPLHVASPIEAKNHTNYFLPVGPTTL
         L   + K  +++ +  K      I     +   L+         S     +Q  DH S ++ KKK      G+    + SPI  K   ++  P   T L
Subjt:  KLDGKNLKLPEEIKSPPKNQEFIPILAETASPKDLSPCLVSPQTISQPRSILQAADH-SNISPKKKTAHSNKGKSPLHVASPIEAKNHTNYFLPVGPTTL

Query:  GLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTSHIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPL
               T    + S   +Y    A  K P  S             I  +++  +S  +   +               S T      +Q      +P+  
Subjt:  GLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTSHIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPL

Query:  SLPTYLCHLAPMLSKHGLCIMALPTGSIPKPPTKKTKATTGKKSKLKREFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSS
         L   L H++ +LS         P+  IP  PT  T++   K S       +   R     KK+  I++  +    SF      K+   D     ++  +
Subjt:  SLPTYLCHLAPMLSKHGLCIMALPTGSIPKPPTKKTKATTGKKSKLKREFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSS

Query:  SCIGWASLDSIGASGGILILWSDPDFTI----KEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWE
        +    +  +S+     I IL   P+  +    ++VI G FS+SI V   +G S+WLSAIYGP++R++R  FW+EL +L  +    WILGGDFNV RW  E
Subjt:  SCIGWASLDSIGASGGILILWSDPDFTI----KEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWE

Query:  KSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWL
         S     + SM+ FN +I+N +L+D PL N  +TWS+       LS LDRFL +    + F       L R TSDH+P+ L    I+WGP PFRF NA+L
Subjt:  KSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWL

Query:  HIESFREVLKNWWNQNPLQGWPGHGFMMKLK-------------GLKMELRKWNIT
            +++ ++ WW      G+ G+ FM +LK             GLK  L+ +N+T
Subjt:  HIESFREVLKNWWNQNPLQGWPGHGFMMKLK-------------GLKMELRKWNIT

RVW25035.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.4e-6742.9Show/hide
Query:  EFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGF
        + LSWN RGLGS KKR ++++ +  QNP  V+LQETK+ + D +F+ S+W+   + W +L + GASGGI+ILW    F   E + G FS+++     +  
Subjt:  EFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGF

Query:  SFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFL
        SFWL+++YGP     R DFW EL DL GL   RW +GGDFNV R   EK     +T +MR F+++I    LLD PL+N ++TWS+   D      LDRFL
Subjt:  SFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFL

Query:  LTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLP
         +++    F  +    L R TSDH P+ L    + WGP PFRF+N WL    F+E  + WW +   +GW GH FM KLK +K++L+KWNI    D+ +  
Subjt:  LTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLP

Query:  SLI
         LI
Subjt:  SLI

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.2e-6925.49Show/hide
Query:  RSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINQLLNSGDRRRL
        RS  ++RK F +  D+ S+ +   +TE     + S+ +S + L W+  + K+L   P + +F  + R  +  +W+ K  N  G   EI ++     +  +
Subjt:  RSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINQLLNSGDRRRL

Query:  LIPSEDNKQGWFSFFSLIS---------------------------DYPGEAHR---------STKSYKDVLQQKESHVVTTHPSSSVPSPQPLDSETIV
        L+P   +K GW SF S+I+                           DY   ++          +T    D     +S   +++     PS   L++  ++
Subjt:  LIPSEDNKQGWFSFFSLIS---------------------------DYPGEAHR---------STKSYKDVLQQKESHVVTTHPSSSVPSPQPLDSETIV

Query:  VQRFHHKDDWNSIRNTILAGISHRCSINPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIF
        V+RF H DDW+ I   +        + N F   KAL+H       + LC NK WS++GKY ++F   +        +  S+GGW     +PL LW    F
Subjt:  VQRFHHKDDWNSIRNTILAGISHRCSINPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIF

Query:  RYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG-----------GDVTMEIKGLTASFFK---SARFEESPSFSEQNNLEIKR
        + IG  C G  +++  T    NL  A+IK+R N  GF+PA +++  +              G   +E        FK   +A F++    SEQ   E   
Subjt:  RYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG-----------GDVTMEIKGLTASFFK---SARFEESPSFSEQNNLEIKR

Query:  -------SEKLDGKNLKLPEEIKSPPKNQEFIPILAETASPKDLSPCLVSPQTI-----SQPRSILQAADHSNISPKKK-----------TAHSNKGKSP
               S   DG+    P++  S  K+    P    T  P  L+  LV+   +          IL    +  +  K K             + +K K  
Subjt:  -------SEKLDGKNLKLPEEIKSPPKNQEFIPILAETASPKDLSPCLVSPQTI-----SQPRSILQAADHSNISPKKK-----------TAHSNKGKSP

Query:  LHVASPIEAKNHTNYFLPVG-----PTTLGLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLP---PTS
        +   SP    N TN F P         +L   EKK   ++  +   ++  + P N K+        T P        D      S  + L  LP   P  
Subjt:  LHVASPIEAKNHTNYFLPVG-----PTTLGLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLP---PTS

Query:  HIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMALPTGSIPKPPTKKTKATTGKKSKLKREFLSWNVRGLGSWKKRALIK
         +    +   +   D+T  +    +P    P++  +     A                  PK   K+      K+ K K        + L SW K+  +K
Subjt:  HIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMALPTGSIPKPPTKKTKATTGKKSKLKREFLSWNVRGLGSWKKRALIK

Query:  KTIQQQN-----PSFVLLQETKK--TSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR
         +    +      + VLL +        + + IKS+W S+ I W + ++ G+SGGILILW   + ++    +G FS+S +  + +  S+WL+ +YGP +R
Subjt:  KTIQQQN-----PSFVLLQETKK--TSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR

Query:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN
          R  FW ELH+L  L    WILGGD NV R   E +   + + + R  N +I+N  L+D PL N  +TWS+  +   + S +DRFL  +   + F    
Subjt:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN

Query:  LLRLDRVTSDHYPLAL--SFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKW
           L R TSDH+PL    S   ++WGP PFR ++  L    F+  +  WW  +   G+PG  F+ +LK L   ++ W
Subjt:  LLRLDRVTSDHYPLAL--SFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKW

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.2e-0734.74Show/hide
Query:  LGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSEGTVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFT
        L NG    FW+ +W   G LS A+PRL+ L+   E +V D W + ++ W++  RR LND E   W  +  +L + R        +W  DS+N+F+
Subjt:  LGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSEGTVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFT

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.2e-6843.96Show/hide
Query:  EFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGF
        + +SWN RGLGS KKR ++K  +  + P  V++QETKK   D + + S+WS     WA+L + GASGGILI+W       +EV+ G FS+SI   M +  
Subjt:  EFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGF

Query:  SFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFL
        S WLSA+YGP+    R DFW EL D+AGL   RW +GGDFNV R S EK  G  +T  M+ F+++I +  L+D PL++ SYTWS+  ++      LDRFL
Subjt:  SFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFL

Query:  LTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQ
         +N+    F  +    L R TSDH+P+ L      WGP PFRF+N WL   SF+E    WW++    GW GH FM KL+ +K +L++WN T+  ++S+
Subjt:  LTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQ

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]6.2e-8749.84Show/hide
Query:  EFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGF
        +FL+WNVRGL SWKK ALIK+ I + NP+ V+LQETK + +D   +KS+WS+  I W++LD+ G + GILILW+DPD    E+I+G FS++I+  ++DGF
Subjt:  EFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGF

Query:  SFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFL
         FW+S IYGPS  E    FWQEL DL+ L  + WIL GDFNVTRWSWEKS+GR +T+SM  FN +I +  L+D+PL NG +TWS         SL+D FL
Subjt:  SFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFL

Query:  LTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITN----RNDV
        LTN C+ K G     R+ R TSDH+P+ L FG   WG  PFRF+N WL  ++F+  L+ WW   PL GWPGHG MMKLK LK  ++ W   +     +  
Subjt:  LTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITN----RNDV

Query:  SQLPSLISQLKNLVASR
          L +L++ L +L  S+
Subjt:  SQLPSLISQLKNLVASR

TrEMBL top hitse value%identityAlignment
A0A438CP96 LINE-1 retrotransposable element ORF2 protein6.9e-6842.9Show/hide
Query:  EFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGF
        + LSWN RGLGS KKR ++++ +  QNP  V+LQETK+ + D +F+ S+W+   + W +L + GASGGI+ILW    F   E + G FS+++     +  
Subjt:  EFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGF

Query:  SFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFL
        SFWL+++YGP     R DFW EL DL GL   RW +GGDFNV R   EK     +T +MR F+++I    LLD PL+N ++TWS+   D      LDRFL
Subjt:  SFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFL

Query:  LTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLP
         +++    F  +    L R TSDH P+ L    + WGP PFRF+N WL    F+E  + WW +   +GW GH FM KLK +K++L+KWNI    D+ +  
Subjt:  LTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLP

Query:  SLI
         LI
Subjt:  SLI

A0A5A7TTA1 DUF4283 domain-containing protein1.3e-7127.2Show/hide
Query:  NHSTRSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINQLLNSGD
        N   R   +++K F +  D+ SR S   ITE     S S+ ++  SL WL  +FK L + P + +F  + R  D+ LW++ + N+ G+  EI ++ + G 
Subjt:  NHSTRSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINQLLNSGD

Query:  RRRLLIPSEDNKQGWFSFFSLIS--------DYPGEAHRSTKSYKDVLQQK--------------ESHVVTTHPSSSVPSPQPLDSETIVVQRFHHKDDW
        +  +L+P   +K GW  F  +++        + P   + +    K+ +QQ                  V ++  SSS  S       T  ++R    DDW
Subjt:  RRRLLIPSEDNKQGWFSFFSLIS--------DYPGEAHRSTKSYKDVLQQK--------------ESHVVTTHPSSSVPSPQPLDSETIVVQRFHHKDDW

Query:  NSIRNTILAGISHRCS---INPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIFRYIGDVC
          I + +      + S     PF  +KALL + D+ +   LC N  W+++G + +KF   + ++     +  S+GGW     +PL +W    F  IG+  
Subjt:  NSIRNTILAGISHRCS---INPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIFRYIGDVC

Query:  GGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAGGDVTMEI----KG---------LTASFFKSA--RFEESPSFSEQ----NNLEIKRSE
        GGF + +  +  KL LT A IK+++N  GF+PA I++     G D  ++     KG         +  SF K+A   F E   ++EQ     NL +    
Subjt:  GGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAGGDVTMEI----KG---------LTASFFKSA--RFEESPSFSEQ----NNLEIKRSE

Query:  KLDGKNLKLPEEIKSPPKNQEFIPILAETASPKDLSPCLVSPQTISQPRSILQAADH-SNISPKKKTAHSNKGKSPLHVASPIEAKNHTNYFLPVGPTTL
         L   + K  +++ +  K      I     +   L+         S     +Q  DH S ++ KKK      G+    + SPI  K   ++  P   T L
Subjt:  KLDGKNLKLPEEIKSPPKNQEFIPILAETASPKDLSPCLVSPQTISQPRSILQAADH-SNISPKKKTAHSNKGKSPLHVASPIEAKNHTNYFLPVGPTTL

Query:  GLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTSHIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPL
               T    + S   +Y    A  K P  S             I  +++  +S  +   +               S T      +Q      +P+  
Subjt:  GLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTSHIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPL

Query:  SLPTYLCHLAPMLSKHGLCIMALPTGSIPKPPTKKTKATTGKKSKLKREFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSS
         L   L H++ +LS         P+  IP  PT  T++   K S       +   R     KK+  I++  +    SF      K+   D     ++  +
Subjt:  SLPTYLCHLAPMLSKHGLCIMALPTGSIPKPPTKKTKATTGKKSKLKREFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSS

Query:  SCIGWASLDSIGASGGILILWSDPDFTI----KEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWE
        +    +  +S+     I IL   P+  +    ++VI G FS+SI V   +G S+WLSAIYGP++R++R  FW+EL +L  +    WILGGDFNV RW  E
Subjt:  SCIGWASLDSIGASGGILILWSDPDFTI----KEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWE

Query:  KSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWL
         S     + SM+ FN +I+N +L+D PL N  +TWS+       LS LDRFL +    + F       L R TSDH+P+ L    I+WGP PFRF NA+L
Subjt:  KSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWL

Query:  HIESFREVLKNWWNQNPLQGWPGHGFMMKLK-------------GLKMELRKWNIT
            +++ ++ WW      G+ G+ FM +LK             GLK  L+ +N+T
Subjt:  HIESFREVLKNWWNQNPLQGWPGHGFMMKLK-------------GLKMELRKWNIT

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein5.6e-7025.49Show/hide
Query:  RSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINQLLNSGDRRRL
        RS  ++RK F +  D+ S+ +   +TE     + S+ +S + L W+  + K+L   P + +F  + R  +  +W+ K  N  G   EI ++     +  +
Subjt:  RSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINQLLNSGDRRRL

Query:  LIPSEDNKQGWFSFFSLIS---------------------------DYPGEAHR---------STKSYKDVLQQKESHVVTTHPSSSVPSPQPLDSETIV
        L+P   +K GW SF S+I+                           DY   ++          +T    D     +S   +++     PS   L++  ++
Subjt:  LIPSEDNKQGWFSFFSLIS---------------------------DYPGEAHR---------STKSYKDVLQQKESHVVTTHPSSSVPSPQPLDSETIV

Query:  VQRFHHKDDWNSIRNTILAGISHRCSINPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIF
        V+RF H DDW+ I   +        + N F   KAL+H       + LC NK WS++GKY ++F   +        +  S+GGW     +PL LW    F
Subjt:  VQRFHHKDDWNSIRNTILAGISHRCSINPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIF

Query:  RYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG-----------GDVTMEIKGLTASFFK---SARFEESPSFSEQNNLEIKR
        + IG  C G  +++  T    NL  A+IK+R N  GF+PA +++  +              G   +E        FK   +A F++    SEQ   E   
Subjt:  RYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG-----------GDVTMEIKGLTASFFK---SARFEESPSFSEQNNLEIKR

Query:  -------SEKLDGKNLKLPEEIKSPPKNQEFIPILAETASPKDLSPCLVSPQTI-----SQPRSILQAADHSNISPKKK-----------TAHSNKGKSP
               S   DG+    P++  S  K+    P    T  P  L+  LV+   +          IL    +  +  K K             + +K K  
Subjt:  -------SEKLDGKNLKLPEEIKSPPKNQEFIPILAETASPKDLSPCLVSPQTI-----SQPRSILQAADHSNISPKKK-----------TAHSNKGKSP

Query:  LHVASPIEAKNHTNYFLPVG-----PTTLGLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLP---PTS
        +   SP    N TN F P         +L   EKK   ++  +   ++  + P N K+        T P        D      S  + L  LP   P  
Subjt:  LHVASPIEAKNHTNYFLPVG-----PTTLGLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLP---PTS

Query:  HIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMALPTGSIPKPPTKKTKATTGKKSKLKREFLSWNVRGLGSWKKRALIK
         +    +   +   D+T  +    +P    P++  +     A                  PK   K+      K+ K K        + L SW K+  +K
Subjt:  HIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMALPTGSIPKPPTKKTKATTGKKSKLKREFLSWNVRGLGSWKKRALIK

Query:  KTIQQQN-----PSFVLLQETKK--TSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR
         +    +      + VLL +        + + IKS+W S+ I W + ++ G+SGGILILW   + ++    +G FS+S +  + +  S+WL+ +YGP +R
Subjt:  KTIQQQN-----PSFVLLQETKK--TSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRR

Query:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN
          R  FW ELH+L  L    WILGGD NV R   E +   + + + R  N +I+N  L+D PL N  +TWS+  +   + S +DRFL  +   + F    
Subjt:  EHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSAN

Query:  LLRLDRVTSDHYPLAL--SFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKW
           L R TSDH+PL    S   ++WGP PFR ++  L    F+  +  WW  +   G+PG  F+ +LK L   ++ W
Subjt:  LLRLDRVTSDHYPLAL--SFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKW

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein1.0e-0734.74Show/hide
Query:  LGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSEGTVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFT
        L NG    FW+ +W   G LS A+PRL+ L+   E +V D W + ++ W++  RR LND E   W  +  +L + R        +W  DS+N+F+
Subjt:  LGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSEGTVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFT

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein1.1e-6843.96Show/hide
Query:  EFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGF
        + +SWN RGLGS KKR ++K  +  + P  V++QETKK   D + + S+WS     WA+L + GASGGILI+W       +EV+ G FS+SI   M +  
Subjt:  EFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGF

Query:  SFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFL
        S WLSA+YGP+    R DFW EL D+AGL   RW +GGDFNV R S EK  G  +T  M+ F+++I +  L+D PL++ SYTWS+  ++      LDRFL
Subjt:  SFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFL

Query:  LTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQ
         +N+    F  +    L R TSDH+P+ L      WGP PFRF+N WL   SF+E    WW++    GW GH FM KL+ +K +L++WN T+  ++S+
Subjt:  LTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQ

A0A6J1E2G6 uncharacterized protein LOC1110254053.0e-8749.84Show/hide
Query:  EFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGF
        +FL+WNVRGL SWKK ALIK+ I + NP+ V+LQETK + +D   +KS+WS+  I W++LD+ G + GILILW+DPD    E+I+G FS++I+  ++DGF
Subjt:  EFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGF

Query:  SFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFL
         FW+S IYGPS  E    FWQEL DL+ L  + WIL GDFNVTRWSWEKS+GR +T+SM  FN +I +  L+D+PL NG +TWS         SL+D FL
Subjt:  SFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFL

Query:  LTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITN----RNDV
        LTN C+ K G     R+ R TSDH+P+ L FG   WG  PFRF+N WL  ++F+  L+ WW   PL GWPGHG MMKLK LK  ++ W   +     +  
Subjt:  LTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITN----RNDV

Query:  SQLPSLISQLKNLVASR
          L +L++ L +L  S+
Subjt:  SQLPSLISQLKNLVASR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACTGCTATCTCTGCCACAGCTCAACCTTGGAACCACTCCACCAGATCTATCTACATTGATCGAAAAACTTTCTCCATTGAATTTGATGAACCTTCTAGGGGAAG
CCGAGCAAAAATCACAGAGCATAGTAGAGCCTCCTCCCATTCCTTAACTTTGTCTTGGAAATCTCTCCATTGGCTAGCATCCTCCTTCAAAACTCTTGCCCATGAACCGT
GCTCTTACAAATTCTCCTCCAAGATAAGAACTGATGACTATGTTCTCTGGTTGGAAAAACTCAGCAATAAGTATGGCTTCTTTGTGGAAATTAATCAACTGTTGAATTCA
GGTGATCGACGTCGACTCCTTATACCATCTGAAGACAACAAGCAAGGTTGGTTCTCCTTTTTCTCCCTCATCTCTGATTACCCAGGAGAGGCTCATCGATCAACAAAATC
ATATAAAGATGTCCTCCAACAAAAGGAAAGTCATGTTGTCACCACTCACCCTTCATCATCTGTCCCTTCACCACAGCCTCTCGACAGTGAGACTATTGTTGTTCAACGAT
TCCATCATAAGGATGATTGGAATTCCATTCGGAACACCATTCTTGCTGGCATATCCCACCGTTGCTCCATCAATCCATTTCAAGATAATAAAGCTTTGTTACATGTATAT
GATCAAAATATCGTGTCAAAACTTTGCAACAACAAGGATTGGTCCTCCATTGGCAAATACCGATTGAAGTTTTACCCATTGACTACCGACTCATTTTATCAAGACACTAT
GACTAATTCTTTTGGTGGATGGATTGAAGTGCTGCAACTTCCTTTACCTTTATGGACAGAACAAATTTTCAGATATATTGGTGATGTTTGTGGAGGCTTCACCGAAATAT
CCAACCACACCAGCAGGAAGCTAAATCTCACAGCGGCAAAGATTAAAATCCGGCAAAATTCCATCGGTTTCATCCCGGCCAGAATTAAGCTTCCTTCATCCCTTGCCGGC
GGCGACGTTACAATGGAAATCAAAGGGTTGACGGCCAGCTTTTTCAAATCAGCGAGATTTGAGGAATCCCCGTCATTTTCGGAACAAAATAATTTAGAAATTAAGAGGAG
TGAGAAATTGGATGGAAAAAATTTGAAACTACCAGAGGAAATCAAATCGCCTCCTAAAAATCAAGAATTCATTCCTATTTTGGCCGAAACTGCTTCTCCAAAGGATTTAT
CTCCTTGTCTGGTTTCTCCCCAAACTATATCACAGCCAAGAAGTATTTTGCAAGCGGCAGATCACTCCAATATATCTCCTAAAAAAAAGACAGCACACTCCAACAAAGGT
AAATCTCCCCTCCACGTGGCCTCCCCCATTGAGGCAAAGAATCATACAAATTATTTTCTTCCAGTGGGACCCACCACTCTTGGTTTAGGAGAAAAGAAATCAACAGGCAA
CAAGATAATAGCTTCTGATACTGAGGCTTACTTATCAAGTCCAGCCAATGACAAATCCCCTCATACTTCGGTTTGTGACCCTACATCGCCTCGAAACTTTGACCTTGCAA
TTTTTGATGAGTTACATTTACCCGAGTCGGAACAAATTCCATTGGCAAGCTTACCTCCGACCTCCCACATTCCATCATCCCCCCATATTATTCCCTCTCCCACAAAAGAT
GTTACCCCACTACAACAACCATCTAGCTCTCCACCCGAACCATCACCCCTTTCCCTCCCAACATATCTCTGTCATTTAGCTCCAATGCTTAGTAAACATGGTTTATGCAT
CATGGCTCTTCCAACTGGCTCAATACCCAAACCGCCAACTAAGAAAACTAAAGCTACTACAGGGAAAAAGTCAAAACTTAAGAGAGAGTTTCTTTCATGGAATGTTAGAG
GTTTGGGCTCTTGGAAAAAAAGAGCATTAATTAAGAAGACCATCCAACAACAAAATCCGAGCTTCGTGCTTCTTCAAGAAACTAAAAAGACATCGGTTGATGGAAAATTT
ATTAAATCTATATGGAGTTCTTCTTGCATTGGTTGGGCTTCCCTTGATTCCATTGGAGCATCCGGAGGCATCCTTATTCTTTGGAGTGATCCTGATTTCACGATCAAAGA
AGTTATTCAAGGTCACTTTTCAATCTCAATTCATGTTTTTATGGCTGACGGTTTTTCTTTTTGGCTTTCGGCTATTTATGGTCCTTCTAGGCGTGAGCACCGTGCAGATT
TTTGGCAAGAACTCCATGATTTGGCTGGTTTAGGTGGTGATCGATGGATCCTTGGAGGGGATTTTAATGTTACTCGCTGGTCTTGGGAGAAATCTCATGGTCGAAATGTT
ACTAGGAGCATGCGCACTTTCAATCAATGGATTGCCAATTACCATCTCTTGGACATTCCACTACAAAATGGCTCTTACACTTGGTCTAGCTTTGGGGATGACATTGAATA
TCTCTCACTTCTGGATAGATTTTTATTAACAAATGATTGCCTTCACAAATTTGGGTCAGCAAATCTCCTTCGTCTTGATAGAGTCACATCAGATCACTACCCTTTAGCTC
TTTCTTTTGGAGACATAGCTTGGGGGCCTTGTCCTTTCCGTTTTGACAATGCTTGGTTACATATTGAGTCCTTTCGTGAAGTTCTGAAAAACTGGTGGAACCAAAATCCT
CTCCAAGGCTGGCCAGGGCATGGTTTTATGATGAAACTCAAGGGATTGAAAATGGAACTAAGAAAATGGAACATCACGAATCGTAATGATGTTTCCCAACTACCATCTCT
TATTTCTCAATTGAAGAATCTTGTTGCTAGTCGTATTCAGAGACGGCTTGGAAATGGTTGTTCCACCCTTTTTTGGCATGATTCTTGGCTAAGTTGTGGAGTCTTGTCTG
AGGCTTTCCCTCGTCTTTATAGATTATCTAATCGCTCGGAAGGTACAGTTGCTGACTTTTGGGTTTCATTGAATTCGGCTTGGGATTTGAGTCTTCGTCGAAATTTAAAT
GATTCGGAGACAAATGAGTGGGCTAGTCTCTCTCATCTGCTTTCTTCCATCAGAATTCGAGTTATTGATGACACTTGGTCTTGGCCTATTGATTCATCTAATGCATTCAC
A
mRNA sequenceShow/hide mRNA sequence
ATGACCACTGCTATCTCTGCCACAGCTCAACCTTGGAACCACTCCACCAGATCTATCTACATTGATCGAAAAACTTTCTCCATTGAATTTGATGAACCTTCTAGGGGAAG
CCGAGCAAAAATCACAGAGCATAGTAGAGCCTCCTCCCATTCCTTAACTTTGTCTTGGAAATCTCTCCATTGGCTAGCATCCTCCTTCAAAACTCTTGCCCATGAACCGT
GCTCTTACAAATTCTCCTCCAAGATAAGAACTGATGACTATGTTCTCTGGTTGGAAAAACTCAGCAATAAGTATGGCTTCTTTGTGGAAATTAATCAACTGTTGAATTCA
GGTGATCGACGTCGACTCCTTATACCATCTGAAGACAACAAGCAAGGTTGGTTCTCCTTTTTCTCCCTCATCTCTGATTACCCAGGAGAGGCTCATCGATCAACAAAATC
ATATAAAGATGTCCTCCAACAAAAGGAAAGTCATGTTGTCACCACTCACCCTTCATCATCTGTCCCTTCACCACAGCCTCTCGACAGTGAGACTATTGTTGTTCAACGAT
TCCATCATAAGGATGATTGGAATTCCATTCGGAACACCATTCTTGCTGGCATATCCCACCGTTGCTCCATCAATCCATTTCAAGATAATAAAGCTTTGTTACATGTATAT
GATCAAAATATCGTGTCAAAACTTTGCAACAACAAGGATTGGTCCTCCATTGGCAAATACCGATTGAAGTTTTACCCATTGACTACCGACTCATTTTATCAAGACACTAT
GACTAATTCTTTTGGTGGATGGATTGAAGTGCTGCAACTTCCTTTACCTTTATGGACAGAACAAATTTTCAGATATATTGGTGATGTTTGTGGAGGCTTCACCGAAATAT
CCAACCACACCAGCAGGAAGCTAAATCTCACAGCGGCAAAGATTAAAATCCGGCAAAATTCCATCGGTTTCATCCCGGCCAGAATTAAGCTTCCTTCATCCCTTGCCGGC
GGCGACGTTACAATGGAAATCAAAGGGTTGACGGCCAGCTTTTTCAAATCAGCGAGATTTGAGGAATCCCCGTCATTTTCGGAACAAAATAATTTAGAAATTAAGAGGAG
TGAGAAATTGGATGGAAAAAATTTGAAACTACCAGAGGAAATCAAATCGCCTCCTAAAAATCAAGAATTCATTCCTATTTTGGCCGAAACTGCTTCTCCAAAGGATTTAT
CTCCTTGTCTGGTTTCTCCCCAAACTATATCACAGCCAAGAAGTATTTTGCAAGCGGCAGATCACTCCAATATATCTCCTAAAAAAAAGACAGCACACTCCAACAAAGGT
AAATCTCCCCTCCACGTGGCCTCCCCCATTGAGGCAAAGAATCATACAAATTATTTTCTTCCAGTGGGACCCACCACTCTTGGTTTAGGAGAAAAGAAATCAACAGGCAA
CAAGATAATAGCTTCTGATACTGAGGCTTACTTATCAAGTCCAGCCAATGACAAATCCCCTCATACTTCGGTTTGTGACCCTACATCGCCTCGAAACTTTGACCTTGCAA
TTTTTGATGAGTTACATTTACCCGAGTCGGAACAAATTCCATTGGCAAGCTTACCTCCGACCTCCCACATTCCATCATCCCCCCATATTATTCCCTCTCCCACAAAAGAT
GTTACCCCACTACAACAACCATCTAGCTCTCCACCCGAACCATCACCCCTTTCCCTCCCAACATATCTCTGTCATTTAGCTCCAATGCTTAGTAAACATGGTTTATGCAT
CATGGCTCTTCCAACTGGCTCAATACCCAAACCGCCAACTAAGAAAACTAAAGCTACTACAGGGAAAAAGTCAAAACTTAAGAGAGAGTTTCTTTCATGGAATGTTAGAG
GTTTGGGCTCTTGGAAAAAAAGAGCATTAATTAAGAAGACCATCCAACAACAAAATCCGAGCTTCGTGCTTCTTCAAGAAACTAAAAAGACATCGGTTGATGGAAAATTT
ATTAAATCTATATGGAGTTCTTCTTGCATTGGTTGGGCTTCCCTTGATTCCATTGGAGCATCCGGAGGCATCCTTATTCTTTGGAGTGATCCTGATTTCACGATCAAAGA
AGTTATTCAAGGTCACTTTTCAATCTCAATTCATGTTTTTATGGCTGACGGTTTTTCTTTTTGGCTTTCGGCTATTTATGGTCCTTCTAGGCGTGAGCACCGTGCAGATT
TTTGGCAAGAACTCCATGATTTGGCTGGTTTAGGTGGTGATCGATGGATCCTTGGAGGGGATTTTAATGTTACTCGCTGGTCTTGGGAGAAATCTCATGGTCGAAATGTT
ACTAGGAGCATGCGCACTTTCAATCAATGGATTGCCAATTACCATCTCTTGGACATTCCACTACAAAATGGCTCTTACACTTGGTCTAGCTTTGGGGATGACATTGAATA
TCTCTCACTTCTGGATAGATTTTTATTAACAAATGATTGCCTTCACAAATTTGGGTCAGCAAATCTCCTTCGTCTTGATAGAGTCACATCAGATCACTACCCTTTAGCTC
TTTCTTTTGGAGACATAGCTTGGGGGCCTTGTCCTTTCCGTTTTGACAATGCTTGGTTACATATTGAGTCCTTTCGTGAAGTTCTGAAAAACTGGTGGAACCAAAATCCT
CTCCAAGGCTGGCCAGGGCATGGTTTTATGATGAAACTCAAGGGATTGAAAATGGAACTAAGAAAATGGAACATCACGAATCGTAATGATGTTTCCCAACTACCATCTCT
TATTTCTCAATTGAAGAATCTTGTTGCTAGTCGTATTCAGAGACGGCTTGGAAATGGTTGTTCCACCCTTTTTTGGCATGATTCTTGGCTAAGTTGTGGAGTCTTGTCTG
AGGCTTTCCCTCGTCTTTATAGATTATCTAATCGCTCGGAAGGTACAGTTGCTGACTTTTGGGTTTCATTGAATTCGGCTTGGGATTTGAGTCTTCGTCGAAATTTAAAT
GATTCGGAGACAAATGAGTGGGCTAGTCTCTCTCATCTGCTTTCTTCCATCAGAATTCGAGTTATTGATGACACTTGGTCTTGGCCTATTGATTCATCTAATGCATTCAC
A
Protein sequenceShow/hide protein sequence
MTTAISATAQPWNHSTRSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINQLLNS
GDRRRLLIPSEDNKQGWFSFFSLISDYPGEAHRSTKSYKDVLQQKESHVVTTHPSSSVPSPQPLDSETIVVQRFHHKDDWNSIRNTILAGISHRCSINPFQDNKALLHVY
DQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIFRYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG
GDVTMEIKGLTASFFKSARFEESPSFSEQNNLEIKRSEKLDGKNLKLPEEIKSPPKNQEFIPILAETASPKDLSPCLVSPQTISQPRSILQAADHSNISPKKKTAHSNKG
KSPLHVASPIEAKNHTNYFLPVGPTTLGLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTSHIPSSPHIIPSPTKD
VTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMALPTGSIPKPPTKKTKATTGKKSKLKREFLSWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKF
IKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNV
TRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNP
LQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLPSLISQLKNLVASRIQRRLGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSEGTVADFWVSLNSAWDLSLRRNLN
DSETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFT