; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011069 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011069
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:13650623..13653850
RNA-Seq ExpressionLag0011069
SyntenyLag0011069
Gene Ontology termsNA
InterPro domainsIPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_018816058.1 uncharacterized protein LOC108987582 [Juglans regia]2.0e-5326.34Show/hide
Query:  VSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDCDEADSEIIE-RGVYGVKLRETQGSKGY----YR
        V T+D G   G  LR +V++DI +P+ RG  + V    ++ WIP  YE+LP FC+  G + H KQ         +E    YG  LR     KGY    +R
Subjt:  VSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDCDEADSEIIE-RGVYGVKLRETQGSKGY----YR

Query:  TKKTEWWGERNQNQWNNSRGRGV--RGRGRWFGNPRNEDRWEQETGTEENGGRQIT-------------QPPEVGKRDLSE---ISRRDKSRER-SLGQE
           T+    ++  +  NS  R        R +        WE+++  EE  G  +T              P E  K  L +   +S  D++       Q 
Subjt:  TKKTEWWGERNQNQWNNSRGRGV--RGRGRWFGNPRNEDRWEQETGTEENGGRQIT-------------QPPEVGKRDLSE---ISRRDKSRER-SLGQE

Query:  TISIPKEKTCQRLVYCQK---------------EELLTENGKEEREWEFWRRTVRANRNKVKSV----------------------------KKSLNR--
        T SIPK     +L                     + + E  K  R+   W+   R    +V  +                            KK  N   
Subjt:  TISIPKEKTCQRLVYCQK---------------EELLTENGKEEREWEFWRRTVRANRNKVKSV----------------------------KKSLNR--

Query:  -------RDISGGWKSAPPNAMKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVT
               + +   W+  PP  M  +SWN RG+GNPRT+  L+L +K  SP+++F +E+KC   +   L + L +D+C AV+S G  GGL L WK   D++
Subjt:  -------RDISGGWKSAPPNAMKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVT

Query:  ISSFSNGHIDAVIKE--DKVIWRFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLR-----------ILS-----------
        I ++S  HI A +KE  D+  W  TGFYG+PE  KR  SW LL  +     + W+  GDFNEI    EK G   R           I+            
Subjt:  ISSFSNGHIDAVIKE--DKVIWRFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLR-----------ILS-----------

Query:  --SWT--PLGKVWTIAILE------------------------------ILAFRTREII----------ETTW--KSEQGNDIE--------------SL
          +W+    G+ +T   L+                              ++   T+E+I          E  W  K+E  N I+               +
Subjt:  --SWT--PLGKVWTIAILE------------------------------ILAFRTREII----------ETTW--KSEQGNDIE--------------SL

Query:  KRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALDLRDKRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLL
         R+ K C   L  WN +       KA+  K   ++ +   ++ +    + + +KEL+  L E++  WK  +++ WL+ G +          + ++  K+ 
Subjt:  KRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALDLRDKRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLL

Query:  VFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVGGTL
                 I +K++IG + T +F  LF+SS P    I+  LE   P VS   N  + + FT+ +++VA+  M    + GPDG  ALF+Q++WDI+G  +
Subjt:  VFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVGGTL

Query:  RDFAL
          FAL
Subjt:  RDFAL

XP_023907370.1 uncharacterized protein LOC112019076 [Quercus suber]2.5e-5131.07Show/hide
Query:  MKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVI-KEDKVIW
        M +L WN RG+ N RT + L + ++   P +VF+ E+  D+ +  +++RS+ +DN F V  + +GGGL LYWKN  D+ + SFS  HID++I K     W
Subjt:  MKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVI-KEDKVIW

Query:  RFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLRILS------------SWTPLG------------------KVW-TIAI
        RFTGFYG P   +R ++W  L +LN   NIPW+  GDFNE++   EK G S R  S             +  LG                   +W T+  
Subjt:  RFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLRILS------------SWTPLG------------------KVW-TIAI

Query:  LEI----LAFRTRE----------IIETTWKS-EQGNDIESL-KRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALDLRDKRSPSPELLRA-EKEL
        L++      FR  E          I+E  W S E+ +DIE +   + K C + L  WN+    G++ + + +K  E+   +    RS   + +R+ ++++
Subjt:  LEI----LAFRTRE----------IIETTWKS-EQGNDIESL-KRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALDLRDKRSPSPELLRA-EKEL

Query:  DNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRD
          L+++E + W   S+  W K+G            + KRK  +       G W  DK +I      Y++ LFS++  +  Y D +      V++A  N  
Subjt:  DNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRD

Query:  ISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVG
        +S  F++ +++ A+  M+P KA GPDG+  LFFQ YW+++G
Subjt:  ISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVG

XP_028068804.1 uncharacterized protein LOC114271378 [Camellia sinensis]2.5e-5134.47Show/hide
Query:  MKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVIKEDKVI--
        MK LSWN RG+GNPRT+RNL L +K+ +PD+VF++E+KC   K    +  LG      V+  G  GGL L+W+    V+I ++S+GH+DAV++ D  I  
Subjt:  MKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVIKEDKVI--

Query:  WRFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLRILSSWTPLGKVWTIAILEILAF----------RTREIIETTWKSEQ
        WRFTGFYG+PEV K+ DSW+LL RL  L ++PW+V  DFNEIL   EK G   R  +         +   L  L F          R   ++        
Subjt:  WRFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLRILSSWTPLGKVWTIAILEILAF----------RTREIIETTWKSEQ

Query:  GNDIESL---KRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALD-----LRDKRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSG
         ND       K +        S    I+++    + +  K NE++ L+      RD+     +L     E+D LLE EE  W   +R  WLK G    S 
Subjt:  GNDIESL---KRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALD-----LRDKRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSG

Query:  STRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLH
             ++  +K ++         W    +D+  I   YF  LF +    +  I + L     VV    N +++RP+T  ++  AL  M PTKA GPDG  
Subjt:  STRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLH

Query:  ALFFQTYWDIVG
         LF+Q +W IVG
Subjt:  ALFFQTYWDIVG

XP_030924668.1 uncharacterized protein LOC115951644 [Quercus lobata]2.5e-5131.05Show/hide
Query:  MKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVIKED-KVIW
        M  LSWN RG+GNP+T   L   ++   P L+F++E+K +      + R + Y N F V  +  GGGL LYW  D++V + SFS  HIDA+I       W
Subjt:  MKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVIKED-KVIW

Query:  RFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLR-----------------------------------ILSSWTPLGKVW
        RFTGFYG+PE   R++SW LL  L+  + +PW+  GDFNEIL+  EK+G   R                                   +LS  + L + +
Subjt:  RFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLR-----------------------------------ILSSWTPLGKVW

Query:  TIA---ILEILAFRTR---EIIETTWKSEQGNDIE-SLKRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALDLRDKRSPSPELLRA-EKELDNLLE
                E +  R R   E++  +W  E           +   C   L  WNK I  G +  +++KK  ++K  +       +P  ++A   E+  L  
Subjt:  TIA---ILEILAFRTR---EIIETTWKSEQGNDIE-SLKRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALDLRDKRSPSPELLRA-EKELDNLLE

Query:  EEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLV-FFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRD-ISR
        +EE  WK  SR  WLK G +  +     R  ++ +  L+     + G+W+ED+  +G +   YF+ +F+SS PS+     ++   I  V+  D+RD +  
Subjt:  EEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLV-FFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRD-ISR

Query:  PFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVG
         F   +++ AL SM+P  A GPDG+  +F++++W IVG
Subjt:  PFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVG

XP_030936723.1 uncharacterized protein LOC115961982 [Quercus lobata]7.2e-5133.91Show/hide
Query:  MKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVI-KEDKVIW
        M  + WN RG+GNP  ++ L   V+  +P +VF+ E+  D+ +   +K  + +D  F V    +GGGL+LYWKND +V + S S  HIDAVI K     W
Subjt:  MKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVI-KEDKVIW

Query:  RFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLRILSSWTPLGKVWTIAILEILAFRTREIIETTW-KSEQGNDIESLKRQ
        +FTGFYGNPE  +R +SW LL +L+   ++PW+  GDFNEI+   EK G  LR   +     K++    + +      + +E+ W  + +GN+   + ++
Subjt:  RFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLRILSSWTPLGKVWTIAILEILAFRTREIIETTW-KSEQGNDIESLKRQ

Query:  FKGCLEGLSHWNKIILEGSISKAVEKKFNEI-KALDLRDKRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLVF
         + C   L+ W++    G+I + +EKK  E+ KA  +  +   S ++ + +KE++ L+++EE+ W+  SR  +LK G               R TK   F
Subjt:  FKGCLEGLSHWNKIILEGSISKAVEKKFNEI-KALDLRDKRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLVF

Query:  FSKGGSWIEDKKDIGVIATD---YFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVGGT
        F    +  + K  I  IA     YF  LF+SS P  D +   LE    VV+   N  +   F   ++EVALK M+P KA GPDG+  LF+Q +W++V   
Subjt:  FSKGGSWIEDKKDIGVIATD---YFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVGGT

Query:  LRDFALI
        +    L+
Subjt:  LRDFALI

TrEMBL top hitse value%identityAlignment
A0A2N9EV43 Reverse transcriptase domain-containing protein8.3e-6126.58Show/hide
Query:  LDNAIGEFEEVSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDCD----EADSEIIERGVYGVKLRE
        + + +G  E V     G  KG  +RVRV IDI +P+ RG  + +G K  + W+   YE+L +FCY  GR+ H ++DC+       S   ++  YG  ++ 
Subjt:  LDNAIGEFEEVSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDCD----EADSEIIERGVYGVKLRE

Query:  TQGSKGYYRTKKTEWWGER--------------NQNQWNNSRGRGVRGRGRWFGN--PRNEDRWEQETGTEENGGRQITQPPEVGKRDLSEISRRDKSRE
             G + ++  +  G+R              N  +   +   G     R  GN  PRN+++   E   E N G Q     E  K+   E+   +    
Subjt:  TQGSKGYYRTKKTEWWGER--------------NQNQWNNSRGRGVRGRGRWFGN--PRNEDRWEQETGTEENGGRQITQPPEVGKRDLSEISRRDKSRE

Query:  RSLGQETISIPKEKTCQRLVYCQKEELLTENGKEEREWEFWRRTVRANRNKVKSVK------KSLNRRDI---SGGWKSAPPNAMKILSWNVRGMGNPRT
           G  T++ P  K     +  +   L     K++     W+R+V   ++ +  V             DI   +G W        ++ +  V+G+GNP+ 
Subjt:  RSLGQETISIPKEKTCQRLVYCQKEELLTENGKEEREWEFWRRTVRANRNKVKSVK------KSLNRRDI---SGGWKSAPPNAMKILSWNVRGMGNPRT

Query:  IRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVIKEDKV-IWRFTGFYGNPEVGKRQD
        +R L   VK   P ++F+LE+K +  +   ++  LG++N F+V S G+ GGL L W+N+ ++ I +FS  HIDA ++  +V  WR TGFYG PE  +R++
Subjt:  IRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVIKEDKV-IWRFTGFYGNPEVGKRQD

Query:  SWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLRIL-----------------------------------------------SSW-----------
        SW LL  L+ + + PW+  GDFNE+L   EK+G + R L                                               S+W           
Subjt:  SWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLRIL-----------------------------------------------SSW-----------

Query:  ----------------TPLG----KVWTIAILEILAFR--TREIIETTWKS--EQGNDIESLKRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALD
                        T  G    K W     E  A      E+I ++W    + G+ +  L ++   C + L  W++ +  G++   V+ K   +++L 
Subjt:  ----------------TPLG----KVWTIAILEILAFR--TREIIETTWKS--EQGNDIESLKRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALD

Query:  LRDKRSPSPELLRA-EKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDY
          +        +R  + E++ LL ++E +W+  SRE WLK G        +   + + K  +       G+W E +  IG IA  YFK +FSSS   E  
Subjt:  LRDKRSPSPELLRA-EKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDY

Query:  IDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVG
        ++ ++     VV+ + N  +  PFT M+I+ A   M P+K+ GPDG+ + FFQ YW IVG
Subjt:  IDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVG

A0A2N9GF83 CCHC-type domain-containing protein9.8e-6227.75Show/hide
Query:  EALDNAIGEFEEVSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDCDEADSEIIERGVYGVKLR---
        E +  AIG  E+V   + G   G  LRVR+ +DI +PI+RG  +  GS   + WI   YE+LP FC+  G+LGH +++C             GVKLR   
Subjt:  EALDNAIGEFEEVSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDCDEADSEIIERGVYGVKLR---

Query:  -ETQGSKGYYRTKKTEWWGERNQNQWNNSRGRG------VRGRGR-------WFGNP-----------------RNEDRWEQETGTEENGGRQ-------
          T   K Y    +      R++N   + R RG      + G G+        F  P                 R  D+    +G  +  G Q       
Subjt:  -ETQGSKGYYRTKKTEWWGERNQNQWNNSRGRG------VRGRGR-------WFGNP-----------------RNEDRWEQETGTEENGGRQ-------

Query:  -----ITQPPEVGKR------------DL-----SEISRRDKSRERSLGQETISIP----------KEKTCQRL----VYCQKEELLTENGKEEREWEFW
               + P +G++            DL       +    +S   S  QE ++               TC  L     +C+   L  ++    +    W
Subjt:  -----ITQPPEVGKR------------DL-----SEISRRDKSRERSLGQETISIP----------KEKTCQRL----VYCQKEELLTENGKEEREWEFW

Query:  RRTVRANRNKVKSVKKSLNRRDISGGWKSAPPNAMKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGG
        +R  RA + KV SV     +R    G   APP+ M +LSWN +G+GNP T+R L L +K+ +P ++F+ E++ D      L+  L + N F V   G GG
Subjt:  RRTVRANRNKVKSVKKSLNRRDISGGWKSAPPNAMKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGG

Query:  GLILYWKNDYDVTISSFSNGHIDAVIKE--DKVIWRFTGFYGNPEVGKRQDSWKLLDRLNRLWNIP-WMVGGDFNEIL----------------------
        GL L W    ++ I S+S  HIDA +K+      +R TGFYGN E  KR++SW LL  L+++   P W+  GDFNE+L                      
Subjt:  GLILYWKNDYDVTISSFSNGHIDAVIKE--DKVIWRFTGFYGNPEVGKRQDSWKLLDRLNRLWNIP-WMVGGDFNEIL----------------------

Query:  -----------------WHKEKKGASL----------------RILSSW-------------------------TPLG-------KVWTIAILEILAFRT
                         W K+++ + L                 +  SW                          P+G       KV+    +     + 
Subjt:  -----------------WHKEKKGASL----------------RILSSW-------------------------TPLG-------KVWTIAILEILAFRT

Query:  REIIETTWKSE--QGNDIESLKRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALDLRDKRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWG
         ++I   W SE   G+ +  +  + KGC   L  W+K+   GS++ +++ K  ++++L        SP +L  + +L+ LLE+EE YW+  SR  W+K G
Subjt:  REIIETTWKSE--QGNDIESLKRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALDLRDKRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWG

Query:  TEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKAS
         +          + +   K+     + G W  DK  +  +A DYFK++FSSS P+ + I  S++    VV+   N  +   FT  +I  ALK M PTKA 
Subjt:  TEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKAS

Query:  GPDGLHALFFQTYWDIVG
        GPDG+ A+F+QTYWDIVG
Subjt:  GPDGLHALFFQTYWDIVG

A0A2N9GKW3 Reverse transcriptase domain-containing protein2.9e-6127.71Show/hide
Query:  KYVEALDNAIGEFEEVSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDCDEADSEIIERGVYGVKLR
        K  + L +++G+   V+  D     G+++RVRV ++I +P+ RG   ++  K  + WI   YE+LP+FCY  G + H  +DC            + ++ +
Subjt:  KYVEALDNAIGEFEEVSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDCDEADSEIIERGVYGVKLR

Query:  ETQGSKGYYRTKKTEWWGERNQNQWNNSRGRGVRGRGRWFGNPRNEDRWEQE-------TGTEENGGRQITQPPEVGKRDLSEISRRDKSRERSLGQETI
        ET   +     +   W    N+  W   R   ++  G     P  + +           T T+ N    I  PP       S  S    +R       TI
Subjt:  ETQGSKGYYRTKKTEWWGERNQNQWNNSRGRGVRGRGRWFGNPRNEDRWEQE-------TGTEENGGRQITQPPEVGKRDLSEISRRDKSRERSLGQETI

Query:  SIPKEKTCQRLVYCQKEELLTENGKEEREWEFWRRTVRANRNKVKSVKKSLNRRDISGGWKSAPPNAMKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVF
        S P EKT             T    +  +          +  +        +   I GG  +APP+AM  L+WN RG+GNPRT++ ++  V+   P +VF
Subjt:  SIPKEKTCQRLVYCQKEELLTENGKEEREWEFWRRTVRANRNKVKSVKKSLNRRDISGGWKSAPPNAMKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVF

Query:  ILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVIKEDKV-IWRFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWM
        ++E+  D+     L+  L + N F   S  KGGGL L+WK +  + + SFS+ HIDA++ E +   WRFTGFYG PE  KR++SW LL RLN  + +PW 
Subjt:  ILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVIKEDKV-IWRFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWM

Query:  VGGDFNEILWHKEKKGASLRILSSWTPLGKVWTIAILEILAFRTREIIETTWKSEQGNDI--ESLKR-----------------QFKGCLEGLSHWNKII
          GDFNE++  +EK G   R          V        L F   +    TW + +  D+  E L R                   KG    L  W++  
Subjt:  VGGDFNEILWHKEKKGASLRILSSWTPLGKVWTIAILEILAFRTREIIETTWKSEQGNDI--ESLKR-----------------QFKGCLEGLSHWNKII

Query:  LEGSIS---KAVEKKFNEIKALDLRDKRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKD
          G+I    K VE+   + +   ++ +       LR   EL +LL +EE+ W+  SR EWL+ G            + +R+ ++     + G W      
Subjt:  LEGSIS---KAVEKKFNEIKALDLRDKRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKD

Query:  IGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVG
        +  +  +Y+ S+F ++  + + +++ +E    VV+   N  ++R +T  ++++ALK M+P K+ GPDGL  +F+Q YW ++G
Subjt:  IGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVG

A0A2N9IIR5 Uncharacterized protein2.9e-6126.44Show/hide
Query:  EALDNAIGEFEEVSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDCDE----ADSEIIERGVYGVKL
        E + +++G  E+    +     G  +R+R+ +D  +P+ RG  IR+G      W+   +E+LP+FCY  GRL H  +DCD+        + E   YG  L
Subjt:  EALDNAIGEFEEVSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDCDE----ADSEIIERGVYGVKL

Query:  RETQGSKGYYRTKKTEWWGERNQ---NQWNNSRGRGVRGRGRWFGNPRN-ED---RWEQETGTEENGGRQ------------ITQPPEVG-KRDLSEISR
        R T    G  +T+     G R +     W  +R +     G    +P   ED     + ET  +EN G++             T PP V     L EI R
Subjt:  RETQGSKGYYRTKKTEWWGERNQ---NQWNNSRGRGVRGRGRWFGNPRN-ED---RWEQETGTEENGGRQ------------ITQPPEVG-KRDLSEISR

Query:  R-DKSRERSLGQETISIPKEKTCQRLVYCQKEE---------LLTENGKEEREWEFWRRTVRANRNKVKSVKKSLNRRDISGGWKSAPPNAMKILSWNVR
           K +  +   E +S   +  C   V C   +           T+    +  W+ W           +          I GG  +A P AM  + WN R
Subjt:  R-DKSRERSLGQETISIPKEKTCQRLVYCQKEE---------LLTENGKEEREWEFWRRTVRANRNKVKSVKKSLNRRDISGGWKSAPPNAMKILSWNVR

Query:  GMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVIKEDKV-IWRFTGFYGNP
        G+GNPRT++ L+  V    P+ VF++E+  D  K   ++  L + N   V    +GGG++L+WK    +TI SFS  HID++I E     WRFTGFYG P
Subjt:  GMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVIKEDKV-IWRFTGFYGNP

Query:  EVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLR-----------------------------------------------ILSSW----
        E   R  SW +L  L+R +++PW   GDFNE++   EK+G   R                                               + S W    
Subjt:  EVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLR-----------------------------------------------ILSSW----

Query:  ---------------------------TPLGKVWTIAILEILAFRTREIIETTWKS-EQGNDIESLKRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEI
                                   TP  K++    + +     R  +E  W+S   G  +  +  +   C   LS+W++    GS+ + + +K +++
Subjt:  ---------------------------TPLGKVWTIAILEILAFRTREIIETTWKS-EQGNDIESLKRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEI

Query:  KALDLRD-KRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYP
        +  +L   +     +++    EL  LL +EE  W   SR  WLK G            + +R+  +L    + G W +    I  +A  YF+ LF ++ P
Subjt:  KALDLRD-KRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYP

Query:  SEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVG
         E  I +     + VVS + N  +S+ +T  ++E+A+K M+P  A GPDG+  LF+QT+W +VG
Subjt:  SEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVG

A0A7N2LIH6 Uncharacterized protein1.8e-6326.63Show/hide
Query:  IGEFEEVSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDCDE-ADSE---IIERGVYGVKLRETQGS
        IG+  EV   + G   G+ LRVR++ D    + RG  + +     + W+   YE+LP+FCY  GRL H ++DC E  D E     ER  YG  LR   G 
Subjt:  IGEFEEVSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDCDE-ADSE---IIERGVYGVKLRETQGS

Query:  KGYYRTKKTEWWGERNQNQWNNSRGRGVRGRGRWFGNPRNEDRWEQETGTEENGGRQITQPPEVGKRDLSE--ISRRDKSRER-----SLGQETISI---
                              S GR     G      R +D  E  T T+     ++ +   VG++ +SE  I ++D +R+       +GQ+ +S    
Subjt:  KGYYRTKKTEWWGERNQNQWNNSRGRGVRGRGRWFGNPRNEDRWEQETGTEENGGRQITQPPEVGKRDLSE--ISRRDKSRER-----SLGQETISI---

Query:  -----------PKEKTCQRLVYCQKEELL----TENGKEEREWE--------------------------------------------------------
                   PKEK   ++    K+ L      ++ +++ +WE                                                        
Subjt:  -----------PKEKTCQRLVYCQKEELL----TENGKEEREWE--------------------------------------------------------

Query:  ------FWRRTVRANRNK---VKSVKKSLNRRD---ISGGWKSAPPNAMKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLG
               W+R  R  ++      SV  S  R+D     GG  +APP++M IL+WN RG+G    +R L+ EVKK +P LVF++E+K    K    +  LG
Subjt:  ------FWRRTVRANRNK---VKSVKKSLNRRD---ISGGWKSAPPNAMKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDFKAGNLKRSLG

Query:  YDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVI--KEDKVIWRFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKG--
        +     V S+G+ GGL L WK   D+   S S+ HID V+        WR TGFYG+P+ GKR  SWKLL+ LN    +PW+V GDFNEI+   EK G  
Subjt:  YDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVI--KEDKVIWRFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKG--

Query:  --------ASLRILS-------------------------------------SWT---PLGKVWTIAILE----ILAF----------------------
                A   +LS                                     +W+   P  KV  +++      +LA                       
Subjt:  --------ASLRILS-------------------------------------SWT---PLGKVWTIAILE----ILAF----------------------

Query:  ---RTREIIETTWKSEQGNDIESLKRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALD-LRDKRSPSPELLRAEKELDNLLEEEEKYWKIGSREEW
             +EI+E  W   + +    ++ + + C + L  WN+    G++ K +++K N ++ L+ L      + E+   +KE++ L   EE  WK  SR  W
Subjt:  ---RTREIIETTWKSEQGNDIESLKRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALD-LRDKRSPSPELLRAEKELDNLLEEEEKYWKIGSREEW

Query:  LKWGTEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSP
        L++G +          + ++K ++       G W ED++    +  DYFK ++SS+ P+    D SLE     V+   N ++ + F  +++  AL+ M P
Subjt:  LKWGTEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSP

Query:  TKASGPDGLHALFFQTYWDIVGGTLRDFAL
        TKA GPDG+  +F+Q YWDIVG ++ +  L
Subjt:  TKASGPDGLHALFFQTYWDIVGGTLRDFAL

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein1.4e-0423.49Show/hide
Query:  IKALDLRD----KRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLV--FFSKGGSWIEDKKDIGVIATDYFKSL
        +KAL+ ++    KRS   E+++   E+ N +E      +I     W  +    +      R+ +  + K+L+    ++ G    D ++I      ++K L
Subjt:  IKALDLRD----KRSPSPELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLV--FFSKGGSWIEDKKDIGVIATDYFKSL

Query:  FSSSYPSEDYIDKSLECF-IPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTY
        +S+   + D +DK L+ + +P ++ +    ++ P +  +IE  + S+   K+ GPDG  A F+QT+
Subjt:  FSSSYPSEDYIDKSLECF-IPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTY

P14381 Transposon TX1 uncharacterized 149 kDa protein1.6e-0528.49Show/hide
Query:  KALDLRDKRSPSP------ELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRK---TKLLVFFSKGGSWIEDKKDIGVIATDYFK
        + LDL  + S S       E L  ++ L N+ + + +   + SR + L    +   GS      EK+K    ++   F++ G+ +ED + I   A  +++
Subjt:  KALDLRDKRSPSP------ELLRAEKELDNLLEEEEKYWKIGSREEWLKWGTEAQSGSTRGRVKEKRK---TKLLVFFSKGGSWIEDKKDIGVIATDYFK

Query:  SLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVG
        +LFS    S D  ++ L   +PVVS      +  P T  ++  AL+ M   K+ G DGL   FFQ +WD +G
Subjt:  SLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTKASGPDGLHALFFQTYWDIVG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAACGAAGGGAAGTCTCCAGATTACCCAAGGGGACCTGAAGGTGATCCAAGGAAACTCAACAGAGGAAGCAACAATCTGGAAATGGAGGAAGAGACTTCAATGGT
CGTTGAAGAGGAATTTATCAACAAGCAAGATCAAAAACAAAACGAGGAGCATGGGGAGATGGAAGAGGTGCTAAACAACCAAATCAAGAAGCTAAGCTTGGAGGAGCAAG
AAGGAAAGAGGATAGTGGAAATTGAGGATGAGGATGTGGAGGAAAAAAACCGAGATCTTAGAGAAGCCACAACCTGCAAAATCCTCACAGCAAACCTGATTCAATGGGAA
GTTTTCTCGGATATCATGCCAAGAATATGGGTTTTGTTCGAGGAACCAAAAGGGGGCCATTGTACCAGCGAGCTAGACTTCAGGAAGTATGTTGAAGCTCTCGACAATGC
TATAGGGGAGTTTGAGGAAGTATCAACCGAGGACAGTGGAAAAATCAAAGGGGAAAGTCTCAGGGTCAGAGTTAAAATCGATATTGGTGAGCCTATAAAGAGGGGAACAA
ATATTAGAGTGGGATCAAAAGCCACAAAGACCTGGATACCGATTACCTATGAAAAATTGCCTGATTTCTGTTATTCTTCTGGAAGACTTGGGCATGTTAAGCAAGACTGC
GATGAAGCCGATTCTGAGATAATAGAAAGAGGGGTCTATGGAGTCAAACTTAGAGAGACACAAGGAAGCAAAGGATATTACAGAACCAAGAAAACAGAATGGTGGGGGGA
GAGAAACCAAAATCAGTGGAATAACAGTCGAGGTAGAGGCGTGAGGGGCAGAGGACGATGGTTCGGAAACCCAAGAAATGAAGATAGATGGGAGCAAGAGACCGGAACGG
AGGAAAATGGAGGAAGACAGATAACCCAGCCACCGGAGGTCGGGAAGAGAGATCTGTCGGAGATCAGTCGACGGGACAAGTCCAGAGAGAGGAGCCTGGGGCAAGAGACA
ATCTCGATTCCCAAGGAAAAAACATGTCAACGACTAGTTTATTGTCAGAAGGAAGAGCTTCTGACCGAAAATGGAAAAGAAGAGCGCGAATGGGAATTTTGGAGAAGAAC
AGTGAGAGCCAACAGAAACAAAGTAAAATCAGTGAAGAAGAGTCTCAACCGGAGGGATATCAGCGGAGGCTGGAAATCAGCCCCGCCGAACGCCATGAAAATCCTAAGTT
GGAACGTTCGAGGGATGGGGAATCCTCGAACGATTCGTAATCTCTCCCTGGAAGTGAAGAAAAATTCCCCGGACTTAGTTTTCATTTTGGAATCGAAATGCGATGATTTC
AAAGCTGGAAACCTCAAGAGATCGTTGGGTTATGACAACTGTTTTGCCGTGAATAGTAATGGTAAAGGTGGAGGGCTCATCCTCTACTGGAAGAATGATTACGATGTGAC
CATTAGTTCTTTCTCGAACGGCCACATTGATGCAGTTATTAAAGAAGACAAAGTGATATGGAGGTTTACGGGGTTCTACGGGAATCCAGAAGTAGGAAAGAGACAAGATT
CATGGAAGCTTTTGGACAGATTAAACAGGCTGTGGAATATTCCGTGGATGGTAGGTGGAGACTTTAATGAGATTCTTTGGCATAAGGAAAAGAAGGGGGCATCCCTAAGG
ATCCTAAGCTCATGGACTCCTTTAGGGAAAGTTTGGACAATTGCAATCTTAGAGATATTGGCTTTTCGGACAAGGGAGATTATTGAAACCACCTGGAAAAGTGAGCAAGG
AAATGACATTGAGTCGCTGAAGAGGCAATTCAAAGGCTGCCTTGAAGGCCTTTCTCATTGGAATAAAATCATATTGGAAGGCTCAATCAGCAAAGCAGTAGAAAAAAAGT
TCAACGAAATAAAGGCCTTAGATCTCAGAGACAAGAGATCCCCATCGCCAGAGCTGCTTCGAGCTGAAAAAGAACTAGATAATCTTTTGGAGGAAGAAGAGAAGTATTGG
AAAATCGGATCTCGCGAAGAATGGCTCAAGTGGGGGACAGAAGCACAAAGTGGTTCCACAAGAGGGCGAGTCAAAGAAAAAAGAAAAACGAAATTACTGGTATTTTTTAG
CAAGGGAGGATCGTGGATTGAGGATAAGAAAGATATAGGGGTCATTGCAACAGATTATTTCAAAAGTTTGTTTTCTTCTTCCTACCCTTCAGAAGATTATATTGACAAGT
CTTTGGAATGTTTTATCCCAGTCGTTTCGGCTAACGATAACAGAGATATTTCAAGACCTTTTACCAAAATGGATATTGAGGTTGCTCTAAAATCTATGAGTCCTACTAAA
GCGTCGGGTCCTGACGGCTTGCATGCCCTGTTCTTTCAAACATACTGGGACATTGTAGGGGGGACATTACGAGACTTTGCCTTGATATCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAACGAAGGGAAGTCTCCAGATTACCCAAGGGGACCTGAAGGTGATCCAAGGAAACTCAACAGAGGAAGCAACAATCTGGAAATGGAGGAAGAGACTTCAATGGT
CGTTGAAGAGGAATTTATCAACAAGCAAGATCAAAAACAAAACGAGGAGCATGGGGAGATGGAAGAGGTGCTAAACAACCAAATCAAGAAGCTAAGCTTGGAGGAGCAAG
AAGGAAAGAGGATAGTGGAAATTGAGGATGAGGATGTGGAGGAAAAAAACCGAGATCTTAGAGAAGCCACAACCTGCAAAATCCTCACAGCAAACCTGATTCAATGGGAA
GTTTTCTCGGATATCATGCCAAGAATATGGGTTTTGTTCGAGGAACCAAAAGGGGGCCATTGTACCAGCGAGCTAGACTTCAGGAAGTATGTTGAAGCTCTCGACAATGC
TATAGGGGAGTTTGAGGAAGTATCAACCGAGGACAGTGGAAAAATCAAAGGGGAAAGTCTCAGGGTCAGAGTTAAAATCGATATTGGTGAGCCTATAAAGAGGGGAACAA
ATATTAGAGTGGGATCAAAAGCCACAAAGACCTGGATACCGATTACCTATGAAAAATTGCCTGATTTCTGTTATTCTTCTGGAAGACTTGGGCATGTTAAGCAAGACTGC
GATGAAGCCGATTCTGAGATAATAGAAAGAGGGGTCTATGGAGTCAAACTTAGAGAGACACAAGGAAGCAAAGGATATTACAGAACCAAGAAAACAGAATGGTGGGGGGA
GAGAAACCAAAATCAGTGGAATAACAGTCGAGGTAGAGGCGTGAGGGGCAGAGGACGATGGTTCGGAAACCCAAGAAATGAAGATAGATGGGAGCAAGAGACCGGAACGG
AGGAAAATGGAGGAAGACAGATAACCCAGCCACCGGAGGTCGGGAAGAGAGATCTGTCGGAGATCAGTCGACGGGACAAGTCCAGAGAGAGGAGCCTGGGGCAAGAGACA
ATCTCGATTCCCAAGGAAAAAACATGTCAACGACTAGTTTATTGTCAGAAGGAAGAGCTTCTGACCGAAAATGGAAAAGAAGAGCGCGAATGGGAATTTTGGAGAAGAAC
AGTGAGAGCCAACAGAAACAAAGTAAAATCAGTGAAGAAGAGTCTCAACCGGAGGGATATCAGCGGAGGCTGGAAATCAGCCCCGCCGAACGCCATGAAAATCCTAAGTT
GGAACGTTCGAGGGATGGGGAATCCTCGAACGATTCGTAATCTCTCCCTGGAAGTGAAGAAAAATTCCCCGGACTTAGTTTTCATTTTGGAATCGAAATGCGATGATTTC
AAAGCTGGAAACCTCAAGAGATCGTTGGGTTATGACAACTGTTTTGCCGTGAATAGTAATGGTAAAGGTGGAGGGCTCATCCTCTACTGGAAGAATGATTACGATGTGAC
CATTAGTTCTTTCTCGAACGGCCACATTGATGCAGTTATTAAAGAAGACAAAGTGATATGGAGGTTTACGGGGTTCTACGGGAATCCAGAAGTAGGAAAGAGACAAGATT
CATGGAAGCTTTTGGACAGATTAAACAGGCTGTGGAATATTCCGTGGATGGTAGGTGGAGACTTTAATGAGATTCTTTGGCATAAGGAAAAGAAGGGGGCATCCCTAAGG
ATCCTAAGCTCATGGACTCCTTTAGGGAAAGTTTGGACAATTGCAATCTTAGAGATATTGGCTTTTCGGACAAGGGAGATTATTGAAACCACCTGGAAAAGTGAGCAAGG
AAATGACATTGAGTCGCTGAAGAGGCAATTCAAAGGCTGCCTTGAAGGCCTTTCTCATTGGAATAAAATCATATTGGAAGGCTCAATCAGCAAAGCAGTAGAAAAAAAGT
TCAACGAAATAAAGGCCTTAGATCTCAGAGACAAGAGATCCCCATCGCCAGAGCTGCTTCGAGCTGAAAAAGAACTAGATAATCTTTTGGAGGAAGAAGAGAAGTATTGG
AAAATCGGATCTCGCGAAGAATGGCTCAAGTGGGGGACAGAAGCACAAAGTGGTTCCACAAGAGGGCGAGTCAAAGAAAAAAGAAAAACGAAATTACTGGTATTTTTTAG
CAAGGGAGGATCGTGGATTGAGGATAAGAAAGATATAGGGGTCATTGCAACAGATTATTTCAAAAGTTTGTTTTCTTCTTCCTACCCTTCAGAAGATTATATTGACAAGT
CTTTGGAATGTTTTATCCCAGTCGTTTCGGCTAACGATAACAGAGATATTTCAAGACCTTTTACCAAAATGGATATTGAGGTTGCTCTAAAATCTATGAGTCCTACTAAA
GCGTCGGGTCCTGACGGCTTGCATGCCCTGTTCTTTCAAACATACTGGGACATTGTAGGGGGGACATTACGAGACTTTGCCTTGATATCCTGA
Protein sequenceShow/hide protein sequence
MDNEGKSPDYPRGPEGDPRKLNRGSNNLEMEEETSMVVEEEFINKQDQKQNEEHGEMEEVLNNQIKKLSLEEQEGKRIVEIEDEDVEEKNRDLREATTCKILTANLIQWE
VFSDIMPRIWVLFEEPKGGHCTSELDFRKYVEALDNAIGEFEEVSTEDSGKIKGESLRVRVKIDIGEPIKRGTNIRVGSKATKTWIPITYEKLPDFCYSSGRLGHVKQDC
DEADSEIIERGVYGVKLRETQGSKGYYRTKKTEWWGERNQNQWNNSRGRGVRGRGRWFGNPRNEDRWEQETGTEENGGRQITQPPEVGKRDLSEISRRDKSRERSLGQET
ISIPKEKTCQRLVYCQKEELLTENGKEEREWEFWRRTVRANRNKVKSVKKSLNRRDISGGWKSAPPNAMKILSWNVRGMGNPRTIRNLSLEVKKNSPDLVFILESKCDDF
KAGNLKRSLGYDNCFAVNSNGKGGGLILYWKNDYDVTISSFSNGHIDAVIKEDKVIWRFTGFYGNPEVGKRQDSWKLLDRLNRLWNIPWMVGGDFNEILWHKEKKGASLR
ILSSWTPLGKVWTIAILEILAFRTREIIETTWKSEQGNDIESLKRQFKGCLEGLSHWNKIILEGSISKAVEKKFNEIKALDLRDKRSPSPELLRAEKELDNLLEEEEKYW
KIGSREEWLKWGTEAQSGSTRGRVKEKRKTKLLVFFSKGGSWIEDKKDIGVIATDYFKSLFSSSYPSEDYIDKSLECFIPVVSANDNRDISRPFTKMDIEVALKSMSPTK
ASGPDGLHALFFQTYWDIVGGTLRDFALIS