; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035115 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035115
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:15173371..15177226
RNA-Seq ExpressionLag0035115
SyntenyLag0035115
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.3e-8132.18Show/hide
Query:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI
        P K LG                GP  +  CL  LN+ + ++ WN T I LIPKI Q R +S++RPISLCNVSYKI++K I N+LK V+  ++ + QS F+
Subjt:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI

Query:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP
        P R+I+DN+I+GHE LH +   + G  G AALKLD+SKA+DRVEW+YL  IM K+GF+  WI+ +++CI+T  FSI +NG   G  + SRGIRQ +PLSP
Subjt:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP

Query:  YLFLICMEGLSALLVSARTRS-MVEVSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACG------------SPKVFYPKKSALCAKF--
        YLFL+C EGLSAL+        +  +          +L FADDSL+FL+    E    + +L  Y +A G            SP V   ++  L      
Subjt:  YLFLICMEGLSALLVSARTRS-MVEVSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACG------------SPKVFYPKKSALCAKF--

Query:  ----RWGSL-----------GDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVL-------------------------
             +G+            G+ R++ W +W  +C PKE GGLNFRDL  FNQA++AK  WR L +P++ +S+VL                         
Subjt:  ----RWGSL-----------GDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVL-------------------------

Query:  ---------------------------------------------------------------------CG--------------------------RGE
                                                                             C                           RG 
Subjt:  ---------------------------------------------------------------------CG--------------------------RGE

Query:  YTVKSGYKLSMMNSQEASLLRVGREMRWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVWELILVG
        Y+V+SGYKL M     A+          WN +WK+ VP+K+K+F+W+S H  IPT  NL    +       +  +  E+  HA F   RAR++W  +   
Subjt:  YTVKSGYKLSMMNSQEASLLRVGREMRWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVWELILVG

Query:  IYALKLVDS
        +  L   D+
Subjt:  IYALKLVDS

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.6e-0545.76Show/hide
Query:  VVKRIHDFSDVAWVIGGDLNEILWQNEKSGGPIRDNRQILAFREVLDDCNLRDLGFSGG
        +++RI +     W+IGGD+N ILW  E S     D  QI AFR ++D C+L D+GF GG
Subjt:  VVKRIHDFSDVAWVIGGDLNEILWQNEKSGGPIRDNRQILAFREVLDDCNLRDLGFSGG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]4.6e-7730.75Show/hide
Query:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI
        P K  G   ++     K     G   VS  L  LN+   +   NHTNIVLIPK+     +S +RPISLCNV YKI++K +AN+LK VL +I+   QS F+
Subjt:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI

Query:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP
        P R ITDN+++ +ETLH +  ++KGK G  ALKLD+SKAYDRVEW +L  IM+K+GF A WI+ VM C+TT +FSIL+NG+ +  I+ SRGIRQ +P+SP
Subjt:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP

Query:  YLFLICMEGLSALLVSARTRSMVE-VSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGS-----------------------------
        YLFL+C EGL+ALL  A    M+  VSI R  PK  NL FADDSL+F +    E      IL+ YE+A G                              
Subjt:  YLFLICMEGLSALLVSARTRSMVE-VSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGS-----------------------------

Query:  ---------------------------------------------------------------------PKVFYPKKSALCAKFRWGSLGDKRRMQWKRW
                                                                             P     +  ALCA+F WG +G++R++ WK W
Subjt:  ---------------------------------------------------------------------PKVFYPKKSALCAKFRWGSLGDKRRMQWKRW

Query:  EDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRV--------------------------------------------------------------------
        + L  PK+ GG+ FRDL  FN AMLAKQ WR+                                                                    
Subjt:  EDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRV--------------------------------------------------------------------

Query:  ----------------------LTNPHITI--------------SRVLC------------------GRGEYTVKSGYKLSMMNSQEASLLRVGREM---
                              L NP   I              +  +C                   RG ++VKS Y ++     +A+  R G  M   
Subjt:  ----------------------LTNPHITI--------------SRVLC------------------GRGEYTVKSGYKLSMMNSQEASLLRVGREM---

Query:  --RWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVW
            W+ +WK+R+P+KVK+F W++ H  +PT VNL    +  +    +   E E+T HAL+  +  +++W
Subjt:  --RWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVW

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.3e-7629.64Show/hide
Query:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI
        P K  G   +S     K     G   V   L +LNSN SM   N TNI L+PKI     +S++RPISLCNV YK+++K +AN+LK +L +I+ E QS F+
Subjt:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI

Query:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP
          R ITDN+++  E +HYL+HK++GK G+AA+KLDMSKAYDRVEW ++ Q+M+K+GFH  WIKLVM CIT+ ++SIL+NG ++G I  +RG+RQ +P+SP
Subjt:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP

Query:  YLFLICMEGLSALLVS-ARTRSMVEVSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGS-----------------------------
        Y+FL+C +G S+LL   AR   +  VSI R CPK  +LFFADDSL+F K  + E      IL+ YE A G                              
Subjt:  YLFLICMEGLSALLVS-ARTRSMVEVSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGS-----------------------------

Query:  ---------------------------------------------------------------------PKVFYPKKSALCAKFRWGSLGDKRRMQWKRW
                                                                             PK    +  A+  +F WG  G + ++ W  W
Subjt:  ---------------------------------------------------------------------PKVFYPKKSALCAKFRWGSLGDKRRMQWKRW

Query:  EDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVLCGR-----------------------------------------------------
        + LCK K+ GG+ FR+L  FN AMLAKQ WR+++NP+  ++++   R                                                     
Subjt:  EDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVLCGR-----------------------------------------------------

Query:  ---------------------------------------------------------------------GEYTVKSGYKLS--MMNSQEASLLRVG-REM
                                                                             GE++VKS Y ++  ++++ E      G    
Subjt:  ---------------------------------------------------------------------GEYTVKSGYKLS--MMNSQEASLLRVG-REM

Query:  RWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVW
          W KLW + +P KV++F WK   N++PT +NL    V +  + P    E E+  H   +   A+ VW
Subjt:  RWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVW

XP_024043083.1 uncharacterized protein LOC112099827 [Citrus clementina]5.1e-7632.38Show/hide
Query:  GSLYSGGSDFSHEEFPPDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKL
        G+ +SG          PDK  GAR  + F   K     G      CL ILN   ++   NHT I LIPK+ + R V  +RPISLCNV Y+IV K IAN+L
Subjt:  GSLYSGGSDFSHEEFPPDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKL

Query:  KVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFG
        K +LN I+   QS FIP+R I DN+I+G++ LH ++H +  + G  ALKLD+SKAYDRVEW +L Q M  LGF A WI L+M CITT  FS+L+NG   G
Subjt:  KVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFG

Query:  FIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVEVSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACG------------SPK
         IK  +G+RQ  PLSPYLF++C E  S LL  A     +            +L FADDSLVF K +  +  H K I   Y KA G            + K
Subjt:  FIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVEVSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACG------------SPK

Query:  VFYPKKSA-------------------------------------------------------------------------------LC-------AKFR
        V   + SA                                                                               LC        +F 
Subjt:  VFYPKKSA-------------------------------------------------------------------------------LC-------AKFR

Query:  WGSLGDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVLCGR--------------------------------------
        WG+  DK  + W RW  + K K  GGL FR+L +FNQA++AKQ WR++  P+  ++RV+  R                                      
Subjt:  WGSLGDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVLCGR--------------------------------------

Query:  -------------------------GEYTVKSGYKLSM----MNSQEASLLRVGREMRWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGI
                                  EY+VKSGY+L++     N  E+S        R W   W + +P KVK+F+W++  N +PT  NLW        I
Subjt:  -------------------------GEYTVKSGYKLSM----MNSQEASLLRVGREMRWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGI

Query:  FPVFQEEMETTDHALFQRSRAREVWEL
            + ++ET  HAL +   AR+ W+L
Subjt:  FPVFQEEMETTDHALFQRSRAREVWEL

XP_030939696.1 uncharacterized protein LOC115964548 [Quercus lobata]6.0e-7735.44Show/hide
Query:  LAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKTGYA
        L+ LN    ++  NHT I LIPK+     VS +RPISLCNV YKIV+K IAN+LK +LN I+ E QS FI DR ITDN+++  E+LH+++     KTG+ 
Subjt:  LAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKTGYA

Query:  ALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVE-VSIAR
        ALKLDMSKAYDRVEW++L +I+ K+GF  SW+ L+M+CITT ++SIL+NGE  G I  ++G+ Q +PLSPYLFL C EGL+ALL        +   SI+R
Subjt:  ALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVE-VSIAR

Query:  VCPKSFNLFFADDSLVFLKVAADEFGHFKSIL-----------------------------------------KDYEKACGSPKVFYPKKSALCAKFR--
          PK   LFFADD L+F +    E    K +L                                         + YEK  G P      K A   + +  
Subjt:  VCPKSFNLFFADDSLVFLKVAADEFGHFKSIL-----------------------------------------KDYEKACGSPKVFYPKKSALCAKFR--

Query:  -WGSL----------------GDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQA------------------------------------WRV
         W  +                G+++++ W +W  LC  K IGG+ FRD+ NFN+AMLAKQ+                                    W  
Subjt:  -WGSL----------------GDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQA------------------------------------WRV

Query:  LTNPHITISRV---------LCGRGEYTVKSGYKLSMMN-SQEASLLRVGREMRWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVF
             I +S V         L   G+Y VKS Y++S    +     L  G     W ++WK+  P ++K F+W +  +S+PT  NL    +PV+    + 
Subjt:  LTNPHITISRV---------LCGRGEYTVKSGYKLSMMN-SQEASLLRVGREMRWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVF

Query:  QEEMETTDHALFQRSRAREVWE
        ++  ET  HAL+   +A+ VW+
Subjt:  QEEMETTDHALFQRSRAREVWE

TrEMBL top hitse value%identityAlignment
A0A2N9FP20 Reverse transcriptase domain-containing protein3.9e-8232.72Show/hide
Query:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI
        P K  G+  +S F   K     G    +  L++LNS + +R  N T I LIPK      +S+YRPISLCNV YKI++K IAN+LK VL+ I+ + QS F+
Subjt:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI

Query:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP
        P R ITDN+ +  E LH ++ KRKGK G  A+KLDMSKAYDRVEW ++  +M KLGF   W+ ++M+CI T  +S+L++G   G++  SRG+RQ +PLSP
Subjt:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP

Query:  YLFLICMEGLSALLVSARTRSMVE-VSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGS-----------------------------
        YLFL+C EGLSAL+    T   ++ V  +R  P   +LFFADDSL+F K +  E   F  +L+ YE + G                              
Subjt:  YLFLICMEGLSALLVSARTRSMVE-VSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGS-----------------------------

Query:  ---------------------------------------------------------------------PKVFYPKKSALCAKFRWGSLGDKRRMQWKRW
                                                                             PKV+  + ++L A++ WG   D+R++ W +W
Subjt:  ---------------------------------------------------------------------PKVFYPKKSALCAKFRWGSLGDKRRMQWKRW

Query:  EDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRV---------------------------LCGR--------------------------
        + LC  KE GG+ FR+L  FN A+L+KQ WR+LT       RV                           L GR                          
Subjt:  EDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRV---------------------------LCGR--------------------------

Query:  GEYTVKSGYKLSMMNSQEASLLRVGR--EMRW-WNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVWE
        G ++VKS Y+L     +E S     +    RW W   WK+ +P K+K F+W++FH+S+PT  NL+   +  +   P+  +E E+T H ++Q   AR  W 
Subjt:  GEYTVKSGYKLSMMNSQEASLLRVGR--EMRW-WNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVWE

Query:  LI
        L+
Subjt:  LI

A0A2N9HDH5 Uncharacterized protein1.6e-8333.22Show/hide
Query:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI
        P K  G+  +S F   K     G    +  L+ILNS + +R  N T + LIPK      +S+YRPISLCNV YKI++K +AN+LK VL+ I+ + QS F+
Subjt:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI

Query:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP
        P R ITDN+ +  E LH ++ KR+G+ G  A+KLDMSKAYDRVEWS++  IM KLGF   WI ++M+CI T  +SI+++G   GFI  SRGIRQ +P+SP
Subjt:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP

Query:  YLFLICMEGLSALLVSARTRSMVE-VSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGS-----------------------------
        YLFL+C EGLSALL  +     ++ +  +R  P   +LFFADDSL+F K +  E   F  +L  YE++ G                              
Subjt:  YLFLICMEGLSALLVSARTRSMVE-VSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGS-----------------------------

Query:  ---------------------------------------------------------------------PKVFYPKKSALCAKFRWGSLGDKRRMQWKRW
                                                                             PK +  + ++L A++ WG   DKR++ W +W
Subjt:  ---------------------------------------------------------------------PKVFYPKKSALCAKFRWGSLGDKRRMQWKRW

Query:  EDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRV---------------------------LCGR--------------------------
        + LC  KE GG+ FR+L  FN A+L+KQ WR+L N      RV                           L GR                          
Subjt:  EDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRV---------------------------LCGR--------------------------

Query:  GEYTVKSGYKL--SMMNSQEASLLRVGREMRW-WNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVWE
        G +TVKS Y+L    M  +        +  RW W K+WK+R+P K+K F+W+++H+S+PT +NL+   +  N +  +  +E E+T HA++Q + AR  W 
Subjt:  GEYTVKSGYKL--SMMNSQEASLLRVGREMRW-WNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVWE

Query:  LI
        L+
Subjt:  LI

A0A2N9HWG1 Reverse transcriptase domain-containing protein5.4e-8433.55Show/hide
Query:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI
        P K  G+  +S F   K     G       L++LNS + +R  N T + LIPK      +S+YRPISLCNV YKI++K +AN+LK VL+ I+ + QS F+
Subjt:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI

Query:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP
        P R ITDN+ +  E LH ++ KR+G+ G  A+KLDMSKAYDRVEWS++  IM KLGF   WI ++M+CI T  +SIL++G   GFI  SRGIRQ +P+SP
Subjt:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP

Query:  YLFLICMEGLSALLVSARTRSMVE-VSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGS-----------------------------
        YLFL+C EGLSALL  +     ++ + I+R  P   +LFFADDSL+F K +  E   F  +L  YE++ G                              
Subjt:  YLFLICMEGLSALLVSARTRSMVE-VSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGS-----------------------------

Query:  ---------------------------------------------------------------------PKVFYPKKSALCAKFRWGSLGDKRRMQWKRW
                                                                             PK +  + ++L A++ WG   DKR++ W +W
Subjt:  ---------------------------------------------------------------------PKVFYPKKSALCAKFRWGSLGDKRRMQWKRW

Query:  EDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRV---------------------------LCGR--------------------------
        + LC  KE GG+ FR+L  FN A+L+KQ WR+L N      RV                           L GR                          
Subjt:  EDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRV---------------------------LCGR--------------------------

Query:  GEYTVKSGYKL--SMMNSQEASLLRVGREMRW-WNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVWE
        G +TVKS Y+L    M  +        +  RW W K+WK+R+P K+K F+W+++H+S+PT +NL+   +  N +  +  +E E+T HAL+Q + AR  W 
Subjt:  GEYTVKSGYKL--SMMNSQEASLLRVGREMRW-WNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVWE

Query:  LI
        L+
Subjt:  LI

A0A2N9I5I3 Reverse transcriptase domain-containing protein2.1e-8337.33Show/hide
Query:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI
        P K  G   ++ F   K     GP   +  L++LNS   +R  N T+I  IPK      +S+YRPISLCNV YKI++K +AN+LK VL  I+   QS F+
Subjt:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI

Query:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP
        P R ITDN+ +  E +H L+ KRKGK G  ALKLDMSKAYDRVEW +L  IM KLGF   W+ ++M C+ T  +S+L++G   G+I  SRGIRQ +PLSP
Subjt:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP

Query:  YLFLICMEGLSALL----VSARTRSMVEVSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILK----------------DYEKACGS-PKVFYPKKS
        YLFL+C EGLSALL    +S R R    +  +   P   +LFFADDSL F      +   F  IL                  Y   C   PK +  + +
Subjt:  YLFLICMEGLSALL----VSARTRSMVEVSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILK----------------DYEKACGS-PKVFYPKKS

Query:  ALCAKFRWGSLGDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRV---------------------------LCGR----
        +L A++ WG   ++R++ W RW+ LC  K  GGL FR+L  FN A+LA Q+WR+L NP     RV                           L GR    
Subjt:  ALCAKFRWGSLGDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRV---------------------------LCGR----

Query:  -----------------------GEYTVKSGYKL---SMMNSQEASLLRVGREMRWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPV
                               G +TV+S Y L       S       V     +W K WK+ +P K+K F+W+++H  +PT  NL+   +  N    +
Subjt:  -----------------------GEYTVKSGYKL---SMMNSQEASLLRVGREMRWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPV

Query:  FQEEMETTDHALFQRSRAREVWELI
         +++ +TT HAL+Q   AR  W L+
Subjt:  FQEEMETTDHALFQRSRAREVWELI

A0A803QH76 Uncharacterized protein1.3e-8231.45Show/hide
Query:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI
        PDK  G   +S           GP    + L ILN  +SM   N   I LIPKI Q +LVS++RPISLC V YK+V+K IA + K VL  ++ + QS F+
Subjt:  PDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFI

Query:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP
        P+R ITDN++L  E +H L++K++G+ GYAALKLDMSKA+DRVEW +LS++M K+GFH++W+ LVM C+T+ T S  +NG   G +   RG+RQ +PLSP
Subjt:  PDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSP

Query:  YLFLICMEGLSALLVSARTRSMVE-VSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGS-----------------------------
        YLFLIC EGLSALL        ++ +++AR  P   +L FADDSL+F +   D       +L  Y +A G                              
Subjt:  YLFLICMEGLSALLVSARTRSMVE-VSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGS-----------------------------

Query:  --------------------------PKVFYPKKSALCAKFRWGSLGDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRV
                                  PK F  +  ++ A F WGS  +  ++ WK+W+ +C  K  GG+ FR  ++FNQA+LAKQAWR+L  P   ++RV
Subjt:  --------------------------PKVFYPKKSALCAKFRWGSLGDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRV

Query:  L------------------------------------------------CGR------------------------------------------------
        L                                                C R                                                
Subjt:  L------------------------------------------------CGR------------------------------------------------

Query:  -----------------------GEYTVKSGYKLSMMNSQEASLLRVGREMRWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQE
                               G YTV+SGY L++               +WWN+LW +++P KVK+F W+  ++++PT VNL +  +  +    + + 
Subjt:  -----------------------GEYTVKSGYKLSMMNSQEASLLRVGREMRWWNKLWKMRVPSKVKLFVWKSFHNSIPTMVNLWNHHVPVNGIFPVFQE

Query:  EMETTDHALFQRSRAREVWE
          E+  HALF+  RA+ VW+
Subjt:  EMETTDHALFQRSRAREVWE

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.2e-1726.55Show/hide
Query:  NIVLIPKI-LQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGK-TGYAALKLDMSKAYDRVE
        +I+LIPK         N+RPISL N+  KI+ K +AN+++  + +++   Q  FIP      N+    ++++ +QH  + K   +  + +D  KA+D+++
Subjt:  NIVLIPKI-LQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGK-TGYAALKLDMSKAYDRVE

Query:  WSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVEVSIARVCPKSFNL-FFADDS
          ++ + ++KLG    ++K++       T +I++NG+         G RQ  PLSP LF I +E L+  +     R   E+   ++  +   L  FADD 
Subjt:  WSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVEVSIARVCPKSFNL-FFADDS

Query:  LVFLKVAADEFGHFKSILKDYEKACG
        +V+L+       +   ++ ++ K  G
Subjt:  LVFLKVAADEFGHFKSILKDYEKACG

P08548 LINE-1 reverse transcriptase homolog2.2e-1828.44Show/hide
Query:  NIVLIPKI-LQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKT-GYAALKLDMSKAYDRVE
        NI LIPK         NYRPISL N+  KI+ K + N+++  + +I+   Q  FIP      N+    ++++ +QH  K K   +  L +D  KA+D ++
Subjt:  NIVLIPKI-LQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKT-GYAALKLDMSKAYDRVE

Query:  WSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVEVSIARVCPKSFNLFFADDSL
          ++ + + K+G   +++KL+    +  T +I++NG          G RQ  PLSP LF I ME L+  +   +    + +    +        FADD +
Subjt:  WSYLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVEVSIARVCPKSFNLFFADDSL

Query:  VFLKVAADEFGHFKSILKDYEKACG
        V+L+   D       ++K+Y    G
Subjt:  VFLKVAADEFGHFKSILKDYEKACG

P11369 LINE-1 retrotransposable element ORF2 protein2.7e-1626.34Show/hide
Query:  IVLIPKILQ-ARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWS
        I LIPK  +    + N+RPISL N+  KI+ K +AN+++  +  I+   Q  FIP      N+      +HY+ +K K K  +  + LD  KA+D+++  
Subjt:  IVLIPKILQ-ARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWS

Query:  YLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVEVSIARVCPKSFNL-FFADDSLV
        ++ +++++ G    ++ ++    +    +I +NGE    I    G RQ  PLSPYLF I +E L+  +     R   E+   ++  +   +   ADD +V
Subjt:  YLSQIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVEVSIARVCPKSFNL-FFADDSLV

Query:  FLKVAADEFGHFKSILKDYEKACG
        ++    +      +++  + +  G
Subjt:  FLKVAADEFGHFKSILKDYEKACG

P14381 Transposon TX1 uncharacterized 149 kDa protein1.5e-1731.17Show/hide
Query:  LIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLS
        L+PK    RL+ N+RP+SL +  YKIV K I+ +LK VL E++   QS  +P R+I DN+ L  + LH+    R+     A L LD  KA+DRV+  YL 
Subjt:  LIPKILQARLVSNYRPISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLS

Query:  QIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVEVSIARVCPKSFNLFFADDSLVFLKV
          +    F   ++  +     +    + +N      +   RG+RQ  PLS  L+ + +E    LL    T  +++    RV   +    +ADD ++   V
Subjt:  QIMDKLGFHASWIKLVMKCITTTTFSILMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVEVSIARVCPKSFNLFFADDSLVFLKV

Query:  AADEFGHFKSILKDYEKACGSPKVFYPKKSA
        A D        L D E+A    +V+    SA
Subjt:  AADEFGHFKSILKDYEKACGSPKVFYPKKSA

P93295 Uncharacterized mitochondrial protein AtMg003102.4e-1246.15Show/hide
Query:  KVFYPKKSALCAKFRWGSLGDKRRMQWKRWEDLCKPKE-IGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVLCGR
        K+   K ++   +F W S  +KR++ W  W+ LCK KE  GGL FRDL  FNQA+LAKQ++R++  PH  +SR+L  R
Subjt:  KVFYPKKSALCAKFRWGSLGDKRRMQWKRWEDLCKPKE-IGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVLCGR

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases7.2e-1237.35Show/hide
Query:  IANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWI
        +  +LK ++  ++   Q++FIP R  TDN++   E +H ++ K KG  G+  LKLD+ KAYDR+ W YL   +   GF   W+
Subjt:  IANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWI

AT4G29090.1 Ribonuclease H-like superfamily protein3.3e-0934.83Show/hide
Query:  LKDYEKACG-SPKVFYPKKSALCAKFRWGSLGDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVLCGR
        L  Y  AC   PK    +  ++ A F W +  + + M WK W+ L   K  GG+ F+D+  FN A+L KQ WR+L+ P   +++V   R
Subjt:  LKDYEKACG-SPKVFYPKKSALCAKFRWGSLGDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVLCGR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-1346.15Show/hide
Query:  KVFYPKKSALCAKFRWGSLGDKRRMQWKRWEDLCKPKE-IGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVLCGR
        K+   K ++   +F W S  +KR++ W  W+ LCK KE  GGL FRDL  FNQA+LAKQ++R++  PH  +SR+L  R
Subjt:  KVFYPKKSALCAKFRWGSLGDKRRMQWKRWEDLCKPKE-IGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVLCGR

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)5.3e-0739.71Show/hide
Query:  LMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVE-VSIARVCPKSFNLFFADDS
        ++NG   G +  SRG+RQ +PLSPYLF++C E LS L   A+ +  +  + ++   P+  +L FADD+
Subjt:  LMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVE-VSIARVCPKSFNLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTCCACTTTTGATGGGTATAGCATCCGATAACAGTAAGGTGAATGAAGATGATGACGTGATGGAAAGTGATCTTGAGATGGAGGAGGAATTGGGGCCTTTGAG
TTCGGATGGTAGGAATTGGGTTGATTCCAACAAAGAGCTATGTTTGCAGGCTTGTACGAGTGGCTCCCTAAAGCCCAATAATGAACGGGTCGAGACCGAAGGGAAGGGTG
GGAGGGTAGTGAAGAGAATACATGATTTTAGTGATGTTGCATGGGTGATTGGTGGTGACTTGAATGAGATTCTATGGCAAAATGAGAAATCGGGAGGTCCTATTAGAGAC
AATCGTCAAATCTTGGCTTTTCGGGAAGTATTGGATGATTGTAACCTCCGGGACCTTGGTTTCTCTGGGGGAGATGAATCAGATGCTTATGGCTCCTTATACTCGGGAGG
AAGTGATTTTAGCCATGAGGAGTTTCCACCCGACAAAGACCTCGGGGCCAGATGGATTTCCTGCTTTATTCTATCAAAAATATTGGGCAATTGTGGGCCACACGACGTGT
CAAATTGTTTGGCCATTTTGAATTCGAATGAGTCGATGCGGGTGTGGAACCATACAAATATTGTGCTCATCCCAAAGATTCTTCAGGCAAGGTTAGTATCTAATTATCGC
CCAATTAGTTTATGTAACGTCTCCTATAAAATTGTTACTAAGTTCATAGCCAATAAACTCAAGGTCGTGTTAAATGAGATTGTAGATGAGTGTCAATCAACTTTTATCCC
CGATAGATCGATAACTGATAATATGATATTGGGGCATGAAACTTTACATTATCTCCAACACAAGCGTAAAGGGAAAACTGGGTATGCTGCACTAAAACTTGATATGAGTA
AAGCATACGATAGGGTGGAGTGGTCGTATTTGAGCCAAATCATGGATAAGTTGGGTTTTCATGCTAGTTGGATTAAATTGGTAATGAAGTGTATAACGACGACCACATTT
TCCATTCTTATGAATGGAGAATCTTTTGGTTTTATTAAGTCATCCCGTGGGATAAGGCAATGTAATCCTTTATCTCCTTACCTGTTCTTAATCTGTATGGAAGGTCTTTC
CGCTCTGTTGGTATCAGCTAGGACAAGATCTATGGTCGAGGTGTCAATAGCGAGAGTTTGTCCCAAAAGTTTTAATCTATTTTTTGCGGATGATAGTCTGGTTTTCCTTA
AAGTTGCGGCTGACGAATTTGGGCATTTCAAATCTATTTTGAAGGACTATGAGAAGGCATGTGGATCCCCAAAGGTATTTTATCCAAAAAAATCGGCACTCTGTGCCAAG
TTCAGATGGGGCTCTCTTGGGGATAAGCGTAGAATGCAATGGAAACGATGGGAGGATCTCTGTAAACCAAAGGAGATTGGTGGTTTAAATTTTCGAGATCTAGTCAATTT
CAACCAGGCAATGCTTGCAAAACAAGCTTGGAGGGTGTTGACTAACCCGCATATTACAATCTCCAGAGTTTTGTGCGGGAGAGGAGAGTACACTGTTAAGAGTGGGTATA
AGCTTAGTATGATGAATAGTCAAGAGGCTTCTTTGTTAAGAGTAGGACGGGAGATGAGATGGTGGAACAAGCTTTGGAAGATGAGGGTGCCAAGCAAGGTGAAACTTTTT
GTCTGGAAATCCTTCCACAATTCAATTCCAACTATGGTCAACCTATGGAATCATCATGTTCCTGTCAACGGGATTTTCCCCGTTTTCCAAGAGGAGATGGAGACTACAGA
TCATGCCCTTTTTCAGCGTTCGAGGGCTCGGGAGGTCTGGGAACTTATTCTTGTTGGGATTTATGCTCTAAAACTCGTAGATAGTGAATGTAAACAAATTGTATACTTGT
CTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCTCCACTTTTGATGGGTATAGCATCCGATAACAGTAAGGTGAATGAAGATGATGACGTGATGGAAAGTGATCTTGAGATGGAGGAGGAATTGGGGCCTTTGAG
TTCGGATGGTAGGAATTGGGTTGATTCCAACAAAGAGCTATGTTTGCAGGCTTGTACGAGTGGCTCCCTAAAGCCCAATAATGAACGGGTCGAGACCGAAGGGAAGGGTG
GGAGGGTAGTGAAGAGAATACATGATTTTAGTGATGTTGCATGGGTGATTGGTGGTGACTTGAATGAGATTCTATGGCAAAATGAGAAATCGGGAGGTCCTATTAGAGAC
AATCGTCAAATCTTGGCTTTTCGGGAAGTATTGGATGATTGTAACCTCCGGGACCTTGGTTTCTCTGGGGGAGATGAATCAGATGCTTATGGCTCCTTATACTCGGGAGG
AAGTGATTTTAGCCATGAGGAGTTTCCACCCGACAAAGACCTCGGGGCCAGATGGATTTCCTGCTTTATTCTATCAAAAATATTGGGCAATTGTGGGCCACACGACGTGT
CAAATTGTTTGGCCATTTTGAATTCGAATGAGTCGATGCGGGTGTGGAACCATACAAATATTGTGCTCATCCCAAAGATTCTTCAGGCAAGGTTAGTATCTAATTATCGC
CCAATTAGTTTATGTAACGTCTCCTATAAAATTGTTACTAAGTTCATAGCCAATAAACTCAAGGTCGTGTTAAATGAGATTGTAGATGAGTGTCAATCAACTTTTATCCC
CGATAGATCGATAACTGATAATATGATATTGGGGCATGAAACTTTACATTATCTCCAACACAAGCGTAAAGGGAAAACTGGGTATGCTGCACTAAAACTTGATATGAGTA
AAGCATACGATAGGGTGGAGTGGTCGTATTTGAGCCAAATCATGGATAAGTTGGGTTTTCATGCTAGTTGGATTAAATTGGTAATGAAGTGTATAACGACGACCACATTT
TCCATTCTTATGAATGGAGAATCTTTTGGTTTTATTAAGTCATCCCGTGGGATAAGGCAATGTAATCCTTTATCTCCTTACCTGTTCTTAATCTGTATGGAAGGTCTTTC
CGCTCTGTTGGTATCAGCTAGGACAAGATCTATGGTCGAGGTGTCAATAGCGAGAGTTTGTCCCAAAAGTTTTAATCTATTTTTTGCGGATGATAGTCTGGTTTTCCTTA
AAGTTGCGGCTGACGAATTTGGGCATTTCAAATCTATTTTGAAGGACTATGAGAAGGCATGTGGATCCCCAAAGGTATTTTATCCAAAAAAATCGGCACTCTGTGCCAAG
TTCAGATGGGGCTCTCTTGGGGATAAGCGTAGAATGCAATGGAAACGATGGGAGGATCTCTGTAAACCAAAGGAGATTGGTGGTTTAAATTTTCGAGATCTAGTCAATTT
CAACCAGGCAATGCTTGCAAAACAAGCTTGGAGGGTGTTGACTAACCCGCATATTACAATCTCCAGAGTTTTGTGCGGGAGAGGAGAGTACACTGTTAAGAGTGGGTATA
AGCTTAGTATGATGAATAGTCAAGAGGCTTCTTTGTTAAGAGTAGGACGGGAGATGAGATGGTGGAACAAGCTTTGGAAGATGAGGGTGCCAAGCAAGGTGAAACTTTTT
GTCTGGAAATCCTTCCACAATTCAATTCCAACTATGGTCAACCTATGGAATCATCATGTTCCTGTCAACGGGATTTTCCCCGTTTTCCAAGAGGAGATGGAGACTACAGA
TCATGCCCTTTTTCAGCGTTCGAGGGCTCGGGAGGTCTGGGAACTTATTCTTGTTGGGATTTATGCTCTAAAACTCGTAGATAGTGAATGTAAACAAATTGTATACTTGT
CTTAA
Protein sequenceShow/hide protein sequence
MSSPLLMGIASDNSKVNEDDDVMESDLEMEEELGPLSSDGRNWVDSNKELCLQACTSGSLKPNNERVETEGKGGRVVKRIHDFSDVAWVIGGDLNEILWQNEKSGGPIRD
NRQILAFREVLDDCNLRDLGFSGGDESDAYGSLYSGGSDFSHEEFPPDKDLGARWISCFILSKILGNCGPHDVSNCLAILNSNESMRVWNHTNIVLIPKILQARLVSNYR
PISLCNVSYKIVTKFIANKLKVVLNEIVDECQSTFIPDRSITDNMILGHETLHYLQHKRKGKTGYAALKLDMSKAYDRVEWSYLSQIMDKLGFHASWIKLVMKCITTTTF
SILMNGESFGFIKSSRGIRQCNPLSPYLFLICMEGLSALLVSARTRSMVEVSIARVCPKSFNLFFADDSLVFLKVAADEFGHFKSILKDYEKACGSPKVFYPKKSALCAK
FRWGSLGDKRRMQWKRWEDLCKPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPHITISRVLCGRGEYTVKSGYKLSMMNSQEASLLRVGREMRWWNKLWKMRVPSKVKLF
VWKSFHNSIPTMVNLWNHHVPVNGIFPVFQEEMETTDHALFQRSRAREVWELILVGIYALKLVDSECKQIVYLS