; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g25860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g25860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr4:18775036..18783677
RNA-Seq ExpressionMoc04g25860
SyntenyMoc04g25860
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CDH30699.1 putative Ty1-copia-like retrotransposon [Cercis chinensis]4.9e-6233.87Show/hide
Query:  PTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQIL--------DQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASE
        PT  Q+L++KL   NFL+W++QLLN V+ANG    L+GT   P Q +           S+ LNP+Y LW+R NR++M WIYSSL+EQ M +I++  SA E
Subjt:  PTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQIL--------DQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASE

Query:  IWSSLNRAYDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEE
        IW++L  +Y   + ARIM L+ QLQ+ +K GLSV  Y+ +I+ + D   AIGE +S  DQ+  +L GLG+EYN  V SI +R DS S++ ++S L  YE+
Subjt:  IWSSLNRAYDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEE

Query:  RLEKQNIVDQLNVAQANLSNLSLQ-------------------------------------HNNK------------------------------RHSPK
        RLE QN V+Q    QAN +  +                                       H+NK                              +  PK
Subjt:  RLEKQNIVDQLNVAQANLSNLSLQ-------------------------------------HNNK------------------------------RHSPK

Query:  SFFLNQSKTSFPHPVSAAQIPPSILDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGN--------------------------------------
        +   NQ  +S   P +    P  + +  WFLD+GATHH+T D +      P++G ++V VGN                                      
Subjt:  SFFLNQSKTSFPHPVSAAQIPPSILDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGN--------------------------------------

Query:  ------DNQAFVEFYSSFFLVKDLRSKTILLRGSLDDGLYSVTSSLTSSS-------VGSTPSATAAFLSSVQSSPNWHLRLGHPAVSVLQQL
              DN A+VEFY S+F VKD +++ ILL+G LD GLYSV S+ +S +       + ST S+T     S+ S   WH RLGHP+++++ ++
Subjt:  ------DNQAFVEFYSSFFLVKDLRSKTILLRGSLDDGLYSVTSSLTSSS-------VGSTPSATAAFLSSVQSSPNWHLRLGHPAVSVLQQL

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]9.2e-6133.4Show/hide
Query:  PTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRA
        P++ Q L+VKL D N+++WK+QLLN VIANGL  FLDG+ + PP+ LD Q    NP++  W+RYNR++M WIY+S++E  +G+I+   SAS+IW +L R 
Subjt:  PTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRA

Query:  YDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRS-LLLAYEERLEKQNI
        Y   + A +  L+  LQ+IKK+GL+   Y+ + + + +  A+IGEP++Y D L + L GLG +YN FVTSIQ++A  PS+E+  S   L  + + +  + 
Subjt:  YDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRS-LLLAYEERLEKQNI

Query:  VD--------------------------------QLNVAQANLSNLSLQHNNKRHS-----PKSFFLNQSKTSFPHPVSAAQIPPSIL----DEGWFLDS
                                          Q+ +   + +N    H N  +      P++F  N   T  P   +++  P  +L    D  W++DS
Subjt:  VD--------------------------------QLNVAQANLSNLSLQHNNKRHS-----PKSFFLNQSKTSFPHPVSAAQIPPSIL----DEGWFLDS

Query:  GATHHMTPDFSIFCNRIPYNGGEQVTVGN--------------------------------------------DNQAFVEFYSSFFLVKDLRSKTILLRG
        GA+HH TPD ++  +  PY G +QVTVGN                                            DN AF+EFY +FFLVK   +K +LLRG
Subjt:  GATHHMTPDFSIFCNRIPYNGGEQVTVGN--------------------------------------------DNQAFVEFYSSFFLVKDLRSKTILLRG

Query:  SLDDGLYSVTSSLTSSSVGSTP------SATAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSLLFWANIIATNATTNPPPTVIVAAFSPSSV
         LD GLY V SS  S     +P       +  + LS+  +SP  +L    P+       SP LS  F+      N TT PP +   +  SPS++
Subjt:  SLDDGLYSVTSSLTSSSVGSTP------SATAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSLLFWANIIATNATTNPPPTVIVAAFSPSSV

RVW59875.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]6.2e-5731.36Show/hide
Query:  LPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRAYD
        L  +L +KL   N++LW+ Q+ N V ANG    ++G  + PPQ     S   NPD+ +W R++R+++ WIYSSL+ + MG+I+  +S+   W +L R + 
Subjt:  LPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRAYD

Query:  FETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIVDQ
          + AR+M L+ + Q+ +K  L++ +Y+ ++K +AD  AAIGEP++ RDQ+  +L GLG +YN+ V S+  R D  SL  V S+LL +E+RL  QN V +
Subjt:  FETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIVDQ

Query:  LNVAQANLSNLSLQH-NNKR-----------------------------------------------------HSPKSFFLNQSKTSFPHPVSAAQIPPS
         NV  ANL+    QH NNKR                                                     ++P    +  +K +  + V A    PS
Subjt:  LNVAQANLSNLSLQH-NNKR-----------------------------------------------------HSPKSFFLNQSKTSFPHPVSAAQIPPS

Query:  -ILDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGN--------------------------------------------DNQAFVEFYSSFFLVK
         I DE WF D+GATHH++       +  PY G ++V VGN                                            DN  F EF+  FF VK
Subjt:  -ILDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGN--------------------------------------------DNQAFVEFYSSFFLVK

Query:  DLRSKTILLRGSLDDGLYSVTSSLTSSSVGSTPSA--TAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSL
        D  +K ILL+GSL+ GLY   +    S      S+   ++ LS   ++  WH RLGHPA ++L+ +  S ++
Subjt:  DLRSKTILLRGSLDDGLYSVTSSLTSSSVGSTPSA--TAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSL

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.5e-7438.96Show/hide
Query:  PTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRA
        P+L QSLS+KL +TN LL K QLLN +IANGL  F+D     PP+ LD     +NP++  W+R N+++M WIYSSL+   +G+I+   +A +IW+SLN  
Subjt:  PTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRA

Query:  YDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIV
        Y+  + A +M L +QLQ IKK  + +S+YLS++K V D+FA IGEP+SYRD+L  IL+GL  EY+ FVTSI NR+D PSL++V SLL  YE RL ++++ 
Subjt:  YDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIV

Query:  DQLNVAQAN---------------------LSNLSLQHNNKRHSPKSF-----FLNQSKTSFPHPVSA---AQIPPSIL-----DEGWFLDSGATHHMTP
          LN  QAN                     ++       N  + P  F     F          P+SA       P+ L     D  W++DSGATHH TP
Subjt:  DQLNVAQAN---------------------LSNLSLQHNNKRHSPKSF-----FLNQSKTSFPHPVSA---AQIPPSIL-----DEGWFLDSGATHHMTP

Query:  DFSIFCNRIPYNGGEQVTVGN--------------------------------------------DNQAFVEFYSSFFLVKDLRSKTILLRGSLDDGLYS
        +F    + + Y+ G+   VGN                                            DN+AFVEFY +FFLVKD ++K +LL+G L+ GLY 
Subjt:  DFSIFCNRIPYNGGEQVTVGN--------------------------------------------DNQAFVEFYSSFFLVKDLRSKTILLRGSLDDGLYS

Query:  VT--SSLTSSSVGSTPSA-------TAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSLLF
        +   ++LTSSSV S+PS          AFLS       WH RLGHPA  V+ Q+  S +L F
Subjt:  VT--SSLTSSSVGSTPSA-------TAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSLLF

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]3.9e-11261.06Show/hide
Query:  FPTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNR
        FPTLPQ L+VKL D NFLLWK+QLLNAVIANGLRG+LDGTI+PPPQ LD   L  NP Y  WERYNR+LMCWIYSSLSE+KMGE++SLE+  +IWSSL R
Subjt:  FPTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNR

Query:  AYDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNI
         YD +TTARIMGLK +LQ+++KDG SVSQYL++IKE+ADKFAA+GEP+SYRD LAH+LDGLG+EYNAFVTSI NRADSPSLEDVRSLLLAYE RL+KQN 
Subjt:  AYDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNI

Query:  VDQLNVAQANLSNLSLQHNNKRHSPKSFFLNQSKTSFPH-PVSAAQ------------------------------------------------------
        VDQLN+AQANL NLSLQHN+KR  PK  F N  K SFP+ P+SAAQ                                                      
Subjt:  VDQLNVAQANLSNLSLQHNNKRHSPKSFFLNQSKTSFPH-PVSAAQ------------------------------------------------------

Query:  ---IPPSIL-----------DEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGN
           + PS             DE WF+DSGATHHMTPD SI CN  PY+GGEQVTVGN
Subjt:  ---IPPSIL-----------DEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGN

TrEMBL top hitse value%identityAlignment
A0A438FIP9 Retrovirus-related Pol polyprotein from transposon RE13.0e-5731.36Show/hide
Query:  LPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRAYD
        L  +L +KL   N++LW+ Q+ N V ANG    ++G  + PPQ     S   NPD+ +W R++R+++ WIYSSL+ + MG+I+  +S+   W +L R + 
Subjt:  LPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRAYD

Query:  FETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIVDQ
          + AR+M L+ + Q+ +K  L++ +Y+ ++K +AD  AAIGEP++ RDQ+  +L GLG +YN+ V S+  R D  SL  V S+LL +E+RL  QN V +
Subjt:  FETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIVDQ

Query:  LNVAQANLSNLSLQH-NNKR-----------------------------------------------------HSPKSFFLNQSKTSFPHPVSAAQIPPS
         NV  ANL+    QH NNKR                                                     ++P    +  +K +  + V A    PS
Subjt:  LNVAQANLSNLSLQH-NNKR-----------------------------------------------------HSPKSFFLNQSKTSFPHPVSAAQIPPS

Query:  -ILDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGN--------------------------------------------DNQAFVEFYSSFFLVK
         I DE WF D+GATHH++       +  PY G ++V VGN                                            DN  F EF+  FF VK
Subjt:  -ILDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGN--------------------------------------------DNQAFVEFYSSFFLVK

Query:  DLRSKTILLRGSLDDGLYSVTSSLTSSSVGSTPSA--TAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSL
        D  +K ILL+GSL+ GLY   +    S      S+   ++ LS   ++  WH RLGHPA ++L+ +  S ++
Subjt:  DLRSKTILLRGSLDDGLYSVTSSLTSSSVGSTPSA--TAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSL

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE14.6e-7438.96Show/hide
Query:  PTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRA
        P+L QSLS+KL +TN LL K QLLN +IANGL  F+D     PP+ LD     +NP++  W+R N+++M WIYSSL+   +G+I+   +A +IW+SLN  
Subjt:  PTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRA

Query:  YDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIV
        Y+  + A +M L +QLQ IKK  + +S+YLS++K V D+FA IGEP+SYRD+L  IL+GL  EY+ FVTSI NR+D PSL++V SLL  YE RL ++++ 
Subjt:  YDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIV

Query:  DQLNVAQAN---------------------LSNLSLQHNNKRHSPKSF-----FLNQSKTSFPHPVSA---AQIPPSIL-----DEGWFLDSGATHHMTP
          LN  QAN                     ++       N  + P  F     F          P+SA       P+ L     D  W++DSGATHH TP
Subjt:  DQLNVAQAN---------------------LSNLSLQHNNKRHSPKSF-----FLNQSKTSFPHPVSA---AQIPPSIL-----DEGWFLDSGATHHMTP

Query:  DFSIFCNRIPYNGGEQVTVGN--------------------------------------------DNQAFVEFYSSFFLVKDLRSKTILLRGSLDDGLYS
        +F    + + Y+ G+   VGN                                            DN+AFVEFY +FFLVKD ++K +LL+G L+ GLY 
Subjt:  DFSIFCNRIPYNGGEQVTVGN--------------------------------------------DNQAFVEFYSSFFLVKDLRSKTILLRGSLDDGLYS

Query:  VT--SSLTSSSVGSTPSA-------TAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSLLF
        +   ++LTSSSV S+PS          AFLS       WH RLGHPA  V+ Q+  S +L F
Subjt:  VT--SSLTSSSVGSTPSA-------TAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSLLF

A0A6J1DQX7 uncharacterized protein LOC1110223151.9e-11261.06Show/hide
Query:  FPTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNR
        FPTLPQ L+VKL D NFLLWK+QLLNAVIANGLRG+LDGTI+PPPQ LD   L  NP Y  WERYNR+LMCWIYSSLSE+KMGE++SLE+  +IWSSL R
Subjt:  FPTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNR

Query:  AYDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNI
         YD +TTARIMGLK +LQ+++KDG SVSQYL++IKE+ADKFAA+GEP+SYRD LAH+LDGLG+EYNAFVTSI NRADSPSLEDVRSLLLAYE RL+KQN 
Subjt:  AYDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNI

Query:  VDQLNVAQANLSNLSLQHNNKRHSPKSFFLNQSKTSFPH-PVSAAQ------------------------------------------------------
        VDQLN+AQANL NLSLQHN+KR  PK  F N  K SFP+ P+SAAQ                                                      
Subjt:  VDQLNVAQANLSNLSLQHNNKRHSPKSFFLNQSKTSFPH-PVSAAQ------------------------------------------------------

Query:  ---IPPSIL-----------DEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGN
           + PS             DE WF+DSGATHHMTPD SI CN  PY+GGEQVTVGN
Subjt:  ---IPPSIL-----------DEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGN

A0A7J0GPN0 UBX domain-containing protein4.4e-6133.4Show/hide
Query:  PTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRA
        P++ Q L+VKL D N+++WK+QLLN VIANGL  FLDG+ + PP+ LD Q    NP++  W+RYNR++M WIY+S++E  +G+I+   SAS+IW +L R 
Subjt:  PTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRA

Query:  YDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRS-LLLAYEERLEKQNI
        Y   + A +  L+  LQ+IKK+GL+   Y+ + + + +  A+IGEP++Y D L + L GLG +YN FVTSIQ++A  PS+E+  S   L  + + +  + 
Subjt:  YDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRS-LLLAYEERLEKQNI

Query:  VD--------------------------------QLNVAQANLSNLSLQHNNKRHS-----PKSFFLNQSKTSFPHPVSAAQIPPSIL----DEGWFLDS
                                          Q+ +   + +N    H N  +      P++F  N   T  P   +++  P  +L    D  W++DS
Subjt:  VD--------------------------------QLNVAQANLSNLSLQHNNKRHS-----PKSFFLNQSKTSFPHPVSAAQIPPSIL----DEGWFLDS

Query:  GATHHMTPDFSIFCNRIPYNGGEQVTVGN--------------------------------------------DNQAFVEFYSSFFLVKDLRSKTILLRG
        GA+HH TPD ++  +  PY G +QVTVGN                                            DN AF+EFY +FFLVK   +K +LLRG
Subjt:  GATHHMTPDFSIFCNRIPYNGGEQVTVGN--------------------------------------------DNQAFVEFYSSFFLVKDLRSKTILLRG

Query:  SLDDGLYSVTSSLTSSSVGSTP------SATAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSLLFWANIIATNATTNPPPTVIVAAFSPSSV
         LD GLY V SS  S     +P       +  + LS+  +SP  +L    P+       SP LS  F+      N TT PP +   +  SPS++
Subjt:  SLDDGLYSVTSSLTSSSVGSTP------SATAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSLLFWANIIATNATTNPPPTVIVAAFSPSSV

U6EFK2 Putative Ty1-copia-like retrotransposon2.4e-6233.87Show/hide
Query:  PTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQIL--------DQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASE
        PT  Q+L++KL   NFL+W++QLLN V+ANG    L+GT   P Q +           S+ LNP+Y LW+R NR++M WIYSSL+EQ M +I++  SA E
Subjt:  PTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQIL--------DQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASE

Query:  IWSSLNRAYDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEE
        IW++L  +Y   + ARIM L+ QLQ+ +K GLSV  Y+ +I+ + D   AIGE +S  DQ+  +L GLG+EYN  V SI +R DS S++ ++S L  YE+
Subjt:  IWSSLNRAYDFETTARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEE

Query:  RLEKQNIVDQLNVAQANLSNLSLQ-------------------------------------HNNK------------------------------RHSPK
        RLE QN V+Q    QAN +  +                                       H+NK                              +  PK
Subjt:  RLEKQNIVDQLNVAQANLSNLSLQ-------------------------------------HNNK------------------------------RHSPK

Query:  SFFLNQSKTSFPHPVSAAQIPPSILDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGN--------------------------------------
        +   NQ  +S   P +    P  + +  WFLD+GATHH+T D +      P++G ++V VGN                                      
Subjt:  SFFLNQSKTSFPHPVSAAQIPPSILDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGN--------------------------------------

Query:  ------DNQAFVEFYSSFFLVKDLRSKTILLRGSLDDGLYSVTSSLTSSS-------VGSTPSATAAFLSSVQSSPNWHLRLGHPAVSVLQQL
              DN A+VEFY S+F VKD +++ ILL+G LD GLYSV S+ +S +       + ST S+T     S+ S   WH RLGHP+++++ ++
Subjt:  ------DNQAFVEFYSSFFLVKDLRSKTILLRGSLDDGLYSVTSSLTSSS-------VGSTPSATAAFLSSVQSSPNWHLRLGHPAVSVLQQL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-3225.27Show/hide
Query:  KLTDTNFLLWKDQLLNAVIANGLRGFLDG-TILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRAYDFETTAR
        KLT TN+L+W  Q+        L GFLDG T +PP  I    +  +NPDY+ W+R ++++   +  ++S      +    +A++IW +L + Y   +   
Subjt:  KLTDTNFLLWKDQLLNAVIANGLRGFLDG-TILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRAYDFETTAR

Query:  IMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERL----------EKQN
        +  L+ QL+   K   ++  Y+  +    D+ A +G+P+ + +Q+  +L+ L  EY   +  I  +   P+L ++   LL +E ++             N
Subjt:  IMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERL----------EKQN

Query:  IVDQLNVAQANLSNLSLQHN---NKRHSPKSFFLNQSKTSFPHPVS---------------------------------AAQIPPS--------------
         V   N    N +N   ++N   N+ ++  S    QS T+F HP +                                  +Q PPS              
Subjt:  IVDQLNVAQANLSNLSLQHN---NKRHSPKSFFLNQSKTSFPHPVS---------------------------------AAQIPPS--------------

Query:  --ILDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVG--------------------------------------------NDNQAFVEFYSSFFLV
               W LDSGATHH+T DF+      PY GG+ V V                                             N N   VEF+ + F V
Subjt:  --ILDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVG--------------------------------------------NDNQAFVEFYSSFFLV

Query:  KDLRSKTILLRGSLDDGLYSVTSSLTSSSVGSTPSATAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSL
        KDL +   LL+G   D LY    +       S P +  A  SS  +  +WH RLGHPA S+L  +  + SL
Subjt:  KDLRSKTILLRGSLDDGLYSVTSSLTSSSVGSTPSATAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.2e-2423.24Show/hide
Query:  KLTDTNFLLWKDQLLNAVIANGLRGFLDG-TILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRAYDFETTAR
        KLT TN+L+W  Q+        L GFLDG T +PP  I       +NPDY+ W R ++++   I  ++S      +    +A++IW +L + Y   +   
Subjt:  KLTDTNFLLWKDQLLNAVIANGLRGFLDG-TILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRAYDFETTAR

Query:  IMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIVDQL----N
        +     QL+ I +                D+ A +G+P+ + +Q+  +L+ L  +Y   +  I  +   PSL ++   L+  E +L   N  + +    N
Subjt:  IMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIVDQL----N

Query:  VAQANLSNLSLQHNNK--------------------------RHSPKSFF-------------------------LNQSKTSFP----HPVSAAQIPPSI
        V     +N +   NN+                             PK +                           NQ +++ P     P +   +    
Subjt:  VAQANLSNLSLQHNNK--------------------------RHSPKSFF-------------------------LNQSKTSFP----HPVSAAQIPPSI

Query:  LDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVG--------------------------------------------NDNQAFVEFYSSFFLVKDL
            W LDSGATHH+T DF+      PY GG+ V +                                             N N+  VEF+ + F VKDL
Subjt:  LDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVG--------------------------------------------NDNQAFVEFYSSFFLVKDL

Query:  RSKTILLRGSLDDGLYS-VTSSLTSSSVGSTPSATAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSL
         +   LL+G   D LY    +S  + S+ ++P + A   S       WH RLGHP++++L  +  + SL
Subjt:  RSKTILLRGSLDDGLYS-VTSSLTSSSVGSTPSATAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSL

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.4e-1421.93Show/hide
Query:  LTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLS-EQKMGEIISLESASEIWSSLNRAYDFETTARI
        + ++N+  W++  L   ++  + G +DGT+LP            N +   W++ + I+   +Y +L+ +Q  G  ++  ++ +IW  +   +     AR 
Subjt:  LTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLS-EQKMGEIISLESASEIWSSLNRAYDFETTARI

Query:  MGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEK
        + L ++L++     + V+ Y  ++K++AD    +  P++ R+ + ++L+GL  +++  +  I++R   PS +D  ++L   E+RL++
Subjt:  MGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEK

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.9e-1724.02Show/hide
Query:  LSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLE-SASEIWSSLNRAYDFET
        +++ L   N+ +W++      ++ G+ G +DG+  P P    +           W+  + ++  WIY ++++  +  II +  +A ++W SL   +    
Subjt:  LSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLE-SASEIWSSLNRAYDFET

Query:  TARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIVDQLNV
         AR +  + +L++   D LSV +Y  ++K ++D    +  PIS R  + H+L+GL  +Y+  +  I++++  PS  + RS+LL  E RL  ++     + 
Subjt:  TARIMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIVDQLNV

Query:  AQANLSNL--SLQHNNKRHSPKSFFLNQS
           +LSN+  ++    +R+ P+ +  N S
Subjt:  AQANLSNL--SLQHNNKRHSPKSFFLNQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCCGACTTTACCTCAGTCTTTGAGTGTCAAATTGACTGATACAAATTTTCTTCTCTGGAAGGATCAACTTCTGAATGCTGTTATTGCTAATGGGCTTCGTGGGTT
TCTTGATGGCACGATTCTTCCTCCACCACAAATTCTGGACCAGCAAAGCCTTCATCTGAATCCAGATTATTCTCTTTGGGAAAGGTATAATCGCATCCTTATGTGTTGGA
TTTATTCTTCTTTGTCTGAACAGAAGATGGGGGAAATTATATCTTTAGAGTCTGCTTCTGAAATCTGGTCTTCTCTCAATCGTGCTTATGATTTTGAAACTACTGCTAGA
ATTATGGGGTTGAAAGCACAATTACAGAGTATTAAGAAAGATGGATTATCTGTGAGTCAGTATCTTTCTCAAATAAAAGAAGTTGCGGATAAATTTGCTGCCATTGGTGA
ACCTATCTCATACCGTGATCAGTTGGCTCATATACTTGATGGCTTAGGCACTGAATATAATGCTTTTGTAACGTCTATTCAAAATAGAGCGGATTCTCCTTCTTTAGAAG
ATGTTCGCAGCCTTCTCTTGGCTTATGAAGAACGGTTGGAAAAACAAAATATTGTGGATCAGCTTAATGTTGCCCAAGCCAATCTCTCAAATCTTTCCCTTCAACACAAT
AACAAACGTCATTCTCCAAAGTCTTTTTTTCTTAATCAGTCAAAAACATCATTCCCTCACCCTGTTTCCGCTGCACAAATTCCTCCCAGTATTTTAGATGAAGGTTGGTT
TTTGGACTCTGGAGCTACTCACCATATGACTCCAGATTTTTCAATTTTCTGTAACCGAATACCTTATAATGGTGGTGAACAAGTCACAGTGGGCAATGATAACCAAGCGT
TTGTTGAGTTTTACTCTTCTTTTTTCCTTGTTAAGGATCTTCGGTCCAAGACGATTCTTCTTCGGGGATCGCTTGATGATGGTTTATATAGCGTCACTTCTAGCTTAACA
AGTTCTTCTGTTGGCTCCACTCCTTCAGCTACTGCTGCCTTTTTATCTTCTGTTCAATCGTCTCCAAATTGGCATTTGCGTCTTGGACACCCTGCTGTTTCAGTCTTGCA
GCAACTCTCTCCTTCCCTTTCTCTCCTATTTTGGGCCAACATTATTGCCACTAACGCCACCACCAACCCACCACCCACCGTCATTGTCGCTGCATTTTCTCCATCATCGG
TGACAGAGAAGGGAAAGGAAGGTCTCCCTCTCTCCTTCGAGAGCTCCTCCTCCCGTCGCGACTACGAAGGGGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTCCGACTTTACCTCAGTCTTTGAGTGTCAAATTGACTGATACAAATTTTCTTCTCTGGAAGGATCAACTTCTGAATGCTGTTATTGCTAATGGGCTTCGTGGGTT
TCTTGATGGCACGATTCTTCCTCCACCACAAATTCTGGACCAGCAAAGCCTTCATCTGAATCCAGATTATTCTCTTTGGGAAAGGTATAATCGCATCCTTATGTGTTGGA
TTTATTCTTCTTTGTCTGAACAGAAGATGGGGGAAATTATATCTTTAGAGTCTGCTTCTGAAATCTGGTCTTCTCTCAATCGTGCTTATGATTTTGAAACTACTGCTAGA
ATTATGGGGTTGAAAGCACAATTACAGAGTATTAAGAAAGATGGATTATCTGTGAGTCAGTATCTTTCTCAAATAAAAGAAGTTGCGGATAAATTTGCTGCCATTGGTGA
ACCTATCTCATACCGTGATCAGTTGGCTCATATACTTGATGGCTTAGGCACTGAATATAATGCTTTTGTAACGTCTATTCAAAATAGAGCGGATTCTCCTTCTTTAGAAG
ATGTTCGCAGCCTTCTCTTGGCTTATGAAGAACGGTTGGAAAAACAAAATATTGTGGATCAGCTTAATGTTGCCCAAGCCAATCTCTCAAATCTTTCCCTTCAACACAAT
AACAAACGTCATTCTCCAAAGTCTTTTTTTCTTAATCAGTCAAAAACATCATTCCCTCACCCTGTTTCCGCTGCACAAATTCCTCCCAGTATTTTAGATGAAGGTTGGTT
TTTGGACTCTGGAGCTACTCACCATATGACTCCAGATTTTTCAATTTTCTGTAACCGAATACCTTATAATGGTGGTGAACAAGTCACAGTGGGCAATGATAACCAAGCGT
TTGTTGAGTTTTACTCTTCTTTTTTCCTTGTTAAGGATCTTCGGTCCAAGACGATTCTTCTTCGGGGATCGCTTGATGATGGTTTATATAGCGTCACTTCTAGCTTAACA
AGTTCTTCTGTTGGCTCCACTCCTTCAGCTACTGCTGCCTTTTTATCTTCTGTTCAATCGTCTCCAAATTGGCATTTGCGTCTTGGACACCCTGCTGTTTCAGTCTTGCA
GCAACTCTCTCCTTCCCTTTCTCTCCTATTTTGGGCCAACATTATTGCCACTAACGCCACCACCAACCCACCACCCACCGTCATTGTCGCTGCATTTTCTCCATCATCGG
TGACAGAGAAGGGAAAGGAAGGTCTCCCTCTCTCCTTCGAGAGCTCCTCCTCCCGTCGCGACTACGAAGGGGCATAG
Protein sequenceShow/hide protein sequence
MFPTLPQSLSVKLTDTNFLLWKDQLLNAVIANGLRGFLDGTILPPPQILDQQSLHLNPDYSLWERYNRILMCWIYSSLSEQKMGEIISLESASEIWSSLNRAYDFETTAR
IMGLKAQLQSIKKDGLSVSQYLSQIKEVADKFAAIGEPISYRDQLAHILDGLGTEYNAFVTSIQNRADSPSLEDVRSLLLAYEERLEKQNIVDQLNVAQANLSNLSLQHN
NKRHSPKSFFLNQSKTSFPHPVSAAQIPPSILDEGWFLDSGATHHMTPDFSIFCNRIPYNGGEQVTVGNDNQAFVEFYSSFFLVKDLRSKTILLRGSLDDGLYSVTSSLT
SSSVGSTPSATAAFLSSVQSSPNWHLRLGHPAVSVLQQLSPSLSLLFWANIIATNATTNPPPTVIVAAFSPSSVTEKGKEGLPLSFESSSSRRDYEGA