; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g20660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g20660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransposon Tf2-2 polyprotein
Genome locationchr6:16080968..16088114
RNA-Seq ExpressionMoc06g20660
SyntenyMoc06g20660
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6478499.1 hypothetical protein ZIOFF_061942 [Zingiber officinale]3.8e-6135.62Show/hide
Query:  ANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDAL
        ANA R +I TW+ +KKE+ D+FL  NT W+ R+  K+ K SG+VRDY+K+FS L+LD++NM EEDKL+NFL GLQPW   EL R++V++LP+AIAA DAL
Subjt:  ANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDAL

Query:  VDFAF----MNESSSSKKKKEKNTKMFCKFK----KKSKKEKAKRKAT----------------------------------------------------
        VD       +N SSSSK      +  FCK K    KK  K+ AK+K +                                                    
Subjt:  VDFAF----MNESSSSKKKKEKNTKMFCKFK----KKSKKEKAKRKAT----------------------------------------------------

Query:  ----PDAAHRVNLLRLLNAMVTSQKY------------LPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAAQAKVSL
             + A  VN L+LLN +V  +                 +L++V V+ NG+    M+DT AT+ F+S K++   GL + +    +K+VN+ AQA V +
Subjt:  ----PDAAHRVNLLRLLNAMVTSQKY------------LPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAAQAKVSL

Query:  -----------------------------------------MPFVNEIFIWNEKSPCFV---------------HVVSA-GLVKSVRES-----------
                                                 MP ++ + +  E +PCFV               +++SA  L K +R             
Subjt:  -----------------------------------------MPFVNEIFIWNEKSPCFV---------------HVVSA-GLVKSVRES-----------

Query:  ------SLPNSLKQV-------------SDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDNVANTYFK
               LP + + V             +DASD+AIG VLVQEG+P+AFES KL  AEQ YS HEKEM+ +VH L  WR Y LGTKF+V TDNVANT+F 
Subjt:  ------SLPNSLKQV-------------SDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDNVANTYFK

Query:  TQKKLTPKQAR
        TQKKL+PKQAR
Subjt:  TQKKLTPKQAR

KAG6478499.1 hypothetical protein ZIOFF_061942 [Zingiber officinale]6.2e-0356.41Show/hide
Query:  VDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLED
        VD+FSKY +F+A P AC  DVA ELF++++V +FGL  D
Subjt:  VDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLED

KAG6478499.1 hypothetical protein ZIOFF_061942 [Zingiber officinale]3.2e-6035.26Show/hide
Query:  ANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDAL
        ANA R +I TW+ +KKE+ D+FLP NT W+ R+  K+ K SG+VRDY+K+FS LMLD++NM EEDKL+NFL GLQPW   EL R++V++LP+AIAA DAL
Subjt:  ANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDAL

Query:  VDFAF----MNESSSSKK---KKEKNTKMFCKFKKKSKKE---------KAKRKA------------------------------------------TPD
        VD       +N SSSSK    +K+K      + KK +KK+         KA+  A                                            +
Subjt:  VDFAF----MNESSSSKK---KKEKNTKMFCKFKKKSKKE---------KAKRKA------------------------------------------TPD

Query:  AAHRVNLLRLLNAMVTSQKY------------LPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAAQAKVSL------
         A  VN L+LLN +V  +                 +L++V V+ NG+    M+DT AT+ F+S K++   GL V +    +K+VN+ AQA V +      
Subjt:  AAHRVNLLRLLNAMVTSQKY------------LPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAAQAKVSL------

Query:  -----------------------------------MPFVNEIFIWNEKSPCFV---------------HVVSA-GLVKSVRES-----------------
                                           MP ++ + +  E +PCFV               +++SA  L K +R                   
Subjt:  -----------------------------------MPFVNEIFIWNEKSPCFV---------------HVVSA-GLVKSVRES-----------------

Query:  SLPNSLKQV---------------------------SDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTD
         LP  ++ +                           +DASD+AIG VLVQEG+P+AFES KL  AEQ YS HEKEM+ +VH L  WR Y LGTKF+V TD
Subjt:  SLPNSLKQV---------------------------SDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTD

Query:  NVANTYFKTQKKLTPKQAR
        NVANT+F TQKKL+PKQAR
Subjt:  NVANTYFKTQKKLTPKQAR

KAG6489469.1 hypothetical protein ZIOFF_050738 [Zingiber officinale]6.2e-0356.41Show/hide
Query:  VDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLED
        VD+FSKY +F+A P AC  DVA ELF++++V +FGL  D
Subjt:  VDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLED

KAG6489469.1 hypothetical protein ZIOFF_050738 [Zingiber officinale]3.5e-5934.49Show/hide
Query:  ANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDAL
        ANA R +I TW+ +KKE+ DKFLP NT W+ R+  K+ K SG+VRDY+K+FS LMLD++NM EEDKL+NF+ GLQPW   EL R++V++LP+AIAA DAL
Subjt:  ANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDAL

Query:  VDFAF----MNESSSSKK---KKEKNTKMFCKFKKKSKKE---------KAKRKAT------------------------------------------PD
        VD       +N SSSSK    +K+K      + KK +KK+         KA+  A                                            +
Subjt:  VDFAF----MNESSSSKK---KKEKNTKMFCKFKKKSKKE---------KAKRKAT------------------------------------------PD

Query:  AAHRVNLLRLLNAMVTSQKY------------LPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAAQAKVSL------
         A  VN L+LLN +V  +                 +L++V V+ N +    M+DT AT+ F+S K++   GL V +    +K+VN+ AQA V +      
Subjt:  AAHRVNLLRLLNAMVTSQKY------------LPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAAQAKVSL------

Query:  -----------------------------------MPFVNEIFIWNEKSPCFVHVV-------SAGLVKSVRESSLPNSLKQVS----------------
                                           MP ++ + +  E +PCFV  V       + G    +   SL   L++                  
Subjt:  -----------------------------------MPFVNEIFIWNEKSPCFVHVV-------SAGLVKSVRESSLPNSLKQVS----------------

Query:  -------------------------------------DASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTD
                                             DASD+AIG VLVQEG+P+AFES KL  AEQ YS HEKEM+ +VH L  WR Y LGTKF+V TD
Subjt:  -------------------------------------DASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTD

Query:  NVANTYFKTQKKLTPKQAR
        N+ANT+F TQKKL+PKQAR
Subjt:  NVANTYFKTQKKLTPKQAR

KAG6506505.1 hypothetical protein ZIOFF_031829 [Zingiber officinale]6.2e-0356.41Show/hide
Query:  VDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLED
        VD+FSKY +F+A P AC  DVA ELF++++V +FGL  D
Subjt:  VDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLED

KAG6506505.1 hypothetical protein ZIOFF_031829 [Zingiber officinale]2.7e-5934.41Show/hide
Query:  ANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDAL
        ANA R +I TW+ +KKE+ D+FLP NT W+ R+  K+ K SG+VRDY+K+FS LMLD++NM EEDKL+NFL GLQPW   EL R++V++LP+AIAA DAL
Subjt:  ANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDAL

Query:  VDFAF----MNESSSSKK---KKEKNTKMFCKFKKKSKKE---------KAKRKAT------------------------------------------PD
        VD       +N SSSSK    +K+K      + KK +KK+         KA+  A                                            +
Subjt:  VDFAF----MNESSSSKK---KKEKNTKMFCKFKKKSKKE---------KAKRKAT------------------------------------------PD

Query:  AAHRVNLLRLLNAMVTSQKY------------LPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAAQAKVSL------
         A  VN L+LLN +V  +                 +L++V V+ NG+    M+DT AT+ F+S K++   GL + +    +K+VN+ AQA V +      
Subjt:  AAHRVNLLRLLNAMVTSQKY------------LPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAAQAKVSL------

Query:  -----------------------------------MPFVNEIFIWNEKSPCFV---------------HVVSA-GLVKSVRES-----------------
                                           MP ++ + +  E +PCFV               +++SA  L K +R                   
Subjt:  -----------------------------------MPFVNEIFIWNEKSPCFV---------------HVVSA-GLVKSVRES-----------------

Query:  SLPNSLKQV----------------------------------SDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGT
         LP  ++ +                                  +DASD+AIG VLVQEG+P+AFES KL  AEQ YS HEKEM+ +VH L  WR Y LGT
Subjt:  SLPNSLKQV----------------------------------SDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGT

Query:  KFVVVTDNVANTYFKTQKKLTPKQAR
        KF+V TDN+ANT+F TQKKL+PKQAR
Subjt:  KFVVVTDNVANTYFKTQKKLTPKQAR

KAG6517981.1 hypothetical protein ZIOFF_021381 [Zingiber officinale]6.2e-0356.41Show/hide
Query:  VDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLED
        VD+FSKY +F+A P AC  DVA ELF++++V +FGL  D
Subjt:  VDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLED

KAG6517981.1 hypothetical protein ZIOFF_021381 [Zingiber officinale]2.5e-5728.13Show/hide
Query:  RIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDALVDFAFM
        +I TW+ +KKE+ D+FLP NT W+ R+  K+ K S +VRDY+K+FS LMLD++NM EEDKL+NFL GLQPW   EL R++V++LP+AIAA DALVD    
Subjt:  RIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDALVDFAFM

Query:  NESSSSKKKKEKNTKMFCKFKKKSKKEKAKRKATPDAAHRVNLLRLLNAMVTSQKY------------LPKELMNVPVLANGQVVTTMLDTRATNNFLST
         ++ +   K+EK   +    K+   ++        +    VN L+LLN +V  +                 +L++V V+ NG+    M+DT AT+ F+S 
Subjt:  NESSSSKKKKEKNTKMFCKFKKKSKKEKAKRKATPDAAHRVNLLRLLNAMVTSQKY------------LPKELMNVPVLANGQVVTTMLDTRATNNFLST

Query:  KIMTNLGLEVCESGIQVKAVNSAAQAKVSL-----------------------------------------MPFVNEIFIWNEKSPCFV-----------
        K++   GL V +    +K+VN+ AQA VS+                                         MP ++ + +  E +PCFV           
Subjt:  KIMTNLGLEVCESGIQVKAVNSAAQAKVSL-----------------------------------------MPFVNEIFIWNEKSPCFV-----------

Query:  ----HVVSA-----GL--------------------------------------------------VKSVRESSLPNSLKQV------------------
            +++SA     GL                                                  V ++ E   P+++ ++                  
Subjt:  ----HVVSA-----GL--------------------------------------------------VKSVRESSLPNSLKQV------------------

Query:  ---------------------------------------------------SDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDT
                                                           +DASD+AIG VLVQEG+P+AFES KL  AEQ YS HEKEM+ +VH L  
Subjt:  ---------------------------------------------------SDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDT

Query:  WRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQAR--------------------------------------------------------ASYGILP---
        WR Y LGTKF+V TDNVANT+F TQKKL+PKQAR                                                         +YG L    
Subjt:  WRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQAR--------------------------------------------------------ASYGILP---

Query:  ---------IRDRPLICMGETGPCCPPAM--TPKTYSRNCLPQEYESSFNKVRLNTTLASHLARLNHLSIETLAIPSGLLAPHYVKFKNALCYF--RSGI
                 I D  L+  G    C    M    + Y + CL  + + +  K                  ++ L IP        + F +        S I
Subjt:  ---------IRDRPLICMGETGPCCPPAM--TPKTYSRNCLPQEYESSFNKVRLNTTLASHLARLNHLSIETLAIPSGLLAPHYVKFKNALCYF--RSGI

Query:  HIPSVDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLED
         +  VD+FSKY +F+A P ACS DVA ELF++++V +FGL  D
Subjt:  HIPSVDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLED

TrEMBL top hitse value%identityAlignment
A0A438IBH1 Retrovirus-related Pol polyprotein from transposon 17.64.1e-4530.73Show/hide
Query:  ANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDAL
        A + R +I TWET+KKEL D+FLP NT WV RE  K+ +H+G+VR+Y+K+FS LMLD++NM EEDKLFNF+SGLQ W  TEL R+ V++LPA +AA D L
Subjt:  ANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDAL

Query:  VDFAF------------------MNESSSSK--------------KKKEKNTKM-----------FC----KFKKKSKKEKA--------KRKATPDAAH
        VD+                     NE  + K              K  EK TK+            C    + K   K+EK         K  + PD   
Subjt:  VDFAF------------------MNESSSSK--------------KKKEKNTKM-----------FC----KFKKKSKKEKA--------KRKATPDAAH

Query:  RVNLLRLLNAMVTSQKYLPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAAQ--------------------------
        RVN L+LLN ++  +  + K LM+V  + NG  V  ++D+ AT+NF++TK +  LGL++ E   ++KAVNS AQ                          
Subjt:  RVNLLRLLNAMVTSQKYLPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAAQ--------------------------

Query:  ---------------AKVSLMPFVNEIFIWNEKSPCFV-------------------------------------------HVVSAGL-------VKSVR
                       AKV+L+P +  + +  EK PCFV                                           H +S GL       V+++ 
Subjt:  ---------------AKVSLMPFVNEIFIWNEKSPCFV-------------------------------------------HVVSAGL-------VKSVR

Query:  ESSLP--------------------------------------------------NSLKQV-------------------SDASDKAIGEVLVQEGYPIA
        E S+P                                                   SLK+                    +DASD+A+G VLVQEG+P  
Subjt:  ESSLP--------------------------------------------------NSLKQV-------------------SDASDKAIGEVLVQEGYPIA

Query:  FESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQAR
                                     WRHY LG+ F VVTDNVANT+FKTQKKL+ KQAR
Subjt:  FESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQAR

A0A6I9QL46 LOW QUALITY PROTEIN: uncharacterized protein LOC1050362882.6e-4724.94Show/hide
Query:  ANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDAL
        A+A R +I+ WET+KKEL D+FLPCNT W+ RE+ KK +H G++R Y+K+FS LMLD+ NM +EDKLFNFLSGLQPW   EL R+ +K+LP+A+AAVD L
Subjt:  ANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDAL

Query:  VDFAFMNE---------------------------------SSSSKKKKEKNTKMFC-------KFKKKSKKEKAKRKATPD------AAHRVNLLRLLN
        VDF                                       +S+++K + +T   C       + K   KKE+       D      +  RVN L+L+N
Subjt:  VDFAFMNE---------------------------------SSSSKKKKEKNTKMFC-------KFKKKSKKEKAKRKATPD------AAHRVNLLRLLN

Query:  AMVTSQKYLPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAA------------------------------------
        AM  +++ +P  L+ + V   G+ +  M DT AT+NF++ +    LGL+V +S  ++KAVNS A                                    
Subjt:  AMVTSQKYLPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAA------------------------------------

Query:  -----QAKVSLMPFVNEIFIWNEKSPCFV-----------------------------------------------------------------------
             QAK +L+P +  + + +E+ PCF+                                                                       
Subjt:  -----QAKVSLMPFVNEIFIWNEKSPCFV-----------------------------------------------------------------------

Query:  ----------------------------------------HVVSAGLVK-------------------------------------SVRESSLPNSL---
                                                 ++ AG+++                                       RE  L NSL   
Subjt:  ----------------------------------------HVVSAGLVK-------------------------------------SVRESSLPNSL---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------KQVSDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQARASYGIL
                K  +DASDKA+G VLVQEG+P+AFESWKLK+AEQ YS HEKEM+ +VH L TW+ Y LGT+FVV TDNVANT+F TQKKL+P+QAR    ++
Subjt:  --------KQVSDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQARASYGIL

Query:  ---------PIRDRPL------------------ICMGETGPCCPPAMTPKTYSRNCLPQEYES-------SFNKVRLNTTLASHLARLNHLSIETL--A
                 P R   +                  + +  T      A + K Y +  L Q+ +        +F + ++   + +++        + +   
Subjt:  ---------PIRDRPL------------------ICMGETGPCCPPAMTPKTYSRNCLPQEYES-------SFNKVRLNTTLASHLARLNHLSIETL--A

Query:  IPSGLLAPHYV---KFKNALCYFRSGIH--------IPSVDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLED
          +GLL P  +    + +    F SG          +  VD+FSKY VF+  P AC  + A +LF++N+V +FGL  D
Subjt:  IPSGLLAPHYV---KFKNALCYFRSGIH--------IPSVDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLED

A0A803LHP5 Uncharacterized protein1.2e-4934.4Show/hide
Query:  ARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDALVD
        AR+  I +W+ MK E+  +FLP NT WV R+  K  KH+ T+R+Y+K+F  LMLD+ NM EEDKLF F+SGL+PW  T+L R+ V++L AAI A +ALVD
Subjt:  ARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDALVD

Query:  F--AFMNESSSSKKK-----------KEKNTKMFCKFKKKS---KKEKAKRKAT---------------------------------------------P
        +  + + E+   KKK           K K  K   K  +KS   K+E +  K T                                              
Subjt:  F--AFMNESSSSKKK-----------KEKNTKMFCKFKKKS---KKEKAKRKAT---------------------------------------------P

Query:  DAAHRVNLLRLLNAMVTSQKYLPKE-LMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAA----------------------
        D A R+N ++++ A +  ++  P + LM   V  NG  + TM+DT AT++FL   ++  +GL+V  S   +  VNSAA                      
Subjt:  DAAHRVNLLRLLNAMVTSQKYLPKE-LMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQVKAVNSAA----------------------

Query:  -------------------QAKVSLMPFVNEIFIWNEKSPCFVH-------------VVSAGLVKSVRESSLPNSLKQVSDASDKAIGEV------LVQE
                           QAK+ + P +  + I +E  PCFV              ++SA  VK   +   P  L  + +     I EV      L++E
Subjt:  -------------------QAKVSLMPFVNEIFIWNEKSPCFVH-------------VVSAGLVKSVRESSLPNSLKQVSDASDKAIGEV------LVQE

Query:  GYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQAR
             FE  KLK+AE+ YS HEKEM+ +VH L TWRHY LGTKF V+TDNVANT+F++QK L+PKQAR
Subjt:  GYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQAR

A0A803N8Q4 Uncharacterized protein1.7e-4631.06Show/hide
Query:  MKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDALVDFAFMNESSSSK
        MK E   +FLP NT WV R+  K  KH+ T+R+Y+K+F  LMLD+ NM EEDKLF F+SGL+PW  TEL R+ V++L  AI A +ALVD+        ++
Subjt:  MKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDALVDFAFMNESSSSK

Query:  K--KKEKNTKMFCKFKKKSKKEKAKRKATPDAAHRVNLLRLLNAMVTSQKYLPKE-LMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQ
           K+++ + +  +     +  +   +   D A R+N ++++ A +  ++  P + LM   V  NG  +  M+DT AT+NFL   ++  +GL+V  S   
Subjt:  K--KKEKNTKMFCKFKKKSKKEKAKRKATPDAAHRVNLLRLLNAMVTSQKYLPKE-LMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLEVCESGIQ

Query:  VKAVNSAA-----------------------------------------QAKVSLMPFVNEIFIWNEKSPCFV-------------HVVSAGLVK-----
        +K VNSAA                                         QAK+ + P +  + I +E  PCFV              ++SA  VK     
Subjt:  VKAVNSAA-----------------------------------------QAKVSLMPFVNEIFIWNEKSPCFV-------------HVVSAGLVK-----

Query:  ----------------------------------------------------------------SVRESSLPNSLKQ-----------------------
                                                                        S + + L + LK+                       
Subjt:  ----------------------------------------------------------------SVRESSLPNSLKQ-----------------------

Query:  ---------------VSDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQAR
                        +DA  +AIG VLVQ+G+PIAFE  KLK+AE+ YS HEKEM+ +VH L TWRHY LGTKF V+TDNVANT+F++QK L+PKQAR
Subjt:  ---------------VSDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQAR

A0A803PL58 Uncharacterized protein4.4e-5536.41Show/hide
Query:  RWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDALVDFA
        R  I+TW+ +KKEL D+FL  NT W+  E+ KK KH+G+VR Y K++S L+LD++NM EEDK+FNFLSGL+P    EL R+ V +LP AI A +  VD+ 
Subjt:  RWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDVKNLPAAIAAVDALVDFA

Query:  F-MNESSSSKKKKEKNTKMFCKFKKKSKKEKAKRKATPDAAHRVNLLRLLNAMVTSQKYLPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLE
        + +N S   K                                                   K L+ V    NG+ VTTM+D+ ATNNF++ K  T L LE
Subjt:  F-MNESSSSKKKKEKNTKMFCKFKKKSKKEKAKRKATPDAAHRVNLLRLLNAMVTSQKYLPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIMTNLGLE

Query:  VCESGIQVKAVNS-----------------AAQAKVSLMPFVNEIFIWNEKSPCFVHVVSA------GLV--------KSVRESSLPNSLKQVS------
        V +S  ++KAVNS                 ++ AK  +   +  + I  E  PCFV  +S+      G+V         S+++ +     K++S      
Subjt:  VCESGIQVKAVNS-----------------AAQAKVSLMPFVNEIFIWNEKSPCFVHVVSA------GLV--------KSVRESSLPNSLKQVS------

Query:  ------------------------------------------DASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKF
                                                  DASD A+G VLVQ+G+PIAFES KL D EQ +S HEKEM V+VH LD WRH  LGTKF
Subjt:  ------------------------------------------DASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKF

Query:  VVVTDNVANTYFKTQKKLTPKQA
        +VVT+NV NTYF++ +KLTPKQA
Subjt:  VVVTDNVANTYFKTQKKLTPKQA

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.5e-0738.55Show/hide
Query:  SDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQAR
        +DASD A+G VL Q+G+P+++ S  L + E  YST EKE++ +V    T+RHY LG  F + +D+   ++    K    K  R
Subjt:  SDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQAR

P0CT34 Transposon Tf2-1 polyprotein8.9e-0538.89Show/hide
Query:  SDASDKAIGEVLVQEG-----YPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGT--KFVVVTDN
        +DASD A+G VL Q+      YP+ + S K+  A+  YS  +KEM+ ++  L  WRHY   T   F ++TD+
Subjt:  SDASDKAIGEVLVQEG-----YPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGT--KFVVVTDN

P0CT41 Transposon Tf2-12 polyprotein8.9e-0538.89Show/hide
Query:  SDASDKAIGEVLVQEG-----YPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGT--KFVVVTDN
        +DASD A+G VL Q+      YP+ + S K+  A+  YS  +KEM+ ++  L  WRHY   T   F ++TD+
Subjt:  SDASDKAIGEVLVQEG-----YPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGT--KFVVVTDN

P20825 Retrovirus-related Pol polyprotein from transposon 2978.6e-0846.15Show/hide
Query:  SDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDN
        +DAS+ A+G VL Q G+PI+F S  L D E  YS  EKE++ +V    T+RHY LG +F++ +D+
Subjt:  SDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGTKFVVVTDN

Q9UR07 Transposon Tf2-11 polyprotein8.9e-0538.89Show/hide
Query:  SDASDKAIGEVLVQEG-----YPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGT--KFVVVTDN
        +DASD A+G VL Q+      YP+ + S K+  A+  YS  +KEM+ ++  L  WRHY   T   F ++TD+
Subjt:  SDASDKAIGEVLVQEG-----YPIAFESWKLKDAEQCYSTHEKEMMVMVHYLDTWRHYFLGT--KFVVVTDN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTTAGTGAGTGGCTAGCGTCCCTAAACACGAAGGTGGGCATGGGGCCAGATAGATGGCTAAAGGCTAATGCGCGTAGATGGAGAATAAAGACATGGGAAACCAT
GAAGAAGGAGCTAACGGATAAGTTCCTACCATGTAACACTTTGTGGGTTACAAGGGAGGCATTCAAGAAGTTTAAGCATAGTGGAACTGTAAGAGATTATATTAAGAAGT
TTAGCTTGCTAATGCTAGACGTTCGTAACATGTTTGAAGAGGACAAGCTGTTCAACTTTCTCTCAGGTTTGCAACCATGGGAGAATACGGAGTTGATAAGGCGTGATGTG
AAGAACTTACCAGCAGCAATTGCAGCAGTCGATGCTTTGGTGGATTTTGCTTTCATGAATGAATCTTCTTCCTCTAAAAAGAAGAAGGAGAAAAACACCAAAATGTTCTG
CAAGTTCAAGAAGAAATCCAAGAAGGAGAAAGCTAAGAGGAAGGCTACACCAGATGCAGCTCATCGTGTCAATCTACTACGACTACTTAATGCCATGGTGACAAGTCAGA
AGTATCTGCCAAAGGAGTTGATGAATGTGCCCGTTCTCGCCAACGGACAAGTAGTGACGACAATGTTGGACACTAGAGCTACAAACAACTTTTTGTCAACCAAGATCATG
ACAAATTTGGGTTTGGAGGTTTGTGAGAGCGGAATCCAAGTTAAGGCTGTCAATTCGGCAGCCCAGGCGAAAGTGTCCTTGATGCCCTTTGTTAACGAGATTTTCATATG
GAATGAAAAGTCCCCTTGTTTCGTGCATGTTGTAAGTGCAGGTTTAGTGAAATCGGTAAGAGAGTCATCTCTGCCCAATAGCTTAAAGCAGGTCTCCGATGCATCTGACA
AAGCAATTGGTGAAGTGTTGGTTCAAGAAGGTTATCCGATAGCATTCGAGAGTTGGAAGTTGAAAGATGCCGAACAATGCTATTCCACGCATGAGAAGGAAATGATGGTC
ATGGTTCACTACCTTGATACATGGAGACATTATTTTCTGGGCACAAAATTCGTTGTGGTGACGGACAATGTGGCGAATACATACTTCAAAACTCAGAAGAAGTTGACCCC
GAAGCAGGCAAGAGCGAGTTATGGAATCCTTCCTATAAGGGATCGTCCTTTGATTTGTATGGGTGAGACTGGTCCGTGTTGCCCACCGGCAATGACGCCAAAAACTTATA
GCAGAAATTGCTTACCACAAGAGTACGAATCAAGTTTTAATAAAGTACGTCTCAACACTACCCTTGCTAGTCACCTGGCTAGGCTTAATCATTTGAGCATTGAGACGCTC
GCTATCCCCTCGGGTCTTCTAGCTCCACATTATGTTAAATTTAAGAACGCGTTATGCTACTTCCGCTCCGGGATTCACATCCCTTCAGTTGACAAGTTCTCAAAGTATGT
AGTGTTCATGGCAACACCATATGCTTGTTCAATGGATGTGGCAATTGAGTTGTTCTTCAAGAACATTGTGAACTACTTTGGACTACTTGAGGATAATCAACGATCAAGGT
GCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGTTAGTGAGTGGCTAGCGTCCCTAAACACGAAGGTGGGCATGGGGCCAGATAGATGGCTAAAGGCTAATGCGCGTAGATGGAGAATAAAGACATGGGAAACCAT
GAAGAAGGAGCTAACGGATAAGTTCCTACCATGTAACACTTTGTGGGTTACAAGGGAGGCATTCAAGAAGTTTAAGCATAGTGGAACTGTAAGAGATTATATTAAGAAGT
TTAGCTTGCTAATGCTAGACGTTCGTAACATGTTTGAAGAGGACAAGCTGTTCAACTTTCTCTCAGGTTTGCAACCATGGGAGAATACGGAGTTGATAAGGCGTGATGTG
AAGAACTTACCAGCAGCAATTGCAGCAGTCGATGCTTTGGTGGATTTTGCTTTCATGAATGAATCTTCTTCCTCTAAAAAGAAGAAGGAGAAAAACACCAAAATGTTCTG
CAAGTTCAAGAAGAAATCCAAGAAGGAGAAAGCTAAGAGGAAGGCTACACCAGATGCAGCTCATCGTGTCAATCTACTACGACTACTTAATGCCATGGTGACAAGTCAGA
AGTATCTGCCAAAGGAGTTGATGAATGTGCCCGTTCTCGCCAACGGACAAGTAGTGACGACAATGTTGGACACTAGAGCTACAAACAACTTTTTGTCAACCAAGATCATG
ACAAATTTGGGTTTGGAGGTTTGTGAGAGCGGAATCCAAGTTAAGGCTGTCAATTCGGCAGCCCAGGCGAAAGTGTCCTTGATGCCCTTTGTTAACGAGATTTTCATATG
GAATGAAAAGTCCCCTTGTTTCGTGCATGTTGTAAGTGCAGGTTTAGTGAAATCGGTAAGAGAGTCATCTCTGCCCAATAGCTTAAAGCAGGTCTCCGATGCATCTGACA
AAGCAATTGGTGAAGTGTTGGTTCAAGAAGGTTATCCGATAGCATTCGAGAGTTGGAAGTTGAAAGATGCCGAACAATGCTATTCCACGCATGAGAAGGAAATGATGGTC
ATGGTTCACTACCTTGATACATGGAGACATTATTTTCTGGGCACAAAATTCGTTGTGGTGACGGACAATGTGGCGAATACATACTTCAAAACTCAGAAGAAGTTGACCCC
GAAGCAGGCAAGAGCGAGTTATGGAATCCTTCCTATAAGGGATCGTCCTTTGATTTGTATGGGTGAGACTGGTCCGTGTTGCCCACCGGCAATGACGCCAAAAACTTATA
GCAGAAATTGCTTACCACAAGAGTACGAATCAAGTTTTAATAAAGTACGTCTCAACACTACCCTTGCTAGTCACCTGGCTAGGCTTAATCATTTGAGCATTGAGACGCTC
GCTATCCCCTCGGGTCTTCTAGCTCCACATTATGTTAAATTTAAGAACGCGTTATGCTACTTCCGCTCCGGGATTCACATCCCTTCAGTTGACAAGTTCTCAAAGTATGT
AGTGTTCATGGCAACACCATATGCTTGTTCAATGGATGTGGCAATTGAGTTGTTCTTCAAGAACATTGTGAACTACTTTGGACTACTTGAGGATAATCAACGATCAAGGT
GCTAG
Protein sequenceShow/hide protein sequence
MGVSEWLASLNTKVGMGPDRWLKANARRWRIKTWETMKKELTDKFLPCNTLWVTREAFKKFKHSGTVRDYIKKFSLLMLDVRNMFEEDKLFNFLSGLQPWENTELIRRDV
KNLPAAIAAVDALVDFAFMNESSSSKKKKEKNTKMFCKFKKKSKKEKAKRKATPDAAHRVNLLRLLNAMVTSQKYLPKELMNVPVLANGQVVTTMLDTRATNNFLSTKIM
TNLGLEVCESGIQVKAVNSAAQAKVSLMPFVNEIFIWNEKSPCFVHVVSAGLVKSVRESSLPNSLKQVSDASDKAIGEVLVQEGYPIAFESWKLKDAEQCYSTHEKEMMV
MVHYLDTWRHYFLGTKFVVVTDNVANTYFKTQKKLTPKQARASYGILPIRDRPLICMGETGPCCPPAMTPKTYSRNCLPQEYESSFNKVRLNTTLASHLARLNHLSIETL
AIPSGLLAPHYVKFKNALCYFRSGIHIPSVDKFSKYVVFMATPYACSMDVAIELFFKNIVNYFGLLEDNQRSRC