; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g00700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g00700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr1:463156..464424
RNA-Seq ExpressionMoc01g00700
SyntenyMoc01g00700
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB2610253.1 hypothetical protein D8674_018285 [Pyrus ussuriensis x Pyrus communis]1.5e-7640.19Show/hide
Query:  VCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGEF
        VC   L GK  KLPF  S+  +  P ++VHSD+WGPAP  SI+GFK+YV+ +D+ ++F W++P+I KSD    F  F   V+    + IKI +SDGGGE+
Subjt:  VCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGEF

Query:  VNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRSN-YSFLKTFGGAC
        +NH L ++L+  G++H  SC YTP+QNG+ ERKH H++E  + L+  A LP  FW F    A ++INR+P+  L +KSPF+LLF  S   + L+ FG +C
Subjt:  VNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRSN-YSFLKTFGGAC

Query:  YPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPL-AISSSATHSVTPSSSSPSTSNNFPLSILLSSPVP-LLNEVPLN
        +PLLKPY  +KLQPKTT+C FLGY+S  KG+I Y +    +YISRHV FDES+FP  ++ + +T       SSP++S  +P  + +  P P L+   P++
Subjt:  YPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPL-AISSSATHSVTPSSSSPSTSNNFPLSILLSSPVP-LLNEVPLN

Query:  D--LPTSSDTSSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAPPLVQNARSMQTRGKSGIFKRKVFVAPTISSSQV
            P +   +SS  S+ V +  S+++H   E +                        P+   P  P    N   MQTR K+GI K+KVF++    SS V
Subjt:  D--LPTSSDTSSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAPPLVQNARSMQTRGKSGIFKRKVFVAPTISSSQV

Query:  -----EPFSFSKASKLPV
             EP ++  A ++PV
Subjt:  -----EPFSFSKASKLPV

KAB2617916.1 hypothetical protein D8674_013785 [Pyrus ussuriensis x Pyrus communis]3.1e-7741.93Show/hide
Query:  PC--NSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKI
        PC  +S   +C   L GK  KLPF  S + A  P ++VHSD+WGPAP  S +GF+YYV+F+D+ + F WL+P+I KSD+   F  F   V N     +++
Subjt:  PC--NSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKI

Query:  FRSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLF-HRSNYS
         +SDGGGE+++H    +L+  GI HQ SC YTP+QNG+ ERKH H+VE  + L+  A LP  FW F   TA +IINR+PS++L NKSPF+LLF      S
Subjt:  FRSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLF-HRSNYS

Query:  FLKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPVPL
         L+ FG  CYPLLKPY   KLQPKTT+C FLGY+S  KGYI Y + ++  YISRHV FDE+ FP +   +  +S+   S+  ST+  F      + P+P+
Subjt:  FLKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPVPL

Query:  LNEVPLNDLPT-SSDTSSSSTSVPVP---SQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNP-------------------PAPPLVQN
         N   LN+L T  S +SS ++S+PV    S  S VV  +  S     +  SL     V ++ +  S+ +  N                    P PPL  N
Subjt:  LNEVPLNDLPT-SSDTSSSSTSVPVP---SQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNP-------------------PAPPLVQN

Query:  ARSMQTRGKSGIFKRKVFV-----APTISSSQVEPFSFSKASKLPV
           MQTR KSGI K+K  +     +P +  S VEP ++  A K+PV
Subjt:  ARSMQTRGKSGIFKRKVFV-----APTISSSQVEPFSFSKASKLPV

PRQ55598.1 putative RNA-directed DNA polymerase [Rosa chinensis]5.3e-7740.72Show/hide
Query:  MNISPCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRI
        + +    S   +C + + GK H+LPFS SSSI S PL L+H+DVWGPA  +S  G +Y++S VDD SKF W+ P+ +KSDVP  F +FK  VENLL + I
Subjt:  MNISPCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRI

Query:  KIFRSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-N
        K  RSDGGGEF ++S   YL  HGI HQ SC +TPQQNGVVERKH H++E+A  +++++ LP  +W     TA F INRLP  ++ +KSPF++L++++ +
Subjt:  KIFRSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-N

Query:  YSFLKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPV
        Y FLK FG  C+P L+PY  +K  P+++ C F+GYS D KGY      +  +Y SRHV FDE+ FP       TH+   SS   ST ++ P+ +   SP 
Subjt:  YSFLKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPV

Query:  PLLNEVPLNDLPTSSDTSSSS-TSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAPPLVQNAR-------------SMQTR
                  +PT+S  SS S  S+P P   S   H  + S  +  S  S P+  +  +      +P+   P  PPL   +R             +M TR
Subjt:  PLLNEVPLNDLPTSSDTSSSS-TSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAPPLVQNAR-------------SMQTR

Query:  GKSGIFKRKVFVAPT---------ISSSQVEPFSFSKASKLP
         K+G  K KVF A            +S  + P S+S+ASK P
Subjt:  GKSGIFKRKVFVAPT---------ISSSQVEPFSFSKASKLP

TQD88914.1 hypothetical protein C1H46_025506 [Malus baccata]2.6e-7641.75Show/hide
Query:  SPCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIF
        S CNS    CT  L GK  KLPF + +S +  PL+++H+DVWGP+P  SI G+ YYVSF+D+ +++TW++P+  K+ V  IF  F   ++N   + +K+ 
Subjt:  SPCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIF

Query:  RSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHR-SNYSF
        +SDGGGE+V+    ++L T GI+HQKSC YTP+QNG+ ERK+ H+VE A+ L+ KA L   FW     T+ +++NRLP+S L   SPF++L+        
Subjt:  RSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHR-SNYSF

Query:  LKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSN-NFPLSILLSSPVPL
        L+ FG ACYP LKPY  NKL PKTT C FLGY++  KGYI Y +    L +SRHV FDESVFP      +T+ ++ SS SP+ S+ + P+S+   +P   
Subjt:  LKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSN-NFPLSILLSSPVPL

Query:  LNEVPLNDLPTSSDTSSSSTSVPVPSQDSNVVHDVAESAG--VNESAGSLPIDGVVPN------SDIQH-SVPSCDNPPAPPLVQNARSMQTRGKSGIFK
         +  P   +P    +SS  +   V   D   + D+    G   + SA SL +    P+       D+QH SV S           N   MQTR KSGIFK
Subjt:  LNEVPLNDLPTSSDTSSSSTSVPVPSQDSNVVHDVAESAG--VNESAGSLPIDGVVPN------SDIQH-SVPSCDNPPAPPLVQNARSMQTRGKSGIFK

Query:  RKVFVAPTISSSQVEPFSFSKASK
        +KVF A  +     EP SFS A++
Subjt:  RKVFVAPTISSSQVEPFSFSKASK

TQD95848.1 hypothetical protein C1H46_018590 [Malus baccata]3.7e-7841.42Show/hide
Query:  VCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGEF
        VC   L GK  KLPFS S   +  P D+VHSDVWGPAP  S+ GFKYYV+F++  +KF W++PI  KSDV S F  F   + N   + IKI  SDGGGE+
Subjt:  VCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGEF

Query:  VNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFH-RSNYSFLKTFGGAC
        +  +   +L   GI HQ SCL+TP+QNG+ ERKH H++E ++ L+  A LP  FW F   T+V++INR+PS +LGNKSPF+L+++       LK FG +C
Subjt:  VNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFH-RSNYSFLKTFGGAC

Query:  YPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPVPLLNE---VPL
        YP L+PY   KL P+TT+C FLGY+S  KG++ Y      +Y+SRHV FDE+ FP        HS+  S+S PS+ +  P  I +  PVP  N    VP 
Subjt:  YPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPVPLLNE---VPL

Query:  NDLPTSSDTSSSS---TSVPVPS--QDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSV--PSCDNPPAPPLVQ-------------NARSMQTRGK
        +  P++S +SS S    SVP+ S   ++ + H   +S+ V  +  S   DG +    I  S   P     P  P  Q             N   MQT  K
Subjt:  NDLPTSSDTSSSS---TSVPVPS--QDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSV--PSCDNPPAPPLVQ-------------NARSMQTRGK

Query:  SGIFKRKVFVAPT-----ISSSQVEPFSFSKASKLPV
        SGI K+   +A       +  +QVEP ++  A K PV
Subjt:  SGIFKRKVFVAPT-----ISSSQVEPFSFSKASKLPV

TrEMBL top hitse value%identityAlignment
A0A2N9EFT0 Uncharacterized protein6.5e-8945.02Show/hide
Query:  MNISPCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRI
        ++++    +   CTH + GK+H+ PF  SS  A+ PL+LVHSDVWGPAP TSING ++YVSFVD  ++FTWL+PI  KS V + FQ F   +EN+L +RI
Subjt:  MNISPCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRI

Query:  KIFRSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-N
        K+ R+D GGE+ N +  S+  T GILHQ SC +TPQQNGV ERKH H+VE AL L+S++ LP+ +WP+ F+TA+++INR+P+ +L   SP++LLFH + +
Subjt:  KIFRSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-N

Query:  YSFLKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPV
        YSFLKTFG  C+PLL+PY  +KL+P+++ C FLGY+ ++KGY+  +L+T  L ISRHVAF E+ FP         S T  SS  + SN +  S+L   P 
Subjt:  YSFLKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPV

Query:  PLLNEV-PLNDLPTSSDTSSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAPPLVQ-NARSMQTRGKSGIFKRKVFV
           + + P   LP  S ++  S+S+P          DV+      ++    P+     +  I   VPSC  P  P L   N+  MQTRGKSGI KRK+ +
Subjt:  PLLNEV-PLNDLPTSSDTSSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAPPLVQ-NARSMQTRGKSGIFKRKVFV

Query:  -APTISSSQVEPFSFSKASKLP
           T++  + EP S+  ASK P
Subjt:  -APTISSSQVEPFSFSKASKLP

A0A2N9FMC6 Integrase catalytic domain-containing protein6.5e-8945.02Show/hide
Query:  MNISPCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRI
        ++++    +   CTH + GK+H+ PF  SS  A+ PL+LVHSDVWGPAP TSING ++YVSFVD  ++FTWL+PI  KS V + FQ F   +EN+L +RI
Subjt:  MNISPCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRI

Query:  KIFRSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-N
        K+ R+D GGE+ N +  S+  T GILHQ SC +TPQQNGV ERKH H+VE AL L+S++ LP+ +WP+ F+TA+++INR+P+ +L   SP++LLFH + +
Subjt:  KIFRSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-N

Query:  YSFLKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPV
        YSFLKTFG  C+PLL+PY  +KL+P+++ C FLGY+ ++KGY+  +L+T  L ISRHVAF E+ FP         S T  SS  + SN +  S+L   P 
Subjt:  YSFLKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPV

Query:  PLLNEV-PLNDLPTSSDTSSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAPPLVQ-NARSMQTRGKSGIFKRKVFV
           + + P   LP  S ++  S+S+P          DV+      ++    P+     +  I   VPSC  P  P L   N+  MQTRGKSGI KRK+ +
Subjt:  PLLNEV-PLNDLPTSSDTSSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAPPLVQ-NARSMQTRGKSGIFKRKVFV

Query:  -APTISSSQVEPFSFSKASKLP
           T++  + EP S+  ASK P
Subjt:  -APTISSSQVEPFSFSKASKLP

A0A2N9GG32 Integrase catalytic domain-containing protein9.7e-8544.52Show/hide
Query:  NISPCNSA-----KCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLL
        ++SPC SA     +  C H L GKMHKLPF  S   ++ PL+LVHSDVWGPAP  S NG++YY+ FVDD S+F+WLY +  KSDV S F+ F+  VENLL
Subjt:  NISPCNSA-----KCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLL

Query:  LSRIKIFRSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFH
          +IKI R+D GGE+ +++  ++  +HGI H  SC +TPQQNG+VERKH H+VE AL L+S A L I  + +   T V +INRLP+  L +K+P++LLFH
Subjt:  LSRIKIFRSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFH

Query:  R-SNYSFLKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFP-LSIL
        +  + + LKTFG  C+PLL+PY  +KLQP++T C FLGY S SKGYI        +YISRHV F+E+ F   +S  ++HS    S   ST +  P LS+ 
Subjt:  R-SNYSFLKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFP-LSIL

Query:  LSSPVPLLNEVPLND---LPTSSDTSSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLP--IDGVVPNSDIQHSVPSCDNPPAPPLVQNARSMQTRGKSG
          +  P LN  PL     +PTS D  SSS      S  +  +   +      ES    P  ++ ++P S    S+ S D    PP   N   M TR K+G
Subjt:  LSSPVPLLNEVPLND---LPTSSDTSSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLP--IDGVVPNSDIQHSVPSCDNPPAPPLVQNARSMQTRGKSG

Query:  IFKRKVFVAPTISSSQVEPFSFSKASKLP
        I+K K F   T+  +Q EP ++  ASK P
Subjt:  IFKRKVFVAPTISSSQVEPFSFSKASKLP

A0A2N9GRJ0 Uncharacterized protein6.5e-8945.02Show/hide
Query:  MNISPCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRI
        ++++    +   CTH + GK+H+ PF  SS  A+ PL+LVHSDVWGPAP TSING ++YVSFVD  ++FTWL+PI  KS V + FQ F   +EN+L +RI
Subjt:  MNISPCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRI

Query:  KIFRSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-N
        K+ R+D GGE+ N +  S+  T GILHQ SC +TPQQNGV ERKH H+VE AL L+S++ LP+ +WP+ F+TA+++INR+P+ +L   SP++LLFH + +
Subjt:  KIFRSDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-N

Query:  YSFLKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPV
        YSFLKTFG  C+PLL+PY  +KL+P+++ C FLGY+ ++KGY+  +L+T  L ISRHVAF E+ FP         S T  SS  + SN +  S+L   P 
Subjt:  YSFLKTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPV

Query:  PLLNEV-PLNDLPTSSDTSSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAPPLVQ-NARSMQTRGKSGIFKRKVFV
           + + P   LP  S ++  S+S+P          DV+      ++    P+     +  I   VPSC  P  P L   N+  MQTRGKSGI KRK+ +
Subjt:  PLLNEV-PLNDLPTSSDTSSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAPPLVQ-NARSMQTRGKSGIFKRKVFV

Query:  -APTISSSQVEPFSFSKASKLP
           T++  + EP S+  ASK P
Subjt:  -APTISSSQVEPFSFSKASKLP

A0A2N9HKM9 Uncharacterized protein4.4e-8545.73Show/hide
Query:  PCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFR
        P ++   +C H L GKMHKLPF  S SI S PL++VHSDVWGPAP TS N  +YYV+FVDD ++FTW +P+  KS V S F  FK  +ENLL  ++KI R
Subjt:  PCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFR

Query:  SDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-NYSFL
        +D GGE+  H   S+  + G+ HQ +C +T QQNGV ERKH H+V++ L LMS+A LP+ FWP+ F+TAVF+INRLPS   G  SP++ LF  S  YS  
Subjt:  SDGGGEFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-NYSFL

Query:  KTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPL-AISSSATHSVTPSSSSPST-----------SNNFPL
        ++FG ACYPLL+PY  +KL P++ QC FLGY S++KG++ +   +   ++SRHV FDESVFP   +SS+ + S  P+ S  S            S + P 
Subjt:  KTFGGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPL-AISSSATHSVTPSSSSPST-----------SNNFPL

Query:  SILLSSPVPLLN--EVPLNDLPTSSDTSSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAP--PLVQNARSMQTRGK
        S+L + P P+LN   +P+  L  +S T  SS  V VPS  + V    + +A V  S+        VP+S       S   P AP  PLV NA  MQTRGK
Subjt:  SILLSSPVPLLN--EVPLNDLPTSSDTSSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAP--PLVQNARSMQTRGK

Query:  SGIFKRKVFVAPTISSS--QVEPFSFSKASKLP
        SGI K+K  +    +      EP SFS A  +P
Subjt:  SGIFKRKVFVAPTISSS--QVEPFSFSKASKLP

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-2330.07Show/hide
Query:  VCTHYLHGKMHKLPFS--LSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGG
        +C   L+GK  +LPF      +    PL +VHSDV GP    +++   Y+V FVD  + +   Y I  KSDV S+FQ F    E     ++     D G 
Subjt:  VCTHYLHGKMHKLPFS--LSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGG

Query:  EFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSL--GNKSPFKLLFHRSNY-SFLKTF
        E++++ +  +    GI +  +  +TPQ NGV ER    + E A  ++S A L   FW     TA ++INR+PS +L   +K+P+++  ++  Y   L+ F
Subjt:  EFVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSL--GNKSPFKLLFHRSNY-SFLKTF

Query:  GGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSAT--HSVTPSSSSPSTSNNFP
        G   Y  +K     K   K+ +  F+GY  +  G+  +    +   ++R V  DE+     ++S A    +V    S  S + NFP
Subjt:  GGACYPLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSAT--HSVTPSSSSPSTSNNFP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.3e-4137.14Show/hide
Query:  CTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGEFV
        C + L GK H++ F  SS      LDLV+SDV GP    S+ G KY+V+F+DD S+  W+Y +  K  V  +FQ F  LVE     ++K  RSD GGE+ 
Subjt:  CTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGEFV

Query:  NHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-NYSFLKTFGGACY
        +     Y  +HGI H+K+   TPQ NGV ER +  +VE   +++  A LP  FW     TA ++INR PS  L  + P ++  ++  +YS LK FG   +
Subjt:  NHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-NYSFLKTFGGACY

Query:  PLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLA--ISSSATHSVTPS-SSSPSTSNN
          +      KL  K+  C F+GY  +  GY  +    K +  SR V F ES    A  +S    + + P+  + PSTSNN
Subjt:  PLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLA--ISSSATHSVTPS-SSSPSTSNN

Q12490 Transposon Ty1-BL Gag-Pol polyprotein9.5e-1323.43Show/hide
Query:  THYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPI--IRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGEF
        T + H K  +L +  S      P   +H+D++GP      +   Y++SF D+ +KF W+YP+   R+  +  +F      ++N   + + + + D G E+
Subjt:  THYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPI--IRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGEF

Query:  VNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRSNYSFLKTFGGACY
         N +L  +L+ +GI    +     + +GV ER +  +++     +  + LP   W      +  + N L S      +         + S L  FG    
Subjt:  VNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRSNYSFLKTFGGACY

Query:  PLLKPYILN------KLQPKTTQCSFLGYSSDSKGYIYY
           +P I+N      K+ P+      L  S +S GYI Y
Subjt:  PLLKPYILN------KLQPKTTQCSFLGYSSDSKGYIYY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.0e-6236.75Show/hide
Query:  CTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGEFV
        C+  L  K +K+PFS S+  ++ PL+ ++SDVW  +P  S + ++YYV FVD  +++TWLYP+ +KS V   F  FK L+EN   +RI  F SD GGEFV
Subjt:  CTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGEFV

Query:  NHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-NYSFLKTFGGACY
          +L  Y   HGI H  S  +TP+ NG+ ERKH H+VE  L L+S A +P  +WP+ F  AV++INRLP+  L  +SPF+ LF  S NY  L+ FG ACY
Subjt:  NHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRS-NYSFLKTFGGACY

Query:  PLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLA------------------------------------ISSSATHS
        P L+PY  +KL  K+ QC FLGYS     Y+  HL+T  LYISRHV FDE+ FP +                                      S   H+
Subjt:  PLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLA------------------------------------ISSSATHS

Query:  VTPSSS-------SPSTSNNFPLSILLSSP------VPLLNEVPLNDLPTSSDT---SSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSD
         TP SS       S  +S+N   S   S P       P  N       PT + T   SS +TS   P+ +S      + S     S+ S        +S 
Subjt:  VTPSSS-------SPSTSNNFPLSILLSSP------VPLLNEVPLNDLPTSSDT---SSSSTSVPVPSQDSNVVHDVAESAGVNESAGSLPIDGVVPNSD

Query:  IQHSVPSCDNPPAPPLVQ----------NARSMQTRGKSGIFKRKVFVAPTIS-SSQVEPFSFSKASK
           + PS    P PPL Q          N  SM TR K+GI K     +  +S +++ EP +  +A K
Subjt:  IQHSVPSCDNPPAPPLVQ----------NARSMQTRGKSGIFKRKVFVAPTIS-SSQVEPFSFSKASK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-6436.86Show/hide
Query:  CTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGEFV
        C+     K HK+PFS S+  +S PL+ ++SDVW  +P  SI+ ++YYV FVD  +++TWLYP+ +KS V   F +FK LVEN   +RI    SD GGEFV
Subjt:  CTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGEFV

Query:  NHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHR-SNYSFLKTFGGACY
           L  YL  HGI H  S  +TP+ NG+ ERKH H+VE+ L L+S A +P  +WP+ F+ AV++INRLP+  L  +SPF+ LF +  NY  LK FG ACY
Subjt:  NHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHR-SNYSFLKTFGGACY

Query:  PLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISS--SATHSVTPSSSSPS--TSNNFPLSILL------------
        P L+PY  +KL+ K+ QC+F+GYS     Y+  H+ T  LY SRHV FDE  FP + ++   +T     S S+P+  +    P + L+            
Subjt:  PLLKPYILNKLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISS--SATHSVTPSSSSPS--TSNNFPLSILL------------

Query:  ------SSPVPL-LNEVPLNDLPTSSDTSSSSTSVPVPS-------------QDSNVVHDVAESAGVNESAGSLP-IDGVVPNSDIQH--------SVPS
              SSP PL   +V  ++LP+SS +S SS+    PS             Q+SN    +  +   N  + + P  +  +P S I          S+  
Subjt:  ------SSPVPL-LNEVPLNDLPTSSDTSSSSTSVPVPS-------------QDSNVVHDVAESAGVNESAGSLP-IDGVVPNSDIQH--------SVPS

Query:  CDNP--------------PAPPLVQ-------NARSMQTRGKSGIFK-RKVFVAPTISSSQVEPFSFSKASK
         ++P              PAPP++Q       N  SM TR K GI K  + +   T  ++  EP +  +A K
Subjt:  CDNP--------------PAPPLVQ-------NARSMQTRGKSGIFK-RKVFVAPTISSSQVEPFSFSKASK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATTTCTCCTTGTAATTCTGCAAAATGTGTTTGTACACACTACTTACATGGCAAAATGCATAAATTGCCATTTTCGTTGTCTTCTTCTATTGCTTCTTTTCCTCT
TGATTTAGTACATAGTGATGTATGGGGCCCTGCTCCTCAAACTTCAATAAATGGTTTCAAATATTACGTATCCTTTGTTGATGATATGTCCAAATTCACTTGGCTTTACC
CTATTATACGAAAATCTGATGTACCTTCTATTTTTCAACTTTTCAAACCATTAGTTGAAAATCTCCTCTTATCTAGAATCAAAATCTTTCGAAGTGATGGCGGTGGTGAG
TTTGTCAATCACTCTCTTGGGTCTTATCTTCAAACACATGGTATTCTTCATCAAAAATCGTGTCTTTACACTCCTCAACAAAACGGTGTTGTTGAGCGTAAGCACTGCCA
TGTTGTTGAAGTTGCCCTGAATCTCATGTCCAAAGCCTTCCTTCCTATTCCGTTTTGGCCTTTTACTTTCAATACTGCTGTCTTTATTATAAATCGCTTACCATCCTCGT
CTCTTGGAAATAAATCTCCCTTTAAACTTTTGTTTCACCGTTCCAATTATTCGTTTCTCAAGACCTTTGGGGGTGCATGTTATCCTTTGCTCAAACCATACATTTTAAAT
AAACTTCAACCCAAAACAACCCAATGTTCATTTCTTGGTTATTCTTCTGACTCCAAAGGATATATATACTATCACTTAGAAACCAAGTGTCTTTACATTTCCCGTCATGT
AGCATTTGATGAATCCGTGTTCCCACTTGCTATTTCCTCTTCTGCTACTCATTCTGTTACTCCGTCTTCCTCTTCTCCTTCTACATCAAACAACTTTCCTTTGTCCATTC
TTCTTTCATCCCCTGTACCTCTACTAAATGAAGTTCCTTTAAACGATTTGCCCACTTCTTCTGATACTTCTTCTTCATCAACATCTGTCCCAGTACCGTCCCAAGATTCA
AATGTTGTGCATGATGTTGCTGAATCTGCTGGGGTAAATGAATCTGCTGGTTCCTTACCAATTGATGGTGTTGTTCCAAATTCTGATATACAACATTCTGTTCCATCATG
TGATAACCCACCTGCTCCTCCTCTAGTTCAGAATGCTCGTTCTATGCAAACCAGAGGGAAGTCTGGGATTTTTAAAAGGAAGGTATTTGTTGCCCCTACCATCTCCTCTA
GTCAGGTTGAGCCATTTTCCTTTTCCAAGGCTTCAAAGCTCCCTGTTTGTTTGGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATATTTCTCCTTGTAATTCTGCAAAATGTGTTTGTACACACTACTTACATGGCAAAATGCATAAATTGCCATTTTCGTTGTCTTCTTCTATTGCTTCTTTTCCTCT
TGATTTAGTACATAGTGATGTATGGGGCCCTGCTCCTCAAACTTCAATAAATGGTTTCAAATATTACGTATCCTTTGTTGATGATATGTCCAAATTCACTTGGCTTTACC
CTATTATACGAAAATCTGATGTACCTTCTATTTTTCAACTTTTCAAACCATTAGTTGAAAATCTCCTCTTATCTAGAATCAAAATCTTTCGAAGTGATGGCGGTGGTGAG
TTTGTCAATCACTCTCTTGGGTCTTATCTTCAAACACATGGTATTCTTCATCAAAAATCGTGTCTTTACACTCCTCAACAAAACGGTGTTGTTGAGCGTAAGCACTGCCA
TGTTGTTGAAGTTGCCCTGAATCTCATGTCCAAAGCCTTCCTTCCTATTCCGTTTTGGCCTTTTACTTTCAATACTGCTGTCTTTATTATAAATCGCTTACCATCCTCGT
CTCTTGGAAATAAATCTCCCTTTAAACTTTTGTTTCACCGTTCCAATTATTCGTTTCTCAAGACCTTTGGGGGTGCATGTTATCCTTTGCTCAAACCATACATTTTAAAT
AAACTTCAACCCAAAACAACCCAATGTTCATTTCTTGGTTATTCTTCTGACTCCAAAGGATATATATACTATCACTTAGAAACCAAGTGTCTTTACATTTCCCGTCATGT
AGCATTTGATGAATCCGTGTTCCCACTTGCTATTTCCTCTTCTGCTACTCATTCTGTTACTCCGTCTTCCTCTTCTCCTTCTACATCAAACAACTTTCCTTTGTCCATTC
TTCTTTCATCCCCTGTACCTCTACTAAATGAAGTTCCTTTAAACGATTTGCCCACTTCTTCTGATACTTCTTCTTCATCAACATCTGTCCCAGTACCGTCCCAAGATTCA
AATGTTGTGCATGATGTTGCTGAATCTGCTGGGGTAAATGAATCTGCTGGTTCCTTACCAATTGATGGTGTTGTTCCAAATTCTGATATACAACATTCTGTTCCATCATG
TGATAACCCACCTGCTCCTCCTCTAGTTCAGAATGCTCGTTCTATGCAAACCAGAGGGAAGTCTGGGATTTTTAAAAGGAAGGTATTTGTTGCCCCTACCATCTCCTCTA
GTCAGGTTGAGCCATTTTCCTTTTCCAAGGCTTCAAAGCTCCCTGTTTGTTTGGTGTGA
Protein sequenceShow/hide protein sequence
MNISPCNSAKCVCTHYLHGKMHKLPFSLSSSIASFPLDLVHSDVWGPAPQTSINGFKYYVSFVDDMSKFTWLYPIIRKSDVPSIFQLFKPLVENLLLSRIKIFRSDGGGE
FVNHSLGSYLQTHGILHQKSCLYTPQQNGVVERKHCHVVEVALNLMSKAFLPIPFWPFTFNTAVFIINRLPSSSLGNKSPFKLLFHRSNYSFLKTFGGACYPLLKPYILN
KLQPKTTQCSFLGYSSDSKGYIYYHLETKCLYISRHVAFDESVFPLAISSSATHSVTPSSSSPSTSNNFPLSILLSSPVPLLNEVPLNDLPTSSDTSSSSTSVPVPSQDS
NVVHDVAESAGVNESAGSLPIDGVVPNSDIQHSVPSCDNPPAPPLVQNARSMQTRGKSGIFKRKVFVAPTISSSQVEPFSFSKASKLPVCLV