; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0096301 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0096301
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationCMiso1.1chr04:10513110..10514297
RNA-Seq ExpressionCmc04g0096301
SyntenyCmc04g0096301
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG99318.1 transposable element gene [Prunus dulcis]3.4e-7842.71Show/hide
Query:  CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEFC
        C +C   K +KLPF  S S +  PLEL+HSDVWGP  + S +GN+YYV F+D+F+K+ W++P+ YKSDV  I + F   +EN +  K+K  +SDSGGEF 
Subjt:  CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEFC

Query:  NTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCACY
        +++ Q F   +GI HQ SCP+TPEQN  AERKHRHIVE A +L+  S VP +FW  AF TA++LINR+  +S + LSPFE+LF    +   LK+FGC C+
Subjt:  NTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCACY

Query:  PLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLIPNIQSAPI
        P L+PY+  KL PK+   VF+GY     GY C +P+T +  VSRHV FHE+IFP+  + +SS   S+S               H P+  +  P + S+P 
Subjt:  PLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLIPNIQSAPI

Query:  RPPTVDITCSSFMDTNATCQSLNVLSTVDTGNEINATTINLPDPP-PPRNTHVMQTRAKSGIFKPKAFHIT----------TAL--PTPTSYTKASKY
        + P   ++ S             +   V + +  + +++++P    P +NTH M TR+K GIFKPKA   T          TAL  PTPT+Y +A+K+
Subjt:  RPPTVDITCSSFMDTNATCQSLNVLSTVDTGNEINATTINLPDPP-PPRNTHVMQTRAKSGIFKPKAFHIT----------TAL--PTPTSYTKASKY

KAB2617916.1 hypothetical protein D8674_013785 [Pyrus ussuriensis x Pyrus communis]2.6e-7841.23Show/hide
Query:  CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEFC
        C SCL+ K +KLPF  S++ ++ P E+VHSD+WGP   IS++G RYYV+F+D  + F WLFP+  KSD+      F   + NQ    L+  QSD GGE+ 
Subjt:  CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEFC

Query:  NTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCACY
        +   Q F   KGI HQ SCPYTPEQN +AERKHRH+VET ++L+ ++ +P  FW +   TA ++INRM S++L+  SPFE+LFG    + HL+VFGC CY
Subjt:  NTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCACY

Query:  PLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYA------NSTNSSPSQSLSISD-------PSILLTLLNIPQHIPL
        PLLKPY   KL+PKTT+ VF+GY   +KGYICY  L+++  +SRHVIF ET FPY+      NS +   +QS + +         + L  L+  P H   
Subjt:  PLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYA------NSTNSSPSQSLSISD-------PSILLTLLNIPQHIPL

Query:  NSVLIPNIQSAPIRPPTV-----------DITCSSFMDTNATCQSL----NVLSTVDTGNEINATTINLPDPPPPRNTHVMQTRAKSGIFKPKAFHIT--
        ++  IP   S     P V            +  S  + +N   QS+    N    +    + +  ++ +  P PP N H MQTR+KSGI K KA   T  
Subjt:  NSVLIPNIQSAPIRPPTV-----------DITCSSFMDTNATCQSL----NVLSTVDTGNEINATTINLPDPPPPRNTHVMQTRAKSGIFKPKAFHIT--

Query:  ------TALPTPTSYTKASKYP
               +L  P +Y  A K P
Subjt:  ------TALPTPTSYTKASKYP

KAG5532217.1 hypothetical protein RHGRI_026743 [Rhododendron griersonianum]3.8e-7742.71Show/hide
Query:  NCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEF
        +C SC +AK  KLPFP+S++TS  P +L+H DVWGP  + S  G RYY+  +D+FS+++WLFP+ YKS+V +++ +F   ++ Q    +K  +SD+GGEF
Subjt:  NCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEF

Query:  CNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCAC
         N  L   F   G+ HQ SCP+TPEQN I ERKHRH++ET ++L+ H+ +P  FW  A  TAVFL NR+  SSL+   P+ +LF  T D   LK FGCAC
Subjt:  CNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCAC

Query:  YPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPY----ANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLIPNI
        +P LKPY  HKL PK+   VF+GY    KGY CY+P+  +  VSRHV F ET FP+    A + +SS +   S S P  L +  +I   +PL + +  +I
Subjt:  YPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPY----ANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLIPNI

Query:  QSAPIRPPTVDITCSSFMDTNATCQSLNVLSTVDTGNEINATTINLPDPPPPRNTHVMQTRAKSGIFKPK--AFHITTALPT--PTSYTKA
         S+ + PPT            +  Q  +  S+ D   E    T+++P P P    H M TR+K+GIFKPK      T+  PT  P S+++A
Subjt:  QSAPIRPPTVDITCSSFMDTNATCQSLNVLSTVDTGNEINATTINLPDPPPPRNTHVMQTRAKSGIFKPK--AFHITTALPT--PTSYTKA

TQD88914.1 hypothetical protein C1H46_025506 [Malus baccata]1.7e-7742.96Show/hide
Query:  CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEFC
        C +CL+ K TKLPFP+  S S  PLE++H+DVWGP    S  G  YYVSF+D  +++TW+FP+  K+ V  I  +F   I+N     +K  QSD GGE+ 
Subjt:  CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEFC

Query:  NTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCACY
        +T  Q+F  +KGI HQKSCPYTPEQN +AERK+RH+VETA++L+  +S+  +FW +A +T+ +L+NR+ +S LKM SPFE+L+     L HL+VFGCACY
Subjt:  NTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCACY

Query:  PLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLL--NIPQHIPLNSVLIPNIQSA
        P LKPY  +KL+PKTT  +F+GY   +KGYICY     + IVSRHV+F E++FP     +SS S S ++S  SI ++L       H P   +  P+I S+
Subjt:  PLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLL--NIPQHIPLNSVLIPNIQSA

Query:  PIRPPTVDITCSSFMD------TNATCQSLNVLSTVDTGNEINATTINLPD----PPPPRNTHVMQTRAKSGIFKPKAFH--ITTALPTPTSYTKASK
                +T S F              S +  S + + +  + T+  +PD         NTH MQTR+KSGIFK K F   +   +  P S++ A++
Subjt:  PIRPPTVDITCSSFMD------TNATCQSLNVLSTVDTGNEINATTINLPD----PPPPRNTHVMQTRAKSGIFKPKAFH--ITTALPTPTSYTKASK

TQE01264.1 hypothetical protein C1H46_013171 [Malus baccata]2.6e-7841.18Show/hide
Query:  CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEFC
        C SCL+ K  KLPF    + ++ PLE++HSDVWGP   +S  G ++YVSFVD  ++FTW+FP+  KS+V  +   F   +  Q    +K FQSD GGE+ 
Subjt:  CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEFC

Query:  NTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCACY
        +   + +   KGI HQKSCPYTP+QN +AERKHRHI+ETA++L+  +S+P + W +A + +V+LINRM+  +L+M SPF+ LFG +  + HLKVFGCAC+
Subjt:  NTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCACY

Query:  PLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLIPNI-----
        PLLK     KL+PKT+Q +FIGY   +KGY+C NPLT +  VSRHV+F ET FPY +S  +S S S  IS PS+ +    +P  +  ++ ++  I     
Subjt:  PLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLIPNI-----

Query:  QSAPIRPP-------TVDITCSSFMDTNATCQSLNVLSTVDTGN------EINATTINLPDPPPPRNTHVMQTRAKSGIFKPKAF----HITTALPTPTS
        + +P  PP       +  +  S F+D+  +  S   L    T +      +     +++  P P  + H MQTR+KSGI K K F      +  +  P +
Subjt:  QSAPIRPP-------TVDITCSSFMDTNATCQSLNVLSTVDTGN------EINATTINLPDPPPPRNTHVMQTRAKSGIFKPKAF----HITTALPTPTS

Query:  YTKASKYP
        +  A K P
Subjt:  YTKASKYP

TrEMBL top hitse value%identityAlignment
A0A2N9EFT0 Uncharacterized protein2.3e-8846.53Show/hide
Query:  SSSISTYNCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQ
        ++  S+  C  C++ K+ + PFP S  T+  PLELVHSDVWGP  + S NG R+YVSFVD+F++FTWLFPI +KS V +  Q F   +EN +  ++K  +
Subjt:  SSSISTYNCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQ

Query:  SDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHL
        +D GGE+ N+  +SF  ++GI HQ SCP+TP+QN +AERKHRHIVETA++LI  SS+PL++WPYAFSTA++LINRM + +LK  SP+++LF    D   L
Subjt:  SDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHL

Query:  KVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLI
        K FGC C+PLL+PY KHKLEP+++  VF+GY  N KGY+C N  T + ++SRHV FHE  FP+ + T  SPS   + S+  +   L   P   P  S+L 
Subjt:  KVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLI

Query:  PNIQSAPIRPPTVDIT--CSSFMDTNATCQSLNVLSTVDTGNEIN--ATTINLPDPP--PPRNTHVMQTRAKSGIFKPK-AFHITTALP---TPTSYTKA
        P     P  PP    T   SS  D +A  +  + L    +   I+    + ++P  P  PP N+H MQTR KSGI K K   H  T  P    P SY  A
Subjt:  PNIQSAPIRPPTVDIT--CSSFMDTNATCQSLNVLSTVDTGNEIN--ATTINLPDPP--PPRNTHVMQTRAKSGIFKPK-AFHITTALP---TPTSYTKA

Query:  SKYP
        SKYP
Subjt:  SKYP

A0A2N9FMC6 Integrase catalytic domain-containing protein2.3e-8846.53Show/hide
Query:  SSSISTYNCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQ
        ++  S+  C  C++ K+ + PFP S  T+  PLELVHSDVWGP  + S NG R+YVSFVD+F++FTWLFPI +KS V +  Q F   +EN +  ++K  +
Subjt:  SSSISTYNCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQ

Query:  SDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHL
        +D GGE+ N+  +SF  ++GI HQ SCP+TP+QN +AERKHRHIVETA++LI  SS+PL++WPYAFSTA++LINRM + +LK  SP+++LF    D   L
Subjt:  SDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHL

Query:  KVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLI
        K FGC C+PLL+PY KHKLEP+++  VF+GY  N KGY+C N  T + ++SRHV FHE  FP+ + T  SPS   + S+  +   L   P   P  S+L 
Subjt:  KVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLI

Query:  PNIQSAPIRPPTVDIT--CSSFMDTNATCQSLNVLSTVDTGNEIN--ATTINLPDPP--PPRNTHVMQTRAKSGIFKPK-AFHITTALP---TPTSYTKA
        P     P  PP    T   SS  D +A  +  + L    +   I+    + ++P  P  PP N+H MQTR KSGI K K   H  T  P    P SY  A
Subjt:  PNIQSAPIRPPTVDIT--CSSFMDTNATCQSLNVLSTVDTGNEIN--ATTINLPDPP--PPRNTHVMQTRAKSGIFKPK-AFHITTALP---TPTSYTKA

Query:  SKYP
        SKYP
Subjt:  SKYP

A0A2N9FT93 Uncharacterized protein7.7e-8444.17Show/hide
Query:  MSSSISTYN------CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIP
        +SS IS  N      C  CL  KM KLPF  S   S  PLELVHSDVWGP  I SSNG RYY+ FVD+FS+F+WLF + +KS+V +  + F   +ENQ+ 
Subjt:  MSSSISTYN------CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIP

Query:  KKLKRFQSDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGY
         ++K  ++D GGE+ +     F  S GI HQ SCP+TP+QN I ERKHRHIVE+A++++ H+S+P+ +W YA STAV LINR+ +  L  +SP+E LF  
Subjt:  KKLKRFQSDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGY

Query:  TRDLHHLKVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIF-PYANSTNSSPSQSL-SISDP-SILLTLLNIP
          DL HLK FGC C+P L+PY  HKL+P++T  +F+GYP + KGYIC +P++ +  +SRH +F+E+ F P+ +  +++ +  + S  DP   LL +++  
Subjt:  TRDLHHLKVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIF-PYANSTNSSPSQSL-SISDP-SILLTLLNIP

Query:  QHIPLNSVLIPNIQ---SAPIRPPTVDITCSSFMDTN--ATCQSLNVLSTVDTGNEINATTINLPDPPPPRNTHVMQTRAKSGIFKPKAFHITTA---LP
         HIPL   L P++    + P  PP   I   S +      T  S ++     +G+    +   LP P  P NTH M TR+K GIFKPK FH  T      
Subjt:  QHIPLNSVLIPNIQ---SAPIRPPTVDITCSSFMDTN--ATCQSLNVLSTVDTGNEINATTINLPDPPPPRNTHVMQTRAKSGIFKPKAFHITTA---LP

Query:  TPTSYTKASKYP
         P +Y  ASKYP
Subjt:  TPTSYTKASKYP

A0A2N9GRJ0 Uncharacterized protein2.3e-8846.53Show/hide
Query:  SSSISTYNCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQ
        ++  S+  C  C++ K+ + PFP S  T+  PLELVHSDVWGP  + S NG R+YVSFVD+F++FTWLFPI +KS V +  Q F   +EN +  ++K  +
Subjt:  SSSISTYNCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQ

Query:  SDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHL
        +D GGE+ N+  +SF  ++GI HQ SCP+TP+QN +AERKHRHIVETA++LI  SS+PL++WPYAFSTA++LINRM + +LK  SP+++LF    D   L
Subjt:  SDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHL

Query:  KVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLI
        K FGC C+PLL+PY KHKLEP+++  VF+GY  N KGY+C N  T + ++SRHV FHE  FP+ + T  SPS   + S+  +   L   P   P  S+L 
Subjt:  KVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLI

Query:  PNIQSAPIRPPTVDIT--CSSFMDTNATCQSLNVLSTVDTGNEIN--ATTINLPDPP--PPRNTHVMQTRAKSGIFKPK-AFHITTALP---TPTSYTKA
        P     P  PP    T   SS  D +A  +  + L    +   I+    + ++P  P  PP N+H MQTR KSGI K K   H  T  P    P SY  A
Subjt:  PNIQSAPIRPPTVDIT--CSSFMDTNATCQSLNVLSTVDTGNEIN--ATTINLPDPP--PPRNTHVMQTRAKSGIFKPK-AFHITTALP---TPTSYTKA

Query:  SKYP
        SKYP
Subjt:  SKYP

A0A2N9J837 Uncharacterized protein5.9e-8444.17Show/hide
Query:  MSSSISTYN------CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIP
        +SS IS  N      C  CL  KM KLPF  S   S  PLELVHSDVWGP  I SSNG RYY+ FVD+FS+F+WLF + +KS+V +  + F   +ENQ+ 
Subjt:  MSSSISTYN------CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIP

Query:  KKLKRFQSDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGY
         ++K  ++D GGE+ +    +F  S GI HQ SCP+TP+QN I ERKHRHIVE+A++++ H+S+P+ +W YA STAV LIN++ +  L  +SP+E LF  
Subjt:  KKLKRFQSDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGY

Query:  TRDLHHLKVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIF-PYANSTNSSPSQSL-SISDP-SILLTLLNIP
          DL HLK FGC C+P L+PY  HKL+P++T  +F+GYP + KGYIC +P++ +  +SRHV+F+E+ F P+ +  +++ +  + S  DP   LL +++  
Subjt:  TRDLHHLKVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIF-PYANSTNSSPSQSL-SISDP-SILLTLLNIP

Query:  QHIPLNSVLIPNIQ---SAPIRPPTVDITCSSFMDTN--ATCQSLNVLSTVDTGNEINATTINLPDPPPPRNTHVMQTRAKSGIFKPKAFHITTA---LP
         HIPL   L P++    + P  PP   I   S +      T  S ++     +G+    +   LP P  P NTH M TR+K GIFKPK FH  T      
Subjt:  QHIPLNSVLIPNIQ---SAPIRPPTVDITCSSFMDTN--ATCQSLNVLSTVDTGNEINATTINLPDPPPPRNTHVMQTRAKSGIFKPKAFHITTA---LP

Query:  TPTSYTKASKYP
         P +Y  ASKYP
Subjt:  TPTSYTKASKYP

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.0e-3233.73Show/hide
Query:  CISCLKAKMTKLPFPMSLSTS--LTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGE
        C  CL  K  +LPF      +    PL +VHSDV GP   ++ +   Y+V FVD F+ +   + I YKSDV S+ Q FV   E     K+     D+G E
Subjt:  CISCLKAKMTKLPFPMSLSTS--LTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGE

Query:  FCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSL--KMLSPFEMLFGYTRDLHHLKVFG
        + +  ++ F   KGI +  + P+TP+ N ++ER  R I E A +++  + +   FW  A  TA +LINR+ S +L     +P+EM       L HL+VFG
Subjt:  FCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSL--KMLSPFEMLFGYTRDLHHLKVFG

Query:  CACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHET
           Y  +K   + K + K+ + +F+GY  N  G+  ++ + ++ IV+R V+  ET
Subjt:  CACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHET

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.3e-4337.85Show/hide
Query:  CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEFC
        C  CL  K  ++ F  S    L  L+LV+SDV GP  I S  GN+Y+V+F+D+ S+  W++ +  K  V  + QKF  L+E +  +KLKR +SD+GGE+ 
Subjt:  CISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEFC

Query:  NTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCACY
        +   + +  S GI H+K+ P TP+ N +AER +R IVE   S++  + +P  FW  A  TA +LINR  S  L    P  +         HLKVFGC  +
Subjt:  NTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCACY

Query:  PLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHET
          +    + KL+ K+   +FIGY     GY  ++P+ K+ I SR V+F E+
Subjt:  PLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHET

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.0e-1625.65Show/hide
Query:  SSISTYNCISCLKAKMTK----LPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPI--AYKSDVPSIVQKFVPLIENQIPKK
        S+ STY C  CL  K TK        +    S  P + +H+D++GP   +  +   Y++SF D  ++F W++P+    +  + ++    +  I+NQ   +
Subjt:  SSISTYNCISCLKAKMTK----LPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPI--AYKSDVPSIVQKFVPLIENQIPKK

Query:  LKRFQSDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRM-SSSSLKMLSPFEMLFGYT
        +   Q D G E+ N TL  FF ++GI    +       + +AER +R ++    +L+  S +P   W  A   +  + N + S  + K       L G  
Subjt:  LKRFQSDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRM-SSSSLKMLSPFEMLFGYT

Query:  RDLHHLKVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPY----NFKGYICYNPLTKQTI-VSRHVIFHE
         D+  +  FG    P++     H  + K       GY      N  GYI Y P  K+T+  + +VI  +
Subjt:  RDLHHLKVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPY----NFKGYICYNPLTKQTI-VSRHVIFHE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-6436.72Show/hide
Query:  NCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEF
        +C  CL  K  K+PF  S   S  PLE ++SDVW    I+S +  RYYV FVD+F+++TWL+P+  KS V      F  L+EN+   ++  F SD+GGEF
Subjt:  NCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEF

Query:  CNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCAC
            L  +F   GI H  S P+TPE N ++ERKHRHIVET ++L+ H+S+P  +WPYAF+ AV+LINR+ +  L++ SPF+ LFG + +   L+VFGCAC
Subjt:  CNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCAC

Query:  YPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYAN-STNSSPSQSLSISDPSILLTLLNIPQHIPL----------N
        YP L+PY +HKL+ K+ Q VF+GY      Y+C +  T +  +SRHV F E  FP++N     SP Q        +      +P   P+          +
Subjt:  YPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYAN-STNSSPSQSLSISDPSILLTLLNIPQHIPL----------N

Query:  SVLIPNIQSAPIRPPTVDITCSSFMDTNATCQSLNVLSTVDTGNEINATTINLPDP-----PPPRNTHVMQTRAKSGIFKPKAFHITTALPTPTSYTKAS
        +   P+  SAP R   V    SS +D++ +       S+  +  E  A   N P P          TH  Q  +++         +  +L TP   + +S
Subjt:  SVLIPNIQSAPIRPPTVDITCSSFMDTNATCQSLNVLSTVDTGNEINATTINLPDP-----PPPRNTHVMQTRAKSGIFKPKAFHITTALPTPTSYTKAS

Query:  KYP
          P
Subjt:  KYP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-6235.49Show/hide
Query:  MSSSISTYNCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRF
        ++ S    +C  C   K  K+PF  S  TS  PLE ++SDVW    I+S +  RYYV FVD+F+++TWL+P+  KS V      F  L+EN+   ++   
Subjt:  MSSSISTYNCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRF

Query:  QSDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHH
         SD+GGEF    L+ +    GI H  S P+TPE N ++ERKHRHIVE  ++L+ H+SVP  +WPYAFS AV+LINR+ +  L++ SPF+ LFG   +   
Subjt:  QSDSGGEFCNTTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHH

Query:  LKVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANS----TNSSPSQSLSISDPSILLTLLNIPQHIPL
        LKVFGCACYP L+PY +HKLE K+ Q  F+GY      Y+C +  T +   SRHV F E  FP++ +    + S   +S S  +     TL   P  +P 
Subjt:  LKVFGCACYPLLKPYTKHKLEPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANS----TNSSPSQSLSISDPSILLTLLNIPQHIPL

Query:  NSVLIPNIQSAPIRPPTVDITCSS-FMDTNATCQSLNVLSTVD------------------TGNEINATTINLPDP--PPPRNTHVMQTRAKSGIFKPKA
           L P++ ++P  P +    C++    +N    S++  S+ +                    +  N+  +N P+P  P P + +      +S I  P  
Subjt:  NSVLIPNIQSAPIRPPTVDITCSS-FMDTNATCQSLNVLSTVD------------------TGNEINATTINLPDP--PPPRNTHVMQTRAKSGIFKPKA

Query:  FHITTALPTPTSYTKAS
           +T++  P S + +S
Subjt:  FHITTALPTPTSYTKAS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTTCTATTTCTACTTACAATTGCATAAGTTGTTTAAAAGCAAAAATGACAAAATTACCTTTTCCTATGTCTTTATCAACTTCTCTTACACCACTTGAACTTGT
TCACAGCGATGTATGGGGACCCTTTCTAATAATATCTTCTAATGGAAATCGATATTATGTTAGCTTTGTTGACAACTTTAGCAAATTTACTTGGCTTTTTCCTATTGCTT
ACAAATCTGATGTTCCATCTATTGTTCAAAAATTTGTTCCTCTCATTGAAAATCAAATACCTAAAAAATTAAAAAGATTTCAATCAGATAGTGGTGGTGAATTTTGCAAC
ACCACTTTACAATCCTTTTTTTATTCCAAAGGTATTTTTCACCAAAAATCATGTCCTTACACTCCTGAACAAAACAGTATAGCCGAACGTAAACATCGTCACATAGTTGA
AACTGCAATGTCTCTAATTTTCCATTCCTCTGTTCCTCTTGAATTTTGGCCTTACGCCTTCTCCACTGCAGTCTTTCTTATTAATCGAATGTCCTCTTCCTCACTTAAAA
TGTTATCACCATTTGAAATGCTTTTTGGTTATACTCGTGATTTGCATCATTTAAAAGTTTTTGGATGTGCATGTTACCCCCTTCTCAAGCCTTACACCAAACACAAACTT
GAACCAAAAACCACCCAACATGTATTTATAGGCTATCCTTATAACTTCAAAGGCTATATTTGTTACAATCCATTAACCAAACAAACCATAGTTTCACGACATGTTATCTT
TCATGAAACAATCTTTCCCTATGCAAATTCCACCAACTCTTCTCCATCTCAATCCCTATCCATCTCTGATCCAAGTATACTACTCACCTTACTAAACATTCCACAACATA
TACCCTTAAATTCTGTTCTCATTCCCAATATCCAATCTGCACCTATTCGCCCTCCCACTGTTGACATTACTTGCTCATCTTTTATGGATACTAATGCCACTTGTCAGTCT
TTGAATGTTTTATCTACAGTCGATACAGGAAATGAGATAAATGCTACCACAATTAACTTACCTGATCCACCTCCTCCTCGAAATACTCATGTTATGCAAACTCGAGCAAA
ATCAGGTATTTTTAAGCCAAAGGCCTTTCACATTACAACTGCTCTCCCAACTCCCACATCATATACAAAAGCCTCCAAATATCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCTTCTATTTCTACTTACAATTGCATAAGTTGTTTAAAAGCAAAAATGACAAAATTACCTTTTCCTATGTCTTTATCAACTTCTCTTACACCACTTGAACTTGT
TCACAGCGATGTATGGGGACCCTTTCTAATAATATCTTCTAATGGAAATCGATATTATGTTAGCTTTGTTGACAACTTTAGCAAATTTACTTGGCTTTTTCCTATTGCTT
ACAAATCTGATGTTCCATCTATTGTTCAAAAATTTGTTCCTCTCATTGAAAATCAAATACCTAAAAAATTAAAAAGATTTCAATCAGATAGTGGTGGTGAATTTTGCAAC
ACCACTTTACAATCCTTTTTTTATTCCAAAGGTATTTTTCACCAAAAATCATGTCCTTACACTCCTGAACAAAACAGTATAGCCGAACGTAAACATCGTCACATAGTTGA
AACTGCAATGTCTCTAATTTTCCATTCCTCTGTTCCTCTTGAATTTTGGCCTTACGCCTTCTCCACTGCAGTCTTTCTTATTAATCGAATGTCCTCTTCCTCACTTAAAA
TGTTATCACCATTTGAAATGCTTTTTGGTTATACTCGTGATTTGCATCATTTAAAAGTTTTTGGATGTGCATGTTACCCCCTTCTCAAGCCTTACACCAAACACAAACTT
GAACCAAAAACCACCCAACATGTATTTATAGGCTATCCTTATAACTTCAAAGGCTATATTTGTTACAATCCATTAACCAAACAAACCATAGTTTCACGACATGTTATCTT
TCATGAAACAATCTTTCCCTATGCAAATTCCACCAACTCTTCTCCATCTCAATCCCTATCCATCTCTGATCCAAGTATACTACTCACCTTACTAAACATTCCACAACATA
TACCCTTAAATTCTGTTCTCATTCCCAATATCCAATCTGCACCTATTCGCCCTCCCACTGTTGACATTACTTGCTCATCTTTTATGGATACTAATGCCACTTGTCAGTCT
TTGAATGTTTTATCTACAGTCGATACAGGAAATGAGATAAATGCTACCACAATTAACTTACCTGATCCACCTCCTCCTCGAAATACTCATGTTATGCAAACTCGAGCAAA
ATCAGGTATTTTTAAGCCAAAGGCCTTTCACATTACAACTGCTCTCCCAACTCCCACATCATATACAAAAGCCTCCAAATATCCATAA
Protein sequenceShow/hide protein sequence
MSSSISTYNCISCLKAKMTKLPFPMSLSTSLTPLELVHSDVWGPFLIISSNGNRYYVSFVDNFSKFTWLFPIAYKSDVPSIVQKFVPLIENQIPKKLKRFQSDSGGEFCN
TTLQSFFYSKGIFHQKSCPYTPEQNSIAERKHRHIVETAMSLIFHSSVPLEFWPYAFSTAVFLINRMSSSSLKMLSPFEMLFGYTRDLHHLKVFGCACYPLLKPYTKHKL
EPKTTQHVFIGYPYNFKGYICYNPLTKQTIVSRHVIFHETIFPYANSTNSSPSQSLSISDPSILLTLLNIPQHIPLNSVLIPNIQSAPIRPPTVDITCSSFMDTNATCQS
LNVLSTVDTGNEINATTINLPDPPPPRNTHVMQTRAKSGIFKPKAFHITTALPTPTSYTKASKYP