; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G16950 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G16950
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr09:27092559..27096301
RNA-Seq ExpressionClc09G16950
SyntenyClc09G16950
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.3e-14953.52Show/hide
Query:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR
        MCDAS+ AVGAVLGQRK+KI   IYYASKTLN +Q NYTTTEKE+LA+VFA DKF                              RS             
Subjt:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR

Query:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD
                                                        G+KV +Y+DH+AI+YL+ KK+AKPRLIRWVLLLQEFDLEI+DRKGTENQ+AD
Subjt:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD

Query:  HLSRLENKEVQESWSDIEEQLPDEHIMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSC
        HLSRLE+    +  + I +  PDE ++    S  PWYADIVNYL C   P + +AQQKKK  + ++ Y WD+P+L++   D+ILRRCVP+ E + IL  C
Subjt:  HLSRLENKEVQESWSDIEEQLPDEHIMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSC

Query:  HEALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAA
        H + YGGHF G RTAAK+LQSG                      TGNIS R+EMPLN++LEVELFDVW IDFMGPF PS GN YILVAVDYVSKWVEAAA
Subjt:  HEALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAA

Query:  CAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYR
           ND+  V  F+KK IF+RF TPRAIISD GTHF NR    LL+K+ V H+++T YHPQT+GQ E++N EIK ILE K VS++RKDW+++LDEALWAYR
Subjt:  CAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYR

Query:  TAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD
        TA+KTPIGMSPY LVFGKACHLP+ELEH A WA++KLN D
Subjt:  TAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD

WP_217833161.1 DDE-type integrase/transposase/recombinase, partial [Synechococcus sp. PCC 7002]2.4e-16485.97Show/hide
Query:  LENKEVQESWSDIEEQLPDEHIMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSCHEALY
        +ENKEVQ+SWSDIEE+ PDEH+M A+SQEPWY DIVNYLVCNQWPEEFNA QKKKL+++SKFYCWDEPYLYRL  DHILRRCVP+YETHSIL+SCHEA Y
Subjt:  LENKEVQESWSDIEEQLPDEHIMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSCHEALY

Query:  GGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKND
        GGHFGGQRTAAKVLQSG                      TGNISNRNEMPLNSMLEVELFDVW IDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKND
Subjt:  GGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKND

Query:  ANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKT
        ANTVSKFLKKQIFSRF TPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTN QAEITN EIKSILE KVVSTSRKDWTE+LDEALWAYRT FKT
Subjt:  ANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKT

Query:  PIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD
        PIGMSPYALVFGKACHL LELEHKAIWAMKKLNLD
Subjt:  PIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.7e-14953.06Show/hide
Query:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR
        MCDAS+ A+GAVLGQR++K+   IYYAS+TLN +Q NYTTTEKEMLA+VFA DKF                              RS   C         
Subjt:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR

Query:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD
                                                         +KV +++DH+A++YL +KK+AKPRLIRW+LLLQEFDLE++D+KG+EN VAD
Subjt:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD

Query:  HLSRLENKEVQESWSDIEEQLPDEHIMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSCH
        HLSRLE +EV+     I+E  PDE +   E + PWYADIVN+L C   P +    Q+KK  +  K+Y WDEP L++   D I+RRCVP+ E  +IL  CH
Subjt:  HLSRLENKEVQESWSDIEEQLPDEHIMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSCH

Query:  EALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAAC
         + YGGHFG  RTAAKVLQSG                       GNIS R E+PL ++LEVELFDVW IDFMGPFPPS G  YIL+AVDYVSKWVEA A 
Subjt:  EALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAAC

Query:  AKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRT
          NDA  V KFL K IF+RF TPRAIISDEGTHF N++  NLL+K+ V H++A AYHPQTNGQAEI+N EIK+ILE K V+T+RKDW +KLD+ALWAYRT
Subjt:  AKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRT

Query:  AFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD
        AFKTPIGMSPY LVFGKACHLP+ELEHKA WA+KK NLD
Subjt:  AFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD

XP_042003745.1 uncharacterized protein LOC121752711 [Salvia splendens]1.6e-14752.21Show/hide
Query:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR
        MCDAS++AVGAVLGQR++K++H +YYASK LN +Q NYTTTEKEMLA+V+A +KF                                             
Subjt:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR

Query:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD
                 LL                                     G+KV +++DHSAIKYLM KK+AKPRL+RW+LLLQEFD+EIKD+KGTEN VAD
Subjt:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD

Query:  HLSRLENKE--VQESWSDIEEQLPDEHIMNAESQE---PWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSI
        HLSRLE  E    E    I E+ PDE ++  E++E   PW+A++ NYLV    PE  ++ QKKK    ++ Y W++P+L+R+ SD ++RRCV ++E   I
Subjt:  HLSRLENKE--VQESWSDIEEQLPDEHIMNAESQE---PWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSI

Query:  LRSCHEALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWV
        L +CH++LYGGHFG +RTA KVLQSG                       GNIS RNEMP+N++ EVELFDVW IDFMGPFP S G QYILVAVDYVSKWV
Subjt:  LRSCHEALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWV

Query:  EAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEAL
        EA A A NDA  V KF+K  IF+RF TPRAIISD GTHF N++  NLL K+ V H+VAT YHPQT+GQ E++N EIK +LE KVV  SRKDW +KLD+AL
Subjt:  EAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEAL

Query:  WAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD
        WAYRTA+KTPIG SPY LVFGKACHLP+ELEHKA WA++KLNLD
Subjt:  WAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD

XP_042009195.1 uncharacterized protein LOC121757770 [Salvia splendens]1.6e-14752.21Show/hide
Query:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR
        MCDAS++AVGAVLGQR++K++H +YYASK LN +Q NYTTTEKEMLA+V+A +KF                                             
Subjt:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR

Query:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD
                 LL                                     G+KV +++DHSAIKYLM KK+AKPRL+RW+LLLQEFD+EIKD+KGTEN VAD
Subjt:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD

Query:  HLSRLENKE--VQESWSDIEEQLPDEHIMNAESQE---PWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSI
        HLSRLE  E    E    I E+ PDE ++  E++E   PW+A++ NYLV    PE  ++ QKKK    ++ Y W++P+L+R+ SD ++RRCV ++E   I
Subjt:  HLSRLENKE--VQESWSDIEEQLPDEHIMNAESQE---PWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSI

Query:  LRSCHEALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWV
        L +CH++LYGGHFG +RTA KVLQSG                       GNIS RNEMP+N++ EVELFDVW IDFMGPFP S G QYILVAVDYVSKWV
Subjt:  LRSCHEALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWV

Query:  EAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEAL
        EA A A NDA  V KF+K  IF+RF TPRAIISD GTHF N++  NLL K+ V H+VAT YHPQT+GQ E++N EIK +LE KVV  SRKDW +KLD+AL
Subjt:  EAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEAL

Query:  WAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD
        WAYRTA+KTPIG SPY LVFGKACHLP+ELEHKA WA++KLNLD
Subjt:  WAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD

TrEMBL top hitse value%identityAlignment
A0A251UM01 Putative reverse transcriptase domain, Ribonuclease H-like domain protein6.6e-14451.49Show/hide
Query:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR
        MCDASNHAVGAVLGQRK+++ H IYYASKTL+ +Q NY+TTEKE+LAIVFA++KF             +Q+                             
Subjt:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR

Query:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD
                                                        G+KV +YSDH+A++YLM KK+AKPRLIRWVLLLQEFDLEI+D+ G +N VAD
Subjt:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD

Query:  HLSRLENKEVQESWSDIEEQLPDEHIMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSCH
        HLSR+ N E       + +  PDEH+  AE   PWYADIVNYLV N +P E +  QK K++ +++ Y WDEPYL++  +D ++RRCV K E  SIL  CH
Subjt:  HLSRLENKEVQESWSDIEEQLPDEHIMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSCH

Query:  EALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAAC
            GGHFG +RTA KVL+SG                      TGN+S+R++MPL  +L  E+FDVW IDFMGPFP S GN YIL+AVDYVSKWVEA A 
Subjt:  EALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAAC

Query:  AKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRT
          ND+  VS F+K  IFSRF TP+A ISD G+HF NR I  L  K+ V+HRV+TAYHPQTNGQAEI+N EIKSILE K V+ +RKDW+ +LD+ALWAYRT
Subjt:  AKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRT

Query:  AFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNL
        A+KTPIGMSP+ LVFGKACHLP+ELEHKA WA+K+ NL
Subjt:  AFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNL

A0A2G9FWY3 Reverse transcriptase6.2e-15053.52Show/hide
Query:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR
        MCDAS+ AVGAVLGQRK+KI   IYYASKTLN +Q NYTTTEKE+LA+VFA DKF                              RS             
Subjt:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR

Query:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD
                                                        G+KV +Y+DH+AI+YL+ KK+AKPRLIRWVLLLQEFDLEI+DRKGTENQ+AD
Subjt:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD

Query:  HLSRLENKEVQESWSDIEEQLPDEHIMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSC
        HLSRLE+    +  + I +  PDE ++    S  PWYADIVNYL C   P + +AQQKKK  + ++ Y WD+P+L++   D+ILRRCVP+ E + IL  C
Subjt:  HLSRLENKEVQESWSDIEEQLPDEHIMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSC

Query:  HEALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAA
        H + YGGHF G RTAAK+LQSG                      TGNIS R+EMPLN++LEVELFDVW IDFMGPF PS GN YILVAVDYVSKWVEAAA
Subjt:  HEALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAA

Query:  CAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYR
           ND+  V  F+KK IF+RF TPRAIISD GTHF NR    LL+K+ V H+++T YHPQT+GQ E++N EIK ILE K VS++RKDW+++LDEALWAYR
Subjt:  CAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYR

Query:  TAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD
        TA+KTPIGMSPY LVFGKACHLP+ELEH A WA++KLN D
Subjt:  TAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD

A0A2K3LHD8 Integrase catalytic domain-containing protein3.9e-14452.13Show/hide
Query:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR
        MCDAS+ AVGAVLGQRK+K++H IYYAS  LN +Q NY TTEKE+LA+V+A DKF                              RS             
Subjt:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR

Query:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD
                 LL                                     GSKV +Y+DH+A++YL AK+ +KPRL+RW+LLLQEFDLEI+D+KG+EN VAD
Subjt:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD

Query:  HLSRLENKEVQESWSDIEEQLPDEHIMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSCH
        HLSRLE     E    I++   DEHI+ A +  PW+AD  NY+V    P +F  QQ+KK  +  KFY WDEP+LY+   D +LRRCVP+ E   +L  CH
Subjt:  HLSRLENKEVQESWSDIEEQLPDEHIMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSCH

Query:  EALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAAC
        ++ YGGHF G RTAAKVLQSG                      TGNIS RNEMP N +LEVE+FDVW IDFMGPFP S    YILVAVDYVSKWVEA A 
Subjt:  EALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAAC

Query:  AKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRT
          NDA  V  FLK+ IFSRF  PRA+ISDEGTHF+NR +  LL K+NV HR+AT YHPQT+GQ E++N +IK ILE K V++SRKDW+ KLD+ALWAYRT
Subjt:  AKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRT

Query:  AFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD
        AFKTPIGMSP+ +V+GKACHLPLELEHKA+WA K LN D
Subjt:  AFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD

A0A6P6GGL5 LOW QUALITY PROTEIN: uncharacterized protein LOC1124928783.9e-14452.58Show/hide
Query:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR
        MCDAS++A+GAVLGQ K+K +H IYYAS+TLN +Q NY TT+KEM AIVFA DKF                                             
Subjt:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR

Query:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD
                          +A+                           GSK  +Y+DHS IKYLM+KK +KPRLIRWVLLLQEFDLEI D+KG EN VAD
Subjt:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD

Query:  HLSRLENKEVQESWSDIEEQLPDEHIMNAES--QEPWYADIVNYLVCNQWPEEF-NAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILR
        HLSRLE  E +E   DIEE  PDE +   +     PWYADIVNYLV N  P       +K K   KS++Y WD+PYL++  +D I+RRCV + ET SI++
Subjt:  HLSRLENKEVQESWSDIEEQLPDEHIMNAES--QEPWYADIVNYLVCNQWPEEF-NAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILR

Query:  SCHEALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEA
        SCH + YGGHFG ++T AK+L SG                      TGNIS +NEMPL ++LEVELFDVW IDFMGPFP SCGN+YILVAVDYVSKWVEA
Subjt:  SCHEALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEA

Query:  AACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWA
        +    NDA  V KFLKK IF+RF TPRAIISD GTHF N+   +LL K+ V H++AT YHPQT+GQ EI+N EIK ILE K V+ SRKDW+ KLD+ALWA
Subjt:  AACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWA

Query:  YRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD
        YRTA+KTPIG SPY LVFGK CHLP+ELEHKA WA K LN D
Subjt:  YRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD

A0A803R2M6 Uncharacterized protein5.4e-14654.06Show/hide
Query:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR
        MCDAS++A+GAVLGQR +K+   IYYASKTLN +Q NY TTEKEMLAIVFA DKF +P                                          
Subjt:  MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSR

Query:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD
                                                        G+KV +Y+DHSAIKYLM KK+AKPRLIRWVLLLQEFDL+IKD+KGTEN VAD
Subjt:  FSLEYCWVSLLSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVAD

Query:  HLSRLENKEVQESWS-DIEEQLPDEHIMNAES--QEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILR
        HLSRLE +E Q +    I EQ PDE + +       PWYAD VN+L  N  P E + QQ KK   + K Y W+EP LY+  +D I+RRCVP+ E +SIL 
Subjt:  HLSRLENKEVQESWS-DIEEQLPDEHIMNAES--QEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILR

Query:  SCHEALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEA
         CH    GGHF G RTAAKVLQSG                      TGNIS RNEMPL  +LEVELFDVW IDFMGPFP S  N YIL+AVDYVSKWVEA
Subjt:  SCHEALYGGHFGGQRTAAKVLQSG----------------------TGNISNRNEMPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEA

Query:  AACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWA
        AA   ND  TV +FL+K IF+RF TPRAIISDEG+HF N+    LL+++ V HR A  YHPQ+NGQAEI+N EIK ILE K V  SRKDW+ KLD+ALWA
Subjt:  AACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWA

Query:  YRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD
        YRTAFKTPIGMSPY LVFGKACHLP+ELEHKA WAMK LN+D
Subjt:  YRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD

SwissProt top hitse value%identityAlignment
P03359 Gag-Pol polyprotein1.3e-1936.18Show/hide
Query:  WRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEI
        W +DF    P   GN+Y+LV +D  S WVEA       A TV K + ++I  RF  P+ + SD G  F+ ++   L T+  ++ ++  AY PQ++GQ E 
Subjt:  WRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEI

Query:  TNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKTP--IGMSPYALVFG
         N  IK  L K  + T  KDW   L  AL   R    TP   G++PY +++G
Subjt:  TNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKTP--IGMSPYALVFG

P10272 Gag-Pol polyprotein2.2e-1936Show/hide
Query:  WRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEI
        W IDF    P   G +Y+LV VD  S WVEA    +  A+ V+K + ++IF RF  P+ I SD G  F++++   L     ++ ++  AY PQ++GQ E 
Subjt:  WRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEI

Query:  TNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKTPIGMSPYALVFG
         N  IK  L K  + T  KDW   L  AL   R       G++PY +++G
Subjt:  TNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKTPIGMSPYALVFG

P26808 Gag-Pol polyprotein1.7e-1936Show/hide
Query:  WRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEI
        W IDF    P   G +Y+LV VD  S WVEA    K  A  V+K L ++IF RF  P+ + +D G  F++++   +     V  ++  AY PQ++GQ E 
Subjt:  WRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEI

Query:  TNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKTPIGMSPYALVFG
         N  IK  L K  ++T  +DW   L  AL+  R     P G++PY +++G
Subjt:  TNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKTPIGMSPYALVFG

P26810 Gag-Pol polyprotein1.7e-1936Show/hide
Query:  WRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEI
        W IDF    P   G +Y+LV VD  S WVEA    K  A  V+K L ++IF RF  P+ + +D G  F++++   +     V  ++  AY PQ++GQ E 
Subjt:  WRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEI

Query:  TNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKTPIGMSPYALVFG
         N  IK  L K  ++T  +DW   L  AL+  R     P G++PY +++G
Subjt:  TNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKTPIGMSPYALVFG

P31792 Pol polyprotein (Fragment)1.3e-1936Show/hide
Query:  WRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEI
        W IDF    P   G +Y+LV VD  S WVEA    +  A+ V+K + ++IF RF  P+ I SD G  F++++   L     ++ ++  AY PQ++GQ E 
Subjt:  WRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNGQAEI

Query:  TNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKTPIGMSPYALVFG
         N  IK  L K  + T  KDW   L  AL   R       G++PY +++G
Subjt:  TNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKTPIGMSPYALVFG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCGATGCAAGCAACCATGCAGTGGGAGCAGTATTGGGGCAAAGAAAAGAGAAAATAATGCACCCCATCTATTATGCGAGTAAAACATTGAATGCGTCTCAGGAGAA
CTACACTACTACTGAGAAGGAAATGTTAGCCATAGTCTTTGCGGTTGACAAATTCTATAAGCCCCGTCGAAGAAGGAAGAGGAAAAGAAGAGGGAAACAGTTCTGCTGCA
GCGGTAGTATCGCTGAGAGAGCGAGAGACAGGGAGCGAAGCAAGTATCGCTGTAACGCAAAGAGGCTTTGCGTCTCACGATTCTCGCTAGAATATTGCTGGGTGTCGCTC
CTAAGTGTTGAGGGACTTTTTGTGAAGGCACATGGTAATGTAGACGTAATCGAATACGGTGCTGAGGAACAGAATGTGAAGGCACTTGATAGTTGCAAATTTGTGAAACA
TGATGGGTCAAAGGTGACCATCTATAGCGATCATTCTGCGATCAAATATTTGATGGCGAAAAAGAACGCAAAGCCTAGACTCATCCGCTGGGTCCTGCTATTACAAGAAT
TTGACTTGGAGATTAAAGACAGAAAGGGAACCGAGAATCAGGTTGCGGACCACCTATCTCGGTTGGAAAATAAAGAGGTCCAGGAAAGTTGGAGTGATATAGAGGAACAA
TTGCCAGATGAGCACATCATGAATGCAGAGAGTCAGGAACCGTGGTATGCAGACATAGTAAATTACTTGGTCTGCAACCAATGGCCTGAAGAATTCAATGCTCAACAAAA
GAAAAAACTCCAATATAAAAGTAAGTTCTACTGCTGGGATGAGCCATATCTATACAGACTTAGCTCGGACCACATACTACGTCGATGCGTTCCAAAATATGAAACGCATA
GCATTTTGAGAAGCTGTCATGAAGCACTTTACGGAGGACACTTTGGGGGGCAGAGAACAGCTGCAAAGGTGTTGCAAAGTGGGACAGGCAACATTTCCAACCGAAATGAG
ATGCCTCTAAACTCTATGCTGGAAGTTGAGTTGTTTGACGTATGGAGAATCGATTTCATGGGACCATTTCCTCCCTCTTGCGGTAATCAATATATCCTAGTAGCGGTCGA
CTACGTATCAAAATGGGTAGAAGCAGCAGCCTGTGCGAAGAACGACGCAAACACAGTGTCCAAGTTCTTAAAGAAACAAATCTTCTCTCGATTTAGGACACCAAGGGCGA
TAATTAGTGATGAAGGTACACATTTTATAAATCGCATAATCACTAATTTACTGACAAAATTTAATGTCTCGCACAGGGTAGCAACTGCTTATCACCCACAGACAAACGGC
CAAGCTGAAATAACAAACTGGGAGATCAAGTCCATACTTGAAAAAAAAGTCGTGAGCACATCAAGGAAAGATTGGACGGAGAAATTAGATGAAGCTCTATGGGCATACAG
AACAGCATTCAAAACACCTATAGGCATGTCACCCTATGCGCTGGTGTTTGGGAAAGCATGCCATCTCCCGCTTGAGCTGGAACACAAGGCCATCTGGGCTATGAAGAAGC
TCAATCTAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGCGATGCAAGCAACCATGCAGTGGGAGCAGTATTGGGGCAAAGAAAAGAGAAAATAATGCACCCCATCTATTATGCGAGTAAAACATTGAATGCGTCTCAGGAGAA
CTACACTACTACTGAGAAGGAAATGTTAGCCATAGTCTTTGCGGTTGACAAATTCTATAAGCCCCGTCGAAGAAGGAAGAGGAAAAGAAGAGGGAAACAGTTCTGCTGCA
GCGGTAGTATCGCTGAGAGAGCGAGAGACAGGGAGCGAAGCAAGTATCGCTGTAACGCAAAGAGGCTTTGCGTCTCACGATTCTCGCTAGAATATTGCTGGGTGTCGCTC
CTAAGTGTTGAGGGACTTTTTGTGAAGGCACATGGTAATGTAGACGTAATCGAATACGGTGCTGAGGAACAGAATGTGAAGGCACTTGATAGTTGCAAATTTGTGAAACA
TGATGGGTCAAAGGTGACCATCTATAGCGATCATTCTGCGATCAAATATTTGATGGCGAAAAAGAACGCAAAGCCTAGACTCATCCGCTGGGTCCTGCTATTACAAGAAT
TTGACTTGGAGATTAAAGACAGAAAGGGAACCGAGAATCAGGTTGCGGACCACCTATCTCGGTTGGAAAATAAAGAGGTCCAGGAAAGTTGGAGTGATATAGAGGAACAA
TTGCCAGATGAGCACATCATGAATGCAGAGAGTCAGGAACCGTGGTATGCAGACATAGTAAATTACTTGGTCTGCAACCAATGGCCTGAAGAATTCAATGCTCAACAAAA
GAAAAAACTCCAATATAAAAGTAAGTTCTACTGCTGGGATGAGCCATATCTATACAGACTTAGCTCGGACCACATACTACGTCGATGCGTTCCAAAATATGAAACGCATA
GCATTTTGAGAAGCTGTCATGAAGCACTTTACGGAGGACACTTTGGGGGGCAGAGAACAGCTGCAAAGGTGTTGCAAAGTGGGACAGGCAACATTTCCAACCGAAATGAG
ATGCCTCTAAACTCTATGCTGGAAGTTGAGTTGTTTGACGTATGGAGAATCGATTTCATGGGACCATTTCCTCCCTCTTGCGGTAATCAATATATCCTAGTAGCGGTCGA
CTACGTATCAAAATGGGTAGAAGCAGCAGCCTGTGCGAAGAACGACGCAAACACAGTGTCCAAGTTCTTAAAGAAACAAATCTTCTCTCGATTTAGGACACCAAGGGCGA
TAATTAGTGATGAAGGTACACATTTTATAAATCGCATAATCACTAATTTACTGACAAAATTTAATGTCTCGCACAGGGTAGCAACTGCTTATCACCCACAGACAAACGGC
CAAGCTGAAATAACAAACTGGGAGATCAAGTCCATACTTGAAAAAAAAGTCGTGAGCACATCAAGGAAAGATTGGACGGAGAAATTAGATGAAGCTCTATGGGCATACAG
AACAGCATTCAAAACACCTATAGGCATGTCACCCTATGCGCTGGTGTTTGGGAAAGCATGCCATCTCCCGCTTGAGCTGGAACACAAGGCCATCTGGGCTATGAAGAAGC
TCAATCTAGACTAG
Protein sequenceShow/hide protein sequence
MCDASNHAVGAVLGQRKEKIMHPIYYASKTLNASQENYTTTEKEMLAIVFAVDKFYKPRRRRKRKRRGKQFCCSGSIAERARDRERSKYRCNAKRLCVSRFSLEYCWVSL
LSVEGLFVKAHGNVDVIEYGAEEQNVKALDSCKFVKHDGSKVTIYSDHSAIKYLMAKKNAKPRLIRWVLLLQEFDLEIKDRKGTENQVADHLSRLENKEVQESWSDIEEQ
LPDEHIMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQYKSKFYCWDEPYLYRLSSDHILRRCVPKYETHSILRSCHEALYGGHFGGQRTAAKVLQSGTGNISNRNE
MPLNSMLEVELFDVWRIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFRTPRAIISDEGTHFINRIITNLLTKFNVSHRVATAYHPQTNG
QAEITNWEIKSILEKKVVSTSRKDWTEKLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLD