; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039714 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039714
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon Tf2-1 polyprotein
Genome locationchr2:48837667..48843626
RNA-Seq ExpressionLag0039714
SyntenyLag0039714
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0009987 - cellular process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF8393178.1 hypothetical protein HHK36_021419 [Tetracentron sinense]1.2e-4232.97Show/hide
Query:  VEKETGVDALEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASAL------------------------------------------
        VE E     + Q+ E++LNS+VGL +PKT+K+KG+I  +EVVVLID   THNFI+  L                                          
Subjt:  VEKETGVDALEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASAL------------------------------------------

Query:  --DFLPLNLGSADVILGVQWLATLGDVTTNHLNLEMFFCLG----------NLG---WKFRRRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVA
          +FLPL LGS++VILG+QWL TLG   TN     M F LG          +LG      +   RT+++  +    E N    + +          + +A
Subjt:  --DFLPLNLGSADVILGVQWLATLGDVTTNHLNLEMFFCLG----------NLG---WKFRRRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVA

Query:  AHVRSF--------IEGDERE--LRDRKTETREREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRRNVYSPTLESHAQHLAKVMAVLQENKLVD
         H   F        + G E    L++       R   +P  + +  + ER  +     D                    ESH QHLA+V+ VLQ N L  
Subjt:  AHVRSF--------IEGDERE--LRDRKTETREREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRRNVYSPTLESHAQHLAKVMAVLQENKLVD

Query:  NQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY-------------------------------------
        N+KKCQFG  Q+ YLGHI+SGKGV+A+ +KV+AM  WP P NL+ L  FLGLTGYY KFV  Y                                     
Subjt:  NQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY-------------------------------------

Query:  ------------VEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW
                    +E DASG G+G +L Q  + +AFFS AL    R K++YE+ELM IV AV KW
Subjt:  ------------VEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW

KAF8400118.1 hypothetical protein HHK36_013414 [Tetracentron sinense]6.7e-4132.53Show/hide
Query:  VEKETGVDALEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASAL------------------------------------------
        VE E     + Q+ E++LNS+VGL +PKT+K+KG+I  +EV+VLID   THNFI+  L                                          
Subjt:  VEKETGVDALEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASAL------------------------------------------

Query:  --DFLPLNLGSADVILGVQWLATLGDVTTNHLNLEMFFCLG----------NLG---WKFRRRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVA
          +FLPL LGS++VILG+QWL TLG   TN     M F LG          +LG      +   RT+++  +    E N    + +          + +A
Subjt:  --DFLPLNLGSADVILGVQWLATLGDVTTNHLNLEMFFCLG----------NLG---WKFRRRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVA

Query:  AHVRSF-IEGDERELRDRKTETREREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRRNVYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGL
         H   F +      +R  +     +E   P    +  + ER  +     D                    ESH QHLA+V+ VLQ N L  N+KKC+FG 
Subjt:  AHVRSF-IEGDERELRDRKTETREREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRRNVYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGL

Query:  KQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY----------------------------------------------
         Q+ YLGHI+SGKGV+A+ +KV+AM  WP P NL+ L  FLGLTGYY KFV  Y                                              
Subjt:  KQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY----------------------------------------------

Query:  ---VEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW
           +E DASG G+GV+L Q  + +AFFS  L      K++YE+ELM IV AV KW
Subjt:  ---VEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW

KFK36440.1 hypothetical protein AALP_AA4G124900 [Arabis alpina]4.6e-4233.41Show/hide
Query:  LEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASAL--------------------------------------------DFLPLNL
        + ++ EL+LNS+VG+ SP T+K+ G +   +VVV+IDS  +HNFI++ L                                            DFLPL L
Subjt:  LEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASAL--------------------------------------------DFLPLNL

Query:  GSADVILGVQWLATLGDVTTNHLNLEMFFCLGNLGWKFRRRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVAAHVRSFIEGDERELRDRKTETR
        GSADVILG+QWLA+LG++  N              W  +  R TV N      AE    G  T     CC      + A +++ +E  E  +       +
Subjt:  GSADVILGVQWLATLGDVTTNHLNLEMFFCLGNLGWKFRRRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVAAHVRSFIEGDERELRDRKTETR

Query:  EREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRR--NVYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAK
          E+  P         +   R   +     G   + GR    N+       H  HL  V++VLQE +L  N+KKCQFG K+IEYLGHI+SG GV+ N  K
Subjt:  EREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRR--NVYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAK

Query:  VEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY-------------------------------------------------VEADASGVGVGVILMQDS
        + AM  W  P+N+K L  FLGLTGYY KFV+ Y                                                 +++DASG+G+G +LMQ  
Subjt:  VEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY-------------------------------------------------VEADASGVGVGVILMQDS

Query:  KSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW
        K +AFFS ALT  ++ K+VYERELM IVLA+Q+W
Subjt:  KSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW

KFK36442.1 hypothetical protein AALP_AA4G124900 [Arabis alpina]4.6e-4233.41Show/hide
Query:  LEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASAL--------------------------------------------DFLPLNL
        + ++ EL+LNS+VG+ SP T+K+ G +   +VVV+IDS  +HNFI++ L                                            DFLPL L
Subjt:  LEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASAL--------------------------------------------DFLPLNL

Query:  GSADVILGVQWLATLGDVTTNHLNLEMFFCLGNLGWKFRRRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVAAHVRSFIEGDERELRDRKTETR
        GSADVILG+QWLA+LG++  N              W  +  R TV N      AE    G  T     CC      + A +++ +E  E  +       +
Subjt:  GSADVILGVQWLATLGDVTTNHLNLEMFFCLGNLGWKFRRRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVAAHVRSFIEGDERELRDRKTETR

Query:  EREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRR--NVYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAK
          E+  P         +   R   +     G   + GR    N+       H  HL  V++VLQE +L  N+KKCQFG K+IEYLGHI+SG GV+ N  K
Subjt:  EREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRR--NVYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAK

Query:  VEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY-------------------------------------------------VEADASGVGVGVILMQDS
        + AM  W  P+N+K L  FLGLTGYY KFV+ Y                                                 +++DASG+G+G +LMQ  
Subjt:  VEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY-------------------------------------------------VEADASGVGVGVILMQDS

Query:  KSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW
        K +AFFS ALT  ++ K+VYERELM IVLA+Q+W
Subjt:  KSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW

KFK43655.1 hypothetical protein AALP_AA1G155400 [Arabis alpina]5.9e-3747.12Show/hide
Query:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY----------
        VYS  LE H +HL  V+ +LQ NKL  N KKCQFG  +IEYLGHI+SG+GVSA+  K++AM +WP PRN+K L  FLGLTGYY KFV  Y          
Subjt:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY----------

Query:  ---------------------------------------VEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW
                                               VE+DASG G+G +LMQ  K +AFFS ALT  +R K+VYERELM IV A+QKW
Subjt:  ---------------------------------------VEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW

TrEMBL top hitse value%identityAlignment
A0A087H2T8 Uncharacterized protein2.2e-4233.41Show/hide
Query:  LEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASAL--------------------------------------------DFLPLNL
        + ++ EL+LNS+VG+ SP T+K+ G +   +VVV+IDS  +HNFI++ L                                            DFLPL L
Subjt:  LEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASAL--------------------------------------------DFLPLNL

Query:  GSADVILGVQWLATLGDVTTNHLNLEMFFCLGNLGWKFRRRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVAAHVRSFIEGDERELRDRKTETR
        GSADVILG+QWLA+LG++  N              W  +  R TV N      AE    G  T     CC      + A +++ +E  E  +       +
Subjt:  GSADVILGVQWLATLGDVTTNHLNLEMFFCLGNLGWKFRRRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVAAHVRSFIEGDERELRDRKTETR

Query:  EREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRR--NVYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAK
          E+  P         +   R   +     G   + GR    N+       H  HL  V++VLQE +L  N+KKCQFG K+IEYLGHI+SG GV+ N  K
Subjt:  EREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRR--NVYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAK

Query:  VEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY-------------------------------------------------VEADASGVGVGVILMQDS
        + AM  W  P+N+K L  FLGLTGYY KFV+ Y                                                 +++DASG+G+G +LMQ  
Subjt:  VEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY-------------------------------------------------VEADASGVGVGVILMQDS

Query:  KSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW
        K +AFFS ALT  ++ K+VYERELM IVLA+Q+W
Subjt:  KSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW

A0A087H2U0 Uncharacterized protein2.2e-4233.41Show/hide
Query:  LEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASAL--------------------------------------------DFLPLNL
        + ++ EL+LNS+VG+ SP T+K+ G +   +VVV+IDS  +HNFI++ L                                            DFLPL L
Subjt:  LEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASAL--------------------------------------------DFLPLNL

Query:  GSADVILGVQWLATLGDVTTNHLNLEMFFCLGNLGWKFRRRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVAAHVRSFIEGDERELRDRKTETR
        GSADVILG+QWLA+LG++  N              W  +  R TV N      AE    G  T     CC      + A +++ +E  E  +       +
Subjt:  GSADVILGVQWLATLGDVTTNHLNLEMFFCLGNLGWKFRRRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVAAHVRSFIEGDERELRDRKTETR

Query:  EREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRR--NVYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAK
          E+  P         +   R   +     G   + GR    N+       H  HL  V++VLQE +L  N+KKCQFG K+IEYLGHI+SG GV+ N  K
Subjt:  EREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRR--NVYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAK

Query:  VEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY-------------------------------------------------VEADASGVGVGVILMQDS
        + AM  W  P+N+K L  FLGLTGYY KFV+ Y                                                 +++DASG+G+G +LMQ  
Subjt:  VEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY-------------------------------------------------VEADASGVGVGVILMQDS

Query:  KSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW
        K +AFFS ALT  ++ K+VYERELM IVLA+Q+W
Subjt:  KSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW

A0A087HNF3 Uncharacterized protein2.8e-3747.12Show/hide
Query:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY----------
        VYS  LE H +HL  V+ +LQ NKL  N KKCQFG  +IEYLGHI+SG+GVSA+  K++AM +WP PRN+K L  FLGLTGYY KFV  Y          
Subjt:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY----------

Query:  ---------------------------------------VEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW
                                               VE+DASG G+G +LMQ  K +AFFS ALT  +R K+VYERELM IV A+QKW
Subjt:  ---------------------------------------VEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW

D1GEG7 Disease resistance protein1.1e-3643.98Show/hide
Query:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY----------
        +YS T+E HA+HL  V++VLQE+KL+ N+KKC FGL+QIEYLGHI+S  GV+ +  K + M++WP+P+++K+L  FLGLTGYY  +V+ Y          
Subjt:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY----------

Query:  ---------------------------------------VEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW
                                               +E+DASG GVG +LMQD K +AFFSH LT   + K  YERELM +VLAVQKW
Subjt:  ---------------------------------------VEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQKW

D1GEG7 Disease resistance protein8.6e-0241.18Show/hide
Query:  LEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIA-SALDFLPLNL---GSADVILG
        ++++  L+LNS +G+GSPKT K+ G I   +V+V++DS  +HNFI  S +  L L +    S D++LG
Subjt:  LEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIA-SALDFLPLNL---GSADVILG

D1GEG7 Disease resistance protein2.0e-3529.25Show/hide
Query:  ETGVDALEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNF-------------IASALDFLPLNLGSADVILGVQWLATLGDVTTNHLNL
        E GV   E M  L+LNSLVG+ SP+T+K+K  +   EVVV+IDS                   I    DFLP+ LG ADVILG+QWL TLG++  N    
Subjt:  ETGVDALEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNF-------------IASALDFLPLNLGSADVILGVQWLATLGDVTTNHLNL

Query:  EMFFCLGNLGWKFRRRRRTVQN-----------------------------GSSNAGAEDNGSGRATDRR------------------------------
           + L    +K   ++ T+Q                              GS   G +         RR                              
Subjt:  EMFFCLGNLGWKFRRRRRTVQN-----------------------------GSSNAGAEDNGSGRATDRR------------------------------

Query:  --------RWCCDSQRRRVAAHVRSF----IEGDERELRDRKTET-------------REREKEHPSEETEGGDTERWRRRTTICDGGG--GGNQNDGRR
                R+C D +    A    SF    I+    EL   K  +             ++ +    +  T  G  E       + +         N+  +
Subjt:  --------RWCCDSQRRRVAAHVRSF----IEGDERELRDRKTET-------------REREKEHPSEETEGGDTERWRRRTTICDGGG--GGNQNDGRR

Query:  RN-------------VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCK
        ++             VYS T+ +H  HL  V+ VL+E++L  NQKKC FG + +EYLGH++S +GVSA+  K+  ME+WP+PRN+K L  F+GLTGYY K
Subjt:  RN-------------VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCK

Query:  FVQNY-------------------------------------------------VEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIV
        FVQ Y                                                 VE++ASGVG+G +LMQ  + +A+FS ALT  +  K++YERELM IV
Subjt:  FVQNY-------------------------------------------------VEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIV

Query:  LAVQKW
         A+QKW
Subjt:  LAVQKW

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.5e-1427.23Show/hide
Query:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNYVE--------
        V+S +L+ H Q L  V   L +  L     KC+F  ++  +LGH+++  G+  N  K+EA++++PIP   KE+  FLGLTGYY KF+ N+ +        
Subjt:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNYVE--------

Query:  -------------------------------------------ADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQ
                                                    DAS V +G +L QD   +++ S  L       +  E+EL+ IV A +
Subjt:  -------------------------------------------ADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQ

P20825 Retrovirus-related Pol polyprotein from transposon 2972.1e-1327.23Show/hide
Query:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNYVE--------
        ++S +L  H   +  V   L +  L     KC+F  K+  +LGHIV+  G+  N  KV+A+  +PIP   KE+  FLGLTGYY KF+ NY +        
Subjt:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNYVE--------

Query:  -------------------------------------------ADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQ
                                                    DAS + +G +L Q+   ++F S  L       +  E+EL+ IV A +
Subjt:  -------------------------------------------ADASGVGVGVILMQDSKSMAFFSHALTSLRRAKAVYERELMTIVLAVQ

P92523 Uncharacterized mitochondrial protein AtMg008603.7e-1855.56Show/hide
Query:  HLAKVMAVLQENKLVDNQKKCQFGLKQIEYLG--HIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY
        HL  V+ + ++++   N+KKC FG  QI YLG  HI+SG+GVSA+ AK+EAM  WP P+N  EL  FLGLTGYY +FV+NY
Subjt:  HLAKVMAVLQENKLVDNQKKCQFGLKQIEYLG--HIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus6.1e-1340.22Show/hide
Query:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNYVE
        V+S   ++H ++L  V+A L +  L  N +K  F   Q+E+LG+IV+  G+ A+  KV A+ + P P ++KEL  FLG+T YY KF+Q+Y +
Subjt:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNYVE

Q9UR07 Transposon Tf2-11 polyprotein2.2e-1022.84Show/hide
Query:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQN-----------
        ++S +   H +H+  V+  L+   L+ NQ KC+F   Q++++G+ +S KG +     ++ + QW  P+N KEL +FLG   Y  KF+             
Subjt:  VYSPTLESHAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQN-----------

Query:  ---------------------------------------YVEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKA-----VYERELMTIVLAVQKW
                                                +E DAS V VG +L Q      ++     S + +KA     V ++E++ I+ +++ W
Subjt:  ---------------------------------------YVEADASGVGVGVILMQDSKSMAFFSHALTSLRRAKA-----VYERELMTIVLAVQKW

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.6e-1955.56Show/hide
Query:  HLAKVMAVLQENKLVDNQKKCQFGLKQIEYLG--HIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY
        HL  V+ + ++++   N+KKC FG  QI YLG  HI+SG+GVSA+ AK+EAM  WP P+N  EL  FLGLTGYY +FV+NY
Subjt:  HLAKVMAVLQENKLVDNQKKCQFGLKQIEYLG--HIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACTTGTGGAGATGGTTGAAAAAGAAACTGGTGTGGATGCTCTTGAACAAATGGGAGAACTCGCTTTGAATTCTTTAGTAGGGTTGGGATCACCTAAAACGCTTAA
GGTCAAGGGATTAATTAGCGTGAAGGAAGTTGTGGTGTTGATTGATAGCGACACAACCCATAATTTCATAGCTTCTGCCTTAGATTTCTTACCATTGAATTTGGGTAGCG
CTGATGTGATCTTGGGAGTACAGTGGTTAGCCACTCTTGGAGATGTTACTACTAACCATTTGAATTTGGAAATGTTTTTTTGTCTTGGAAATCTAGGCTGGAAATTCAGA
CGGCGGCGAAGAACAGTTCAAAATGGCAGCAGCAACGCTGGGGCTGAAGACAATGGCAGCGGCAGAGCTACAGATCGGCGGCGGTGGTGCTGCGATTCACAACGACGGAG
GGTGGCAGCTCACGTGCGATCTTTTATAGAGGGAGATGAGCGAGAGCTGAGGGACCGAAAGACAGAAACGAGAGAGAGGGAAAAAGAACATCCATCGGAGGAAACAGAGG
GCGGCGACACCGAAAGGTGGCGACGACGCACGACGATCTGTGACGGAGGAGGAGGCGGCAATCAGAATGATGGACGGCGGCGCAACGTTTACAGTCCAACTCTCGAATCT
CATGCTCAACATCTTGCTAAAGTTATGGCAGTATTACAAGAAAATAAGTTGGTGGATAACCAAAAGAAATGTCAGTTTGGGCTTAAACAAATAGAGTATTTAGGGCACAT
TGTATCTGGAAAGGGAGTTTCTGCTAATCTGGCTAAAGTGGAAGCTATGGAACAATGGCCAATTCCAAGAAATTTAAAAGAATTGTGTGAATTTCTTGGGCTTACAGGCT
ATTACTGCAAATTTGTCCAAAATTACGTTGAAGCTGATGCTTCAGGAGTGGGAGTTGGAGTGATATTGATGCAGGATTCGAAATCAATGGCTTTCTTCAGTCATGCCCTT
ACATCTTTACGTCGTGCTAAGGCCGTTTATGAAAGAGAACTAATGACAATTGTCCTGGCTGTTCAAAAATGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAACTTGTGGAGATGGTTGAAAAAGAAACTGGTGTGGATGCTCTTGAACAAATGGGAGAACTCGCTTTGAATTCTTTAGTAGGGTTGGGATCACCTAAAACGCTTAA
GGTCAAGGGATTAATTAGCGTGAAGGAAGTTGTGGTGTTGATTGATAGCGACACAACCCATAATTTCATAGCTTCTGCCTTAGATTTCTTACCATTGAATTTGGGTAGCG
CTGATGTGATCTTGGGAGTACAGTGGTTAGCCACTCTTGGAGATGTTACTACTAACCATTTGAATTTGGAAATGTTTTTTTGTCTTGGAAATCTAGGCTGGAAATTCAGA
CGGCGGCGAAGAACAGTTCAAAATGGCAGCAGCAACGCTGGGGCTGAAGACAATGGCAGCGGCAGAGCTACAGATCGGCGGCGGTGGTGCTGCGATTCACAACGACGGAG
GGTGGCAGCTCACGTGCGATCTTTTATAGAGGGAGATGAGCGAGAGCTGAGGGACCGAAAGACAGAAACGAGAGAGAGGGAAAAAGAACATCCATCGGAGGAAACAGAGG
GCGGCGACACCGAAAGGTGGCGACGACGCACGACGATCTGTGACGGAGGAGGAGGCGGCAATCAGAATGATGGACGGCGGCGCAACGTTTACAGTCCAACTCTCGAATCT
CATGCTCAACATCTTGCTAAAGTTATGGCAGTATTACAAGAAAATAAGTTGGTGGATAACCAAAAGAAATGTCAGTTTGGGCTTAAACAAATAGAGTATTTAGGGCACAT
TGTATCTGGAAAGGGAGTTTCTGCTAATCTGGCTAAAGTGGAAGCTATGGAACAATGGCCAATTCCAAGAAATTTAAAAGAATTGTGTGAATTTCTTGGGCTTACAGGCT
ATTACTGCAAATTTGTCCAAAATTACGTTGAAGCTGATGCTTCAGGAGTGGGAGTTGGAGTGATATTGATGCAGGATTCGAAATCAATGGCTTTCTTCAGTCATGCCCTT
ACATCTTTACGTCGTGCTAAGGCCGTTTATGAAAGAGAACTAATGACAATTGTCCTGGCTGTTCAAAAATGGTGA
Protein sequenceShow/hide protein sequence
MQLVEMVEKETGVDALEQMGELALNSLVGLGSPKTLKVKGLISVKEVVVLIDSDTTHNFIASALDFLPLNLGSADVILGVQWLATLGDVTTNHLNLEMFFCLGNLGWKFR
RRRRTVQNGSSNAGAEDNGSGRATDRRRWCCDSQRRRVAAHVRSFIEGDERELRDRKTETREREKEHPSEETEGGDTERWRRRTTICDGGGGGNQNDGRRRNVYSPTLES
HAQHLAKVMAVLQENKLVDNQKKCQFGLKQIEYLGHIVSGKGVSANLAKVEAMEQWPIPRNLKELCEFLGLTGYYCKFVQNYVEADASGVGVGVILMQDSKSMAFFSHAL
TSLRRAKAVYERELMTIVLAVQKW