; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC04G068010 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC04G068010
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE2
Genome locationCiama_Chr04:4316814..4318542
RNA-Seq ExpressionCaUC04G068010
SyntenyCaUC04G068010
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]9.5e-5336.32Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFV----QPEATAVLSPLPATPAAPSSVTTAQTIIN
        PS++S G      +++PPLNQ+LNQ+ ++KLD+ N+LLWK L  PILK Y+L GHLT E   P  FV        T       AT  A SS+T    I+N
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFV----QPEATAVLSPLPATPAAPSSVTTAQTIIN

Query:  PHYESWIANDQLLLGWLYNSMMPEVATQVMGRETAKGV---------VECIART--VQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQ
          +E W+  D LLLGWLYNSM P+VA Q+MG    + +         V+  A    ++Q+ QTTRKG+ KM +YL  MKT+ DNL QV SPV  RAL+SQ
Subjt:  PHYESWIANDQLLLGWLYNSMMPEVATQVMGRETAKGV---------VECIART--VQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQ

Query:  VLLGLDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNI-----TMGNVVMIEEEAEGEEITCYQLYN----KLFGQN-QH--GEKGNR
        VLLGLDE YN ++  IQGK D++W       L  + ++ I    + H N        GN+         +        N    K +G N QH  G++GN 
Subjt:  VLLGLDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNI-----TMGNVVMIEEEAEGEEITCYQLYN----KLFGQN-QH--GEKGNR

Query:  GQNFQGKRIPNQGHN-----------FPGP--------------TPNQA-FMATQNTNPFAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKER
              +     GH+           F  P              +PN A F++TQN  PFA TP +V+D  WY++SGA+NHV  + +++TN  EY G+  
Subjt:  GQNFQGKRIPNQGHN-----------FPGP--------------TPNQA-FMATQNTNPFAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKER

Query:  VIVGNGRALPITHIGSCHIPADDINTGQMVLKGELDDGLYIFEKTG
                                  G+ +L+G L DG Y  E+ G
Subjt:  VIVGNGRALPITHIGSCHIPADDINTGQMVLKGELDDGLYIFEKTG

KAA0057475.1 uncharacterized protein E6C27_scaffold280G003560 [Cucumis melo var. makuwa]7.8e-4737.89Show/hide
Query:  EATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECIART-----------VQQVFQTTRKGSLKMVD
        EA++V+S       A SS +    I+NP YE W+ +D LLLG +YNSM+P+VA Q+MG  TAK + E I              ++  FQTTR+G+ KM D
Subjt:  EATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECIART-----------VQQVFQTTRKGSLKMVD

Query:  YLRAMKTHADNLEQVESPVVLRALVSQVLLGLDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLY
        YLR MK +ADNL Q  SPV  R L+SQVLLGLDE YNP+ A IQGK D++W   ++  L  ++ ++I  + ++   I M   V+ EE             
Subjt:  YLRAMKTHADNLEQVESPVVLRALVSQVLLGLDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLY

Query:  NKLFGQNQHGEKGNRGQNFQGKRIPNQGHNFPGPTPNQAFMATQNTNPFAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPI
        N+ F  NQ+G++                       P+ AF+ TQ ++  A TP++V+D+  YV+SGA+NHV +DH++L N  +Y G E V+VGN   L I
Subjt:  NKLFGQNQHGEKGNRGQNFQGKRIPNQGHNFPGPTPNQAFMATQNTNPFAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPI

Query:  THIGSCHIP-------ADDI------------NTGQMVLKGELDDGLYIFE
        + +G   +         D I            +TG+++LKG L DGLY  E
Subjt:  THIGSCHIP-------ADDI------------NTGQMVLKGELDDGLYIFE

TXG67243.1 hypothetical protein EZV62_008518 [Acer yangbiense]1.8e-4333.65Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
        P+V   G  +N +  S P    LNQ  +IKLD+ NF+LWK +   I+K +RL GHL   +  PP F+ P  T    P P TP     V+ + +  NP YE
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE

Query:  SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLG
         W+ NDQLL+GWLY+SM   VA  VMG  TA G+ + +           A T++   QTTRKGS  M +YL  MKT AD+L     P     L +  L G
Subjt:  SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLG

Query:  LDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIK-IPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQN----------FQ
        LD EY PIV  I+ +   TWQ+     L    +++ I +++     ++  +  +   +      T     NK   Q    + GNR  N          F+
Subjt:  LDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIK-IPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQN----------FQ

Query:  GKRIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGN
        G+   N             GH           N+ G  P     A  N N    F  TP++V D+ WY +SGA++HV ND  +L    +Y G E ++VGN
Subjt:  GKRIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGN

Query:  GRALPITHIGSCHIPA
        G+ L I+H+G   +P+
Subjt:  GRALPITHIGSCHIPA

TXG69253.1 hypothetical protein EZV62_004188 [Acer yangbiense]4.7e-4433.89Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
        P+V   G  +N +  S P    LNQ  +IKLD+ NF+LWK +   I+K +RL GHL   +  PP F+ P  T    P P TP     V+ + +  NP YE
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE

Query:  SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLG
         W+ NDQLL+GWLY+SM   VA  VMG  TA G+ + +           A T++   QTTRKGS  M +YL  MKT AD+L     P     L +  L G
Subjt:  SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLG

Query:  LDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIK-IPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQN----------FQ
        LD EY PIV  I+ +   TWQ+     L    +++ I +++     ++  +  +   +      T     NK   Q    + GNR  N          F+
Subjt:  LDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIK-IPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQN----------FQ

Query:  GKRIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGN
        G+   N             GH           N+ G  P     A  N N    F  TP++V D+ WY +SGA+NHV ND  +L    +Y G E ++VGN
Subjt:  GKRIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGN

Query:  GRALPITHIGSCHIPA
        G+ L I+H+G   +P+
Subjt:  GRALPITHIGSCHIPA

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]3.0e-6738.89Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
        P V    V +   + SPPLNQLLNQITSIK+D+ NFLLW+NL  PIL+SY+LF +LT +K  PP  + P  T        T    S+ + +   +NP YE
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE

Query:  SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECIART-----------VQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLG
        +WI  D+LLLGWLYNSM  +VA QVMG  T++ +   +              ++QVFQ T KGSL+M++YL+ MK+HADNL    S V +R LVSQVL G
Subjt:  SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECIART-----------VQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLG

Query:  LDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQNFQ----GKR----
        LDEEYNPIV  +QGK++L+W +     L  + R++  +     + I       +    +G      Q  N   G N HG   +RG  +Q    G+R    
Subjt:  LDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQNFQ----GKR----

Query:  --IPNQGHNFPGPTPNQAFMATQNTNPFAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPAD----------
           P Q  NF          A  +T+    TP++VID  WY +SGA++HV  + N++    +Y G E VIV NG  L I+HIGS +I A           
Subjt:  --IPNQGHNFPGPTPNQAFMATQNTNPFAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPAD----------

Query:  ---------DINTGQMVLKGELDDGLYIFEKT
                 D  +G+ +LKG L D LY  +++
Subjt:  ---------DINTGQMVLKGELDDGLYIFEKT

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein4.6e-5336.32Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFV----QPEATAVLSPLPATPAAPSSVTTAQTIIN
        PS++S G      +++PPLNQ+LNQ+ ++KLD+ N+LLWK L  PILK Y+L GHLT E   P  FV        T       AT  A SS+T    I+N
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFV----QPEATAVLSPLPATPAAPSSVTTAQTIIN

Query:  PHYESWIANDQLLLGWLYNSMMPEVATQVMGRETAKGV---------VECIART--VQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQ
          +E W+  D LLLGWLYNSM P+VA Q+MG    + +         V+  A    ++Q+ QTTRKG+ KM +YL  MKT+ DNL QV SPV  RAL+SQ
Subjt:  PHYESWIANDQLLLGWLYNSMMPEVATQVMGRETAKGV---------VECIART--VQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQ

Query:  VLLGLDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNI-----TMGNVVMIEEEAEGEEITCYQLYN----KLFGQN-QH--GEKGNR
        VLLGLDE YN ++  IQGK D++W       L  + ++ I    + H N        GN+         +        N    K +G N QH  G++GN 
Subjt:  VLLGLDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNI-----TMGNVVMIEEEAEGEEITCYQLYN----KLFGQN-QH--GEKGNR

Query:  GQNFQGKRIPNQGHN-----------FPGP--------------TPNQA-FMATQNTNPFAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKER
              +     GH+           F  P              +PN A F++TQN  PFA TP +V+D  WY++SGA+NHV  + +++TN  EY G+  
Subjt:  GQNFQGKRIPNQGHN-----------FPGP--------------TPNQA-FMATQNTNPFAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKER

Query:  VIVGNGRALPITHIGSCHIPADDINTGQMVLKGELDDGLYIFEKTG
                                  G+ +L+G L DG Y  E+ G
Subjt:  VIVGNGRALPITHIGSCHIPADDINTGQMVLKGELDDGLYIFEKTG

A0A5C7ID32 Uncharacterized protein8.7e-4433.65Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
        P+V   G  +N +  S P    LNQ  +IKLD+ NF+LWK +   I+K +RL GHL   +  PP F+ P  T    P P TP     V+ + +  NP YE
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE

Query:  SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLG
         W+ NDQLL+GWLY+SM   VA  VMG  TA G+ + +           A T++   QTTRKGS  M +YL  MKT AD+L     P     L +  L G
Subjt:  SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLG

Query:  LDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIK-IPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQN----------FQ
        LD EY PIV  I+ +   TWQ+     L    +++ I +++     ++  +  +   +      T     NK   Q    + GNR  N          F+
Subjt:  LDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIK-IPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQN----------FQ

Query:  GKRIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGN
        G+   N             GH           N+ G  P     A  N N    F  TP++V D+ WY +SGA++HV ND  +L    +Y G E ++VGN
Subjt:  GKRIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGN

Query:  GRALPITHIGSCHIPA
        G+ L I+H+G   +P+
Subjt:  GRALPITHIGSCHIPA

A0A5C7IJ06 Uncharacterized protein2.3e-4433.89Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
        P+V   G  +N +  S P    LNQ  +IKLD+ NF+LWK +   I+K +RL GHL   +  PP F+ P  T    P P TP     V+ + +  NP YE
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE

Query:  SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLG
         W+ NDQLL+GWLY+SM   VA  VMG  TA G+ + +           A T++   QTTRKGS  M +YL  MKT AD+L     P     L +  L G
Subjt:  SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLG

Query:  LDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIK-IPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQN----------FQ
        LD EY PIV  I+ +   TWQ+     L    +++ I +++     ++  +  +   +      T     NK   Q    + GNR  N          F+
Subjt:  LDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIK-IPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQN----------FQ

Query:  GKRIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGN
        G+   N             GH           N+ G  P     A  N N    F  TP++V D+ WY +SGA+NHV ND  +L    +Y G E ++VGN
Subjt:  GKRIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGN

Query:  GRALPITHIGSCHIPA
        G+ L I+H+G   +P+
Subjt:  GRALPITHIGSCHIPA

A0A5D3E3L7 Uncharacterized protein3.8e-4737.89Show/hide
Query:  EATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECIART-----------VQQVFQTTRKGSLKMVD
        EA++V+S       A SS +    I+NP YE W+ +D LLLG +YNSM+P+VA Q+MG  TAK + E I              ++  FQTTR+G+ KM D
Subjt:  EATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECIART-----------VQQVFQTTRKGSLKMVD

Query:  YLRAMKTHADNLEQVESPVVLRALVSQVLLGLDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLY
        YLR MK +ADNL Q  SPV  R L+SQVLLGLDE YNP+ A IQGK D++W   ++  L  ++ ++I  + ++   I M   V+ EE             
Subjt:  YLRAMKTHADNLEQVESPVVLRALVSQVLLGLDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLY

Query:  NKLFGQNQHGEKGNRGQNFQGKRIPNQGHNFPGPTPNQAFMATQNTNPFAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPI
        N+ F  NQ+G++                       P+ AF+ TQ ++  A TP++V+D+  YV+SGA+NHV +DH++L N  +Y G E V+VGN   L I
Subjt:  NKLFGQNQHGEKGNRGQNFQGKRIPNQGHNFPGPTPNQAFMATQNTNPFAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPI

Query:  THIGSCHIP-------ADDI------------NTGQMVLKGELDDGLYIFE
        + +G   +         D I            +TG+++LKG L DGLY  E
Subjt:  THIGSCHIP-------ADDI------------NTGQMVLKGELDDGLYIFE

A0A6J1DCW4 uncharacterized protein LOC1110195981.5e-6738.89Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
        P V    V +   + SPPLNQLLNQITSIK+D+ NFLLW+NL  PIL+SY+LF +LT +K  PP  + P  T        T    S+ + +   +NP YE
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE

Query:  SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECIART-----------VQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLG
        +WI  D+LLLGWLYNSM  +VA QVMG  T++ +   +              ++QVFQ T KGSL+M++YL+ MK+HADNL    S V +R LVSQVL G
Subjt:  SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECIART-----------VQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLG

Query:  LDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQNFQ----GKR----
        LDEEYNPIV  +QGK++L+W +     L  + R++  +     + I       +    +G      Q  N   G N HG   +RG  +Q    G+R    
Subjt:  LDEEYNPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQNFQ----GKR----

Query:  --IPNQGHNFPGPTPNQAFMATQNTNPFAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPAD----------
           P Q  NF          A  +T+    TP++VID  WY +SGA++HV  + N++    +Y G E VIV NG  L I+HIGS +I A           
Subjt:  --IPNQGHNFPGPTPNQAFMATQNTNPFAVTPKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPAD----------

Query:  ---------DINTGQMVLKGELDDGLYIFEKT
                 D  +G+ +LKG L D LY  +++
Subjt:  ---------DINTGQMVLKGELDDGLYIFEKT

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-1723.03Show/hide
Query:  NSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLY
        N+  LN  ++ +T  KL  +N+L+W    H +   Y L G L      PP  +             T AAP         +NP Y  W   D+L+   + 
Subjt:  NSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLY

Query:  NSMMPEVATQVMGRETAKGVVECIAR-----------TVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLGLDEEYNPIVATIQG
         ++   V   V    TA  + E + +            ++   +   KG+  + DY++ + T  D L  +  P+     V +VL  L EEY P++  I  
Subjt:  NSMMPEVATQVMGRETAKGVVECIAR-----------TVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLGLDEEYNPIVATIQG

Query:  K----------MDLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYN-----KLFGQNQHGEKGNRGQN--FQGK--RIPN
        K            L   +++ L++ +   I I + A+ H N T  N         G     Y   N     K + Q+      N  Q+  + GK      
Subjt:  K----------MDLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYN-----KLFGQNQHGEKGNRGQN--FQGK--RIPN

Query:  QGHNFPGPTPNQAFMATQNTN--PFAVTP----------KSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHI------
        QGH+    +  Q F+++ N+   P   TP               + W ++SGA++H+ +D N+L+    Y G + V+V +G  +PI+H GS  +      
Subjt:  QGHNFPGPTPNQAFMATQNTN--PFAVTP----------KSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHI------

Query:  ---------------------------------PAD----DINTGQMVLKGELDDGLYIFEKTGATGSASNVGKTNLKSIGRIMACSTESILHQMLGHPS
                                         PA     D+NTG  +L+G+  D LY +    ++   S     + K        +T S  H  LGHP+
Subjt:  ---------------------------------PAD----DINTGQMVLKGELDDGLYIFEKTGATGSASNVGKTNLKSIGRIMACSTESILHQMLGHPS

Query:  SKVLNDVV
          +LN V+
Subjt:  SKVLNDVV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-1722.49Show/hide
Query:  NSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLY
        N+  LN  ++ +T  KL  +N+L+W    H +   Y L G L      PP  +             T A P         +NP Y  W   D+L+   + 
Subjt:  NSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLY

Query:  NSMMPEVATQVMGRETAKGVVECIARTVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLGLDEEYNPIVATIQGK----------
         ++   V   V    TA  + E    T+++++     G +  + ++    T  D L  +  P+     V +VL  L ++Y P++  I  K          
Subjt:  NSMMPEVATQVMGRETAKGVVECIARTVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLGLDEEYNPIVATIQGK----------

Query:  MDLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQNFQGKRIPN-------QGHNFPGPTPNQA
          L  ++++ L+L + + + I +  + H N          +   G+    Y   N      Q    G+R  N Q K           QGH+         
Subjt:  MDLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQNFQGKRIPN-------QGHNFPGPTPNQA

Query:  FMAT----QNTNPFAV-TPKSVI-------DSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPAD----------------
        F +T    Q+T+PF    P++ +        + W ++SGA++H+ +D N+L+    Y G + V++ +G  +PITH GS  +P                  
Subjt:  FMAT----QNTNPFAV-TPKSVI-------DSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPAD----------------

Query:  ---------------------------DINTGQMVLKGELDDGLY---IFEKTGATGSASNVGKTNLKSIGRIMACSTESILHQMLGHPSSKVLNDVV
                                   D+NTG  +L+G+  D LY   I      +  AS   K            +T S  H  LGHPS  +LN V+
Subjt:  ---------------------------DINTGQMVLKGELDDGLY---IFEKTGATGSASNVGKTNLKSIGRIMACSTESILHQMLGHPSSKVLNDVV

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.3e-0428.88Show/hide
Query:  GSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLGLDEEYNPIVATIQGKMDL-TWQQARTLSLPTKDRIKIPSMAI----DHLNITMGNVVMIEEE
        G +++ DY R MK  AD+L  V+ PV  R LV  VL GL+ +++ I+  I+ +    ++  A T+    +DR+K    AI     H++ +  + V+   E
Subjt:  GSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLGLDEEYNPIVATIQGKMDL-TWQQARTLSLPTKDRIKIPSMAI----DHLNITMGNVVMIEEE

Query:  AEGEEITCYQLYNKLFGQNQHGEKG-NRGQN-FQGKRIPNQGHNFPGPTPNQAFMATQNTNPFAVTPKSVIDSGW----YVNSGASN
        A    +T +Q      G NQ G +G  RG N F+G+      +N P          + N  PF      + +  W    YVN+   N
Subjt:  AEGEEITCYQLYNKLFGQNQHGEKG-NRGQN-FQGKRIPNQGHNFPGPTPNQAFMATQNTNPFAVTPKSVIDSGW----YVNSGASN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACGTCTTTCCTTCTGTCAACTCAAATGGAGTATTTACTAACCCTACCTACAATAGCCCACCTCTAAATCAACTTTTAAACCAAATTACTTCCATCAAA
CTAGATAAGAGTAATTTTCTATTGTGGAAGAACTTAACCCATCCTATTTTGAAGAGTTATCGGCTGTTCGGACATCTCACTAGAGAGAAGGAATCTCCCCCTATG
TTTGTTCAACCGGAAGCTACTGCCGTTTTGTCGCCTCTGCCAGCGACACCAGCTGCTCCCTCCTCAGTAACCACGGCCCAAACCATTATTAATCCTCACTATGAA
TCGTGGATAGCAAATGATCAACTACTGCTGGGATGGCTATACAACTCCATGATGCCCGAAGTTGCAACTCAAGTCATGGGTCGTGAGACGGCCAAGGGGGTTGTG
GAATGCATTGCAAGAACTGTTCAGCAAGTTTTTCAAACTACCCGTAAGGGATCTTTAAAAATGGTTGATTATCTCAGAGCTATGAAAACACACGCTGATAATCTT
GAACAAGTTGAAAGTCCTGTTGTGCTTAGAGCCTTAGTCTCTCAAGTTCTCCTAGGCCTAGATGAAGAGTACAACCCGATAGTTGCTACAATCCAAGGAAAAATG
GATTTAACATGGCAACAAGCAAGAACCCTATCTCTACCAACCAAAGATAGAATCAAAATTCCTTCAATGGCAATAGACCACCTCAACATAACAATGGGCAATGTA
GTAATGATCGAGGAAGAGGCAGAGGGCGAGGAAATAACCTGTTACCAACTGTACAACAAATTGTTTGGGCAAAATCAACATGGTGAGAAAGGGAACCGAGGCCAA
AATTTTCAAGGGAAGCGGATTCCAAACCAAGGCCACAATTTTCCAGGACCGACACCTAACCAAGCATTCATGGCAACTCAGAATACCAACCCATTTGCTGTTACA
CCTAAATCTGTCATTGATTCAGGATGGTATGTCAACAGTGGGGCTTCAAACCATGTTGCCAATGATCACAACAGTCTAACCAATGCATATGAATATGGAGGTAAA
GAAAGAGTAATTGTTGGTAATGGCAGGGCTCTACCTATAACTCACATTGGTTCTTGCCATATTCCTGCGGATGATATCAACACGGGTCAAATGGTGCTGAAGGGG
GAGCTTGATGATGGGCTGTACATATTTGAGAAAACTGGAGCCACTGGTAGTGCTTCAAATGTGGGAAAAACCAACTTGAAGTCGATTGGAAGAATAATGGCGTGC
TCTACTGAATCTATTTTGCATCAAATGCTTGGACATCCTTCCTCAAAAGTCCTGAATGATGTTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACGTCTTTCCTTCTGTCAACTCAAATGGAGTATTTACTAACCCTACCTACAATAGCCCACCTCTAAATCAACTTTTAAACCAAATTACTTCCATCAAA
CTAGATAAGAGTAATTTTCTATTGTGGAAGAACTTAACCCATCCTATTTTGAAGAGTTATCGGCTGTTCGGACATCTCACTAGAGAGAAGGAATCTCCCCCTATG
TTTGTTCAACCGGAAGCTACTGCCGTTTTGTCGCCTCTGCCAGCGACACCAGCTGCTCCCTCCTCAGTAACCACGGCCCAAACCATTATTAATCCTCACTATGAA
TCGTGGATAGCAAATGATCAACTACTGCTGGGATGGCTATACAACTCCATGATGCCCGAAGTTGCAACTCAAGTCATGGGTCGTGAGACGGCCAAGGGGGTTGTG
GAATGCATTGCAAGAACTGTTCAGCAAGTTTTTCAAACTACCCGTAAGGGATCTTTAAAAATGGTTGATTATCTCAGAGCTATGAAAACACACGCTGATAATCTT
GAACAAGTTGAAAGTCCTGTTGTGCTTAGAGCCTTAGTCTCTCAAGTTCTCCTAGGCCTAGATGAAGAGTACAACCCGATAGTTGCTACAATCCAAGGAAAAATG
GATTTAACATGGCAACAAGCAAGAACCCTATCTCTACCAACCAAAGATAGAATCAAAATTCCTTCAATGGCAATAGACCACCTCAACATAACAATGGGCAATGTA
GTAATGATCGAGGAAGAGGCAGAGGGCGAGGAAATAACCTGTTACCAACTGTACAACAAATTGTTTGGGCAAAATCAACATGGTGAGAAAGGGAACCGAGGCCAA
AATTTTCAAGGGAAGCGGATTCCAAACCAAGGCCACAATTTTCCAGGACCGACACCTAACCAAGCATTCATGGCAACTCAGAATACCAACCCATTTGCTGTTACA
CCTAAATCTGTCATTGATTCAGGATGGTATGTCAACAGTGGGGCTTCAAACCATGTTGCCAATGATCACAACAGTCTAACCAATGCATATGAATATGGAGGTAAA
GAAAGAGTAATTGTTGGTAATGGCAGGGCTCTACCTATAACTCACATTGGTTCTTGCCATATTCCTGCGGATGATATCAACACGGGTCAAATGGTGCTGAAGGGG
GAGCTTGATGATGGGCTGTACATATTTGAGAAAACTGGAGCCACTGGTAGTGCTTCAAATGTGGGAAAAACCAACTTGAAGTCGATTGGAAGAATAATGGCGTGC
TCTACTGAATCTATTTTGCATCAAATGCTTGGACATCCTTCCTCAAAAGTCCTGAATGATGTTGTTTAA
Protein sequenceShow/hide protein sequence
MANVFPSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDKSNFLLWKNLTHPILKSYRLFGHLTREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
SWIANDQLLLGWLYNSMMPEVATQVMGRETAKGVVECIARTVQQVFQTTRKGSLKMVDYLRAMKTHADNLEQVESPVVLRALVSQVLLGLDEEYNPIVATIQGKM
DLTWQQARTLSLPTKDRIKIPSMAIDHLNITMGNVVMIEEEAEGEEITCYQLYNKLFGQNQHGEKGNRGQNFQGKRIPNQGHNFPGPTPNQAFMATQNTNPFAVT
PKSVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPADDINTGQMVLKGELDDGLYIFEKTGATGSASNVGKTNLKSIGRIMAC
STESILHQMLGHPSSKVLNDVV