; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G01145 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G01145
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr04:3202722..3204450
RNA-Seq ExpressionClc04G01145
SyntenyClc04G01145
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]1.5e-5035.81Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFV----QPEATAVLSPLPATPAAPSSVTTAQTIIN
        PS++S G      +++PPLNQ+LNQ+ ++KLDR N+LLWK L  PILK Y+L GHL  E   P  FV        T       AT  A SS+T    I+N
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFV----QPEATAVLSPLPATPAAPSSVTTAQTIIN

Query:  PHYESWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGV---------VECIART--VQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQ
          +E W+  D LLLGWLYNSM P+VA Q+MG    + +         V+  A    ++Q+ QTTRKG+ KM +YL  MKT+ DNL QV SPV  RAL+SQ
Subjt:  PHYESWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGV---------VECIART--VQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQ

Query:  VLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLN-----RTMGNAVMVEEEAEGEEITCYQLYN----KSFGQN-QH--GEQGNR
        VLLGLDE Y+ ++  IQGK D++W       L  + ++ I    + H N     +  GN          +        N    K +G N QH  G++GN 
Subjt:  VLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLN-----RTMGNAVMVEEEAEGEEITCYQLYN----KSFGQN-QH--GEQGNR

Query:  GQNFQGKQIPNQGHN-----------FPGP--------------TPNQA-FMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKER
              +     GH+           F  P              +PN A F++TQN  PF  TP++V+D  WY++SGA+NHV  + +++TN  EY G+  
Subjt:  GQNFQGKQIPNQGHN-----------FPGP--------------TPNQA-FMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKER

Query:  VIVGNGRALPITHIGSCHIPADDINTGQMVLKGELDDGLYIFEK
                                  G+ +L+G L DG Y  E+
Subjt:  VIVGNGRALPITHIGSCHIPADDINTGQMVLKGELDDGLYIFEK

KAA0057475.1 uncharacterized protein E6C27_scaffold280G003560 [Cucumis melo var. makuwa]6.0e-4738.18Show/hide
Query:  EATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECIART-----------VQQVFQTTRKGSLKMVD
        EA++V+S       A SS +    I+NP YE W+ +D LLLG +YNSM+P+VA Q+MG +TAK + E I              ++  FQTTR+G+ KM D
Subjt:  EATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECIART-----------VQQVFQTTRKGSLKMVD

Query:  YLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEAEGEEITCYQLY
        YLR MK +ADNL Q  SPV  R L+SQVLLGLDE Y+P+ A IQGK D++W   ++  L  ++ ++I  + ++     M  A +VEEE            
Subjt:  YLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEAEGEEITCYQLY

Query:  NKSFGQNQHGEQGNRGQNFQGKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPI
        N+ F  NQ+G+Q                       P+ AF+ TQ ++  + TPE+V+D+  YV+SGA+NHV +DH++L N  +Y G E V+VGN   L I
Subjt:  NKSFGQNQHGEQGNRGQNFQGKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPI

Query:  THIGSCHIP-------ADDI------------NTGQMVLKGELDDGLYIFE
        + +G   +         D I            +TG+++LKG L DGLY  E
Subjt:  THIGSCHIP-------ADDI------------NTGQMVLKGELDDGLYIFE

TXG67243.1 hypothetical protein EZV62_008518 [Acer yangbiense]1.9e-4534.21Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
        P+V   G  +N +  S P    LNQ  +IKLDR NF+LWK +   I+K +RL GHL   +  PP F+ P  T    P P TP     V+ + +  NP YE
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE

Query:  SWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLG
         W+ NDQLL+GWLY+SM   VA  VMG  TA G+ + +           A T++   QTTRKGS  M +YL  MKT AD+L     P     L +  L G
Subjt:  SWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLG

Query:  LDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEA---EGEEITCYQLYNKSFGQNQHGEQGNRGQN----------
        LD EY PIV  I+ +   TWQ+         D +      ++H+N       ++   +      +       NK+  Q    + GNR  N          
Subjt:  LDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEA---EGEEITCYQLYNKSFGQNQHGEQGNRGQN----------

Query:  FQGKQIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIV
        F+G+   N             GH           N+ G  P     A  N N    FV TPE+V D+ WY +SGA++HV ND  +L    +Y G E ++V
Subjt:  FQGKQIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIV

Query:  GNGRALPITHIGSCHIPA
        GNG+ L I+H+G   +P+
Subjt:  GNGRALPITHIGSCHIPA

TXG69253.1 hypothetical protein EZV62_004188 [Acer yangbiense]5.1e-4634.45Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
        P+V   G  +N +  S P    LNQ  +IKLDR NF+LWK +   I+K +RL GHL   +  PP F+ P  T    P P TP     V+ + +  NP YE
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE

Query:  SWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLG
         W+ NDQLL+GWLY+SM   VA  VMG  TA G+ + +           A T++   QTTRKGS  M +YL  MKT AD+L     P     L +  L G
Subjt:  SWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLG

Query:  LDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEA---EGEEITCYQLYNKSFGQNQHGEQGNRGQN----------
        LD EY PIV  I+ +   TWQ+         D +      ++H+N       ++   +      +       NK+  Q    + GNR  N          
Subjt:  LDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEA---EGEEITCYQLYNKSFGQNQHGEQGNRGQN----------

Query:  FQGKQIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIV
        F+G+   N             GH           N+ G  P     A  N N    FV TPE+V D+ WY +SGA+NHV ND  +L    +Y G E ++V
Subjt:  FQGKQIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIV

Query:  GNGRALPITHIGSCHIPA
        GNG+ L I+H+G   +P+
Subjt:  GNGRALPITHIGSCHIPA

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]8.0e-6836.75Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
        P V    V +   + SPPLNQLLNQITSIK+DR NFLLW+NL  PIL+SY+LF +L  +K  PP  + P  T        T    S+ + +   +NP YE
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE

Query:  SWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECIART-----------VQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLG
        +WI  D+LLLGWLYNSM  +VA QVMG  T++ +   +              ++QVFQ T KGSL+M++YL+ MK+HADNL    S V++R LVSQVL G
Subjt:  SWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECIART-----------VQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLG

Query:  LDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAID--HLNRTMGNAVMVEEEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQ---------
        LDEEY+PIV  +QGK++L+W +     L  + R++  +       +N+T   +V      +G      Q  N   G N HG   +RG  +Q         
Subjt:  LDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAID--HLNRTMGNAVMVEEEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQ---------

Query:  --GKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPAD-------
          G Q P Q  NF          A  +T+  V TPE+VID  WY +SGA++HV  + N++    +Y G E VIV NG  L I+HIGS +I A        
Subjt:  --GKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPAD-------

Query:  ------------DINTGQMVLKGELDDGLYIFEKTKATGSAS-------------NVGKTNLKS---------IGRIMACSTESILHQMLGHPSSKVL
                    D  +G+ +LKG L D LY  +++  +  A+             ++    L S            I    + ++ H+ LGHPS +VL
Subjt:  ------------DINTGQMVLKGELDDGLYIFEKTKATGSAS-------------NVGKTNLKS---------IGRIMACSTESILHQMLGHPSSKVL

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein7.3e-5135.81Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFV----QPEATAVLSPLPATPAAPSSVTTAQTIIN
        PS++S G      +++PPLNQ+LNQ+ ++KLDR N+LLWK L  PILK Y+L GHL  E   P  FV        T       AT  A SS+T    I+N
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFV----QPEATAVLSPLPATPAAPSSVTTAQTIIN

Query:  PHYESWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGV---------VECIART--VQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQ
          +E W+  D LLLGWLYNSM P+VA Q+MG    + +         V+  A    ++Q+ QTTRKG+ KM +YL  MKT+ DNL QV SPV  RAL+SQ
Subjt:  PHYESWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGV---------VECIART--VQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQ

Query:  VLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLN-----RTMGNAVMVEEEAEGEEITCYQLYN----KSFGQN-QH--GEQGNR
        VLLGLDE Y+ ++  IQGK D++W       L  + ++ I    + H N     +  GN          +        N    K +G N QH  G++GN 
Subjt:  VLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLN-----RTMGNAVMVEEEAEGEEITCYQLYN----KSFGQN-QH--GEQGNR

Query:  GQNFQGKQIPNQGHN-----------FPGP--------------TPNQA-FMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKER
              +     GH+           F  P              +PN A F++TQN  PF  TP++V+D  WY++SGA+NHV  + +++TN  EY G+  
Subjt:  GQNFQGKQIPNQGHN-----------FPGP--------------TPNQA-FMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKER

Query:  VIVGNGRALPITHIGSCHIPADDINTGQMVLKGELDDGLYIFEK
                                  G+ +L+G L DG Y  E+
Subjt:  VIVGNGRALPITHIGSCHIPADDINTGQMVLKGELDDGLYIFEK

A0A5C7ID32 Uncharacterized protein9.3e-4634.21Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
        P+V   G  +N +  S P    LNQ  +IKLDR NF+LWK +   I+K +RL GHL   +  PP F+ P  T    P P TP     V+ + +  NP YE
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE

Query:  SWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLG
         W+ NDQLL+GWLY+SM   VA  VMG  TA G+ + +           A T++   QTTRKGS  M +YL  MKT AD+L     P     L +  L G
Subjt:  SWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLG

Query:  LDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEA---EGEEITCYQLYNKSFGQNQHGEQGNRGQN----------
        LD EY PIV  I+ +   TWQ+         D +      ++H+N       ++   +      +       NK+  Q    + GNR  N          
Subjt:  LDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEA---EGEEITCYQLYNKSFGQNQHGEQGNRGQN----------

Query:  FQGKQIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIV
        F+G+   N             GH           N+ G  P     A  N N    FV TPE+V D+ WY +SGA++HV ND  +L    +Y G E ++V
Subjt:  FQGKQIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIV

Query:  GNGRALPITHIGSCHIPA
        GNG+ L I+H+G   +P+
Subjt:  GNGRALPITHIGSCHIPA

A0A5C7IJ06 Uncharacterized protein2.4e-4634.45Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
        P+V   G  +N +  S P    LNQ  +IKLDR NF+LWK +   I+K +RL GHL   +  PP F+ P  T    P P TP     V+ + +  NP YE
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE

Query:  SWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLG
         W+ NDQLL+GWLY+SM   VA  VMG  TA G+ + +           A T++   QTTRKGS  M +YL  MKT AD+L     P     L +  L G
Subjt:  SWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECI-----------ARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLG

Query:  LDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEA---EGEEITCYQLYNKSFGQNQHGEQGNRGQN----------
        LD EY PIV  I+ +   TWQ+         D +      ++H+N       ++   +      +       NK+  Q    + GNR  N          
Subjt:  LDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEA---EGEEITCYQLYNKSFGQNQHGEQGNRGQN----------

Query:  FQGKQIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIV
        F+G+   N             GH           N+ G  P     A  N N    FV TPE+V D+ WY +SGA+NHV ND  +L    +Y G E ++V
Subjt:  FQGKQIPNQ------------GH-----------NFPGPTPNQAFMATQNTNP---FVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIV

Query:  GNGRALPITHIGSCHIPA
        GNG+ L I+H+G   +P+
Subjt:  GNGRALPITHIGSCHIPA

A0A5D3E3L7 Uncharacterized protein2.9e-4738.18Show/hide
Query:  EATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECIART-----------VQQVFQTTRKGSLKMVD
        EA++V+S       A SS +    I+NP YE W+ +D LLLG +YNSM+P+VA Q+MG +TAK + E I              ++  FQTTR+G+ KM D
Subjt:  EATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECIART-----------VQQVFQTTRKGSLKMVD

Query:  YLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEAEGEEITCYQLY
        YLR MK +ADNL Q  SPV  R L+SQVLLGLDE Y+P+ A IQGK D++W   ++  L  ++ ++I  + ++     M  A +VEEE            
Subjt:  YLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEAEGEEITCYQLY

Query:  NKSFGQNQHGEQGNRGQNFQGKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPI
        N+ F  NQ+G+Q                       P+ AF+ TQ ++  + TPE+V+D+  YV+SGA+NHV +DH++L N  +Y G E V+VGN   L I
Subjt:  NKSFGQNQHGEQGNRGQNFQGKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPI

Query:  THIGSCHIP-------ADDI------------NTGQMVLKGELDDGLYIFE
        + +G   +         D I            +TG+++LKG L DGLY  E
Subjt:  THIGSCHIP-------ADDI------------NTGQMVLKGELDDGLYIFE

A0A6J1DCW4 uncharacterized protein LOC1110195983.9e-6836.75Show/hide
Query:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE
        P V    V +   + SPPLNQLLNQITSIK+DR NFLLW+NL  PIL+SY+LF +L  +K  PP  + P  T        T    S+ + +   +NP YE
Subjt:  PSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYE

Query:  SWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECIART-----------VQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLG
        +WI  D+LLLGWLYNSM  +VA QVMG  T++ +   +              ++QVFQ T KGSL+M++YL+ MK+HADNL    S V++R LVSQVL G
Subjt:  SWIANDQLLLGWLYNSMMPEVATQVMGRDTAKGVVECIART-----------VQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLG

Query:  LDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAID--HLNRTMGNAVMVEEEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQ---------
        LDEEY+PIV  +QGK++L+W +     L  + R++  +       +N+T   +V      +G      Q  N   G N HG   +RG  +Q         
Subjt:  LDEEYSPIVATIQGKMDLTWQQARTLSLPTKDRIKIPSMAID--HLNRTMGNAVMVEEEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQ---------

Query:  --GKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPAD-------
          G Q P Q  NF          A  +T+  V TPE+VID  WY +SGA++HV  + N++    +Y G E VIV NG  L I+HIGS +I A        
Subjt:  --GKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPAD-------

Query:  ------------DINTGQMVLKGELDDGLYIFEKTKATGSAS-------------NVGKTNLKS---------IGRIMACSTESILHQMLGHPSSKVL
                    D  +G+ +LKG L D LY  +++  +  A+             ++    L S            I    + ++ H+ LGHPS +VL
Subjt:  ------------DINTGQMVLKGELDDGLYIFEKTKATGSAS-------------NVGKTNLKS---------IGRIMACSTESILHQMLGHPSSKVL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.1e-1723.68Show/hide
Query:  NSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLY
        N+  LN  ++ +T  KL  +N+L+W    H +   Y L G L      PP  +             T AAP         +NP Y  W   D+L+   + 
Subjt:  NSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLY

Query:  NSMMPEVATQVMGRDTAKGVVECIAR-----------TVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQG
         ++   V   V    TA  + E + +            ++   +   KG+  + DY++ + T  D L  +  P+     V +VL  L EEY P++  I  
Subjt:  NSMMPEVATQVMGRDTAKGVVECIAR-----------TVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQG

Query:  K----------MDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEAEGEEITCYQLYN-----KSFGQNQHGEQGNRGQN--FQGK-QIPN-
        K            L   +++ L++ +   I I + A+ H N T  N         G     Y   N     K + Q+      N  Q+  + GK QI   
Subjt:  K----------MDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEAEGEEITCYQLYN-----KSFGQNQHGEQGNRGQN--FQGK-QIPN-

Query:  QGHNFPGPTPNQAFMATQNTN--PFVVTP----------ESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHI------
        QGH+    +  Q F+++ N+   P   TP               + W ++SGA++H+ +D N+L+    Y G + V+V +G  +PI+H GS  +      
Subjt:  QGHNFPGPTPNQAFMATQNTN--PFVVTP----------ESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHI------

Query:  ---------------------------------PAD----DINTGQMVLKGELDDGLY---IFEKTKATGSASNVGKTNLKSIGRIMACSTESILHQMLG
                                         PA     D+NTG  +L+G+  D LY   I      +  AS   K            +T S  H  LG
Subjt:  ---------------------------------PAD----DINTGQMVLKGELDDGLY---IFEKTKATGSASNVGKTNLKSIGRIMACSTESILHQMLG

Query:  HPSSKVLNDVV
        HP+  +LN V+
Subjt:  HPSSKVLNDVV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.1e-1722.94Show/hide
Query:  NSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLY
        N+  LN  ++ +T  KL  +N+L+W    H +   Y L G L      PP  +             T A P         +NP Y  W   D+L+   + 
Subjt:  NSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIANDQLLLGWLY

Query:  NSMMPEVATQVMGRDTAKGVVECIARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGK----------
         ++   V   V    TA  + E    T+++++     G +  + ++    T  D L  +  P+     V +VL  L ++Y P++  I  K          
Subjt:  NSMMPEVATQVMGRDTAKGVVECIARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGK----------

Query:  MDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQGK------QIPN-QGHNFPGPTPNQA
          L  ++++ L+L + + + I +  + H N          +   G+    Y   N      Q    G+R  N Q K      QI + QGH+         
Subjt:  MDLTWQQARTLSLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQGK------QIPN-QGHNFPGPTPNQA

Query:  FMAT----QNTNPF----------VVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPAD--------------
        F +T    Q+T+PF          V +P +   + W ++SGA++H+ +D N+L+    Y G + V++ +G  +PITH GS  +P                
Subjt:  FMAT----QNTNPF----------VVTPESVIDSGWYVNSGASNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPAD--------------

Query:  -----------------------------DINTGQMVLKGELDDGLYIFEKTKATGSASNVGKTNLKSIGRIMACSTESILHQMLGHPSSKVLNDVV
                                     D+NTG  +L+G+  D LY  E   A+  A ++  +           +T S  H  LGHPS  +LN V+
Subjt:  -----------------------------DINTGQMVLKGELDDGLYIFEKTKATGSASNVGKTNLKSIGRIMACSTESILHQMLGHPSSKVLNDVV

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.8e-0428.88Show/hide
Query:  GSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDL-TWQQARTLSLPTKDRIKIPSMAI----DHLNRTMGNAVMVEEE
        G +++ DY R MK  AD+L  V+ PV  R LV  VL GL+ ++  I+  I+ +    ++  A T+    +DR+K    AI     H++ +  + V+   E
Subjt:  GSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDL-TWQQARTLSLPTKDRIKIPSMAI----DHLNRTMGNAVMVEEE

Query:  AEGEEITCYQLYNKSFGQNQHGEQG-NRGQN-FQGKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGW----YVNSGASN
        A    +T +Q      G NQ G +G  RG N F+G+      +N P          + N  PF      + +  W    YVN+   N
Subjt:  AEGEEITCYQLYNKSFGQNQHGEQG-NRGQN-FQGKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGW----YVNSGASN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACGTCTTTCCTTCTGTCAACTCAAATGGAGTATTTACTAACCCTACCTACAACAGCCCACCTCTAAATCAACTTTTAAACCAAATTACTTCCATCAAACTAGA
TAGGAGTAATTTTCTATTGTGGAAGAACTTAACCCATCCTATTTTGAAGAGTTATCGGCTGTTCGGACATCTCATTAGAGAGAAGGAATCTCCCCCTATGTTTGTTCAAC
CGGAAGCTACTGCCGTTTTGTCGCCTTTGCCAGCGACACCAGCTGCTCCCTCCTCAGTAACCACGGCCCAAACCATTATTAATCCTCACTATGAATCGTGGATAGCAAAT
GATCAACTACTGCTGGGATGGCTATACAACTCCATGATGCCCGAAGTTGCAACTCAAGTCATGGGTCGTGATACGGCCAAAGGGGTTGTGGAATGCATTGCAAGAACTGT
TCAGCAAGTTTTTCAAACTACCCGTAAGGGATCTTTAAAAATGGTTGATTATCTCAGAACTATGAAAACACATGCTGATAATCTTGAACAAGTCGAAAGTCCTGTTGCGC
TTAGAGCCTTAGTCTCTCAAGTTCTCCTAGGCCTAGATGAAGAGTACAGCCCGATAGTTGCTACAATCCAAGGAAAAATGGATTTAACATGGCAACAAGCAAGAACCCTA
TCTCTACCAACCAAAGATAGAATCAAAATTCCTTCAATGGCAATAGACCACCTCAACAGAACAATGGGCAATGCAGTAATGGTCGAGGAAGAGGCAGAGGGCGAGGAAAT
AACCTGTTACCAACTGTACAACAAATCGTTTGGGCAAAATCAACATGGTGAGCAAGGGAACCGAGGCCAAAATTTTCAAGGGAAGCAGATTCCAAACCAAGGCCATAATT
TTCCAGGACCGACACCTAACCAAGCATTCATGGCAACTCAGAATACCAACCCATTTGTTGTTACACCTGAATCTGTCATTGATTCAGGATGGTATGTCAACAGTGGGGCT
TCAAACCATGTTGCCAATGATCACAACAGTCTAACCAATGCATATGAATATGGAGGTAAAGAAAGAGTAATTGTTGGTAATGGTAGGGCTCTACCTATAACTCACATTGG
TTCTTGCCATATTCCTGCGGATGATATCAACACGGGTCAAATGGTGCTGAAGGGGGAGCTTGATGATGGGCTGTACATATTTGAGAAAACTAAAGCCACTGGTAGTGCTT
CAAATGTGGGAAAAACCAACCTGAAGTCAATTGGAAGAATAATGGCGTGCTCTACTGAATCTATTTTGCATCAAATGCTTGGACATCCTTCCTCAAAAGTCCTGAATGAT
GTTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACGTCTTTCCTTCTGTCAACTCAAATGGAGTATTTACTAACCCTACCTACAACAGCCCACCTCTAAATCAACTTTTAAACCAAATTACTTCCATCAAACTAGA
TAGGAGTAATTTTCTATTGTGGAAGAACTTAACCCATCCTATTTTGAAGAGTTATCGGCTGTTCGGACATCTCATTAGAGAGAAGGAATCTCCCCCTATGTTTGTTCAAC
CGGAAGCTACTGCCGTTTTGTCGCCTTTGCCAGCGACACCAGCTGCTCCCTCCTCAGTAACCACGGCCCAAACCATTATTAATCCTCACTATGAATCGTGGATAGCAAAT
GATCAACTACTGCTGGGATGGCTATACAACTCCATGATGCCCGAAGTTGCAACTCAAGTCATGGGTCGTGATACGGCCAAAGGGGTTGTGGAATGCATTGCAAGAACTGT
TCAGCAAGTTTTTCAAACTACCCGTAAGGGATCTTTAAAAATGGTTGATTATCTCAGAACTATGAAAACACATGCTGATAATCTTGAACAAGTCGAAAGTCCTGTTGCGC
TTAGAGCCTTAGTCTCTCAAGTTCTCCTAGGCCTAGATGAAGAGTACAGCCCGATAGTTGCTACAATCCAAGGAAAAATGGATTTAACATGGCAACAAGCAAGAACCCTA
TCTCTACCAACCAAAGATAGAATCAAAATTCCTTCAATGGCAATAGACCACCTCAACAGAACAATGGGCAATGCAGTAATGGTCGAGGAAGAGGCAGAGGGCGAGGAAAT
AACCTGTTACCAACTGTACAACAAATCGTTTGGGCAAAATCAACATGGTGAGCAAGGGAACCGAGGCCAAAATTTTCAAGGGAAGCAGATTCCAAACCAAGGCCATAATT
TTCCAGGACCGACACCTAACCAAGCATTCATGGCAACTCAGAATACCAACCCATTTGTTGTTACACCTGAATCTGTCATTGATTCAGGATGGTATGTCAACAGTGGGGCT
TCAAACCATGTTGCCAATGATCACAACAGTCTAACCAATGCATATGAATATGGAGGTAAAGAAAGAGTAATTGTTGGTAATGGTAGGGCTCTACCTATAACTCACATTGG
TTCTTGCCATATTCCTGCGGATGATATCAACACGGGTCAAATGGTGCTGAAGGGGGAGCTTGATGATGGGCTGTACATATTTGAGAAAACTAAAGCCACTGGTAGTGCTT
CAAATGTGGGAAAAACCAACCTGAAGTCAATTGGAAGAATAATGGCGTGCTCTACTGAATCTATTTTGCATCAAATGCTTGGACATCCTTCCTCAAAAGTCCTGAATGAT
GTTGTTTAA
Protein sequenceShow/hide protein sequence
MANVFPSVNSNGVFTNPTYNSPPLNQLLNQITSIKLDRSNFLLWKNLTHPILKSYRLFGHLIREKESPPMFVQPEATAVLSPLPATPAAPSSVTTAQTIINPHYESWIAN
DQLLLGWLYNSMMPEVATQVMGRDTAKGVVECIARTVQQVFQTTRKGSLKMVDYLRTMKTHADNLEQVESPVALRALVSQVLLGLDEEYSPIVATIQGKMDLTWQQARTL
SLPTKDRIKIPSMAIDHLNRTMGNAVMVEEEAEGEEITCYQLYNKSFGQNQHGEQGNRGQNFQGKQIPNQGHNFPGPTPNQAFMATQNTNPFVVTPESVIDSGWYVNSGA
SNHVANDHNSLTNAYEYGGKERVIVGNGRALPITHIGSCHIPADDINTGQMVLKGELDDGLYIFEKTKATGSASNVGKTNLKSIGRIMACSTESILHQMLGHPSSKVLND
VV