; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0189741 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0189741
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr07:8483101..8484471
RNA-Seq ExpressionCmc07g0189741
SyntenyCmc07g0189741
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0006749 - glutathione metabolic process (biological process)
GO:0042221 - response to chemical (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004364 - glutathione transferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045111.1 putative glutathione S-transferase isoform X1 [Cucumis melo var. makuwa]1.4e-5583.8Show/hide
Query:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM
        MQF+ KLHN+KKG +SLKEYF+KI+Q VDALASINKPIST+DHILYILAGLGNEYQSMISVI ARTD PSVQD+MSLLLTQESQIE+KITSEVSLPTVNM
Subjt:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM

Query:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGG
        TTH RDI SL KE  VTHRGG NNLSY  TNSQYHH+SRG G
Subjt:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGG

KAA0046195.1 putative Ty1-copia-like retrotransposon [Cucumis melo var. makuwa]2.0e-9189.18Show/hide
Query:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM
        MQF+ KLHN+KKGAMSLKEYF+KI+Q VDALASINKPIST+DHILYILAGLGNEYQS+IS+I ARTD PSVQD MSLLLTQESQIE+KITSEVSLPTVNM
Subjt:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM

Query:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYA
        TTHTRDI SLEKE EVTHRGGSNNL YTTTNSQYHHKSR GGRSNRGGRGNRHKTQCQIC+KFGH+ADRCYFRYTPRNPPSGYS N+SNAFPYA
Subjt:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYA

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.0e-12254.61Show/hide
Query:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM
        MQF+ KLHNIKKG+M LKEYF+KI Q VDALASINKP+S++DHILYILAGLG++YQSMISVI ARTD PSVQ++MSLLLTQESQ E+K+ SE +LP+VN+
Subjt:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM

Query:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNP
         T T      EK  E   R   NN  Y   +S      RG GRSNRG RGNR+K QCQIC K G+ ADRC+FRYTPR+  SGYS N+ N   Y N +++P
Subjt:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNP

Query:  QMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTANG-------------------------------------------------
        QM AMVA+ DLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGG+QIY ANG                                                 
Subjt:  QMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTANG-------------------------------------------------

Query:  ----------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNF
                         GQ+LLQG L DGLY+F ++ S +    S +N+ P +  T + K +    D+WHRRLGHPHL +++  L H+ +++  INK+NF
Subjt:  ----------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNF

Query:  CEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR
        CEACALGKHHAL F +S T Y +PLQLI CDLWGPA + S N  RYYISFVDAYSR
Subjt:  CEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR

KAA0059137.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.2e-12173.98Show/hide
Query:  LGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRG
        LGNEYQSMISVI ARTD  SVQDIMSLLLTQESQIE+KITS+VSLP VN+T HTRDIPSLEK+GEVTHRG SNNL+YTT NSQYHH+S GGGRS RGGRG
Subjt:  LGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRG

Query:  NRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTAN
        NR+KTQCQICTKFGHIAD CYFRYTPRN  SGYSAN S+ FPY NAS NPQM AMV  YDLN +SNWYPDSGA+NHLTHSLSNLS GSEYG GHQIY AN
Subjt:  NRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTAN

Query:  G-----------------------------------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLG
        G                                          GQILLQ HLCDGLYQFNLKSS QG MKST N NP  LTTTLSKYHVNTTDVWHRRLG
Subjt:  G-----------------------------------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLG

Query:  HPHLNVMRNALKHVHHANI
        HP+LNVMRNALKHVHHANI
Subjt:  HPHLNVMRNALKHVHHANI

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.0e-12254.61Show/hide
Query:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM
        MQF+ KLHNIKKG+M LKEYF+KI Q VDALASINKP+S++DHILYILAGLG++YQSMISVI ARTD PSVQ++MSLLLTQESQ E+K+ SE +LP+VN+
Subjt:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM

Query:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNP
         T T      EK  E   R   NN  Y   +S      RG GRSNRG RGNR+K QCQIC K G+ ADRC+FRYTPR+  SGYS N+ N   Y N +++P
Subjt:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNP

Query:  QMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTANG-------------------------------------------------
        QM AMVA+ DLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGG+QIY ANG                                                 
Subjt:  QMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTANG-------------------------------------------------

Query:  ----------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNF
                         GQ+LLQG L DGLY+F ++ S +    S +N+ P +  T + K +    D+WHRRLGHPHL +++  L H+ +++  INK+NF
Subjt:  ----------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNF

Query:  CEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR
        CEACALGKHHAL F +S T Y +PLQLI CDLWGPA + S N  RYYISFVDAYSR
Subjt:  CEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR

TrEMBL top hitse value%identityAlignment
A0A5A7TUB3 Putative glutathione S-transferase isoform X16.7e-5683.8Show/hide
Query:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM
        MQF+ KLHN+KKG +SLKEYF+KI+Q VDALASINKPIST+DHILYILAGLGNEYQSMISVI ARTD PSVQD+MSLLLTQESQIE+KITSEVSLPTVNM
Subjt:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM

Query:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGG
        TTH RDI SL KE  VTHRGG NNLSY  TNSQYHH+SRG G
Subjt:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGG

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-12254.61Show/hide
Query:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM
        MQF+ KLHNIKKG+M LKEYF+KI Q VDALASINKP+S++DHILYILAGLG++YQSMISVI ARTD PSVQ++MSLLLTQESQ E+K+ SE +LP+VN+
Subjt:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM

Query:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNP
         T T      EK  E   R   NN  Y   +S      RG GRSNRG RGNR+K QCQIC K G+ ADRC+FRYTPR+  SGYS N+ N   Y N +++P
Subjt:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNP

Query:  QMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTANG-------------------------------------------------
        QM AMVA+ DLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGG+QIY ANG                                                 
Subjt:  QMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTANG-------------------------------------------------

Query:  ----------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNF
                         GQ+LLQG L DGLY+F ++ S +    S +N+ P +  T + K +    D+WHRRLGHPHL +++  L H+ +++  INK+NF
Subjt:  ----------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNF

Query:  CEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR
        CEACALGKHHAL F +S T Y +PLQLI CDLWGPA + S N  RYYISFVDAYSR
Subjt:  CEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-12254.61Show/hide
Query:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM
        MQF+ KLHNIKKG+M LKEYF+KI Q VDALASINKP+S++DHILYILAGLG++YQSMISVI ARTD PSVQ++MSLLLTQESQ E+K+ SE +LP+VN+
Subjt:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM

Query:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNP
         T T      EK  E   R   NN  Y   +S      RG GRSNRG RGNR+K QCQIC K G+ ADRC+FRYTPR+  SGYS N+ N   Y N +++P
Subjt:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNP

Query:  QMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTANG-------------------------------------------------
        QM AMVA+ DLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGG+QIY ANG                                                 
Subjt:  QMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTANG-------------------------------------------------

Query:  ----------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNF
                         GQ+LLQG L DGLY+F ++ S +    S +N+ P +  T + K +    D+WHRRLGHPHL +++  L H+ +++  INK+NF
Subjt:  ----------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRINKMNF

Query:  CEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR
        CEACALGKHHAL F +S T Y +PLQLI CDLWGPA + S N  RYYISFVDAYSR
Subjt:  CEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR

A0A5D3CRZ7 Putative Ty1-copia-like retrotransposon9.9e-9289.18Show/hide
Query:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM
        MQF+ KLHN+KKGAMSLKEYF+KI+Q VDALASINKPIST+DHILYILAGLGNEYQS+IS+I ARTD PSVQD MSLLLTQESQIE+KITSEVSLPTVNM
Subjt:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM

Query:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYA
        TTHTRDI SLEKE EVTHRGGSNNL YTTTNSQYHHKSR GGRSNRGGRGNRHKTQCQIC+KFGH+ADRCYFRYTPRNPPSGYS N+SNAFPYA
Subjt:  TTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYA

A0A5D3DDT9 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-12173.98Show/hide
Query:  LGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRG
        LGNEYQSMISVI ARTD  SVQDIMSLLLTQESQIE+KITS+VSLP VN+T HTRDIPSLEK+GEVTHRG SNNL+YTT NSQYHH+S GGGRS RGGRG
Subjt:  LGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRG

Query:  NRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTAN
        NR+KTQCQICTKFGHIAD CYFRYTPRN  SGYSAN S+ FPY NAS NPQM AMV  YDLN +SNWYPDSGA+NHLTHSLSNLS GSEYG GHQIY AN
Subjt:  NRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTAN

Query:  G-----------------------------------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLG
        G                                          GQILLQ HLCDGLYQFNLKSS QG MKST N NP  LTTTLSKYHVNTTDVWHRRLG
Subjt:  G-----------------------------------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLG

Query:  HPHLNVMRNALKHVHHANI
        HP+LNVMRNALKHVHHANI
Subjt:  HPHLNVMRNALKHVHHANI

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.8e-1923.13Show/hide
Query:  QFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMT
        Q   +L    KG  ++ +Y   +    D LA + KP+  ++ +  +L  L  EY+ +I  I A+   P++ +I   LL  ES+I    ++ V   T N  
Subjt:  QFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMT

Query:  THTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKT---QCQICTKFGHIADRC------YFRYTPRNPPSGYS-----ANT
        +H     +        +  G+ N  Y   N+  + K      +N     N+ K    +CQIC   GH A RC            + PPS ++     AN 
Subjt:  THTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKT---QCQICTKFGHIADRC------YFRYTPRNPPSGYS-----ANT

Query:  SNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTANG------------------------------------
        +   PY++                   +NW  DSGAT+H+T   +NLS+   Y GG  +  A+G                                    
Subjt:  SNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGHQIYTANG------------------------------------

Query:  --------------------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHH
                                   G  LLQG   D LY++ + SSQ   + ++ +S               T   WH RLGHP  +++ + + +   
Subjt:  --------------------------LGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHH

Query:  ANIR-INKMNFCEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR
        + +   +K   C  C + K + + F  S      PL+ I  D+W      S ++ RYY+ FVD ++R
Subjt:  ANIR-INKMNFCEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.0e-2125.69Show/hide
Query:  DALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRGGSNNLSYT
        D LA + KP+  ++ +  +L  L ++Y+ +I  I A+   PS+ +I   L+ +ES++    ++EV   T N+ TH     +        +RG + N +  
Subjt:  DALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSLEKEGEVTHRGGSNNLSYT

Query:  TTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRC----YFRYTPRNPPSGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGAT
           S     S  G RS+   +   +  +CQIC+  GH A RC     F+ T     S     TS   P+   ++     A+ + Y+ N   NW  DSGAT
Subjt:  TTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRC----YFRYTPRNPPSGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPDSGAT

Query:  NHLTHSLSNLSIGSEYGGGHQIYTANG--------------------------------------------------------------LGQILLQGHLC
        +H+T   +NLS    Y GG  +  A+G                                                               G  LLQG   
Subjt:  NHLTHSLSNLSIGSEYGGGHQIYTANG--------------------------------------------------------------LGQILLQGHLC

Query:  DGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRIN---KMNFCEACALGKHHALLFHNSNTQYIYP
        D LY++ + SSQ   M ++  S               T   WH RLGHP L ++ + +   +H+   +N   K+  C  C + K H + F NS      P
Subjt:  DGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRIN---KMNFCEACALGKHHALLFHNSNTQYIYP

Query:  LQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR
        L+ I  D+W      S ++ RYY+ FVD ++R
Subjt:  LQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.8e-0624.68Show/hide
Query:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM
        ++ + +L     G M + +Y+ K+++  D+L +++ P++  + ++Y+L GL  ++ ++I+VI  R  FPS  D  ++L  +E +++  I      PT   
Subjt:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNM

Query:  TTHTRDIPSLEKEGEVTH--RGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNR
         + +  + +  +   VT+  R G N + Y         + RG G +   GRG R
Subjt:  TTHTRDIPSLEKEGEVTH--RGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.8e-0828.21Show/hide
Query:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVS---LPT
        +QFE +L       +S+ EY  K++   D L +++ PIS    ++++L GL  +Y  +++VI  ++ FPS  +  S+LL +ES++ NK  S +S    P+
Subjt:  MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVS---LPT

Query:  VNMTTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRH
        ++    T  +P  ++     +   ++N+       +   K+RGGG S+  GR N +
Subjt:  VNMTTHTRDIPSLEKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTTCGAAAAAAAGCTTCATAATATAAAAAAGGGAGCCATGTCTTTAAAGGAATATTTTATTAAAATTCGACAATATGTTGATGCTTTAGCCTCCATAAATAAACC
TATATCAACTGAGGATCACATATTGTACATCTTAGCTGGATTAGGAAATGAATATCAGTCTATGATATCAGTTATTTTTGCTCGTACTGATTTTCCTTCTGTTCAAGATA
TTATGTCACTCTTATTGACTCAAGAATCACAGATTGAAAATAAGATTACGAGTGAGGTCTCTTTACCTACTGTCAATATGACTACACATACTAGAGACATTCCATCATTG
GAAAAAGAGGGTGAGGTTACACACAGAGGAGGTTCGAATAATCTTAGTTATACAACCACCAATTCTCAATATCATCACAAAAGTCGTGGTGGGGGTCGATCTAATAGAGG
AGGAAGAGGAAATAGACACAAAACTCAGTGTCAAATTTGCACCAAATTTGGACATATTGCCGATAGATGTTACTTTCGTTATACTCCAAGGAATCCTCCTTCAGGTTATT
CTGCCAACACATCTAATGCTTTTCCATATGCTAATGCTTCTCATAATCCACAGATGTGTGCGATGGTCGCCTCCTATGATCTAAACATTGATAGTAATTGGTATCCTGAC
TCGGGAGCAACAAATCATTTGACACATAGCTTGAGTAATTTATCAATTGGATCTGAATATGGTGGAGGACATCAGATTTATACAGCAAATGGTTTAGGCCAAATACTTCT
ACAAGGACATTTATGTGATGGTCTATATCAATTCAATCTCAAATCCTCTCAACAAGGTTTCATGAAGTCTACTACTAATAGTAATCCACGTATTTTAACTACTACTTTAT
CTAAGTATCATGTGAACACTACTGATGTATGGCATAGGCGATTAGGCCATCCCCACCTGAATGTTATGCGAAATGCTTTGAAACATGTCCATCATGCCAATATCAGAATA
AATAAAATGAATTTTTGTGAAGCTTGTGCTTTAGGAAAACATCATGCTCTTCTCTTTCACAATTCAAATACTCAATATATCTATCCTTTGCAACTAATTGTTTGTGATCT
TTGGGGTCCTGCATTTGACACATCTAGGAATGATGTTCGATACTATATTAGTTTTGTTGATGCCTATAGTAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTTCGAAAAAAAGCTTCATAATATAAAAAAGGGAGCCATGTCTTTAAAGGAATATTTTATTAAAATTCGACAATATGTTGATGCTTTAGCCTCCATAAATAAACC
TATATCAACTGAGGATCACATATTGTACATCTTAGCTGGATTAGGAAATGAATATCAGTCTATGATATCAGTTATTTTTGCTCGTACTGATTTTCCTTCTGTTCAAGATA
TTATGTCACTCTTATTGACTCAAGAATCACAGATTGAAAATAAGATTACGAGTGAGGTCTCTTTACCTACTGTCAATATGACTACACATACTAGAGACATTCCATCATTG
GAAAAAGAGGGTGAGGTTACACACAGAGGAGGTTCGAATAATCTTAGTTATACAACCACCAATTCTCAATATCATCACAAAAGTCGTGGTGGGGGTCGATCTAATAGAGG
AGGAAGAGGAAATAGACACAAAACTCAGTGTCAAATTTGCACCAAATTTGGACATATTGCCGATAGATGTTACTTTCGTTATACTCCAAGGAATCCTCCTTCAGGTTATT
CTGCCAACACATCTAATGCTTTTCCATATGCTAATGCTTCTCATAATCCACAGATGTGTGCGATGGTCGCCTCCTATGATCTAAACATTGATAGTAATTGGTATCCTGAC
TCGGGAGCAACAAATCATTTGACACATAGCTTGAGTAATTTATCAATTGGATCTGAATATGGTGGAGGACATCAGATTTATACAGCAAATGGTTTAGGCCAAATACTTCT
ACAAGGACATTTATGTGATGGTCTATATCAATTCAATCTCAAATCCTCTCAACAAGGTTTCATGAAGTCTACTACTAATAGTAATCCACGTATTTTAACTACTACTTTAT
CTAAGTATCATGTGAACACTACTGATGTATGGCATAGGCGATTAGGCCATCCCCACCTGAATGTTATGCGAAATGCTTTGAAACATGTCCATCATGCCAATATCAGAATA
AATAAAATGAATTTTTGTGAAGCTTGTGCTTTAGGAAAACATCATGCTCTTCTCTTTCACAATTCAAATACTCAATATATCTATCCTTTGCAACTAATTGTTTGTGATCT
TTGGGGTCCTGCATTTGACACATCTAGGAATGATGTTCGATACTATATTAGTTTTGTTGATGCCTATAGTAGATAA
Protein sequenceShow/hide protein sequence
MQFEKKLHNIKKGAMSLKEYFIKIRQYVDALASINKPISTEDHILYILAGLGNEYQSMISVIFARTDFPSVQDIMSLLLTQESQIENKITSEVSLPTVNMTTHTRDIPSL
EKEGEVTHRGGSNNLSYTTTNSQYHHKSRGGGRSNRGGRGNRHKTQCQICTKFGHIADRCYFRYTPRNPPSGYSANTSNAFPYANASHNPQMCAMVASYDLNIDSNWYPD
SGATNHLTHSLSNLSIGSEYGGGHQIYTANGLGQILLQGHLCDGLYQFNLKSSQQGFMKSTTNSNPRILTTTLSKYHVNTTDVWHRRLGHPHLNVMRNALKHVHHANIRI
NKMNFCEACALGKHHALLFHNSNTQYIYPLQLIVCDLWGPAFDTSRNDVRYYISFVDAYSR