; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0016309 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0016309
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionTransposon Tf2-9 polyprotein
Genome locationchr04:25978210..25979497
RNA-Seq ExpressionPI0016309
SyntenyPI0016309
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037097.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]4.9e-2336.44Show/hide
Query:  QEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLASA---WRIGRSQ-----------------
        ++  +  RFL +KQE+ + +Y NLFDKLVAPLS +   V+  TFMNGL PWI AEV      GLA+MM+ A       I R++                 
Subjt:  QEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLASA---WRIGRSQ-----------------

Query:  ---------------------------------------GRKQASDKIWKIVADARFHF--RKISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELV
                                               G+ Q  + I  I   A  +F   K+   L+LP+ ET HY VI GS + V+GK +CE VE+ 
Subjt:  ---------------------------------------GRKQASDKIWKIVADARFHF--RKISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELV

Query:  IGEWKVKDSFLPLELRGVDVILGMQ
        +  WKVK+ FLPLEL GVDV+LGMQ
Subjt:  IGEWKVKDSFLPLELRGVDVILGMQ

KAA0051119.1 uncharacterized protein E6C27_scaffold511G00660 [Cucumis melo var. makuwa]4.5e-2435.18Show/hide
Query:  MQEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLA----------------SAWRIGRSQGRK
        ++EG++ GRFL IKQE  ++EY N FDKL+AP++ L   VL  TFMNGL+PW+ +EVE  E IGLAQMMKLA                SA+ I      +
Subjt:  MQEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLA----------------SAWRIGRSQGRK

Query:  QASDKIW----KIVADARFHFRKISI------------PLK--------------------------------------------------------LPM
        Q +  I     ++  +  +  R I++            P+K                                                        L +
Subjt:  QASDKIW----KIVADARFHFRKISI------------PLK--------------------------------------------------------LPM

Query:  VETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLELRGVDVILGMQ
         ET +Y VI GS + VKGKRVC  VE+ + EWKV DSFLPLEL G+D+ILGMQ
Subjt:  VETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLELRGVDVILGMQ

KAA0053087.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.7e-2936.16Show/hide
Query:  MQEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLASAWR---IGRS--------QGRKQ----
        M EGSIC RFLAIKQ+T ++EY NLFDKLVAPL  L K VL  T MNGL PWI A +ECWE +GL QMM LA   +   + RS        +G+ Q    
Subjt:  MQEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLASAWR---IGRS--------QGRKQ----

Query:  -----------------------------ASDKIW----KIVADARFHFR--------------------------------------------------
                                     A D  W    K ++DA F  +                                                  
Subjt:  -----------------------------ASDKIW----KIVADARFHFR--------------------------------------------------

Query:  --------KISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLELRGVDVILGMQ
                K+   L LPM ET HY V+ GS +T+KGK +C  +ELV+G+WK+ D+FLPLEL GVDV+LG+Q
Subjt:  --------KISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLELRGVDVILGMQ

KAA0059263.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]6.2e-2638.5Show/hide
Query:  QEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMM--------------------------------
        ++G ICG+FL IKQET ++EY NLFDKLVAPLS L + V+  TFMNGL PWI AEV      GLA+MM                                
Subjt:  QEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMM--------------------------------

Query:  --KLASAWRIGRS-------------QGRKQASDKIWKIVADARFHF--RKISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLP
          K  + +  G +             +G+ Q  D +  I   A  +F   K++  L +P  ET H  VI GS + ++GK VC+ +E+ + +W VK+ FLP
Subjt:  --KLASAWRIGRS-------------QGRKQASDKIWKIVADARFHF--RKISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLP

Query:  LELRGVDVILGMQ
        LE  GVDVIL MQ
Subjt:  LELRGVDVILGMQ

XP_038904367.1 uncharacterized protein LOC120090724 [Benincasa hispida]1.8e-2843.85Show/hide
Query:  EGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLA-------SAW-RIGRSQ-------------
        +GSICGRFL IKQET ++ Y N FDKL+A L  L  EVL  TFMNGL+ WI AEVECW+  GLAQMM          +AW   GR++             
Subjt:  EGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLA-------SAW-RIGRSQ-------------

Query:  -GRKQASDKIWKIVADARFHFRKISIP--LKLPMVETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLELRGVDVILGMQ
         G  +    +  +  +A  +F   S    L+L + +T +Y V+ GS +T++GK VC+ V++ +G+W+V DSF  LEL   DVILGMQ
Subjt:  -GRKQASDKIWKIVADARFHFRKISIP--LKLPMVETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLELRGVDVILGMQ

TrEMBL top hitse value%identityAlignment
A0A5A7T6B1 Transposon Ty3-G Gag-Pol polyprotein2.4e-2336.44Show/hide
Query:  QEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLASA---WRIGRSQ-----------------
        ++  +  RFL +KQE+ + +Y NLFDKLVAPLS +   V+  TFMNGL PWI AEV      GLA+MM+ A       I R++                 
Subjt:  QEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLASA---WRIGRSQ-----------------

Query:  ---------------------------------------GRKQASDKIWKIVADARFHF--RKISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELV
                                               G+ Q  + I  I   A  +F   K+   L+LP+ ET HY VI GS + V+GK +CE VE+ 
Subjt:  ---------------------------------------GRKQASDKIWKIVADARFHF--RKISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELV

Query:  IGEWKVKDSFLPLELRGVDVILGMQ
        +  WKVK+ FLPLEL GVDV+LGMQ
Subjt:  IGEWKVKDSFLPLELRGVDVILGMQ

A0A5A7UC53 Retrotrans_gag domain-containing protein2.2e-2435.18Show/hide
Query:  MQEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLA----------------SAWRIGRSQGRK
        ++EG++ GRFL IKQE  ++EY N FDKL+AP++ L   VL  TFMNGL+PW+ +EVE  E IGLAQMMKLA                SA+ I      +
Subjt:  MQEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLA----------------SAWRIGRSQGRK

Query:  QASDKIW----KIVADARFHFRKISI------------PLK--------------------------------------------------------LPM
        Q +  I     ++  +  +  R I++            P+K                                                        L +
Subjt:  QASDKIW----KIVADARFHFRKISI------------PLK--------------------------------------------------------LPM

Query:  VETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLELRGVDVILGMQ
         ET +Y VI GS + VKGKRVC  VE+ + EWKV DSFLPLEL G+D+ILGMQ
Subjt:  VETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLELRGVDVILGMQ

A0A5D3BWP0 Transposon Ty3-G Gag-Pol polyprotein3.0e-2638.5Show/hide
Query:  QEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMM--------------------------------
        ++G ICG+FL IKQET ++EY NLFDKLVAPLS L + V+  TFMNGL PWI AEV      GLA+MM                                
Subjt:  QEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMM--------------------------------

Query:  --KLASAWRIGRS-------------QGRKQASDKIWKIVADARFHF--RKISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLP
          K  + +  G +             +G+ Q  D +  I   A  +F   K++  L +P  ET H  VI GS + ++GK VC+ +E+ + +W VK+ FLP
Subjt:  --KLASAWRIGRS-------------QGRKQASDKIWKIVADARFHF--RKISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLP

Query:  LELRGVDVILGMQ
        LE  GVDVIL MQ
Subjt:  LELRGVDVILGMQ

A0A5D3C860 Transposon Tf2-1 polyprotein isoform X12.4e-2336.44Show/hide
Query:  QEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLASA---WRIGRSQ-----------------
        ++  +  RFL +KQE+ + +Y NLFDKLVAPLS +   V+  TFMNGL PWI AEV      GLA+MM+ A       I R++                 
Subjt:  QEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLASA---WRIGRSQ-----------------

Query:  ---------------------------------------GRKQASDKIWKIVADARFHF--RKISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELV
                                               G+ Q  + I  I   A  +F   K+   L+LP+ ET HY VI GS + V+GK +CE VE+ 
Subjt:  ---------------------------------------GRKQASDKIWKIVADARFHF--RKISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELV

Query:  IGEWKVKDSFLPLELRGVDVILGMQ
        +  WKVK+ FLPLEL GVDV+LGMQ
Subjt:  IGEWKVKDSFLPLELRGVDVILGMQ

A0A5D3E235 Ty3-gypsy retrotransposon protein1.3e-2936.16Show/hide
Query:  MQEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLASAWR---IGRS--------QGRKQ----
        M EGSIC RFLAIKQ+T ++EY NLFDKLVAPL  L K VL  T MNGL PWI A +ECWE +GL QMM LA   +   + RS        +G+ Q    
Subjt:  MQEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVL--TFMNGLSPWINAEVECWESIGLAQMMKLASAWR---IGRS--------QGRKQ----

Query:  -----------------------------ASDKIW----KIVADARFHFR--------------------------------------------------
                                     A D  W    K ++DA F  +                                                  
Subjt:  -----------------------------ASDKIW----KIVADARFHFR--------------------------------------------------

Query:  --------KISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLELRGVDVILGMQ
                K+   L LPM ET HY V+ GS +T+KGK +C  +ELV+G+WK+ D+FLPLEL GVDV+LG+Q
Subjt:  --------KISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLELRGVDVILGMQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G30770.1 Eukaryotic aspartyl protease family protein7.1e-0434.92Show/hide
Query:  KISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLEL--RGVDVILG
        ++++ LKLP   T    V+ G R  ++    C G+ L++ E ++ ++FL L+L    VDVILG
Subjt:  KISIPLKLPMVETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLEL--RGVDVILG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAGGGATCAATCTGTGGACGATTCCTAGCGATTAAACAGGAAACAATGTTGAAAGAATATCATAATCTATTCGATAAGCTTGTTGCACCGCTATCGCAATTGCT
AAAGGAAGTCCTAACGTTCATGAATGGGCTTAGTCCATGGATCAATGCTGAAGTGGAATGTTGGGAGTCGATTGGACTCGCTCAAATGATGAAGTTGGCCAGCGCGTGGA
GAATTGGGAGATCACAAGGAAGGAAGCAGGCCTCGGACAAAATATGGAAGATTGTGGCCGATGCACGATTTCATTTCAGAAAAATTAGTATCCCGCTGAAACTACCTATG
GTAGAGACCTTGCACTATGAAGTGATACGAGGCTCTAGATCAACAGTAAAAGGGAAGAGGGTATGCGAAGGGGTGGAATTGGTGATTGGAGAATGGAAGGTGAAAGATAG
TTTCCTGCCTTTGGAGTTAAGAGGGGTGGACGTAATTCTGGGAATGCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAGGGATCAATCTGTGGACGATTCCTAGCGATTAAACAGGAAACAATGTTGAAAGAATATCATAATCTATTCGATAAGCTTGTTGCACCGCTATCGCAATTGCT
AAAGGAAGTCCTAACGTTCATGAATGGGCTTAGTCCATGGATCAATGCTGAAGTGGAATGTTGGGAGTCGATTGGACTCGCTCAAATGATGAAGTTGGCCAGCGCGTGGA
GAATTGGGAGATCACAAGGAAGGAAGCAGGCCTCGGACAAAATATGGAAGATTGTGGCCGATGCACGATTTCATTTCAGAAAAATTAGTATCCCGCTGAAACTACCTATG
GTAGAGACCTTGCACTATGAAGTGATACGAGGCTCTAGATCAACAGTAAAAGGGAAGAGGGTATGCGAAGGGGTGGAATTGGTGATTGGAGAATGGAAGGTGAAAGATAG
TTTCCTGCCTTTGGAGTTAAGAGGGGTGGACGTAATTCTGGGAATGCAGTAG
Protein sequenceShow/hide protein sequence
MQEGSICGRFLAIKQETMLKEYHNLFDKLVAPLSQLLKEVLTFMNGLSPWINAEVECWESIGLAQMMKLASAWRIGRSQGRKQASDKIWKIVADARFHFRKISIPLKLPM
VETLHYEVIRGSRSTVKGKRVCEGVELVIGEWKVKDSFLPLELRGVDVILGMQ