; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0022039 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0022039
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr07:14943601..14944481
RNA-Seq ExpressionPI0022039
SyntenyPI0022039
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]5.0e-2838.54Show/hide
Query:  SSSAPLKVLNLR-------SKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIISWLLGAMS
        S+S+PL V N         +K + VKL + NFL WK QI + L  Y LE F + E+ PP K+L S   +++S  RT NPEY  W R + LI  WLLG+MS
Subjt:  SSSAPLKVLNLR-------SKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIISWLLGAMS

Query:  KD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAM-KGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS
        ++ L++M+ C S KE+W  L   FSS+ +A+ +  K KL  + KG   ++  FL+ Q+   +L ++++ +  +DH+++IL  LG +  S +S
Subjt:  KD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAM-KGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]1.7e-2838.19Show/hide
Query:  MGDSPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIIS
        M  + S +   +T +SS   ++    +K + VKL + NFL WK QI + L  Y LE F++ E  PP K+L+S   +++S  RT NP Y  W RQD LI S
Subjt:  MGDSPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIIS

Query:  WLLGAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAMKGK-YEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS
        WLLG+MS++ L++ML C S KE+W  L   FSS+ +A+ +  K KL  +K +   ++  FL+ Q    +L ++++ +  +DH+++ILA LG +  S +S
Subjt:  WLLGAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAMKGK-YEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS

TYK18917.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]1.7e-2838.19Show/hide
Query:  MGDSPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIIS
        M  + S +   +T +SS   ++    +K + VKL + NFL WK QI + L  Y LE F++ E  PP K+L+S   +++S  RT NP Y  W RQD LI S
Subjt:  MGDSPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIIS

Query:  WLLGAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAMKGK-YEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS
        WLLG+MS++ L++ML C S KE+W  L   FSS+ +A+ +  K KL  +K +   ++  FL+ Q    +L ++++ +  +DH+++ILA LG +  S +S
Subjt:  WLLGAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAMKGK-YEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]2.0e-3237.13Show/hide
Query:  SPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFL--VSETASSFMRTMNPEYDQWVRQDSLIISWLL
        S S +++S         K +N  SK + V+L++ N L WK QI + L+G GLE +ID   + P +F+    + +SS     NP Y +W++QD LI +WLL
Subjt:  SPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFL--VSETASSFMRTMNPEYDQWVRQDSLIISWLL

Query:  GAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAM-KGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS----
        G+M++D LS+MLDC S +E+W++L   F+S+ +ARV+ +K KLE   KG   ++  FL+ +    SL    + +  EDH+MHILA LG E D+ +S    
Subjt:  GAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAM-KGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS----

Query:  -----------------ENRIERHSFVNPDGSTPTVNLTTQEQVKQAPPSSSNNNNDIQQNKSQ-GNKQNKR
                         E R ER + +N DGS P+VNLT  +  K+     S   N  Q N SQ G   N R
Subjt:  -----------------ENRIERHSFVNPDGSTPTVNLTTQEQVKQAPPSSSNNNNDIQQNKSQ-GNKQNKR

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]1.8e-3045.22Show/hide
Query:  KLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIISWLLGAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDV
        K Q+ + ++G+GLE++ID ++ PP +F+ +     SS  +  NPEY  W++QD LI  WLLG+MS++ LS+MLDC   KE+W++L   F+S+N+ARV+ +
Subjt:  KLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIISWLLGAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDV

Query:  KEKLEAM-KGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS
        K KLE M KG   ++  FL+ +    SL    + +P +DH+MHILARLG E DS VS
Subjt:  KEKLEAM-KGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS

TrEMBL top hitse value%identityAlignment
A0A5A7UB21 Keratin, type II cytoskeletal 1-like2.4e-2838.54Show/hide
Query:  SSSAPLKVLNLR-------SKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIISWLLGAMS
        S+S+PL V N         +K + VKL + NFL WK QI + L  Y LE F + E+ PP K+L S   +++S  RT NPEY  W R + LI  WLLG+MS
Subjt:  SSSAPLKVLNLR-------SKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIISWLLGAMS

Query:  KD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAM-KGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS
        ++ L++M+ C S KE+W  L   FSS+ +A+ +  K KL  + KG   ++  FL+ Q+   +L ++++ +  +DH+++IL  LG +  S +S
Subjt:  KD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAM-KGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like8.3e-2938.19Show/hide
Query:  MGDSPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIIS
        M  + S +   +T +SS   ++    +K + VKL + NFL WK QI + L  Y LE F++ E  PP K+L+S   +++S  RT NP Y  W RQD LI S
Subjt:  MGDSPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIIS

Query:  WLLGAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAMKGK-YEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS
        WLLG+MS++ L++ML C S KE+W  L   FSS+ +A+ +  K KL  +K +   ++  FL+ Q    +L ++++ +  +DH+++ILA LG +  S +S
Subjt:  WLLGAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAMKGK-YEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS

A0A5D3D5T2 Keratin, type II cytoskeletal 1-like8.3e-2938.19Show/hide
Query:  MGDSPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIIS
        M  + S +   +T +SS   ++    +K + VKL + NFL WK QI + L  Y LE F++ E  PP K+L+S   +++S  RT NP Y  W RQD LI S
Subjt:  MGDSPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIIS

Query:  WLLGAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAMKGK-YEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS
        WLLG+MS++ L++ML C S KE+W  L   FSS+ +A+ +  K KL  +K +   ++  FL+ Q    +L ++++ +  +DH+++ILA LG +  S +S
Subjt:  WLLGAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAMKGK-YEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS

A0A6J1DLT9 uncharacterized protein LOC1110217579.5e-3337.13Show/hide
Query:  SPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFL--VSETASSFMRTMNPEYDQWVRQDSLIISWLL
        S S +++S         K +N  SK + V+L++ N L WK QI + L+G GLE +ID   + P +F+    + +SS     NP Y +W++QD LI +WLL
Subjt:  SPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFL--VSETASSFMRTMNPEYDQWVRQDSLIISWLL

Query:  GAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAM-KGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS----
        G+M++D LS+MLDC S +E+W++L   F+S+ +ARV+ +K KLE   KG   ++  FL+ +    SL    + +  EDH+MHILA LG E D+ +S    
Subjt:  GAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAM-KGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS----

Query:  -----------------ENRIERHSFVNPDGSTPTVNLTTQEQVKQAPPSSSNNNNDIQQNKSQ-GNKQNKR
                         E R ER + +N DGS P+VNLT  +  K+     S   N  Q N SQ G   N R
Subjt:  -----------------ENRIERHSFVNPDGSTPTVNLTTQEQVKQAPPSSSNNNNDIQQNKSQ-GNKQNKR

A0A6J1DSS1 uncharacterized protein LOC1110235868.9e-3145.22Show/hide
Query:  KLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIISWLLGAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDV
        K Q+ + ++G+GLE++ID ++ PP +F+ +     SS  +  NPEY  W++QD LI  WLLG+MS++ LS+MLDC   KE+W++L   F+S+N+ARV+ +
Subjt:  KLQINSTLRGYGLEKFIDPEVNPPPKFLVS--ETASSFMRTMNPEYDQWVRQDSLIISWLLGAMSKD-LSEMLDCSSTKEVWSILNARFSSKNMARVLDV

Query:  KEKLEAM-KGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS
        K KLE M KG   ++  FL+ +    SL    + +P +DH+MHILARLG E DS VS
Subjt:  KEKLEAM-KGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.3e-1024.38Show/hide
Query:  VLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVSETASSFMRTMNPEYDQWVRQDSLIISWLLGAMSKDLSEMLDCSST-KEV
        +LN+     T KL   N+L W  Q+++   GY L  F+D     PP  + ++ A      +NP+Y +W RQD LI S +LGA+S  +   +  ++T  ++
Subjt:  VLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVSETASSFMRTMNPEYDQWVRQDSLIISWLLGAMSKDLSEMLDCSST-KEV

Query:  WSILNARFSSKNMARVLDVKEKLEA-MKGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVSE--------------NRIERHSFV
        W  L   +++ +   V  ++ +L+   KG   I             L  + + + H++ V  +L  L  E    + +               R+  H   
Subjt:  WSILNARFSSKNMARVLDVKEKLEA-MKGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVSE--------------NRIERHSFV

Query:  NPDGSTPTVNLTTQEQVKQAPPSSSNNNNDIQQNKSQGNKQN
            S+ TV   T   V     +++NNNN+  +N    N+ N
Subjt:  NPDGSTPTVNLTTQEQVKQAPPSSSNNNNDIQQNKSQGNKQN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.3e-0622.5Show/hide
Query:  VLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVSETASSFMRTMNPEYDQWVRQDSLIISWLLGAMSKDLSEMLDCSST-KEV
        +LN+     T KL   N+L W  Q+++   GY L  F+D     PP  + ++     +  +NP+Y +W RQD LI S +LGA+S  +   +  ++T  ++
Subjt:  VLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVSETASSFMRTMNPEYDQWVRQDSLIISWLLGAMSKDLSEMLDCSST-KEV

Query:  WSILNARFSSKNMARVLDVKEKLEAMKGKYEIRRAFLEDQESSGSLKAVDRNIP--HEDHVMHILARLGLELDSTVSENRIERHSFVNPDGSTPTVNLTT
        W  L   +++ +   V  ++        +++      +  +    ++ V  N+P  ++  +  I A+      + + E  I R S +    S   V +T 
Subjt:  WSILNARFSSKNMARVLDVKEKLEAMKGKYEIRRAFLEDQESSGSLKAVDRNIP--HEDHVMHILARLGLELDSTVSENRIERHSFVNPDGSTPTVNLTT

Query:  QEQVKQAPPSSSNNNNDIQQNKSQGNKQNKRFLEQQWKAS
           V     +++ N N+   N++  N  N+      W+ S
Subjt:  QEQVKQAPPSSSNNNNDIQQNKSQGNKQNKRFLEQQWKAS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGACTCTCCCTCCTGTGTACAGTCGTCGTCAACCATATCCTCATCTGCCCCACTGAAAGTACTCAACCTTAGAAGCAAGACCACCACAGTAAAACTTGATGAAGG
AAACTTCCTGTCATGGAAATTACAAATAAACTCCACCCTTAGAGGATATGGTCTTGAAAAGTTTATTGATCCAGAAGTAAATCCACCACCAAAATTTCTTGTATCTGAAA
CAGCATCCTCCTTTATGAGAACAATGAATCCCGAATATGATCAATGGGTAAGACAAGACAGTCTGATTATTTCTTGGCTTCTTGGGGCTATGTCCAAAGACCTATCGGAG
ATGCTTGATTGTTCATCAACAAAAGAAGTTTGGAGTATACTCAATGCAAGATTTTCATCCAAGAATATGGCAAGAGTCCTCGATGTAAAGGAAAAATTAGAAGCAATGAA
AGGGAAATATGAAATTAGAAGAGCATTTCTAGAAGATCAAGAATCTAGTGGATCGTTGAAAGCGGTTGATAGAAATATTCCTCATGAAGATCATGTGATGCACATATTGG
CTAGGCTAGGCCTTGAATTGGACTCTACTGTCTCAGAAAATAGAATCGAAAGACACTCATTTGTAAATCCTGATGGTTCTACACCCACAGTGAATCTTACTACACAAGAA
CAAGTGAAGCAAGCTCCTCCCTCCTCCTCAAATAATAACAATGACATACAACAAAACAAGAGTCAGGGAAACAAGCAGAATAAACGATTTCTGGAACAACAGTGGAAAGC
CTCAATGCCAAGTATGCAGGAAGTTTGGGCATACCGCATACAACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGACTCTCCCTCCTGTGTACAGTCGTCGTCAACCATATCCTCATCTGCCCCACTGAAAGTACTCAACCTTAGAAGCAAGACCACCACAGTAAAACTTGATGAAGG
AAACTTCCTGTCATGGAAATTACAAATAAACTCCACCCTTAGAGGATATGGTCTTGAAAAGTTTATTGATCCAGAAGTAAATCCACCACCAAAATTTCTTGTATCTGAAA
CAGCATCCTCCTTTATGAGAACAATGAATCCCGAATATGATCAATGGGTAAGACAAGACAGTCTGATTATTTCTTGGCTTCTTGGGGCTATGTCCAAAGACCTATCGGAG
ATGCTTGATTGTTCATCAACAAAAGAAGTTTGGAGTATACTCAATGCAAGATTTTCATCCAAGAATATGGCAAGAGTCCTCGATGTAAAGGAAAAATTAGAAGCAATGAA
AGGGAAATATGAAATTAGAAGAGCATTTCTAGAAGATCAAGAATCTAGTGGATCGTTGAAAGCGGTTGATAGAAATATTCCTCATGAAGATCATGTGATGCACATATTGG
CTAGGCTAGGCCTTGAATTGGACTCTACTGTCTCAGAAAATAGAATCGAAAGACACTCATTTGTAAATCCTGATGGTTCTACACCCACAGTGAATCTTACTACACAAGAA
CAAGTGAAGCAAGCTCCTCCCTCCTCCTCAAATAATAACAATGACATACAACAAAACAAGAGTCAGGGAAACAAGCAGAATAAACGATTTCTGGAACAACAGTGGAAAGC
CTCAATGCCAAGTATGCAGGAAGTTTGGGCATACCGCATACAACCTTAA
Protein sequenceShow/hide protein sequence
MGDSPSCVQSSSTISSSAPLKVLNLRSKTTTVKLDEGNFLSWKLQINSTLRGYGLEKFIDPEVNPPPKFLVSETASSFMRTMNPEYDQWVRQDSLIISWLLGAMSKDLSE
MLDCSSTKEVWSILNARFSSKNMARVLDVKEKLEAMKGKYEIRRAFLEDQESSGSLKAVDRNIPHEDHVMHILARLGLELDSTVSENRIERHSFVNPDGSTPTVNLTTQE
QVKQAPPSSSNNNNDIQQNKSQGNKQNKRFLEQQWKASMPSMQEVWAYRIQP