; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016444 (gene) of Chayote v1 genome

Gene IDSed0016444
OrganismSechium edule (Chayote v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG01:19826644..19828659
RNA-Seq ExpressionSed0016444
SyntenySed0016444
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA3023561.1 retroelement pol poly [Olea europaea subsp. europaea]1.0e-2548.8Show/hide
Query:  KMIGKAKNIDGLFMLTVE-----KPFFACNVSINTWHNRLGHLSTKRLELMKDILDYNDSHTSH---CSICPLAKQRRLPFAYNNKVASEIFDLIHCDIW
        KMIGK   I GL++L  +        F  +VS + WHNRLGHLS KRL+++KD L Y+   ++    C ICPLAKQRRL F  +N ++++ FDLIHCDIW
Subjt:  KMIGKAKNIDGLFMLTVE-----KPFFACNVSINTWHNRLGHLSTKRLELMKDILDYNDSHTSH---CSICPLAKQRRLPFAYNNKVASEIFDLIHCDIW

Query:  GPSKAPTYTGFRFFCTIVDDCSRYT
        GP    ++ G+R+F T+VDD  R+T
Subjt:  GPSKAPTYTGFRFFCTIVDDCSRYT

KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]5.0e-2550.4Show/hide
Query:  MIGKAKNIDGLFMLTVE--KPFFACNVSIN-----TWHNRLGHLSTKRLELMKD--ILDYNDSHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIW
        MIGKA   +GL++L  E      A  V IN     TWH RLGHLS K L  +     L  +  H S C +CPLAKQ+RL F  NN VAS  FDL+H DIW
Subjt:  MIGKAKNIDGLFMLTVE--KPFFACNVSIN-----TWHNRLGHLSTKRLELMKD--ILDYNDSHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIW

Query:  GPSKAPTYTGFRFFCTIVDDCSRYT
        GP K P+Y G+++F T+VDDC R+T
Subjt:  GPSKAPTYTGFRFFCTIVDDCSRYT

XP_012833844.1 PREDICTED: uncharacterized protein LOC105954710 [Erythranthe guttata]1.9e-2446.27Show/hide
Query:  KMIGKAKNIDGLFMLTVEKPF------------FACN-VSINTWHNRLGHLSTKRLELMKDILDYNDSHT----SHCSICPLAKQRRLPFAYNNKVASEI
        KMIG+ + ++ L +L    PF            F CN VS    H RLGH+  K+L  +K  L  +        S C +CP+AKQ+RL F  +N VA  +
Subjt:  KMIGKAKNIDGLFMLTVEKPF------------FACN-VSINTWHNRLGHLSTKRLELMKDILDYNDSHT----SHCSICPLAKQRRLPFAYNNKVASEI

Query:  FDLIHCDIWGPSKAPTYTGFRFFCTIVDDCSRYT
        FDLIHCDIWGP K P+Y GF++F TIVDDCSRYT
Subjt:  FDLIHCDIWGPSKAPTYTGFRFFCTIVDDCSRYT

XP_022861542.1 uncharacterized protein LOC111381922 [Olea europaea var. sylvestris]1.2e-2347.24Show/hide
Query:  KMIGKAKNIDGLFMLTVE-----KPFFACNVSINTWHNRLGHLSTKRLELMK-----DILDYNDSHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCD
        KMIG+   +  L++L +         F   V  + WHNRLG+LS KRLE +K     DIL  N  H   C ICP+AKQRRL F  NN ++ + FDL+HCD
Subjt:  KMIGKAKNIDGLFMLTVE-----KPFFACNVSINTWHNRLGHLSTKRLELMK-----DILDYNDSHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCD

Query:  IWGPSKAPTYTGFRFFCTIVDDCSRYT
        IWGP   P++ G+R+F T+VDD SR+T
Subjt:  IWGPSKAPTYTGFRFFCTIVDDCSRYT

XP_022871010.1 uncharacterized protein LOC111390234 [Olea europaea var. sylvestris]3.8e-2543.07Show/hide
Query:  MKMIGKAKNIDGLFML----------TVEKPFFAC-------NVSINTWHNRLGHLSTKRLELMKDILDYN----DSHTSHCSICPLAKQRRLPFAYNNK
        MKMIGK +  + L+++           V    F+C       N++ + WH+RLGHLS K+L++MKD+L ++    D H   C +CPLAKQRRL F  NN 
Subjt:  MKMIGKAKNIDGLFML----------TVEKPFFAC-------NVSINTWHNRLGHLSTKRLELMKDILDYN----DSHTSHCSICPLAKQRRLPFAYNNK

Query:  VASEIFDLIHCDIWGPSKAPTYTGFRFFCTIVDDCSR
        ++   FDLIHCD+WGP    T+ G+++F T+VDDC+R
Subjt:  VASEIFDLIHCDIWGPSKAPTYTGFRFFCTIVDDCSR

TrEMBL top hitse value%identityAlignment
A0A2N9GZ55 Integrase catalytic domain-containing protein1.9e-2242.25Show/hide
Query:  KMIGKAKNIDGLFMLTVEKPFFA------------CNVSIN-------TWHNRLGHLSTKRLELMK----DILDY--NDSHTSHCSICPLAKQRRLPFAY
        K+IG+ K   GL++L  ++   A             N SIN        WH RLGH S   L  +K    D+ ++   D H  HC +CPLAKQ+RLPF  
Subjt:  KMIGKAKNIDGLFMLTVEKPFFA------------CNVSIN-------TWHNRLGHLSTKRLELMK----DILDY--NDSHTSHCSICPLAKQRRLPFAY

Query:  NNKVASEIFDLIHCDIWGPSKAPTYTGFRFFCTIVDDCSRYT
        +NK +S  F LIHCD+WGP   PT  GF++F TIVDD +R T
Subjt:  NNKVASEIFDLIHCDIWGPSKAPTYTGFRFFCTIVDDCSRYT

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 82.4e-2550.4Show/hide
Query:  MIGKAKNIDGLFMLTVE--KPFFACNVSIN-----TWHNRLGHLSTKRLELMKD--ILDYNDSHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIW
        MIGKA   +GL++L  E      A  V IN     TWH RLGHLS K L  +     L  +  H S C +CPLAKQ+RL F  NN VAS  FDL+H DIW
Subjt:  MIGKAKNIDGLFMLTVE--KPFFACNVSIN-----TWHNRLGHLSTKRLELMKD--ILDYNDSHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIW

Query:  GPSKAPTYTGFRFFCTIVDDCSRYT
        GP K P+Y G+++F T+VDDC R+T
Subjt:  GPSKAPTYTGFRFFCTIVDDCSRYT

A0A5D3D1N3 Cysteine-rich RLK (Receptor-like protein kinase) 85.6e-2247.58Show/hide
Query:  KMIGKAKNIDGLFMLTVEKPF-FACNVSIN---TWHNRLGHLSTKRLELMKDILDYNDS---HTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIWG
        K IG AK   GL++L     F   C V+ N    WH R+GH ST+ L  ++  L    S     SHC+ICPLAKQR+L F  NN ++S  FDLIH DIWG
Subjt:  KMIGKAKNIDGLFMLTVEKPF-FACNVSIN---TWHNRLGHLSTKRLELMKDILDYNDS---HTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIWG

Query:  PSKAPTYTGFRFFCTIVDDCSRYT
        P    TY+ + +F TIVDD +RYT
Subjt:  PSKAPTYTGFRFFCTIVDDCSRYT

A0A6D2HNE3 Uncharacterized protein3.9e-2344.27Show/hide
Query:  MIGKAKNIDGLFML---------TVEKPFFACN---VSINTWHNRLGHLSTKRLELMKDILDYNDSHTS---HCSICPLAKQRRLPFAYNNKVASEIFDL
        MIG+AK +  L++L             P   C      ++ WH+RLGH S   L+ +K IL     H+S   HCS+CPLAKQRRL +  +N +AS+ FDL
Subjt:  MIGKAKNIDGLFML---------TVEKPFFACN---VSINTWHNRLGHLSTKRLELMKDILDYNDSHTS---HCSICPLAKQRRLPFAYNNKVASEIFDL

Query:  IHCDIWGPSKAPTYTGFRFFCTIVDDCSRYT
        IH DIWGP    +  G+R+F TIVDDC+R T
Subjt:  IHCDIWGPSKAPTYTGFRFFCTIVDDCSRYT

A0A6D2L5Q5 Uncharacterized protein5.6e-2251.09Show/hide
Query:  VSINTWHNRLGHLSTKRLELMKDILDYNDSHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIWGPSKAPTYTGFRFFCTIVDDCSRYT
        V  + WH RLGH S+ +L+ +  IL  + S  SHC +CPLAKQ+ LPF  NN++++  FDL+H DIWGP +  +  GFR+F TIVDDC+R T
Subjt:  VSINTWHNRLGHLSTKRLELMKDILDYNDSHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIWGPSKAPTYTGFRFFCTIVDDCSRYT

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-1233.33Show/hide
Query:  VSINTWHNRLGHLSTKRLELM--KDILDYNDSHT-SHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIWGPSKAPTYTGFRFFCTIVDDCSR
        +S++ WH R+GH+S K L+++  K ++ Y    T   C  C   KQ R+ F  +++    I DL++ D+ GP +  +  G ++F T +DD SR
Subjt:  VSINTWHNRLGHLSTKRLELM--KDILDYNDSHT-SHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIWGPSKAPTYTGFRFFCTIVDDCSR

P93293 Uncharacterized mitochondrial protein AtMg003001.6e-0532.39Show/hide
Query:  WHNRLGHLSTKRLELM--KDILDYND-SHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIWGPSKAP
        WH+RL H+S + +EL+  K  LD +  S    C  C   K  R+ F+          D +H D+WG    P
Subjt:  WHNRLGHLSTKRLELM--KDILDYND-SHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIWGPSKAP

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.1e-0632.39Show/hide
Query:  WHNRLGHLSTKRLELM--KDILDYND-SHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIWGPSKAP
        WH+RL H+S + +EL+  K  LD +  S    C  C   K  R+ F+          D +H D+WG    P
Subjt:  WHNRLGHLSTKRLELM--KDILDYND-SHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIWGPSKAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATGATTGGCAAGGCTAAGAACATTGATGGATTGTTTATGTTAACTGTTGAGAAACCTTTTTTTGCTTGTAATGTGTCTATTAATACGTGGCACAATAGGTTAGG
ACACTTGTCTACAAAACGCCTTGAGCTAATGAAGGATATATTAGATTACAATGATTCACACACTTCTCATTGTAGCATTTGTCCTTTAGCAAAACAAAGACGATTGCCTT
TTGCTTATAATAATAAAGTGGCTTCTGAGATTTTTGACTTGATTCATTGTGATATTTGGGGACCATCCAAGGCTCCTACTTATACTGGTTTTCGTTTCTTTTGTACTATT
GTAGATGATTGTTCTCGGTACACTTAG
mRNA sequenceShow/hide mRNA sequence
GCAAATTCTCTTGTATTGCTCTGTTTCTTCAATCCAAATTATTTGAATTTTCAAGAAAAACAGGGTAGATTTCTTGTTAGATTATGATACGAGATTCAATCTACCTATCA
AGCCTTGTATAAAGTTTATTTATCTTTCTGTTGTAACAAATATCTATTAGTTATTTATCTTTAGTTACCCGAGTTAATTCTGTAATAAACCCTCTATTTAAAGGGCTTTA
ATGTGAAATGGTATCAGAGCGTACACAAAAAGCTTCCGCATAAGTCTCTCAAAACCTAATTTTCGCTCTCTCTCGCACGCACCACCACTCCGATATCTTAGTCTACGATG
AATGTGGAGGTTCCTACCAGCGAGATTCTTACCAGCAAATTTCCTACTGTTTTAGAGGCAAACTTCAATTCGTACGCTATGCATCACTCGCTCGGAGCGACAATGATCGT
AGTAAATCAACCTCTCCTTGGCGCAGAAAACTACCTTTTTTGGCAAGAACAAGGATGGGTCAAGAAACCAACCAGATGATGAGATTGGTAGCGGCGTGGAAATGTAACAA
TGACACAATATGTTCCTGGCTGTTGAATTCTGTTTCGAAGGAGATAGCTGCTAGCGTGATCTACTCTGGCTCCGTGAAGGAAGTATGAGATGATATTGCTGAAATGTTCA
AAGCAAGCAACGGTCCAAGAATCTATCAATTGAAAAGAGAGTTGATCAATACAGTTCAAGGAAATTCATCCGTCGAGTCCTATTTCATGAAATTGAAGACCGTATGGCAA
AACCTCAGTGATTTCAAACCTGATGTAGAATGTACTTGCGGAGGACTCAAATCTGAATTTACCATGGTGTTTCTCATGGGTCTTAATGACTCTTACTCGACCATCAGGAC
ACAGATTCTCTTAATCGAACCCCTTCCTCCTATTAACAAAGCCTATCCAAGAAAGGCTCTTCTTCCACATATCGCAGAAGAGAAAGACCTCATTGTATGCACTGCAATCG
ACTAGGACATACCGCTGATAAGTGCTACAAGATTCACGGCTACCCGCCCGGATATCGATCACATTTTCAAGGTCAGTCTTCAACTGATAACTCGAATGCATCAGTTGCAA
ATACAATAGCGCAAAATGATGGCAATAGCTTTCTTGGTGGTTTAAACTAAAGCCAATATGTTCAATTGTTAAACATTCTGAATTCACAAACCTCAAATGACAAATCAGTT
ACTGTAGGTGCAGTCGTCTCACACACCCCAGGTATTTTTTCTGCTTCTTTATATGCTAATAATAAAGAACCCACTTGGATTATTGATTCAGGTGCATCCCCTCATATTTG
TCGTATTCATAATATGTTTTCAAATTGGAAATGAATAAATGACATAAGCGTGGTTTTACCAAATTCCTATAAGGTCTTAGTTCAATTCACAGGAGATGTAAAGATAAATA
GCCATCTAATTTTAAAAGATGTGTTTTACATACCTAACTTTTCCTTTAATTTGCTATATGTTGGTTGCTTACTTCAAGATGGCCCTTATGAAATTAATTTTACGGCTGAT
TATTGTACTATTCAGGACAAGCCGAGTATGAAGATGATTGGCAAGGCTAAGAACATTGATGGATTGTTTATGTTAACTGTTGAGAAACCTTTTTTTGCTTGTAATGTGTC
TATTAATACGTGGCACAATAGGTTAGGACACTTGTCTACAAAACGCCTTGAGCTAATGAAGGATATATTAGATTACAATGATTCACACACTTCTCATTGTAGCATTTGTC
CTTTAGCAAAACAAAGACGATTGCCTTTTGCTTATAATAATAAAGTGGCTTCTGAGATTTTTGACTTGATTCATTGTGATATTTGGGGACCATCCAAGGCTCCTACTTAT
ACTGGTTTTCGTTTCTTTTGTACTATTGTAGATGATTGTTCTCGGTACACTTAGGTTTATTTGATGAAATATAAATATGATGTATTAACCATCATTCCTCATTTTTTTAC
GCTAGTTAAAACTCAGTTTAACAAAGTTATCAAATC
Protein sequenceShow/hide protein sequence
MKMIGKAKNIDGLFMLTVEKPFFACNVSINTWHNRLGHLSTKRLELMKDILDYNDSHTSHCSICPLAKQRRLPFAYNNKVASEIFDLIHCDIWGPSKAPTYTGFRFFCTI
VDDCSRYT