; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G11307 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G11307
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationClcChr06:21168554..21171537
RNA-Seq ExpressionClc06G11307
SyntenyClc06G11307
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG68750.1 hypothetical protein EZV62_003685 [Acer yangbiense]5.2e-1151.72Show/hide
Query:  LRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNL
        L+K  T  N  LF+ K +  KF AIG+PLSY  +LGY+LEGLG+EY  FV +I N  D PSI DV  LL ++E RL K+TL ++ +L
Subjt:  LRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNL

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]4.7e-1248.89Show/hide
Query:  LFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNS
        L K+K +  K  +IG+P+S   ++ YI+EGLG EY  FV +I N +D  ++ DV  LL AY+ RLEKQ  V+QLN++QANVANL +++ S
Subjt:  LFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNS

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]6.7e-1950Show/hide
Query:  GFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANV
        G     + LRK  +  +  L K+K +  KF A+G+PLSY  +L ++L+GLG+EY  FV +IHN  D PS+ DV  LL AYEARL+KQ  V+QLN+ QAN+
Subjt:  GFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANV

Query:  ANLSISQNSCRPKWPP
         NLS+  NS RP  PP
Subjt:  ANLSISQNSCRPKWPP

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]9.4e-2145.86Show/hide
Query:  GFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANV
        GF    ++++K     +  L ++K +   F AIG+PLSY  +L YILEGLG+EY PFV +IHN T+ PSI DV  LL  Y++RLEKQT  + L LIQANV
Subjt:  GFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANV

Query:  ANLSISQNSCRPKWPPYDKSFTPTTLPMLGFVP
        A+LSI+  +  P+W  +++S   ++ P +G  P
Subjt:  ANLSISQNSCRPKWPPYDKSFTPTTLPMLGFVP

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]2.0e-1037.8Show/hide
Query:  KELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSIS
        +++RK     +  L ++K +  KF  +G+ +SY  +L +IL+GLG+EY  FV +I N  D  S+ DV  LL +YEA+LEKQ  ++ LN+ QA ++ LS  
Subjt:  KELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSIS

Query:  QNSCRPKWPPYDKSFTPTTLPMLGFVP
         NS R    P+  + +   LP   F P
Subjt:  QNSCRPKWPPYDKSFTPTTLPMLGFVP

TrEMBL top hitse value%identityAlignment
A0A5C7I9Y1 Uncharacterized protein1.5e-0851.35Show/hide
Query:  LRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEA
        LRK  T  N  LF+ K +  KF  +G+PLSY Y+LGY  EGLG EY  FV +I N  D PSI DV  LL ++E+
Subjt:  LRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEA

A0A5C7IHH0 Uncharacterized protein2.5e-1151.72Show/hide
Query:  LRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNL
        L+K  T  N  LF+ K +  KF AIG+PLSY  +LGY+LEGLG+EY  FV +I N  D PSI DV  LL ++E RL K+TL ++ +L
Subjt:  LRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNL

A0A6J1D6N7 uncharacterized protein LOC1110174382.3e-1248.89Show/hide
Query:  LFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNS
        L K+K +  K  +IG+P+S   ++ YI+EGLG EY  FV +I N +D  ++ DV  LL AY+ RLEKQ  V+QLN++QANVANL +++ S
Subjt:  LFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVANLSISQNS

A0A6J1DQX7 uncharacterized protein LOC1110223153.3e-1950Show/hide
Query:  GFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANV
        G     + LRK  +  +  L K+K +  KF A+G+PLSY  +L ++L+GLG+EY  FV +IHN  D PS+ DV  LL AYEARL+KQ  V+QLN+ QAN+
Subjt:  GFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANV

Query:  ANLSISQNSCRPKWPP
         NLS+  NS RP  PP
Subjt:  ANLSISQNSCRPKWPP

A0A803NL56 Uncharacterized protein2.6e-0835.56Show/hide
Query:  FGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVA
        F ++ + L+K    ++  L KLK L     ++G P+S   +L Y+L GLG EY  FV  I      P+I +V  LL +YEARLE+Q      + +QAN A
Subjt:  FGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQANVA

Query:  NLSI---------SQNSCRPKWPPYDKSFTPTTLP
        NLS           Q S +P++P + +   P T+P
Subjt:  NLSI---------SQNSCRPKWPPYDKSFTPTTLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGTTTTCTTCTACAACTTGCATCACGGGGCTTTGGTCTCAGCTACAAAGAGTTAAGAAAGATGGTCACTCTATCTAACAATATGTTGTTCAAATTAAAGATGTT
GCGGATAAAGTTCTGTGCTATTGGCAAGCCTTTATCTTATTTTTATTATCTGGGTTATATCCTTGAAGGTCTAGGTAATGAGTATTATCCATTTGTCATTACTATTCATA
ATTGCACTGATGGACCCTCTATTACAGATGTTCCCATCCTTCTTTGGGCGTATGAGGCTCGCTTGGAGAAACAAACTTTGGTTAATCAACTCAATCTTATTCAGGCTAAT
GTTGCCAATTTGTCCATATCACAAAACTCTTGTCGACCAAAATGGCCTCCATACGATAAGTCTTTTACTCCTACCACTCTGCCAATGCTTGGGTTTGTGCCATTTTTTAT
TGTTGTTCTTTCTGGTCCTGATATTCTTGATAATGCTCGCCCTCTTATCCCTTTGTCCAACTTCCAAACAATTGTCCTCAATAACCTTATAAACCTCCAAACAGTTGTCC
TCAATGTTAGATTTGTAGTAAGCTTGGGCACATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCGTTTTCTTCTACAACTTGCATCACGGGGCTTTGGTCTCAGCTACAAAGAGTTAAGAAAGATGGTCACTCTATCTAACAATATGTTGTTCAAATTAAAGATGTT
GCGGATAAAGTTCTGTGCTATTGGCAAGCCTTTATCTTATTTTTATTATCTGGGTTATATCCTTGAAGGTCTAGGTAATGAGTATTATCCATTTGTCATTACTATTCATA
ATTGCACTGATGGACCCTCTATTACAGATGTTCCCATCCTTCTTTGGGCGTATGAGGCTCGCTTGGAGAAACAAACTTTGGTTAATCAACTCAATCTTATTCAGGCTAAT
GTTGCCAATTTGTCCATATCACAAAACTCTTGTCGACCAAAATGGCCTCCATACGATAAGTCTTTTACTCCTACCACTCTGCCAATGCTTGGGTTTGTGCCATTTTTTAT
TGTTGTTCTTTCTGGTCCTGATATTCTTGATAATGCTCGCCCTCTTATCCCTTTGTCCAACTTCCAAACAATTGTCCTCAATAACCTTATAAACCTCCAAACAGTTGTCC
TCAATGTTAGATTTGTAGTAAGCTTGGGCACATAG
Protein sequenceShow/hide protein sequence
MDRFLLQLASRGFGLSYKELRKMVTLSNNMLFKLKMLRIKFCAIGKPLSYFYYLGYILEGLGNEYYPFVITIHNCTDGPSITDVPILLWAYEARLEKQTLVNQLNLIQAN
VANLSISQNSCRPKWPPYDKSFTPTTLPMLGFVPFFIVVLSGPDILDNARPLIPLSNFQTIVLNNLINLQTVVLNVRFVVSLGT