; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0071221 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0071221
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationCMiso1.1chr03:17373100..17375012
RNA-Seq ExpressionCmc03g0071221
SyntenyCmc03g0071221
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013723.1 hypothetical protein SDJN02_23890, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-7557.32Show/hide
Query:  MFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQLI
        MFLV+L+ F+PL DATS L QI+++AD++FT    S+IAS+ SPRFVA LQM+   F NY VD+ ++S++SLESFHDA+LDGG+  SM+IH+L + +Q+I
Subjt:  MFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQLI

Query:  LRFES-SSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRI
        LR+E+ SS+ P +H EL L+P Q E LG+V+Y KFF+++SK LR++I+ LP+FH D + V AT ++VKFSIASKEI +TKE+  C IVGYEGE+ET+  I
Subjt:  LRFES-SSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRI

Query:  NLNPMLFFLNFTHDALRIWFYKTTTSHGAMVVPAFGFFAQYVILFP
           PM+FFLNFT+ A R+WFYKTT S   + VPAFG + QYV+ FP
Subjt:  NLNPMLFFLNFTHDALRIWFYKTTTSHGAMVVPAFGFFAQYVILFP

XP_016903187.1 PREDICTED: uncharacterized protein LOC103502263 [Cucumis melo]3.3e-9598.37Show/hide
Query:  MAMFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQ
        MAMFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLAN KQ
Subjt:  MAMFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQ

Query:  LILRFESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENE
        LILRFESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKE +
Subjt:  LILRFESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENE

XP_023000630.1 uncharacterized protein LOC111494874 [Cucurbita maxima]4.5e-7657.49Show/hide
Query:  AMFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQL
        +MFLVRLK+F PL D TSRLAQIARE+DI FTPL   +  S  SPRF+A LQ+ H CF  Y V+ DH SRISLES HDALLD G+S +MTIHLL NT  +
Subjt:  AMFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQL

Query:  ILRFESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRI
        +LRFE+ +H P++ H+  L P QE+ + E++Y+K  ++DS+DLR+VI+ LP+FHGDS+CVT T S+V+FSIAS+E++  KE   C I+G++G+  T+FRI
Subjt:  ILRFESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRI

Query:  NLNPMLFFLNFTHDALRIWFYKT-TTSHGAMVVPAFGFFAQYVILFP
         L PMLFFLN T+D   +WF+KT T +H  M+ P F  FAQYVI FP
Subjt:  NLNPMLFFLNFTHDALRIWFYKT-TTSHGAMVVPAFGFFAQYVILFP

XP_031744160.1 uncharacterized protein LOC116404808 [Cucumis sativus]1.1e-11987.1Show/hide
Query:  MFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQLI
        MFLVRLK+F+P F ATSRLA IAREAD+KFTPLFFSI  SN+ PRFVAYL MT++CFINYKVDNDHTSRISLESFHDALLDGG SPSMTIHLLAN  Q+I
Subjt:  MFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQLI

Query:  LRFESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRIN
        LRFESSSHAP+V HELSL PSQEEDLGE+DYAKFFSIDSK LRRVIRNLPIFHGDSICVTAT SQVKFSIASKEIVLTKENEEC+IVGYEGEEETK  IN
Subjt:  LRFESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRIN

Query:  LNPMLFFLNFTHDALRIWFYKTTTSHGAMVVPAFGFFAQYVILFPNYN
        LNPMLFFLNFTHD LR+WFYKTTT HGAMVVP+FGF++QYVILFPNYN
Subjt:  LNPMLFFLNFTHDALRIWFYKTTTSHGAMVVPAFGFFAQYVILFPNYN

XP_038875055.1 uncharacterized protein LOC120067580 [Benincasa hispida]1.7e-7058.17Show/hide
Query:  MFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQLI
        MFLV+L +F+PL DATS LAQI+  AD+KFTPL F +IA   SPRFVA LQ++  CF NY VD++HTS++ LESFHDA+LDGG+  SMTIHLL    Q+I
Subjt:  MFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQLI

Query:  LRFES-SSHAPKVHHELSLTPSQEEDL---GEVDYAKFFSIDSKDLRRVIRNLPIFHGDS-ICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEET
        LRF++ SS  P +HHEL+ +P Q  D    G+++  KFF + S+ LRR+I+ LPIF  DS +CV  T SQ+KFSIASKEIVL  + + C IVG+E E ET
Subjt:  LRFES-SSHAPKVHHELSLTPSQEEDL---GEVDYAKFFSIDSKDLRRVIRNLPIFHGDS-ICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEET

Query:  KFRINLNPMLFFLNFTHDALRIWFYKT-TTSHGAMVVPAFGFFAQYVILFP
        +F+I L PMLFFLNFT+ A ++WFYKT   S+  M VPAFG   QYVI FP
Subjt:  KFRINLNPMLFFLNFTHDALRIWFYKT-TTSHGAMVVPAFGFFAQYVILFP

TrEMBL top hitse value%identityAlignment
A0A1S3C8J1 uncharacterized protein LOC1034980101.4e-7060.93Show/hide
Query:  MFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQLI
        MFLVRL+ F+PL DATS LAQ+A++AD+KFTPL   II SNRSP+FVA LQ++   F N+ VD++ +S++SL+ FHDA+LDGG+  SMTIHLL  T Q++
Subjt:  MFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQLI

Query:  LRFESSSH-APKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRI
        LRFE+ SH  P +HHEL+L+P Q E+LG+V+Y  FF++ S++LRR+I+ LP+FH D++ VT TGSQVKFSI SKEI+LTKE   C IVGYEGE ETK ++
Subjt:  LRFESSSH-APKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRI

Query:  NLNPMLFFLNFTHDA
         L PM+FFLNFT+ A
Subjt:  NLNPMLFFLNFTHDA

A0A1S3CL88 uncharacterized protein LOC1035022502.4e-7060.08Show/hide
Query:  MFLVRLKDFDPLFDATSRLAQIARE-ADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQL
        MFLV+LK+FDPL DATS LAQI+ + AD+KFTP  F IIAS+RSPRF+A LQ++   F  + VDNDH+S++SLESFHDA+LDGG+  SMTIHLL  T Q+
Subjt:  MFLVRLKDFDPLFDATSRLAQIARE-ADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQL

Query:  ILRFES-SSHAPKVHHELSLTPSQEED--LG--EVDYAKFFSIDSKDLRRVIRNLPIFHGDS-ICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEE
        ILRF++ SS    +HHEL+L+P Q ED  +G  E+D  K+F + SK LRR+I++LPIF  DS I V  T S+VKFSIASKEI+LT E   C I G+E E 
Subjt:  ILRFES-SSHAPKVHHELSLTPSQEED--LG--EVDYAKFFSIDSKDLRRVIRNLPIFHGDS-ICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEE

Query:  ETKFRINLNPMLFFLNFTHDALRIWFYKT-TTSHGAMVVPAFGFFAQYVILFP
        ET+F+I L PM+FFLNFT+ A R+WFYKT   ++  MVVPA+G F QYVI FP
Subjt:  ETKFRINLNPMLFFLNFTHDALRIWFYKT-TTSHGAMVVPAFGFFAQYVILFP

A0A1S4E4N8 uncharacterized protein LOC1035022631.6e-9598.37Show/hide
Query:  MAMFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQ
        MAMFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLAN KQ
Subjt:  MAMFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQ

Query:  LILRFESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENE
        LILRFESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKE +
Subjt:  LILRFESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENE

A0A6J1KIW5 uncharacterized protein LOC1114948742.2e-7657.49Show/hide
Query:  AMFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQL
        +MFLVRLK+F PL D TSRLAQIARE+DI FTPL   +  S  SPRF+A LQ+ H CF  Y V+ DH SRISLES HDALLD G+S +MTIHLL NT  +
Subjt:  AMFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQL

Query:  ILRFESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRI
        +LRFE+ +H P++ H+  L P QE+ + E++Y+K  ++DS+DLR+VI+ LP+FHGDS+CVT T S+V+FSIAS+E++  KE   C I+G++G+  T+FRI
Subjt:  ILRFESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRI

Query:  NLNPMLFFLNFTHDALRIWFYKT-TTSHGAMVVPAFGFFAQYVILFP
         L PMLFFLN T+D   +WF+KT T +H  M+ P F  FAQYVI FP
Subjt:  NLNPMLFFLNFTHDALRIWFYKT-TTSHGAMVVPAFGFFAQYVILFP

A0A6J1KZ05 uncharacterized protein LOC1114988873.0e-6555.1Show/hide
Query:  MFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQLI
        MFLVRL  FDPL +ATS LAQI+ EAD+KF+   FS+I S  S RFVA  Q++H  F NY VD +H+SR+SL+SF+DA+ DG    SMTIH    T +++
Subjt:  MFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQLI

Query:  LRFESSSHAP-KVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRI
        L+FESS+H   K+H  L L+PSQEE+LG++ + +FFSI S+D R +I  LP F  +SI V+ T S+VKF  AS+E +LTKE   C I+GYEGE E  F+I
Subjt:  LRFESSSHAP-KVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRI

Query:  NLNPMLFFLNFTHDALRIWFYKTTTSHGAMVVPAFGFFAQYVILF
        NLNP  FF N ++ A RIWFYKT  S   + VPAFG  AQYVI F
Subjt:  NLNPMLFFLNFTHDALRIWFYKTTTSHGAMVVPAFGFFAQYVILF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATGTTCTTAGTGAGGCTTAAAGACTTTGATCCTCTTTTTGATGCAACTTCTCGTCTTGCTCAAATTGCTAGAGAAGCCGATATCAAATTCACACCATTA
TTCTTTTCAATAATTGCTTCCAATCGATCCCCTCGTTTTGTTGCATATCTTCAAATGACTCACCATTGCTTCATCAATTACAAAGTCGATAATGATCATACCTCA
AGAATTTCCCTCGAATCTTTCCATGACGCTCTCTTGGACGGCGGCGCTTCTCCTTCAATGACTATTCATCTTCTCGCAAACACTAAGCAATTGATCCTTAGATTT
GAATCTTCGAGCCATGCCCCGAAAGTGCATCATGAATTGTCATTGACACCGTCGCAAGAAGAAGATCTAGGAGAAGTTGATTATGCAAAATTTTTCTCAATTGAT
TCGAAGGATTTAAGGCGTGTTATAAGAAATTTACCTATATTCCACGGGGACTCAATATGTGTTACTGCAACGGGGTCACAAGTCAAATTCTCTATTGCTTCTAAA
GAGATTGTTCTTACCAAAGAGAATGAAGAATGTATGATTGTAGGTTACGAGGGAGAAGAAGAAACTAAATTCCGAATAAATCTAAATCCAATGTTGTTTTTTCTT
AATTTCACACATGATGCACTTAGAATATGGTTTTATAAGACAACCACTTCTCATGGTGCCATGGTTGTCCCAGCTTTTGGATTTTTTGCTCAATATGTAATCCTT
TTTCCCAACTATAATAATTAG
mRNA sequenceShow/hide mRNA sequence
TTATTGAATTCAATTCCACATTCCACAAACTAGACTAATATAAAAAGAATGTAACATATATAGAATAATTGAAACCCCATTAGAAATCTACAAACAAAAATCACA
CCATGGCCATGTTCTTAGTGAGGCTTAAAGACTTTGATCCTCTTTTTGATGCAACTTCTCGTCTTGCTCAAATTGCTAGAGAAGCCGATATCAAATTCACACCAT
TATTCTTTTCAATAATTGCTTCCAATCGATCCCCTCGTTTTGTTGCATATCTTCAAATGACTCACCATTGCTTCATCAATTACAAAGTCGATAATGATCATACCT
CAAGAATTTCCCTCGAATCTTTCCATGACGCTCTCTTGGACGGCGGCGCTTCTCCTTCAATGACTATTCATCTTCTCGCAAACACTAAGCAATTGATCCTTAGAT
TTGAATCTTCGAGCCATGCCCCGAAAGTGCATCATGAATTGTCATTGACACCGTCGCAAGAAGAAGATCTAGGAGAAGTTGATTATGCAAAATTTTTCTCAATTG
ATTCGAAGGATTTAAGGCGTGTTATAAGAAATTTACCTATATTCCACGGGGACTCAATATGTGTTACTGCAACGGGGTCACAAGTCAAATTCTCTATTGCTTCTA
AAGAGATTGTTCTTACCAAAGAGAATGAAGAATGTATGATTGTAGGTTACGAGGGAGAAGAAGAAACTAAATTCCGAATAAATCTAAATCCAATGTTGTTTTTTC
TTAATTTCACACATGATGCACTTAGAATATGGTTTTATAAGACAACCACTTCTCATGGTGCCATGGTTGTCCCAGCTTTTGGATTTTTTGCTCAATATGTAATCC
TTTTTCCCAACTATAATAATTAGGTTCAATAATAGTAGACTTCCATATCATTTTCTCAGTTTCTT
Protein sequenceShow/hide protein sequence
MAMFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINYKVDNDHTSRISLESFHDALLDGGASPSMTIHLLANTKQLILRF
ESSSHAPKVHHELSLTPSQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKENEECMIVGYEGEEETKFRINLNPMLFFL
NFTHDALRIWFYKTTTSHGAMVVPAFGFFAQYVILFPNYNN