; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G193100 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G193100
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRetrotransposon protein
Genome locationCla97Chr10:21343289..21344620
RNA-Seq ExpressionCla97C10G193100
SyntenyCla97C10G193100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038880837.1 uncharacterized protein LOC120072528 [Benincasa hispida]1.3e-1480.39Show/hide
Query:  KSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFEDE
        +SHP+AK MWNKSF HYDDLSTVFGKD+ +GQSSE  +VMATNAFREFEDE
Subjt:  KSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFEDE

XP_038892629.1 uncharacterized protein At2g29880-like [Benincasa hispida]1.4e-2443.3Show/hide
Query:  MGIGAMLDMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLAV---MVGDLNTYGRRWRTLSWWK-----PYCI--WWR----PVRGPTIGHFNQ---DT
        MGI AMLD+  EE+ A VGLD DHI+FVESSEEWTKFRDDLAV   M G+       W  +   K      Y +   WR      R   + H  Q   + 
Subjt:  MGIGAMLDMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLAV---MVGDLNTYGRRWRTLSWWK-----PYCI--WWR----PVRGPTIGHFNQ---DT

Query:  VLEWALNQNTIECK-----------------------------------------SHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVM
        V   ALN NTIECK                                         SHP+AK MWNK FPHYDDLST+FGKD+A+GQSSE  +VM
Subjt:  VLEWALNQNTIECK-----------------------------------------SHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVM

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]3.9e-2241.24Show/hide
Query:  REMGIGAMLDMLVEEDSAPVGLDRD----HIQFVESSEEWTKFRDDLAVMVGDLNTYGRRWRTLSWWKPYCIWWRPVRGPTIGHFNQDTVLEWALNQNTI
        R+MGIGAM D+  EE+SA VGLD D      + V S  E  KF + L  +V             + W+     +R      +   + + VL  ALNQNTI
Subjt:  REMGIGAMLDMLVEEDSAPVGLDRD----HIQFVESSEEWTKFRDDLAVMVGDLNTYGRRWRTLSWWKPYCIWWRPVRGPTIGHFNQDTVLEWALNQNTI

Query:  ECK-----------------------------------------SHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFEDE
        ECK                                         SHP+AKGMWNK FPHYDDLSTVFGK KA+GQSSE  +VM TNAFREFEDE
Subjt:  ECK-----------------------------------------SHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFEDE

XP_038895852.1 uncharacterized protein LOC120084021 [Benincasa hispida]4.0e-1957.29Show/hide
Query:  DTVLEWALNQNTIECK------------------------------SHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFEDE
        + VLE ALNQNTIECK                              SHP+AKGMWNKSFPHYDDLSTVFGKD+A+GQSSE  ++MATNAFREFED+
Subjt:  DTVLEWALNQNTIECK------------------------------SHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFEDE

XP_038899910.1 uncharacterized protein LOC120087100 [Benincasa hispida]3.2e-1684.31Show/hide
Query:  KSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFEDE
        +SHP+AKGMWNKSFPHYDDLSTVFGKD+A+GQSSE  +VMA NAFREFEDE
Subjt:  KSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFEDE

TrEMBL top hitse value%identityAlignment
A0A5A7SXX8 Retrotransposon protein2.8e-1031.02Show/hide
Query:  REMGIGAMLDMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLAV-----------MVGDLNTYGRRWR---------------TLSWWKPYCIWWRPVR
        REM    ++D + E DS       D I ++E+S EWT++RDDL             M   L      W                    W+     +RP R
Subjt:  REMGIGAMLDMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLAV-----------MVGDLNTYGRRWR---------------TLSWWKPYCIWWRPVR

Query:  ---------------GPTIGHFNQD-----TVLEWALNQNTIECKSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATN
                       GPT   F  +      V E  +  N +  KSHP+AKG+ NKSFPHYD+LS VFGKD+A G  +E+   +  N
Subjt:  ---------------GPTIGHFNQD-----TVLEWALNQNTIECKSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATN

A0A5A7TDS7 Retrotransposon protein2.0e-0829.71Show/hide
Query:  LDMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLAVMVGDLNTYGRRWRTLSWWKPYCIWWRPVRGPTIGHFNQDTVLEW-ALNQNTIEC---------
        L+ + E DS+   +  D++ ++E+S EWT+++D+LA               ++ W+  C+  +   G T G F +  +  W  + Q  ++          
Subjt:  LDMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLAVMVGDLNTYGRRWRTLSWWKPYCIWWRPVRGPTIGHFNQDTVLEW-ALNQNTIEC---------

Query:  ---KSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSE
           K+HP+AKG+ NK FP+YD+LS VF KD+A G+ ++
Subjt:  ---KSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSE

A0A5A7UGW0 Retrotransposon protein1.6e-0833.83Show/hide
Query:  DMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLA-VMVGDLNTYGRRWRTLSWWKPYCIWWRPVRGPTIGHFNQDTVLEWALNQNTIECKSHPSAKGMW
        D + E DS       D I ++E S EWT++RD+LA  M  D          +      C WW  VR                  Q  +  +SHP+A+G+ 
Subjt:  DMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLA-VMVGDLNTYGRRWRTLSWWKPYCIWWRPVRGPTIGHFNQDTVLEWALNQNTIECKSHPSAKGMW

Query:  NKSFPHYDDLSTVFGKDKAIGQSSEALHVMATN
        NK F HYD+LS VFGKD+A G  +E    + +N
Subjt:  NKSFPHYDDLSTVFGKDKAIGQSSEALHVMATN

A0A5D3DG22 Retrotransposon protein1.0e-1234.62Show/hide
Query:  REMGIGAMLDMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLA-VMVGD-----------LNTYG--RRWRTL--SWWKPYCIWWR-PVRGPTIGHFN-
        REM    ++D L E DS       D I ++E+S EW+++RD LA  M  D            N +G  R+ + L  +WW     W+     GP +GHFN 
Subjt:  REMGIGAMLDMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLA-VMVGD-----------LNTYG--RRWRTL--SWWKPYCIWWR-PVRGPTIGHFN-

Query:  ----------------QDTVLEWALNQNTIECKSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFED
                        Q  + E  L  + +  KSHP+ KG+ +KSFP+YDDLS VFGKD+A G  SE    + +N    F D
Subjt:  ----------------QDTVLEWALNQNTIECKSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFED

A0A5D3E3R2 Retrotransposon protein2.7e-0830.06Show/hide
Query:  DMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLAVMVGDLNTYGRRWRTLSWWKPYCIWWRPV---RGPTI-----------GHFNQDTVLEWALNQ--
        D + E DS    +  D I ++E+S EWT++R+DLA  +         W  L+ W+ +  + R +   R   +             F+ +T     LNQ  
Subjt:  DMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLAVMVGDLNTYGRRWRTLSWWKPYCIWWRPV---RGPTI-----------GHFNQDTVLEWALNQ--

Query:  -----NTIECKSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFEDEAA
                   SHP AKG+ NK FPHY++LS VFGKD+A    ++    + +N    +E  AA
Subjt:  -----NTIECKSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFEDEAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGGTTCGTCTTTGTTTTGCCAGGGTAGGAAGAGTCTGCAGTCGATTCTATGGTTTTTAGGGATGCGGTTTTGCGACGTACGAATTGAGGGTTCCGAAGGGATTCT
TGGCTCCGTACAGAGGCGAACAATACCATCTCTCAGGGAGATGGGAATCGGTGCAATGCTTGACATGCTTGTTGAGGAGGATTCTGCACCAGTTGGTCTTGATAGAGATC
ATATTCAATTTGTTGAATCCTCGGAGGAATGGACCAAGTTCAGAGATGACTTGGCAGTAATGGTAGGGGATCTAAATACGTATGGTCGAAGGTGGAGGACGCTAAGTTGG
TGGAAGCCCTATTGTATTTGGTGGAGACCAGTTAGAGGTCCGACAATAGGACATTTCAACCAGGATACCGTGCTCGAGTGGGCACTAAACCAGAACACCATTGAGTGCAA
GAGTCATCCTAGTGCGAAGGGAATGTGGAATAAGTCATTCCCCCATTACGATGACCTCTCCACCGTATTTGGGAAAGACAAAGCAATAGGACAATCAAGTGAGGCCCTGC
ACGTGATGGCAACGAATGCATTCCGAGAATTTGAAGATGAGGCGGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGGTTCGTCTTTGTTTTGCCAGGGTAGGAAGAGTCTGCAGTCGATTCTATGGTTTTTAGGGATGCGGTTTTGCGACGTACGAATTGAGGGTTCCGAAGGGATTCT
TGGCTCCGTACAGAGGCGAACAATACCATCTCTCAGGGAGATGGGAATCGGTGCAATGCTTGACATGCTTGTTGAGGAGGATTCTGCACCAGTTGGTCTTGATAGAGATC
ATATTCAATTTGTTGAATCCTCGGAGGAATGGACCAAGTTCAGAGATGACTTGGCAGTAATGGTAGGGGATCTAAATACGTATGGTCGAAGGTGGAGGACGCTAAGTTGG
TGGAAGCCCTATTGTATTTGGTGGAGACCAGTTAGAGGTCCGACAATAGGACATTTCAACCAGGATACCGTGCTCGAGTGGGCACTAAACCAGAACACCATTGAGTGCAA
GAGTCATCCTAGTGCGAAGGGAATGTGGAATAAGTCATTCCCCCATTACGATGACCTCTCCACCGTATTTGGGAAAGACAAAGCAATAGGACAATCAAGTGAGGCCCTGC
ACGTGATGGCAACGAATGCATTCCGAGAATTTGAAGATGAGGCGGCTTAG
Protein sequenceShow/hide protein sequence
MGGSSLFCQGRKSLQSILWFLGMRFCDVRIEGSEGILGSVQRRTIPSLREMGIGAMLDMLVEEDSAPVGLDRDHIQFVESSEEWTKFRDDLAVMVGDLNTYGRRWRTLSW
WKPYCIWWRPVRGPTIGHFNQDTVLEWALNQNTIECKSHPSAKGMWNKSFPHYDDLSTVFGKDKAIGQSSEALHVMATNAFREFEDEAA