; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034979 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034979
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold3:27095114..27097278
RNA-Seq ExpressionSpg034979
SyntenySpg034979
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013723.1 hypothetical protein SDJN02_23890, partial [Cucurbita argyrosperma subsp. argyrosperma]2.9e-5748.25Show/hide
Query:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT
        M   +++   PLL+ATSL  + +K+ADV+F  +  S+IASH SPRF ATLQ+S   FT YSVDH +S++ SL SFHDA+L    F S++ H+LE+  Q+ 
Subjt:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT

Query:  LTFKTSTRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTIVG
        L ++  +     PPL   LPL P Q E L ++ +G FF + S+ LR+++ ELP+  D+ V V  T ++VKF IASKEI +TK         +   C IVG
Subjt:  LTFKTSTRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTIVG

Query:  YEGV-ETQFQIVLQPLSFFLLLSYKAERLWFYRTTNSHNVISLPAFGMYAQYVIYFP
        YEG  ET+  I  +P+ FFL  +YKA R+WFY+TTNS +VIS+PAFGMY QYV+YFP
Subjt:  YEGV-ETQFQIVLQPLSFFLLLSYKAERLWFYRTTNSHNVISLPAFGMYAQYVIYFP

XP_008464344.1 PREDICTED: uncharacterized protein LOC103502250 [Cucumis melo]6.9e-5147.94Show/hide
Query:  MLKFKVENVYPLLEATSLFAEFA-KEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQI
        M   K++N  PLL+ATS  A+ +   AD+KF P  F IIASH SPRF ATLQ+S  +FT +SVD++HS++ SL SFHDA+L    F S+T HLL+  NQ+
Subjt:  MLKFKVENVYPLLEATSLFAEFA-KEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQI

Query:  TLTFKTSTRLWRAPPLQHHLPLLPSQEE-------RLDKIVFGMFFVIKSEDLRRVLIELPILQ-DNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLV
         L F T +      PL H L L P Q E        LD+     +F++KS+ LRR++ +LPI Q D+ + V +T+S+VKF IASKEIILT          
Subjt:  TLTFKTSTRLWRAPPLQHHLPLLPSQEE-------RLDKIVFGMFFVIKSEDLRRVLIELPILQ-DNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLV

Query:  QDRQCTIVGY-EGVETQFQIVLQPLSFFLLLSYKAERLWFYRT-TNSHNVISLPAFGMYAQYVIYFP
        + R C I G+ E VETQFQI+L P+ FFL  +YKA R+WFY+T  N++ ++ +PA+G++ QYVIYFP
Subjt:  QDRQCTIVGY-EGVETQFQIVLQPLSFFLLLSYKAERLWFYRT-TNSHNVISLPAFGMYAQYVIYFP

XP_023006010.1 uncharacterized protein LOC111498887 [Cucurbita maxima]7.7e-5048.45Show/hide
Query:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT
        M   ++ +  PL EATSL A+ + EAD+KF    FS+I S+PS RF AT QISH FF  YSVD NHS+R SL+SF+DA+     F S+T H  ET +++ 
Subjt:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT

Query:  LTFKTS--TRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTI
        L F++S  T+L     +   L L PSQEE L +I    FF I S+D R ++  LP   +N + V+LT S+VKFC AS+E ILTK         +  +C I
Subjt:  LTFKTS--TRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTI

Query:  VGYEG-VETQFQIVLQPLSFFLLLSYKAERLWFYRTTNSHNVISLPAFGMYAQYVIYF
        +GYEG  E  FQI L P  FF  LSY A R+WFY+T +S  VI +PAFG+ AQYVIYF
Subjt:  VGYEG-VETQFQIVLQPLSFFLLLSYKAERLWFYRTTNSHNVISLPAFGMYAQYVIYF

XP_031744160.1 uncharacterized protein LOC116404808 [Cucumis sativus]1.4e-5147.47Show/hide
Query:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT
        M   +++N  P   ATS  A  A+EADVKF P  FSI  S+  PRF A L ++++ F  Y VD++H++R SL SFHDALL      S+T HLL  +NQ+ 
Subjt:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT

Query:  LTFKTSTRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTIVG
        L F++S+    AP ++H L L PSQEE L +I +  FF I S+ LRRV+  LPI   + +CVT T SQVKF IASKEI+LTK         ++ +C IVG
Subjt:  LTFKTSTRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTIVG

Query:  YEG-VETQFQIVLQPLSFFLLLSYKAERLWFYRTTNSHNVISLPAFGMYAQYVIYFP
        YEG  ET+  I L P+ FFL  ++   R+WFY+TT  H  + +P+FG Y+QYVI FP
Subjt:  YEG-VETQFQIVLQPLSFFLLLSYKAERLWFYRTTNSHNVISLPAFGMYAQYVIYFP

XP_038875055.1 uncharacterized protein LOC120067580 [Benincasa hispida]6.5e-5750.19Show/hide
Query:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT
        M   K+ N  PLL+ATS  A+ +  ADVKF P  F +IA +PSPRF ATLQ+S   FT YSVDH H+++  L SFHDA+L    F S+T HLLE  NQ+ 
Subjt:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT

Query:  LTFKTSTRLWRAPPLQHHLPLLPSQ---EERLDKIVFGMFFVIKSEDLRRVLIELPILQDNP-VCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQC
        L F+T +     PPL H L   P Q        ++  G FF++KSE LRR++ ELPI QD+  VCV +T SQ+KF IASKEI+L           + + C
Subjt:  LTFKTSTRLWRAPPLQHHLPLLPSQ---EERLDKIVFGMFFVIKSEDLRRVLIELPILQDNP-VCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQC

Query:  TIVGY-EGVETQFQIVLQPLSFFLLLSYKAERLWFYRT-TNSHNVISLPAFGMYAQYVIYFPP
         IVG+ E VETQFQI+L+P+ FFL  +YKA ++WFY+T  NS++V+++PAFG+  QYVIYFPP
Subjt:  TIVGY-EGVETQFQIVLQPLSFFLLLSYKAERLWFYRT-TNSHNVISLPAFGMYAQYVIYFPP

TrEMBL top hitse value%identityAlignment
A0A1S3C8J1 uncharacterized protein LOC1034980101.6e-4849.12Show/hide
Query:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT
        M   ++E   PL++ATSL A+ AK+ADVKF P    II S+ SP+F ATLQ+S   FT +SVDHN S++ SL+ FHDA+L    F S+T HLL+T NQ+ 
Subjt:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT

Query:  LTFKTSTRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTIVG
        L F+T +     PPL H L L P Q E L ++ +G FF + S +LRR++ ELP+   + V VT+T SQVKF I SKEIILTK         +   C IVG
Subjt:  LTFKTSTRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTIVG

Query:  YEG-VETQFQIVLQPLSFFLLLSYKAER
        YEG VET+ Q+VL+P+ FFL  +Y+A +
Subjt:  YEG-VETQFQIVLQPLSFFLLLSYKAER

A0A1S3CL88 uncharacterized protein LOC1035022503.4e-5147.94Show/hide
Query:  MLKFKVENVYPLLEATSLFAEFA-KEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQI
        M   K++N  PLL+ATS  A+ +   AD+KF P  F IIASH SPRF ATLQ+S  +FT +SVD++HS++ SL SFHDA+L    F S+T HLL+  NQ+
Subjt:  MLKFKVENVYPLLEATSLFAEFA-KEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQI

Query:  TLTFKTSTRLWRAPPLQHHLPLLPSQEE-------RLDKIVFGMFFVIKSEDLRRVLIELPILQ-DNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLV
         L F T +      PL H L L P Q E        LD+     +F++KS+ LRR++ +LPI Q D+ + V +T+S+VKF IASKEIILT          
Subjt:  TLTFKTSTRLWRAPPLQHHLPLLPSQEE-------RLDKIVFGMFFVIKSEDLRRVLIELPILQ-DNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLV

Query:  QDRQCTIVGY-EGVETQFQIVLQPLSFFLLLSYKAERLWFYRT-TNSHNVISLPAFGMYAQYVIYFP
        + R C I G+ E VETQFQI+L P+ FFL  +YKA R+WFY+T  N++ ++ +PA+G++ QYVIYFP
Subjt:  QDRQCTIVGY-EGVETQFQIVLQPLSFFLLLSYKAERLWFYRT-TNSHNVISLPAFGMYAQYVIYFP

A0A6J1H2Z8 uncharacterized protein LOC1114600116.3e-5048.06Show/hide
Query:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT
        M   ++ +  PL+EATS+ A+ + EAD+KF    FS+I S+PS RF AT QISH FF  Y VD NHS+R SL+SF++A+ A   F S+T H  ET +++ 
Subjt:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT

Query:  LTFKTS--TRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTI
        L F++S  TR+     +   L L PSQEE L +I    FF I S+D R ++  LP   +N + V+LT S+VKFC AS+E ILTK         +  +C I
Subjt:  LTFKTS--TRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTI

Query:  VGYEG-VETQFQIVLQPLSFFLLLSYKAERLWFYRTTNSHNVISLPAFGMYAQYVIYF
        VGYEG  E  FQI L P  FF  LSY A R+WFY+T +S  VI +PAFG+ AQYVIYF
Subjt:  VGYEG-VETQFQIVLQPLSFFLLLSYKAERLWFYRTTNSHNVISLPAFGMYAQYVIYF

A0A6J1KIW5 uncharacterized protein LOC1114948742.7e-4843.02Show/hide
Query:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT
        M   +++N  PL++ TS  A+ A+E+D+ F P +  +  S PSPRF ATLQI H  FT YSV+ +H +R SL S HDALL      ++T HLLE  N + 
Subjt:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT

Query:  LTFKTSTRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTIVG
        L F+T       P L+H   LLP QE+ +++I +     + S DLR+V+ ELP+   + VCVT+T S+V+F IAS+E+I  K         +  +C I+G
Subjt:  LTFKTSTRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTIVG

Query:  YEG-VETQFQIVLQPLSFFLLLSYKAERLWFYRT-TNSHNVISLPAFGMYAQYVIYFP
        ++G   T+F+IVL P+ FFL L+Y  + +WF++T TN+H+V+  P F ++AQYVIYFP
Subjt:  YEG-VETQFQIVLQPLSFFLLLSYKAERLWFYRT-TNSHNVISLPAFGMYAQYVIYFP

A0A6J1KZ05 uncharacterized protein LOC1114988873.7e-5048.45Show/hide
Query:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT
        M   ++ +  PL EATSL A+ + EAD+KF    FS+I S+PS RF AT QISH FF  YSVD NHS+R SL+SF+DA+     F S+T H  ET +++ 
Subjt:  MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQIT

Query:  LTFKTS--TRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTI
        L F++S  T+L     +   L L PSQEE L +I    FF I S+D R ++  LP   +N + V+LT S+VKFC AS+E ILTK         +  +C I
Subjt:  LTFKTS--TRLWRAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTI

Query:  VGYEG-VETQFQIVLQPLSFFLLLSYKAERLWFYRTTNSHNVISLPAFGMYAQYVIYF
        +GYEG  E  FQI L P  FF  LSY A R+WFY+T +S  VI +PAFG+ AQYVIYF
Subjt:  VGYEG-VETQFQIVLQPLSFFLLLSYKAERLWFYRTTNSHNVISLPAFGMYAQYVIYF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAAATTCAAGGTGGAAAACGTCTATCCTCTCCTGGAAGCAACCTCCCTATTCGCTGAATTTGCAAAGGAAGCCGACGTCAAATTCAAACCGTGGGCATTCTCCAT
CATTGCCTCACACCCTTCCCCTCGCTTCCATGCAACCCTCCAAATCTCTCACCATTTCTTCACCAAATATTCCGTCGATCACAATCACAGCGCCAGGACTTCCCTTCGGT
CCTTCCACGACGCTCTGTTGGCCGACCGAGATTTCGATTCCATCACATTCCATCTTCTTGAAACCCTAAACCAAATCACCCTCACCTTCAAGACTTCCACACGTTTGTGG
CGCGCGCCGCCGTTGCAACATCATTTGCCGTTGTTGCCTTCGCAAGAGGAGCGCTTAGATAAAATTGTATTCGGAATGTTCTTCGTAATCAAATCGGAAGATCTAAGGCG
CGTTCTAATAGAACTGCCTATCCTCCAAGATAATCCTGTTTGTGTTACTCTAACGGATTCACAAGTCAAATTCTGTATTGCATCCAAAGAGATTATTCTTACCAAAAAGT
TGTCGGCTAATGTTGTGCTTGTGCAGGATCGACAATGTACAATTGTAGGGTACGAGGGAGTTGAAACTCAGTTCCAAATCGTTCTTCAACCTTTGTCGTTTTTCCTTCTT
TTATCATATAAAGCGGAGAGGTTATGGTTTTATAGGACAACTAATTCTCATAATGTAATTAGTCTTCCCGCTTTTGGAATGTATGCTCAATACGTGATCTATTTTCCCCC
AAATGTGACATGCGGGAAAATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGAAATTCAAGGTGGAAAACGTCTATCCTCTCCTGGAAGCAACCTCCCTATTCGCTGAATTTGCAAAGGAAGCCGACGTCAAATTCAAACCGTGGGCATTCTCCAT
CATTGCCTCACACCCTTCCCCTCGCTTCCATGCAACCCTCCAAATCTCTCACCATTTCTTCACCAAATATTCCGTCGATCACAATCACAGCGCCAGGACTTCCCTTCGGT
CCTTCCACGACGCTCTGTTGGCCGACCGAGATTTCGATTCCATCACATTCCATCTTCTTGAAACCCTAAACCAAATCACCCTCACCTTCAAGACTTCCACACGTTTGTGG
CGCGCGCCGCCGTTGCAACATCATTTGCCGTTGTTGCCTTCGCAAGAGGAGCGCTTAGATAAAATTGTATTCGGAATGTTCTTCGTAATCAAATCGGAAGATCTAAGGCG
CGTTCTAATAGAACTGCCTATCCTCCAAGATAATCCTGTTTGTGTTACTCTAACGGATTCACAAGTCAAATTCTGTATTGCATCCAAAGAGATTATTCTTACCAAAAAGT
TGTCGGCTAATGTTGTGCTTGTGCAGGATCGACAATGTACAATTGTAGGGTACGAGGGAGTTGAAACTCAGTTCCAAATCGTTCTTCAACCTTTGTCGTTTTTCCTTCTT
TTATCATATAAAGCGGAGAGGTTATGGTTTTATAGGACAACTAATTCTCATAATGTAATTAGTCTTCCCGCTTTTGGAATGTATGCTCAATACGTGATCTATTTTCCCCC
AAATGTGACATGCGGGAAAATCTAG
Protein sequenceShow/hide protein sequence
MLKFKVENVYPLLEATSLFAEFAKEADVKFKPWAFSIIASHPSPRFHATLQISHHFFTKYSVDHNHSARTSLRSFHDALLADRDFDSITFHLLETLNQITLTFKTSTRLW
RAPPLQHHLPLLPSQEERLDKIVFGMFFVIKSEDLRRVLIELPILQDNPVCVTLTDSQVKFCIASKEIILTKKLSANVVLVQDRQCTIVGYEGVETQFQIVLQPLSFFLL
LSYKAERLWFYRTTNSHNVISLPAFGMYAQYVIYFPPNVTCGKI