; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy03g005290 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy03g005290
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr03:28991154..28991591
RNA-Seq ExpressionLcy03g005290
SyntenyLcy03g005290
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN65684.1 hypothetical protein Csa_019689 [Cucumis sativus]1.3e-3857.93Show/hide
Query:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI
        +YNS+  EV  Q++   NAKD+WEA    FGV+SRAEED+LRQ FQ +RK +  M DYLR+MK++ADNLGQA+S +  R L+SQVLLGLDE YN V+ +I
Subjt:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI

Query:  QGRMDISWSEMQAELLVFEKRLELQNSQKSVNSFSHNALVNMANS
        QG+ +ISW +MQ++LL+FEKRL+ QNSQK++ +   NA +NMA S
Subjt:  QGRMDISWSEMQAELLVFEKRLELQNSQKSVNSFSHNALVNMANS

XP_022148963.1 uncharacterized protein LOC111017501 [Momordica charantia]5.7e-3970.07Show/hide
Query:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI
        +YNSM  EVATQVM  ENA DLW AIQ LFGVQS+AEEDYLRQVFQQ+RK SLKM D+LRVMKSHADNLGQA S V TR+L+SQVLLGLDEEYN VVA I
Subjt:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI

Query:  QGRMDISWSEMQAELLVFEKRLELQNSQKSVNSFSHN
        QG+  ISW EMQAE        + QN Q S   F++N
Subjt:  QGRMDISWSEMQAELLVFEKRLELQNSQKSVNSFSHN

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]1.8e-3764.62Show/hide
Query:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI
        +YNSM ++VA QVM    +++LW A+Q LFGVQSRAE DYL+QVFQQ+ K SL+M +YL++MKSHADNL  A S VS R+LVSQVL GLDEEYN +V  +
Subjt:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI

Query:  QGRMDISWSEMQAELLVFEKRLELQNSQKS
        QG++++SWSEM AELL +EKRLE QNS KS
Subjt:  QGRMDISWSEMQAELLVFEKRLELQNSQKS

XP_038905161.1 uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida]4.0e-4063.27Show/hide
Query:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI
        +YNSM  EVA QVM CE AKDLW +I  LFGVQSR EEDYLR VFQ +RK +LKM +YL+ MK + DNL QA S +  R LVSQVLLGLDEEYN++VAMI
Subjt:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI

Query:  QGRMDISWSEMQAELLVFEKRLELQNSQKSVNSFSH--NALVNMANS
        QGR+D+SW +MQ+ELL++E+RLE Q++QK+   F+   NA VNM N+
Subjt:  QGRMDISWSEMQAELLVFEKRLELQNSQKSVNSFSH--NALVNMANS

XP_038905164.1 uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida]4.0e-4063.27Show/hide
Query:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI
        +YNSM  EVA QVM CE AKDLW +I  LFGVQSR EEDYLR VFQ +RK +LKM +YL+ MK + DNL QA S +  R LVSQVLLGLDEEYN++VAMI
Subjt:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI

Query:  QGRMDISWSEMQAELLVFEKRLELQNSQKSVNSFSH--NALVNMANS
        QGR+D+SW +MQ+ELL++E+RLE Q++QK+   F+   NA VNM N+
Subjt:  QGRMDISWSEMQAELLVFEKRLELQNSQKSVNSFSH--NALVNMANS

TrEMBL top hitse value%identityAlignment
A0A0A0LXB7 Uncharacterized protein6.2e-3957.93Show/hide
Query:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI
        +YNS+  EV  Q++   NAKD+WEA    FGV+SRAEED+LRQ FQ +RK +  M DYLR+MK++ADNLGQA+S +  R L+SQVLLGLDE YN V+ +I
Subjt:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI

Query:  QGRMDISWSEMQAELLVFEKRLELQNSQKSVNSFSHNALVNMANS
        QG+ +ISW +MQ++LL+FEKRL+ QNSQK++ +   NA +NMA S
Subjt:  QGRMDISWSEMQAELLVFEKRLELQNSQKSVNSFSHNALVNMANS

A0A5A7SIT7 Uncharacterized protein3.9e-3354.79Show/hide
Query:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI
        +YNSM  +VA Q+M   N +DLW+A Q  FGVQSRAEED+LRQ+ Q +RK + KM +YL VMK++ DNLGQ  S V  R L+SQVLLGLDE YN V+ +I
Subjt:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI

Query:  QGRMDISWSEMQAELLVFEKRLELQNSQ---KSVNSFSHNALVNMA
        QG+ DISW +MQ++LL+FEK L+ QN+Q   K   + + +  +NMA
Subjt:  QGRMDISWSEMQAELLVFEKRLELQNSQ---KSVNSFSHNALVNMA

A0A5D3E3L7 Uncharacterized protein1.3e-3362.1Show/hide
Query:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI
        +YNSM+ +VA Q+M    AKDLWEAIQ LFG++SRAEE +LR  FQ +R+ + KM DYLR+MK +ADNLGQA S V  R L+SQVLLGLDE YN V A+I
Subjt:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI

Query:  QGRMDISWSEMQAELLVFEKRLEL
        QG+ DISW +MQ+ELL+FE  +E+
Subjt:  QGRMDISWSEMQAELLVFEKRLEL

A0A6J1D5J0 uncharacterized protein LOC1110175012.8e-3970.07Show/hide
Query:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI
        +YNSM  EVATQVM  ENA DLW AIQ LFGVQS+AEEDYLRQVFQQ+RK SLKM D+LRVMKSHADNLGQA S V TR+L+SQVLLGLDEEYN VVA I
Subjt:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI

Query:  QGRMDISWSEMQAELLVFEKRLELQNSQKSVNSFSHN
        QG+  ISW EMQAE        + QN Q S   F++N
Subjt:  QGRMDISWSEMQAELLVFEKRLELQNSQKSVNSFSHN

A0A6J1DCW4 uncharacterized protein LOC1110195988.9e-3864.62Show/hide
Query:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI
        +YNSM ++VA QVM    +++LW A+Q LFGVQSRAE DYL+QVFQQ+ K SL+M +YL++MKSHADNL  A S VS R+LVSQVL GLDEEYN +V  +
Subjt:  MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMI

Query:  QGRMDISWSEMQAELLVFEKRLELQNSQKS
        QG++++SWSEM AELL +EKRLE QNS KS
Subjt:  QGRMDISWSEMQAELLVFEKRLELQNSQKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.9e-0428.43Show/hide
Query:  AKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMIQGRMDISWSEMQAELLVF
        ++D+W  I+  F     A    L    +      +++ADY R MK  AD+L      V+ RNLV  VL GL+ ++++++ +I+ R      +  A +L  
Subjt:  AKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMIQGRMDISWSEMQAELLVF

Query:  EK
        E+
Subjt:  EK

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.8e-0629.92Show/hide
Query:  AKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMIQGRMDI-SWSEMQAELLV
        A+DLW +++ LF     A         + +  + L + +Y + +KS +D L    S +S R LV  +L GL E+Y+ ++ +I+ +    S++E ++ LL+
Subjt:  AKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMIQGRMDI-SWSEMQAELLV

Query:  FEKRLELQNSQKSVNSFSHNALVNMAN
         E RL    S KS +S SH    +++N
Subjt:  FEKRLELQNSQKSVNSFSHNALVNMAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACAACTCGATGATCTCAGAAGTGGCGACTCAAGTAATGGACTGTGAAAATGCAAAGGACCTCTGGGAAGCTATTCAAGGACTGTTTGGTGTACAATCGAGAGCCGA
AGAGGACTACTTACGCCAGGTATTCCAACAATCTCGTAAAAATAGCTTAAAAATGGCTGATTATTTGCGTGTGATGAAAAGCCATGCAGATAACCTAGGTCAGGCTAAGA
GTCATGTATCTACAAGAAATCTTGTTTCACAAGTGTTGCTAGGGCTTGACGAGGAATATAATTCTGTGGTAGCCATGATACAAGGACGGATGGATATCTCTTGGTCTGAG
ATGCAAGCGGAATTGTTAGTGTTCGAAAAACGTTTGGAGCTACAAAACTCACAGAAGAGTGTCAATTCATTTAGCCACAATGCCTTAGTGAATATGGCTAACAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTACAACTCGATGATCTCAGAAGTGGCGACTCAAGTAATGGACTGTGAAAATGCAAAGGACCTCTGGGAAGCTATTCAAGGACTGTTTGGTGTACAATCGAGAGCCGA
AGAGGACTACTTACGCCAGGTATTCCAACAATCTCGTAAAAATAGCTTAAAAATGGCTGATTATTTGCGTGTGATGAAAAGCCATGCAGATAACCTAGGTCAGGCTAAGA
GTCATGTATCTACAAGAAATCTTGTTTCACAAGTGTTGCTAGGGCTTGACGAGGAATATAATTCTGTGGTAGCCATGATACAAGGACGGATGGATATCTCTTGGTCTGAG
ATGCAAGCGGAATTGTTAGTGTTCGAAAAACGTTTGGAGCTACAAAACTCACAGAAGAGTGTCAATTCATTTAGCCACAATGCCTTAGTGAATATGGCTAACAGTTGA
Protein sequenceShow/hide protein sequence
MYNSMISEVATQVMDCENAKDLWEAIQGLFGVQSRAEEDYLRQVFQQSRKNSLKMADYLRVMKSHADNLGQAKSHVSTRNLVSQVLLGLDEEYNSVVAMIQGRMDISWSE
MQAELLVFEKRLELQNSQKSVNSFSHNALVNMANS