; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy07g007770 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy07g007770
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr07:35578198..35578804
RNA-Seq ExpressionLcy07g007770
SyntenyLcy07g007770
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]8.1e-4260.71Show/hide
Query:  DAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVWD
        DAQLNP+ +HHS   T+ +V QPLTGA NY SWSRAM+MA+SG+NK GFI G I+KP++  L  AW  NNDI+ASWI+NSVSKEI  SI+Y GS+K +WD
Subjt:  DAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVWD

Query:  ELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK
        EL  RFK+S     +YQL+KE +T  QG L+IE Y+T+LK
Subjt:  ELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK

XP_022142771.1 uncharacterized protein LOC111012810 [Momordica charantia]6.4e-3164.81Show/hide
Query:  IDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVW
        I++QLNP+L+HHS A T++LV Q L GA NY SWSR+M++ALSGKNK GF+DGTI+KP    LA AWK  NDII SWI+NSVSKEI  S VY+GS K +W
Subjt:  IDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVW

Query:  DELADRFK
        DEL +RF+
Subjt:  DELADRFK

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]1.6e-4264.54Show/hide
Query:  IDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVW
        I++QLNP+L+HHS A T++LV Q L GA NY SW R+M++ALSGKNK GFIDGTI+KP N  L  AWK NNDII SWIINSVSKEI  SI+Y+GS K++W
Subjt:  IDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVW

Query:  DELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK
        DEL +RF++S  +  ++QL+KEL+TT QGTLSIE Y+T+LK
Subjt:  DELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK

XP_022888913.1 uncharacterized protein LOC111404319 [Olea europaea var. sylvestris]1.0e-2847.44Show/hide
Query:  MANKEVSDGDVSLNIDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKP-TNAKLAKAWKFNNDIIASWIINSVSKE
        M+N  + DG          +P+ LHHS +   VLV+QPL G  NY SWSRAM +ALS KNK  FI+G+I KP  N  L  AW  NN+++ SWI+NSVSKE
Subjt:  MANKEVSDGDVSLNIDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKP-TNAKLAKAWKFNNDIIASWIINSVSKE

Query:  ITTSIVYSGSVKNVWDELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK
        I+ SI+Y+ SV+ +WD+L +R+++   +  ++QL++EL+   QG  S+ VYFT+LK
Subjt:  ITTSIVYSGSVKNVWDELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK

XP_022891830.1 uncharacterized protein LOC111406673 [Olea europaea var. sylvestris]3.2e-3052.11Show/hide
Query:  IDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKP-TNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNV
        ID   +P+ LHHS +   +LV+QPLTG  NY SWSRAM++ALS KNK GFIDG+I +P +N  L  +W  NN+I+ SWI+NSVSK+IT SI+YS S   +
Subjt:  IDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKP-TNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNV

Query:  WDELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK
        W +L +RF++      ++QL++EL+  TQG +S  VYFT+LK
Subjt:  WDELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK

TrEMBL top hitse value%identityAlignment
A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 83.9e-4260.71Show/hide
Query:  DAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVWD
        DAQLNP+ +HHS   T+ +V QPLTGA NY SWSRAM+MA+SG+NK GFI G I+KP++  L  AW  NNDI+ASWI+NSVSKEI  SI+Y GS+K +WD
Subjt:  DAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVWD

Query:  ELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK
        EL  RFK+S     +YQL+KE +T  QG L+IE Y+T+LK
Subjt:  ELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK

A0A5J5BKC2 Uncharacterized protein1.2e-2747.55Show/hide
Query:  IDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKP--TNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKN
        I+   NP+ LHHS +L  +LV+Q LTG  NY +WSRAM++ALS KNK GF+DG+I +P  T   L  +W  NN+I+ SWI+NSVSKEI+ SI+++ S + 
Subjt:  IDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKP--TNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKN

Query:  VWDELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK
        +W +L DRF++      ++QL++EL+   Q   S+ +YFT+LK
Subjt:  VWDELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK

A0A6J1CN69 uncharacterized protein LOC1110128103.1e-3164.81Show/hide
Query:  IDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVW
        I++QLNP+L+HHS A T++LV Q L GA NY SWSR+M++ALSGKNK GF+DGTI+KP    LA AWK  NDII SWI+NSVSKEI  S VY+GS K +W
Subjt:  IDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVW

Query:  DELADRFK
        DEL +RF+
Subjt:  DELADRFK

A0A6J1CXR2 uncharacterized protein LOC1110152397.9e-4364.54Show/hide
Query:  IDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVW
        I++QLNP+L+HHS A T++LV Q L GA NY SW R+M++ALSGKNK GFIDGTI+KP N  L  AWK NNDII SWIINSVSKEI  SI+Y+GS K++W
Subjt:  IDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVW

Query:  DELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK
        DEL +RF++S  +  ++QL+KEL+TT QGTLSIE Y+T+LK
Subjt:  DELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK

A0A7J0FKC9 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein2.1e-2746.79Show/hide
Query:  NKEVSDGDVS-LNIDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKP--TNAKLAKAWKFNNDIIASWIINSVSKE
        N E S   ++ L  D   +P+ LHHS     VLV+Q LTG  NY SW+RAM++ALS KNK GFIDG+I KP   +  L  +W  NN+++ SWI+NSVSKE
Subjt:  NKEVSDGDVS-LNIDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKP--TNAKLAKAWKFNNDIIASWIINSVSKE

Query:  ITTSIVYSGSVKNVWDELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK
        I+ SI++S S   +W +L DRF++S     ++QL++EL+   Q    + VYFT+LK
Subjt:  ITTSIVYSGSVKNVWDELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).6.4e-1333.33Show/hide
Query:  NYVSWSRAMMMALSGKNKDGFIDGTIEKPTN-AKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVWDELADRFKESTLSIEVYQLQKELITTTQ
        NYV+W       L    K GFIDGT+ KP   + L + W+  N ++  W++NS++ ++  S++Y+ +   +W++L  R     + +++YQL++ L T  Q
Subjt:  NYVSWSRAMMMALSGKNKDGFIDGTIEKPTN-AKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSVKNVWDELADRFKESTLSIEVYQLQKELITTTQ

Query:  GTLSIEVYFTRLKK
        G  S+E YF +L K
Subjt:  GTLSIEVYFTRLKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAACAAAGAAGTTTCTGATGGCGATGTTTCGTTAAACATTGATGCACAATTAAATCCATTTCTTCTGCATCACTCTTTTGCTTTGACGTCTGTTCTTGTTAATCA
GCCATTGACTGGTGCATATAACTATGTTTCATGGAGCCGAGCAATGATGATGGCTCTTTCTGGTAAAAACAAGGATGGATTCATTGATGGCACCATTGAGAAACCTACCA
ATGCAAAATTGGCCAAGGCTTGGAAGTTCAACAACGATATTATCGCTTCGTGGATTATTAATTCGGTTTCAAAAGAAATCACGACCAGCATCGTCTATTCCGGTTCTGTA
AAAAATGTATGGGATGAACTTGCTGACCGATTCAAGGAAAGTACTTTGTCCATTGAAGTTTATCAATTGCAAAAGGAACTGATTACAACAACTCAAGGTACTTTGTCCAT
TGAAGTTTATTTTACCAGATTGAAAAAAAATTTGGCAAAATTTGAATGA
mRNA sequenceShow/hide mRNA sequence
AGAGCATTAATTTTTTTCTCCTTCTTTCTGCGCATTATCTTCGATTTCGCTTCTTCTTGCTCGAATTTCCTTTCCAATGGCGAACAAAGAAGTTTCTGATGGCGATGTTT
CGTTAAACATTGATGCACAATTAAATCCATTTCTTCTGCATCACTCTTTTGCTTTGACGTCTGTTCTTGTTAATCAGCCATTGACTGGTGCATATAACTATGTTTCATGG
AGCCGAGCAATGATGATGGCTCTTTCTGGTAAAAACAAGGATGGATTCATTGATGGCACCATTGAGAAACCTACCAATGCAAAATTGGCCAAGGCTTGGAAGTTCAACAA
CGATATTATCGCTTCGTGGATTATTAATTCGGTTTCAAAAGAAATCACGACCAGCATCGTCTATTCCGGTTCTGTAAAAAATGTATGGGATGAACTTGCTGACCGATTCA
AGGAAAGTACTTTGTCCATTGAAGTTTATCAATTGCAAAAGGAACTGATTACAACAACTCAAGGTACTTTGTCCATTGAAGTTTATTTTACCAGATTGAAAAAAAATTTG
GCAAAATTTGAATGATTTTTGACCTAGCCTTGACTGCAATTGTGGAAGTATAAAGGA
Protein sequenceShow/hide protein sequence
MANKEVSDGDVSLNIDAQLNPFLLHHSFALTSVLVNQPLTGAYNYVSWSRAMMMALSGKNKDGFIDGTIEKPTNAKLAKAWKFNNDIIASWIINSVSKEITTSIVYSGSV
KNVWDELADRFKESTLSIEVYQLQKELITTTQGTLSIEVYFTRLKKNLAKFE