; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G022080 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G022080
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionYqgFc domain-containing protein
Genome locationchr05:28653660..28659211
RNA-Seq ExpressionLsi05G022080
SyntenyLsi05G022080
Gene Ontology termsGO:0000967 - rRNA 5'-end processing (biological process)
InterPro domainsIPR005227 - Putative pre-16S rRNA nuclease
IPR006641 - YqgF/RNase H-like domain
IPR012337 - Ribonuclease H-like superfamily
IPR037027 - YqgF/RNase H-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582728.1 hypothetical protein SDJN03_22730, partial [Cucurbita argyrosperma subsp. sororia]5.1e-7479.49Show/hide
Query:  MRAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR
        M A  FQPFPLQSH L S  IFC PP PSSL  LPPSP LICKC+  ISS+ELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR
Subjt:  MRAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR

Query:  GQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL
        GQKLE KLIEIAEQE                 EADEFIIGLPKS DGKETP SNKIRSIAGRVAA+AAERGWRVYLHDEHGTTAEAESHMI + L
Subjt:  GQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL

XP_022979278.1 uncharacterized protein LOC111479048 [Cucurbita maxima]1.4e-7480Show/hide
Query:  MRAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR
        M A  FQPFPLQSH L S  IFC PP PSSL  LPPSP LICKC+  ISS+ELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR
Subjt:  MRAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR

Query:  GQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL
        GQKLE KLIEIAEQE                 EADEFIIGLPKS DGKETPQSNKIRSIAGRVAA+AAERGWRVYLHDEHGTTAEAESHMI + L
Subjt:  GQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL

XP_023528266.1 uncharacterized protein LOC111791236 [Cucurbita pepo subsp. pepo]8.8e-7479.49Show/hide
Query:  MRAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR
        M A  FQPFP QSH L S  IFC PP PSSL  LPPSP LICKC+  ISS+ELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR
Subjt:  MRAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR

Query:  GQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL
        GQKLE KLIEIAEQE                 EADEFIIGLPKS DGKETPQSNKIRSIAGRVAA+AAERGWRVYLHDEHGTTAEAESHMI + L
Subjt:  GQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL

XP_038893103.1 putative pre-16S rRNA nuclease isoform X1 [Benincasa hispida]6.3e-8084.02Show/hide
Query:  RAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELRG
        R Q FQPFPLQSH +  PTIFCLP  P SLAPLPPSPTLICKCQK ISSI+LPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPL VLELRG
Subjt:  RAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELRG

Query:  QKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL
        QKLEVKLIEIAEQE                 EADEFIIGLPKSCDGKETPQSNKIRSIAGRVA RAAERGWRVYLHDEHGTTAEAESHMISK L
Subjt:  QKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL

XP_038893121.1 putative pre-16S rRNA nuclease isoform X3 [Benincasa hispida]6.3e-8084.02Show/hide
Query:  RAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELRG
        R Q FQPFPLQSH +  PTIFCLP  P SLAPLPPSPTLICKCQK ISSI+LPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPL VLELRG
Subjt:  RAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELRG

Query:  QKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL
        QKLEVKLIEIAEQE                 EADEFIIGLPKSCDGKETPQSNKIRSIAGRVA RAAERGWRVYLHDEHGTTAEAESHMISK L
Subjt:  QKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL

TrEMBL top hitse value%identityAlignment
A0A0A0L3L5 YqgFc domain-containing protein8.9e-6474.74Show/hide
Query:  QSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKC--QKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELRG
        Q+FQ FPLQ+H L       LPP        P SP LI K     PISSIELPPNALRRKLDP WRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELRG
Subjt:  QSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKC--QKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELRG

Query:  QKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL
        QKLE KLIEIAEQE                 EADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYL+DEHGTTAEAESHMIS+ L
Subjt:  QKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL

A0A6J1CS97 uncharacterized protein LOC111014097 isoform X37.5e-6370.94Show/hide
Query:  MRAQSFQPFPLQS-----HNLQSPTIFCLPPAPSSLAPLPPSPTLICKC---QKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTR
        M +Q FQPFP QS       LQS TI  LP   +   P+     L C C    + ISSIELPPNALRRKLDP WRGGFSLGVDLGTSRTGLALSKGFSTR
Subjt:  MRAQSFQPFPLQS-----HNLQSPTIFCLPPAPSSLAPLPPSPTLICKC---QKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTR

Query:  PLTVLELRGQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMIS
        PLTVLELRG KLEVKLIEIAEQE                 EADEFIIGLPKS DGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTT+EAE+HMI 
Subjt:  PLTVLELRGQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMIS

Query:  KQL
        K L
Subjt:  KQL

A0A6J1CTB8 uncharacterized protein LOC111014097 isoform X17.5e-6370.94Show/hide
Query:  MRAQSFQPFPLQS-----HNLQSPTIFCLPPAPSSLAPLPPSPTLICKC---QKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTR
        M +Q FQPFP QS       LQS TI  LP   +   P+     L C C    + ISSIELPPNALRRKLDP WRGGFSLGVDLGTSRTGLALSKGFSTR
Subjt:  MRAQSFQPFPLQS-----HNLQSPTIFCLPPAPSSLAPLPPSPTLICKC---QKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTR

Query:  PLTVLELRGQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMIS
        PLTVLELRG KLEVKLIEIAEQE                 EADEFIIGLPKS DGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTT+EAE+HMI 
Subjt:  PLTVLELRGQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMIS

Query:  KQL
        K L
Subjt:  KQL

A0A6J1EA74 uncharacterized protein LOC1114321857.2e-7479.49Show/hide
Query:  MRAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR
        M A  FQPFPLQS  L S  IFC PP PSSL  LPPSP LICKC+  ISS+ELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR
Subjt:  MRAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR

Query:  GQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL
        GQKLE KLIEIAEQE                 EADEFIIGLPKS DGKETPQSNKIRSIAGRVAA+AAERGWRVYLHDEHGTTAEAESHMI + L
Subjt:  GQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL

A0A6J1IVR0 uncharacterized protein LOC1114790486.6e-7580Show/hide
Query:  MRAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR
        M A  FQPFPLQSH L S  IFC PP PSSL  LPPSP LICKC+  ISS+ELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR
Subjt:  MRAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELR

Query:  GQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL
        GQKLE KLIEIAEQE                 EADEFIIGLPKS DGKETPQSNKIRSIAGRVAA+AAERGWRVYLHDEHGTTAEAESHMI + L
Subjt:  GQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMISKQL

SwissProt top hitse value%identityAlignment
A1SJC8 Putative pre-16S rRNA nuclease5.6e-0736Show/hide
Query:  RGGFSLGVDLGTSRTGLALS--KGFSTRPL-TVLELRGQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRV
        R G  +G+D G +R G+A S   GF   P+ TV   +G    +  I  AE++       T+L          E ++GLP+S  G+E P + K+R  AGR+
Subjt:  RGGFSLGVDLGTSRTGLALS--KGFSTRPL-TVLELRGQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRV

Query:  AARAAERGWRVYLHDEHGTTAEAES
        AAR A     V L DE  TT  AE+
Subjt:  AARAAERGWRVYLHDEHGTTAEAES

Arabidopsis top hitse value%identityAlignment
AT1G12244.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.2e-4665.71Show/hide
Query:  ELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELRGQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETP
        E+PPNA+RRK+D  WRGGFSLGVDLG SRTG+A+SKG++ +PLTVL+ RGQKLE +L+EIAE+E                 EADEFIIGLP+S DGKET 
Subjt:  ELPPNALRRKLDPQWRGGFSLGVDLGTSRTGLALSKGFSTRPLTVLELRGQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETP

Query:  QSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMI
        QSNKIRS+AGR+A +AAERGWRVY+ DEHGTT+EA   MI
Subjt:  QSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAESHMI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATTTTGGCGCGGCCGCGCGGGTTGTGATGTGCGAAACAGCGGCACCGTGACTGTGAGTGCGGAGCTTCTTCTTCTATCGTGTTGGTTGGAAATAATGCGTGCGCA
AAGTTTCCAACCCTTTCCGCTGCAAAGCCACAATCTTCAGTCTCCGACGATTTTCTGTCTTCCACCAGCCCCTTCCTCTCTTGCTCCTCTGCCTCCATCTCCAACCCTAA
TTTGCAAATGCCAGAAGCCAATTTCCTCCATTGAACTTCCTCCCAACGCCCTTCGCCGCAAGCTCGACCCTCAGTGGAGAGGAGGTTTCAGTCTAGGGGTCGACCTCGGA
ACCTCTCGCACTGGACTTGCTCTTAGTAAAGGCTTCTCCACTCGTCCTCTTACCGTTCTGGAGTTGCGAGGACAAAAGCTTGAGGTTAAGCTTATTGAGATTGCTGAACA
GGAAGTATACATCTCTCCCTCTCTGACGATCTTGCGAATGAAATGTTATGAACATGAGGCTGATGAATTTATTATTGGACTTCCTAAATCATGCGATGGAAAAGAGACGC
CTCAGTCAAACAAAATTCGAAGTATTGCCGGAAGGGTGGCAGCCCGGGCAGCTGAAAGGGGCTGGAGAGTTTACTTGCATGATGAACACGGGACAACAGCAGAAGCGGAA
AGCCATATGATTTCCAAACAATTGGTTGCTAATTGCTATGCTAGTTGCGAACCATTGTTGAATCTCCTTGGCTTGGGCTGGAGAATGCCTCACTGTCTTTGGTGCACTCT
TCTAATCAACTGGAGTTCACCATTTCCTATTATAAAAAGAAAAGGCGAATTTGACAAAGGCAAACAAAATCTAATTGGTGGCATAACAACCCTATTAGTGGTCGTGGTCA
CAGTCTTCTACCGAAGCTCAAAGATTCTCCATCTTTTCAAAAGCACAATGAAATTAGAGCCGGGCCTTTTAAGTCCAAGCACAACTTATTTCAATCACATCATGGCTGTT
AAGCTTAGGAAAAGCACCATCGTTGGGATCCTTGGTAAATATAGGTTCAACATTTTAAATGACAAAGAACTAGAAGAGGACATCGTTGATTTTTATAAACAGCTATATAC
CAAGAAAGCCCTATTACTATCGAAGGGTGCATCTCCTCCCCTATTTCTCCATCATCATCAACGACAAGTCACAAGGCTTGAGGAAAGATCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATATTTTGGCGCGGCCGCGCGGGTTGTGATGTGCGAAACAGCGGCACCGTGACTGTGAGTGCGGAGCTTCTTCTTCTATCGTGTTGGTTGGAAATAATGCGTGCGCA
AAGTTTCCAACCCTTTCCGCTGCAAAGCCACAATCTTCAGTCTCCGACGATTTTCTGTCTTCCACCAGCCCCTTCCTCTCTTGCTCCTCTGCCTCCATCTCCAACCCTAA
TTTGCAAATGCCAGAAGCCAATTTCCTCCATTGAACTTCCTCCCAACGCCCTTCGCCGCAAGCTCGACCCTCAGTGGAGAGGAGGTTTCAGTCTAGGGGTCGACCTCGGA
ACCTCTCGCACTGGACTTGCTCTTAGTAAAGGCTTCTCCACTCGTCCTCTTACCGTTCTGGAGTTGCGAGGACAAAAGCTTGAGGTTAAGCTTATTGAGATTGCTGAACA
GGAAGTATACATCTCTCCCTCTCTGACGATCTTGCGAATGAAATGTTATGAACATGAGGCTGATGAATTTATTATTGGACTTCCTAAATCATGCGATGGAAAAGAGACGC
CTCAGTCAAACAAAATTCGAAGTATTGCCGGAAGGGTGGCAGCCCGGGCAGCTGAAAGGGGCTGGAGAGTTTACTTGCATGATGAACACGGGACAACAGCAGAAGCGGAA
AGCCATATGATTTCCAAACAATTGGTTGCTAATTGCTATGCTAGTTGCGAACCATTGTTGAATCTCCTTGGCTTGGGCTGGAGAATGCCTCACTGTCTTTGGTGCACTCT
TCTAATCAACTGGAGTTCACCATTTCCTATTATAAAAAGAAAAGGCGAATTTGACAAAGGCAAACAAAATCTAATTGGTGGCATAACAACCCTATTAGTGGTCGTGGTCA
CAGTCTTCTACCGAAGCTCAAAGATTCTCCATCTTTTCAAAAGCACAATGAAATTAGAGCCGGGCCTTTTAAGTCCAAGCACAACTTATTTCAATCACATCATGGCTGTT
AAGCTTAGGAAAAGCACCATCGTTGGGATCCTTGGTAAATATAGGTTCAACATTTTAAATGACAAAGAACTAGAAGAGGACATCGTTGATTTTTATAAACAGCTATATAC
CAAGAAAGCCCTATTACTATCGAAGGGTGCATCTCCTCCCCTATTTCTCCATCATCATCAACGACAAGTCACAAGGCTTGAGGAAAGATCCTAG
Protein sequenceShow/hide protein sequence
MIFWRGRAGCDVRNSGTVTVSAELLLLSCWLEIMRAQSFQPFPLQSHNLQSPTIFCLPPAPSSLAPLPPSPTLICKCQKPISSIELPPNALRRKLDPQWRGGFSLGVDLG
TSRTGLALSKGFSTRPLTVLELRGQKLEVKLIEIAEQEVYISPSLTILRMKCYEHEADEFIIGLPKSCDGKETPQSNKIRSIAGRVAARAAERGWRVYLHDEHGTTAEAE
SHMISKQLVANCYASCEPLLNLLGLGWRMPHCLWCTLLINWSSPFPIIKRKGEFDKGKQNLIGGITTLLVVVVTVFYRSSKILHLFKSTMKLEPGLLSPSTTYFNHIMAV
KLRKSTIVGILGKYRFNILNDKELEEDIVDFYKQLYTKKALLLSKGASPPLFLHHHQRQVTRLEERS