; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005856 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005856
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr6:32182349..32190904
RNA-Seq ExpressionLag0005856
SyntenyLag0005856
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037539.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]5.3e-1491.3Show/hide
Query:  MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASI
        ML RYSMQNSK GLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYAS+
Subjt:  MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASI

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]9.0e-1470.31Show/hide
Query:  VKSKKQMMIAL--------MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS
        ++ +K   +AL        +L RYSMQNSK+GLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS
Subjt:  VKSKKQMMIAL--------MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS

KAA0043583.1 putative Integrase core domain [Cucumis melo var. makuwa]6.9e-1469.23Show/hide
Query:  VKSKKQMMIAL--------MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASI
        ++ +K   +AL        ML RYSMQNSK+GLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYAS+
Subjt:  VKSKKQMMIAL--------MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASI

KAA0064276.1 gag/pol protein [Cucumis melo var. makuwa]6.9e-1493.33Show/hide
Query:  MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS
        ML RYSMQNSK+GLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYAS
Subjt:  MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS

TYK11959.1 gag/pol protein [Cucumis melo var. makuwa]6.9e-1493.33Show/hide
Query:  MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS
        ML RYSMQNSK+GLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYAS
Subjt:  MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS

TrEMBL top hitse value%identityAlignment
A0A5A7T8C2 Putative gag-pol polyprotein2.6e-1491.3Show/hide
Query:  MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASI
        ML RYSMQNSK GLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYAS+
Subjt:  MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASI

A0A5A7TJH9 Putative Integrase core domain3.3e-1469.23Show/hide
Query:  VKSKKQMMIAL--------MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASI
        ++ +K   +AL        ML RYSMQNSK+GLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYAS+
Subjt:  VKSKKQMMIAL--------MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASI

A0A5A7TZD0 Gag/pol protein4.4e-1470.31Show/hide
Query:  VKSKKQMMIAL--------MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS
        ++ +K   +AL        +L RYSMQNSK+GLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS
Subjt:  VKSKKQMMIAL--------MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS

A0A5A7VD16 Gag/pol protein3.3e-1493.33Show/hide
Query:  MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS
        ML RYSMQNSK+GLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYAS
Subjt:  MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS

A0A5D3CNV3 Gag/pol protein3.3e-1493.33Show/hide
Query:  MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS
        ML RYSMQNSK+GLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYAS
Subjt:  MLARYSMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS

SwissProt top hitse value%identityAlignment
B9DHG0 Tetratricopeptide repeat domain-containing protein PYG7, chloroplastic1.1e-1148Show/hide
Query:  DEAKSAVYNALGVSYVRVDKLDKRITQFETAVKL---------HLGSFY--------------------LNNKLVGPRRDALKERMEMYKG-VPVKSKKQ
        D+  + VYNALGVSYVR DKLDK I QFE AVKL         +LG  Y                     NNK+  PRRDALK+R+++YKG V VKSKK+
Subjt:  DEAKSAVYNALGVSYVRVDKLDKRITQFETAVKL---------HLGSFY--------------------LNNKLVGPRRDALKERMEMYKG-VPVKSKKQ

C0PEY7 Tetratricopeptide repeat domain-containing protein PYG7, chloroplastic9.1e-0944.09Show/hide
Query:  VYNALGVSYVRVDKLDKRITQFETAVKL---------HLGSFY--------------------LNNKLVGPRRDALKERMEMYKGVPVKSKKQ
        VYNALGVSY R +KLDK I QF+ AV+L         +LG  Y                     NNK+  PR D L+ R+ MYKGVPVKS+K+
Subjt:  VYNALGVSYVRVDKLDKRITQFETAVKL---------HLGSFY--------------------LNNKLVGPRRDALKERMEMYKGVPVKSKKQ

C0Z274 Vacuolar fusion protein CCZ1 homolog B1.2e-0553.57Show/hide
Query:  ASLKIMKVEENLSKGCGGKNAYHVGGHHYLLLGGLNFLLSVEHSPSIPMGSKVSTI
        AS KI+KVEE LSKG GG+NAYHV G+ YLL+        +E S + P G KV+T+
Subjt:  ASLKIMKVEENLSKGCGGKNAYHVGGHHYLLLGGLNFLLSVEHSPSIPMGSKVSTI

F4I2S4 Vacuolar fusion protein CCZ1 homolog A1.6e-0540.54Show/hide
Query:  IGRDAAIEVLVYPCIELQASLKIMKVEENLSKGCGGKNAYHVGGHHYLLLGGLNFLLSVEHSPSIPMGSKVSTI
        +  D +I  +    IE  ASL+I+K+EEN+S+G GG+NAYH+ G+ YL++         + S S P G KV+T+
Subjt:  IGRDAAIEVLVYPCIELQASLKIMKVEENLSKGCGGKNAYHVGGHHYLLLGGLNFLLSVEHSPSIPMGSKVSTI

Arabidopsis top hitse value%identityAlignment
AT1G16020.1 Protein of unknown function (DUF1712)1.1e-0640.54Show/hide
Query:  IGRDAAIEVLVYPCIELQASLKIMKVEENLSKGCGGKNAYHVGGHHYLLLGGLNFLLSVEHSPSIPMGSKVSTI
        +  D +I  +    IE  ASL+I+K+EEN+S+G GG+NAYH+ G+ YL++         + S S P G KV+T+
Subjt:  IGRDAAIEVLVYPCIELQASLKIMKVEENLSKGCGGKNAYHVGGHHYLLLGGLNFLLSVEHSPSIPMGSKVSTI

AT1G22700.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.1e-1348Show/hide
Query:  DEAKSAVYNALGVSYVRVDKLDKRITQFETAVKL---------HLGSFY--------------------LNNKLVGPRRDALKERMEMYKG-VPVKSKKQ
        D+  + VYNALGVSYVR DKLDK I QFE AVKL         +LG  Y                     NNK+  PRRDALK+R+++YKG V VKSKK+
Subjt:  DEAKSAVYNALGVSYVRVDKLDKRITQFETAVKL---------HLGSFY--------------------LNNKLVGPRRDALKERMEMYKG-VPVKSKKQ

AT1G22700.2 Tetratricopeptide repeat (TPR)-like superfamily protein8.1e-1348Show/hide
Query:  DEAKSAVYNALGVSYVRVDKLDKRITQFETAVKL---------HLGSFY--------------------LNNKLVGPRRDALKERMEMYKG-VPVKSKKQ
        D+  + VYNALGVSYVR DKLDK I QFE AVKL         +LG  Y                     NNK+  PRRDALK+R+++YKG V VKSKK+
Subjt:  DEAKSAVYNALGVSYVRVDKLDKRITQFETAVKL---------HLGSFY--------------------LNNKLVGPRRDALKERMEMYKG-VPVKSKKQ

AT1G22700.3 Tetratricopeptide repeat (TPR)-like superfamily protein8.1e-1348Show/hide
Query:  DEAKSAVYNALGVSYVRVDKLDKRITQFETAVKL---------HLGSFY--------------------LNNKLVGPRRDALKERMEMYKG-VPVKSKKQ
        D+  + VYNALGVSYVR DKLDK I QFE AVKL         +LG  Y                     NNK+  PRRDALK+R+++YKG V VKSKK+
Subjt:  DEAKSAVYNALGVSYVRVDKLDKRITQFETAVKL---------HLGSFY--------------------LNNKLVGPRRDALKERMEMYKG-VPVKSKKQ

AT1G80910.1 Protein of unknown function (DUF1712)8.7e-0753.57Show/hide
Query:  ASLKIMKVEENLSKGCGGKNAYHVGGHHYLLLGGLNFLLSVEHSPSIPMGSKVSTI
        AS KI+KVEE LSKG GG+NAYHV G+ YLL+        +E S + P G KV+T+
Subjt:  ASLKIMKVEENLSKGCGGKNAYHVGGHHYLLLGGLNFLLSVEHSPSIPMGSKVSTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGTTGTAACTCTCTCTCCACAGGAGCCAACTCAACAATCCAACTCAAAGGTGAGCAGTCCTACTAATTTTTTCAAGGAATGTGGAAGTGTACAGATGTTGACCAT
AGGGCGAGATGCTGCCATTGAAGTTCTGGTTTATCCTTGCATAGAGCTTCAAGCTTCCCTAAAGATCATGAAAGTTGAAGAAAATTTGTCGAAAGGTTGTGGTGGAAAAA
ATGCTTATCATGTTGGTGGCCATCATTATTTACTGCTTGGTGGGTTAAATTTCTTGTTATCAGTGGAGCATTCTCCATCTATTCCTATGGGTTCAAAGGTTTCCACAATT
ATCCTAGGATTAGAAATCAGAAAGTGGCTTGTAACACAAATTACTTTCACCCACCAGGTGAAGCCTAAGCCTGGAGTTCTAAGATTGATGGATGAGGCAAAGTCTGCTGT
TTACAATGCGCTTGGAGTTAGTTATGTGCGTGTCGATAAGCTTGACAAGAGAATTACCCAGTTTGAAACTGCCGTGAAGCTGCACCTTGGTTCTTTTTATCTGAACAACA
AGCTTGTTGGGCCGAGAAGAGATGCATTGAAGGAGCGAATGGAGATGTACAAAGGAGTTCCTGTGAAATCTAAAAAACAGATGATGATAGCCCTGATGTTGGCTCGATAT
TCGATGCAGAACTCCAAGAGGGGCTTGTTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGAACAGTCTCCTAAGACACCTCAAGAAGTTGAGGACATGAGACGGATTCC
CTACGCCTCTATAAAAAATGTTATTAAAGACCAATCATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCAGTTGTAACTCTCTCTCCACAGGAGCCAACTCAACAATCCAACTCAAAGGTGAGCAGTCCTACTAATTTTTTCAAGGAATGTGGAAGTGTACAGATGTTGACCAT
AGGGCGAGATGCTGCCATTGAAGTTCTGGTTTATCCTTGCATAGAGCTTCAAGCTTCCCTAAAGATCATGAAAGTTGAAGAAAATTTGTCGAAAGGTTGTGGTGGAAAAA
ATGCTTATCATGTTGGTGGCCATCATTATTTACTGCTTGGTGGGTTAAATTTCTTGTTATCAGTGGAGCATTCTCCATCTATTCCTATGGGTTCAAAGGTTTCCACAATT
ATCCTAGGATTAGAAATCAGAAAGTGGCTTGTAACACAAATTACTTTCACCCACCAGGTGAAGCCTAAGCCTGGAGTTCTAAGATTGATGGATGAGGCAAAGTCTGCTGT
TTACAATGCGCTTGGAGTTAGTTATGTGCGTGTCGATAAGCTTGACAAGAGAATTACCCAGTTTGAAACTGCCGTGAAGCTGCACCTTGGTTCTTTTTATCTGAACAACA
AGCTTGTTGGGCCGAGAAGAGATGCATTGAAGGAGCGAATGGAGATGTACAAAGGAGTTCCTGTGAAATCTAAAAAACAGATGATGATAGCCCTGATGTTGGCTCGATAT
TCGATGCAGAACTCCAAGAGGGGCTTGTTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGAACAGTCTCCTAAGACACCTCAAGAAGTTGAGGACATGAGACGGATTCC
CTACGCCTCTATAAAAAATGTTATTAAAGACCAATCATAG
Protein sequenceShow/hide protein sequence
MPVVTLSPQEPTQQSNSKVSSPTNFFKECGSVQMLTIGRDAAIEVLVYPCIELQASLKIMKVEENLSKGCGGKNAYHVGGHHYLLLGGLNFLLSVEHSPSIPMGSKVSTI
ILGLEIRKWLVTQITFTHQVKPKPGVLRLMDEAKSAVYNALGVSYVRVDKLDKRITQFETAVKLHLGSFYLNNKLVGPRRDALKERMEMYKGVPVKSKKQMMIALMLARY
SMQNSKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASIKNVIKDQS