; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003960 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003960
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationscaffold4:1194814..1204444
RNA-Seq ExpressionSpg003960
SyntenySpg003960
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]3.5e-1659.52Show/hide
Query:  ASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSAS---SESSTSSDSPTITP
        +S+N ++D++SP FLLSNICNLVSIRL+STDF+LW F+L++ LKAHKLFGF+DGS   P+QFL+S+S   S+ +T++  P I P
Subjt:  ASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSAS---SESSTSSDSPTITP

XP_022152751.1 uncharacterized protein LOC111020396 isoform X1 [Momordica charantia]1.0e-1557.65Show/hide
Query:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT
        MAS  +E D+NSP FLLSNICNL+SIRL+ST+F+LW F+L++ LKAHKLF F+DGS+  P +F+++++ ESS+S DS +   P T
Subjt:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT

XP_022152752.1 uncharacterized protein LOC111020396 isoform X2 [Momordica charantia]1.0e-1557.65Show/hide
Query:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT
        MAS  +E D+NSP FLLSNICNL+SIRL+ST+F+LW F+L++ LKAHKLF F+DGS+  P +F+++++ ESS+S DS +   P T
Subjt:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT

XP_022152753.1 uncharacterized protein LOC111020396 isoform X3 [Momordica charantia]1.0e-1557.65Show/hide
Query:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT
        MAS  +E D+NSP FLLSNICNL+SIRL+ST+F+LW F+L++ LKAHKLF F+DGS+  P +F+++++ ESS+S DS +   P T
Subjt:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT

XP_022152754.1 uncharacterized protein LOC111020396 isoform X4 [Momordica charantia]1.0e-1557.65Show/hide
Query:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT
        MAS  +E D+NSP FLLSNICNL+SIRL+ST+F+LW F+L++ LKAHKLF F+DGS+  P +F+++++ ESS+S DS +   P T
Subjt:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT

TrEMBL top hitse value%identityAlignment
A0A6J1D9L6 uncharacterized protein LOC1110188921.7e-1659.52Show/hide
Query:  ASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSAS---SESSTSSDSPTITP
        +S+N ++D++SP FLLSNICNLVSIRL+STDF+LW F+L++ LKAHKLFGF+DGS   P+QFL+S+S   S+ +T++  P I P
Subjt:  ASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSAS---SESSTSSDSPTITP

A0A6J1DFQ7 uncharacterized protein LOC111020396 isoform X35.0e-1657.65Show/hide
Query:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT
        MAS  +E D+NSP FLLSNICNL+SIRL+ST+F+LW F+L++ LKAHKLF F+DGS+  P +F+++++ ESS+S DS +   P T
Subjt:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT

A0A6J1DGZ1 uncharacterized protein LOC111020396 isoform X45.0e-1657.65Show/hide
Query:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT
        MAS  +E D+NSP FLLSNICNL+SIRL+ST+F+LW F+L++ LKAHKLF F+DGS+  P +F+++++ ESS+S DS +   P T
Subjt:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT

A0A6J1DH48 uncharacterized protein LOC111020396 isoform X25.0e-1657.65Show/hide
Query:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT
        MAS  +E D+NSP FLLSNICNL+SIRL+ST+F+LW F+L++ LKAHKLF F+DGS+  P +F+++++ ESS+S DS +   P T
Subjt:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT

A0A6J1DIP4 uncharacterized protein LOC111020396 isoform X15.0e-1657.65Show/hide
Query:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT
        MAS  +E D+NSP FLLSNICNL+SIRL+ST+F+LW F+L++ LKAHKLF F+DGS+  P +F+++++ ESS+S DS +   P T
Subjt:  MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTCTAATCTCGAGGAGGATGTCAATTCACCAAACTTCTTGCTATCGAATATCTGTAATCTTGTATCGATCCGCCTTAATTCAACTGACTTTGTTCTCTGGAA
CTTTGAATTATCTTCCAATCTCAAGGCTCATAAACTATTTGGATTTGTTGATGGATCCAATCCTACACCTGCACAATTTCTTTCCTCCGCTTCGTCTGAGTCCTCAACTT
CTAGTGATTCTCCGACTATTACTCCACCTCAAACTCGGAACAATCCAAAGAGTATCCAGGCTCTGAGGGACCTTCTCATTTGCTTATCGAGAAACATATCTACAGTGAGG
GGAGTGCAACTACTAGGCTATAGTAGAGTGACCTGCTTCTCAAACCCAAAAGCTCAAAACTCTTCAAAGCTTGAAATCCAAGCTCATTTCCTTCCAAATTCCATCAGTAG
AGGATCCCACAATCCGTTCCAAGGCCTGAGAATAGCAATGAAGACATTTGTGGTGGTTCGTGATCAATTAGTGAGAAGAAACAACCAAGCAATCGGTCCAATTTTGAGCT
ACACAAAGGAGGCAACTGGGCTTTCTCCTTCTACTGTGTTTGTTAATTCTTTTCCTTTCTGTGGAAGTGATTTGCCTCATCCCACTATTTACTCATCTGGAGAGACAGAT
TCAACTTTTGCTTCTTTCTCTTATGTCTCTAGTAGTCGGGTCAAGAGGAGTAGTCAATTGTTTAGTGGAGGGTTAGTAGTGGGAGACTCTTTATTCTCAAGTAATCCTAC
ACAAGGTTCTTCATTTAATGGGGGTGTCATTGTTACTCCCCTAGAGGTGGTCAAAGAAGTGATTTTAATCTCTATCTCCTTTGATATTCCTGGCTTTGGGATGGGTTGGC
CTCGTTTGGTGAATGGGAAGTTCACTTGGGCTAATAATAGGGTTGCAGTCGTATTGGCAAATGTTTGTCTTTGTGAGATGCTTAAGAAGTTCTTCCTCATTCGTTGTTTC
GAGAGAAAGGGAAGTGTTTATGGTAGGCATGGATTTGTGTTGTCCTATGGGATTTGTGGGTTGGAGGTGGACTTAAGGGAAGAACGCTTAGCCCTTAAAATGGAGTTTAT
GGAGTTGGTTTCTTTCACAAAGTGGTGGGCGAATTTGGTGGGCTGTTGTGAGGTTGGTTCCTTTCCTTCATCACTCAGGCTTCCTCTTGGTCATAACCCAAGAACTATCA
TCTTTTGTGATCCTACCATAGAAAAGGGGATTGAGGAGGGGAGAGGTACACATGAGTTGGGAGACAGTGGGAAAGCCTATGAACCTTTGGGGTTAGAGTTCAGGAAGTTA
GGGACTCGTAACAGAGCCTTGCTGCCTTTGAAGGTTATTCATGAAAGGATTAATACTTTTGATAGGCTTACAAGAAAGATGCATTCATTGATTGGATCGTTTTGCTGCAT
CCTTTGTTGGAAGGCGAAGGAAGATCTCGATCACTTGCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTTCTAATCTCGAGGAGGATGTCAATTCACCAAACTTCTTGCTATCGAATATCTGTAATCTTGTATCGATCCGCCTTAATTCAACTGACTTTGTTCTCTGGAA
CTTTGAATTATCTTCCAATCTCAAGGCTCATAAACTATTTGGATTTGTTGATGGATCCAATCCTACACCTGCACAATTTCTTTCCTCCGCTTCGTCTGAGTCCTCAACTT
CTAGTGATTCTCCGACTATTACTCCACCTCAAACTCGGAACAATCCAAAGAGTATCCAGGCTCTGAGGGACCTTCTCATTTGCTTATCGAGAAACATATCTACAGTGAGG
GGAGTGCAACTACTAGGCTATAGTAGAGTGACCTGCTTCTCAAACCCAAAAGCTCAAAACTCTTCAAAGCTTGAAATCCAAGCTCATTTCCTTCCAAATTCCATCAGTAG
AGGATCCCACAATCCGTTCCAAGGCCTGAGAATAGCAATGAAGACATTTGTGGTGGTTCGTGATCAATTAGTGAGAAGAAACAACCAAGCAATCGGTCCAATTTTGAGCT
ACACAAAGGAGGCAACTGGGCTTTCTCCTTCTACTGTGTTTGTTAATTCTTTTCCTTTCTGTGGAAGTGATTTGCCTCATCCCACTATTTACTCATCTGGAGAGACAGAT
TCAACTTTTGCTTCTTTCTCTTATGTCTCTAGTAGTCGGGTCAAGAGGAGTAGTCAATTGTTTAGTGGAGGGTTAGTAGTGGGAGACTCTTTATTCTCAAGTAATCCTAC
ACAAGGTTCTTCATTTAATGGGGGTGTCATTGTTACTCCCCTAGAGGTGGTCAAAGAAGTGATTTTAATCTCTATCTCCTTTGATATTCCTGGCTTTGGGATGGGTTGGC
CTCGTTTGGTGAATGGGAAGTTCACTTGGGCTAATAATAGGGTTGCAGTCGTATTGGCAAATGTTTGTCTTTGTGAGATGCTTAAGAAGTTCTTCCTCATTCGTTGTTTC
GAGAGAAAGGGAAGTGTTTATGGTAGGCATGGATTTGTGTTGTCCTATGGGATTTGTGGGTTGGAGGTGGACTTAAGGGAAGAACGCTTAGCCCTTAAAATGGAGTTTAT
GGAGTTGGTTTCTTTCACAAAGTGGTGGGCGAATTTGGTGGGCTGTTGTGAGGTTGGTTCCTTTCCTTCATCACTCAGGCTTCCTCTTGGTCATAACCCAAGAACTATCA
TCTTTTGTGATCCTACCATAGAAAAGGGGATTGAGGAGGGGAGAGGTACACATGAGTTGGGAGACAGTGGGAAAGCCTATGAACCTTTGGGGTTAGAGTTCAGGAAGTTA
GGGACTCGTAACAGAGCCTTGCTGCCTTTGAAGGTTATTCATGAAAGGATTAATACTTTTGATAGGCTTACAAGAAAGATGCATTCATTGATTGGATCGTTTTGCTGCAT
CCTTTGTTGGAAGGCGAAGGAAGATCTCGATCACTTGCTTTAA
Protein sequenceShow/hide protein sequence
MASSNLEEDVNSPNFLLSNICNLVSIRLNSTDFVLWNFELSSNLKAHKLFGFVDGSNPTPAQFLSSASSESSTSSDSPTITPPQTRNNPKSIQALRDLLICLSRNISTVR
GVQLLGYSRVTCFSNPKAQNSSKLEIQAHFLPNSISRGSHNPFQGLRIAMKTFVVVRDQLVRRNNQAIGPILSYTKEATGLSPSTVFVNSFPFCGSDLPHPTIYSSGETD
STFASFSYVSSSRVKRSSQLFSGGLVVGDSLFSSNPTQGSSFNGGVIVTPLEVVKEVILISISFDIPGFGMGWPRLVNGKFTWANNRVAVVLANVCLCEMLKKFFLIRCF
ERKGSVYGRHGFVLSYGICGLEVDLREERLALKMEFMELVSFTKWWANLVGCCEVGSFPSSLRLPLGHNPRTIIFCDPTIEKGIEEGRGTHELGDSGKAYEPLGLEFRKL
GTRNRALLPLKVIHERINTFDRLTRKMHSLIGSFCCILCWKAKEDLDHLL