; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007143 (gene) of Snake gourd v1 genome

Gene IDTan0007143
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionrRNA intron-encoded homing endonuclease
Genome locationLG09:36983273..36984013
RNA-Seq ExpressionTan0007143
SyntenyTan0007143
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0016020 - membrane (cellular component)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637441.1 hypothetical protein CSA_023571 [Cucumis sativus]1.0e-3157.95Show/hide
Query:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDV-GDSLPATGRRSRNKVSV
        MGGGAW FLVGGAICLVNSVNER+L+LLTSY  IIAIVGLQRGIPSKRESSARVD  +P + T R S     L+     G+V G      GRRSRNKVSV
Subjt:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDV-GDSLPATGRRSRNKVSV

Query:  GEPAERSLSMPK--HQTTHERVYKPFVSGGSTR------APLVPKP-----NLGIGRAKELKRIHPVPCPGLGVRG
        GEPAE SLSMPK  +        KP  SG   R      A L+P P     N G+GRAKEL+   P P  GLG  G
Subjt:  GEPAERSLSMPK--HQTTHERVYKPFVSGGSTR------APLVPKP-----NLGIGRAKELKRIHPVPCPGLGVRG

KAF1855785.1 hypothetical protein Lal_00045053 [Lupinus albus]2.0e-2780.9Show/hide
Query:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPAT
        MGGGAW FLVGGAICLVNSVNER+L+LLTSY  IIAIVGLQRGIPSKRESSARVD         RRSYRLNG VKCSD GDVG SLPAT
Subjt:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPAT

KAF1856291.1 hypothetical protein Lal_00046676 [Lupinus albus]8.4e-4269.54Show/hide
Query:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPAT-GRRSRNKVSV
        MGGGAW FLVGGAICLVNSVNER+L+LLTSY  IIAIVGLQRGIPSKRESSARVD    FVHTARRSYRLNG VKCSD GDVG SLPAT GRRSRNKVSV
Subjt:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPAT-GRRSRNKVSV

Query:  GEPAERSLSMPKHQTTHERVY------------KPFVSGGSTRAPLVPKPN
        GEPAE SLS P  Q     ++             P VSGG+T   +   PN
Subjt:  GEPAERSLSMPKHQTTHERVY------------KPFVSGGSTRAPLVPKPN

KAF1856293.1 hypothetical protein Lal_00046678 [Lupinus albus]1.1e-3054.34Show/hide
Query:  MGGGAWSFLVGGAICLVNSVNERELTLLTSYIIAIVGLQRGIPSKR------ESSARVD-------------------------TSLPFVHTARRSYRLN
        MGGGAW FLVGGAICLVNSVNER+L+LLTSY+     L+   P  R      ES A  D                         TSLPFVHTARRSYRLN
Subjt:  MGGGAWSFLVGGAICLVNSVNERELTLLTSYIIAIVGLQRGIPSKR------ESSARVD-------------------------TSLPFVHTARRSYRLN

Query:  GLVKCSDRGDVGDSLPAT-GRRSRNKVSVGEPAERSLSMPKHQTTHE-----RVYKPFVSGGSTRAPLVPKPN
        G VKCSD GDVG SLPAT GRRSRNKVSVGEPAE SLS P  Q             P VSGG+T   +   PN
Subjt:  GLVKCSDRGDVGDSLPAT-GRRSRNKVSVGEPAERSLSMPKHQTTHE-----RVYKPFVSGGSTRAPLVPKPN

KAF1891678.1 hypothetical protein Lal_00049250 [Lupinus albus]5.6e-3869.44Show/hide
Query:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPAT-GRRSRNKVSV
        MGGGAW FLVGGAICLVNSVNER+L+LLTSY  IIAIVGLQRGIPSKRESSARVD         RRSYRLNG VKCSD GDVG SLPAT GRRSRNKVSV
Subjt:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPAT-GRRSRNKVSV

Query:  GEPAERSLSMPKHQTTHE-----RVYKPFVSGGSTRAPLVPKPN
        GEPAE SLS P  Q             P VSGG+T   +   PN
Subjt:  GEPAERSLSMPKHQTTHE-----RVYKPFVSGGSTRAPLVPKPN

TrEMBL top hitse value%identityAlignment
A0A0A0LPG1 Uncharacterized protein1.3e-3259.09Show/hide
Query:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDV-GDSLPATGRRSRNKVSV
        MGGGAW FLVGGAICLVNSVNER+L+LLTSY  IIAIVGLQRGIPSKRESSARVD  +P + T R S     L+     G+V G      GRRSRNKVSV
Subjt:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDV-GDSLPATGRRSRNKVSV

Query:  GEPAERSLSMPK--HQTTHERVYKPFVSGGSTR------APLVPKP-----NLGIGRAKELKRIHPVPCPGLGVRG
        GEPAE SLSMPK  +        KP  SG   R      A L+P P     N G+GRAKEL+   P P P LGVRG
Subjt:  GEPAERSLSMPK--HQTTHERVYKPFVSGGSTR------APLVPKP-----NLGIGRAKELKRIHPVPCPGLGVRG

A0A6A5L557 Uncharacterized protein4.1e-4269.54Show/hide
Query:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPAT-GRRSRNKVSV
        MGGGAW FLVGGAICLVNSVNER+L+LLTSY  IIAIVGLQRGIPSKRESSARVD    FVHTARRSYRLNG VKCSD GDVG SLPAT GRRSRNKVSV
Subjt:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPAT-GRRSRNKVSV

Query:  GEPAERSLSMPKHQTTHERVY------------KPFVSGGSTRAPLVPKPN
        GEPAE SLS P  Q     ++             P VSGG+T   +   PN
Subjt:  GEPAERSLSMPKHQTTHERVY------------KPFVSGGSTRAPLVPKPN

A0A6A5L8M3 Uncharacterized protein9.7e-2880.9Show/hide
Query:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPAT
        MGGGAW FLVGGAICLVNSVNER+L+LLTSY  IIAIVGLQRGIPSKRESSARVD         RRSYRLNG VKCSD GDVG SLPAT
Subjt:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPAT

A0A6A5L9L7 Uncharacterized protein5.5e-3154.34Show/hide
Query:  MGGGAWSFLVGGAICLVNSVNERELTLLTSYIIAIVGLQRGIPSKR------ESSARVD-------------------------TSLPFVHTARRSYRLN
        MGGGAW FLVGGAICLVNSVNER+L+LLTSY+     L+   P  R      ES A  D                         TSLPFVHTARRSYRLN
Subjt:  MGGGAWSFLVGGAICLVNSVNERELTLLTSYIIAIVGLQRGIPSKR------ESSARVD-------------------------TSLPFVHTARRSYRLN

Query:  GLVKCSDRGDVGDSLPAT-GRRSRNKVSVGEPAERSLSMPKHQTTHE-----RVYKPFVSGGSTRAPLVPKPN
        G VKCSD GDVG SLPAT GRRSRNKVSVGEPAE SLS P  Q             P VSGG+T   +   PN
Subjt:  GLVKCSDRGDVGDSLPAT-GRRSRNKVSVGEPAERSLSMPKHQTTHE-----RVYKPFVSGGSTRAPLVPKPN

A0A6A5P378 Uncharacterized protein2.7e-3869.44Show/hide
Query:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPAT-GRRSRNKVSV
        MGGGAW FLVGGAICLVNSVNER+L+LLTSY  IIAIVGLQRGIPSKRESSARVD         RRSYRLNG VKCSD GDVG SLPAT GRRSRNKVSV
Subjt:  MGGGAWSFLVGGAICLVNSVNERELTLLTSY--IIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPAT-GRRSRNKVSV

Query:  GEPAERSLSMPKHQTTHE-----RVYKPFVSGGSTRAPLVPKPN
        GEPAE SLS P  Q             P VSGG+T   +   PN
Subjt:  GEPAERSLSMPKHQTTHE-----RVYKPFVSGGSTRAPLVPKPN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTGGTGCATGGTCGTTCTTAGTTGGTGGAGCGATTTGTCTGGTTAATTCCGTTAACGAACGAGAACTCACCCTGCTAACTAGCTATATCATTGCAATTGTTGG
TCTTCAACGAGGAATTCCTAGTAAGCGCGAGTCATCAGCTCGCGTTGACACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTCCTACCGATTGAATGGTCTGGTGAAGT
GTTCGGATCGCGGCGACGTGGGCGATTCGCTGCCTGCGACAGGAAGGAGAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAAGATCATTGTCAATGCCTAAACAT
CAAACGACCCACGAACGCGTTTACAAACCTTTTGTGTCGGGGGGGAGCACTCGTGCCCCCCTGGTGCCTAAACCAAACCTCGGCATAGGTCGTGCCAAGGAACTCAAACG
AATTCACCCCGTCCCTTGCCCCGGTCTCGGCGTGCGGGGTGTCATGGTCGCACCTCTTAGAGGGTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGTGGTGCATGGTCGTTCTTAGTTGGTGGAGCGATTTGTCTGGTTAATTCCGTTAACGAACGAGAACTCACCCTGCTAACTAGCTATATCATTGCAATTGTTGG
TCTTCAACGAGGAATTCCTAGTAAGCGCGAGTCATCAGCTCGCGTTGACACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTCCTACCGATTGAATGGTCTGGTGAAGT
GTTCGGATCGCGGCGACGTGGGCGATTCGCTGCCTGCGACAGGAAGGAGAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAAGATCATTGTCAATGCCTAAACAT
CAAACGACCCACGAACGCGTTTACAAACCTTTTGTGTCGGGGGGGAGCACTCGTGCCCCCCTGGTGCCTAAACCAAACCTCGGCATAGGTCGTGCCAAGGAACTCAAACG
AATTCACCCCGTCCCTTGCCCCGGTCTCGGCGTGCGGGGTGTCATGGTCGCACCTCTTAGAGGGTCTTGA
Protein sequenceShow/hide protein sequence
MGGGAWSFLVGGAICLVNSVNERELTLLTSYIIAIVGLQRGIPSKRESSARVDTSLPFVHTARRSYRLNGLVKCSDRGDVGDSLPATGRRSRNKVSVGEPAERSLSMPKH
QTTHERVYKPFVSGGSTRAPLVPKPNLGIGRAKELKRIHPVPCPGLGVRGVMVAPLRGS