; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012442 (gene) of Snake gourd v1 genome

Gene IDTan0012442
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4218 domain-containing protein
Genome locationLG07:56578603..56589258
RNA-Seq ExpressionTan0012442
SyntenyTan0012442
Gene Ontology termsNA
InterPro domainsIPR004242 - Transposon, En/Spm-like
IPR013904 - Transcriptional regulatory protein RXT2, N-terminal
IPR016024 - Armadillo-type fold
IPR029480 - Transposase-associated domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW51304.1 hypothetical protein CK203_075478 [Vitis vinifera]2.6e-5153.33Show/hide
Query:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ
        +LWTINDFPAYGNL+GWSTKGYKACPV  +                  F PL                                        GK  N + 
Subjt:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ

Query:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK
         KRKR+  ELNW K+SIFFEL YWS LK++H +DVMH+EKN+CD+V+GTLLNI GKTKDTNKAR DL  +NIRKELHLQ  GNKLVKPHA YTLTV+ERK
Subjt:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK

Query:  KFCSFLKTIK
        +FC FLK++K
Subjt:  KFCSFLKTIK

RVW74856.1 hypothetical protein CK203_053854 [Vitis vinifera]2.6e-5153.33Show/hide
Query:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ
        +LWTINDFPAYGNL+GWSTKGYKACPV  +                  F PL                                        GK  N + 
Subjt:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ

Query:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK
         KRKR+  ELNW K+SIFFEL YWS LK++H +DVMH+EKN+CD+V+GTLLNI GKTKDTNKAR DL  +NIRKELHLQ  GNKLVKPHA YTLTV+ERK
Subjt:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK

Query:  KFCSFLKTIK
        +FC FLK++K
Subjt:  KFCSFLKTIK

RVX01357.1 hypothetical protein CK203_031234 [Vitis vinifera]9.6e-5460.44Show/hide
Query:  LLWTINDFPAYGNLSGWSTKGYKACPV-SIDFPPLGKTVNILQW-----------------------------KRKRLHNELNWVKRSIFFELPYWSSLK
        +LWTINDFPAYGNL+GWSTKGYKACPV + D   LG  V  + W                             KRKR+  ELNW K+SIFFEL YWS LK
Subjt:  LLWTINDFPAYGNLSGWSTKGYKACPV-SIDFPPLGKTVNILQW-----------------------------KRKRLHNELNWVKRSIFFELPYWSSLK

Query:  IQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERKKFCSFLKTIK
        ++H +DVMH+EKN+CD+V+GTLLNI GKTKDTNKAR DL  +NIRKELHLQ  GNKLVKPHA YTLTV+ RK+FC FLK++K
Subjt:  IQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERKKFCSFLKTIK

RVX21623.1 hypothetical protein CK203_002253 [Vitis vinifera]2.6e-5153.33Show/hide
Query:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ
        +LWTINDFPAYGNL+GWSTKGYKACPV  +                  F PL                                        GK  N + 
Subjt:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ

Query:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK
         KRKR+  ELNW K+SIFFEL YWS LK++H +DVMH+EKN+CD+V+GTLLNI GKTKDTNKAR DL  +NIRKELHLQ  GNKLVKPHA YTLTV+ERK
Subjt:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK

Query:  KFCSFLKTIK
        +FC FLK++K
Subjt:  KFCSFLKTIK

XP_038895319.1 uncharacterized protein LOC120083572 isoform X1 [Benincasa hispida]8.1e-5346.31Show/hide
Query:  QVRGASRGLGLARIIEATGDRVRVSWSLEQGKPVGSVASLFNSEIGILTRAFVPLKYATKYDIPNEVFTNIIERLLNKFDVDISEDHIKNYIVYEIGTRY
        +VRGASRG+ L +   AT  R++V+W+  QGKP+G +ASLFN EIG+L R F+PLKY  + DIPNE++  + E+LLN+FDVDIS+ HIK YI YEIG R+
Subjt:  QVRGASRGLGLARIIEATGDRVRVSWSLEQGKPVGSVASLFNSEIGILTRAFVPLKYATKYDIPNEVFTNIIERLLNKFDVDISEDHIKNYIVYEIGTRY

Query:  KDYRSRLYQYYRKLGDPRKARERPHKDVAPEDWIMLCDRWETTEWKK-------------------------------REDGSYLSPIEIFHQTHWSVAK
        KDYR  LY++Y+K  DP +AR  P+K    +DW +LCDRWE++ WK+                               +EDG+Y+S IE+F++TH S +K
Subjt:  KDYRSRLYQYYRKLGDPRKARERPHKDVAPEDWIMLCDRWETTEWKK-------------------------------REDGSYLSPIEIFHQTHWSVAK

Query:  GWTDTAASEAHAKMVALAEEQATSSTPMTDDEIVASVLGTRPSY
        GW D AA EA+  M+ L + +       TD+EI+  VLG R SY
Subjt:  GWTDTAASEAHAKMVALAEEQATSSTPMTDDEIVASVLGTRPSY

TrEMBL top hitse value%identityAlignment
A0A438D9E3 Uncharacterized protein1.3e-5153.33Show/hide
Query:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ
        +LWTINDFPAYGNL+GWSTKGYKACPV  +                  F PL                                        GK  N + 
Subjt:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ

Query:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK
         KRKR+  ELNW K+SIFFEL YWS LK++H +DVMH+EKN+CD+V+GTLLNI GKTKDTNKAR DL  +NIRKELHLQ  GNKLVKPHA YTLTV+ERK
Subjt:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK

Query:  KFCSFLKTIK
        +FC FLK++K
Subjt:  KFCSFLKTIK

A0A438EUD1 Uncharacterized protein1.3e-5153.33Show/hide
Query:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ
        +LWTINDFPAYGNL+GWSTKGYKACPV  +                  F PL                                        GK  N + 
Subjt:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ

Query:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK
         KRKR+  ELNW K+SIFFEL YWS LK++H +DVMH+EKN+CD+V+GTLLNI GKTKDTNKAR DL  +NIRKELHLQ  GNKLVKPHA YTLTV+ERK
Subjt:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK

Query:  KFCSFLKTIK
        +FC FLK++K
Subjt:  KFCSFLKTIK

A0A438H145 Uncharacterized protein1.3e-5153.33Show/hide
Query:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ
        +LWTINDFPAYGNL+GWSTKGYKACPV  +                  F PL                                        GK  N + 
Subjt:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ

Query:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK
         KRKR+  ELNW K+SIFFEL YWS LK++H +DVMH+EKN+CD+V+GTLLNI GKTKDTNKAR DL  +NIRKELHLQ  GNKLVKPHA YTLTV+ERK
Subjt:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK

Query:  KFCSFLKTIK
        +FC FLK++K
Subjt:  KFCSFLKTIK

A0A438IXD8 DUF4218 domain-containing protein4.6e-5460.44Show/hide
Query:  LLWTINDFPAYGNLSGWSTKGYKACPV-SIDFPPLGKTVNILQW-----------------------------KRKRLHNELNWVKRSIFFELPYWSSLK
        +LWTINDFPAYGNL+GWSTKGYKACPV + D   LG  V  + W                             KRKR+  ELNW K+SIFFEL YWS LK
Subjt:  LLWTINDFPAYGNLSGWSTKGYKACPV-SIDFPPLGKTVNILQW-----------------------------KRKRLHNELNWVKRSIFFELPYWSSLK

Query:  IQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERKKFCSFLKTIK
        ++H +DVMH+EKN+CD+V+GTLLNI GKTKDTNKAR DL  +NIRKELHLQ  GNKLVKPHA YTLTV+ RK+FC FLK++K
Subjt:  IQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERKKFCSFLKTIK

A0A438KKA2 Uncharacterized protein1.3e-5153.33Show/hide
Query:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ
        +LWTINDFPAYGNL+GWSTKGYKACPV  +                  F PL                                        GK  N + 
Subjt:  LLWTINDFPAYGNLSGWSTKGYKACPVSID------------------FPPL----------------------------------------GKTVNILQ

Query:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK
         KRKR+  ELNW K+SIFFEL YWS LK++H +DVMH+EKN+CD+V+GTLLNI GKTKDTNKAR DL  +NIRKELHLQ  GNKLVKPHA YTLTV+ERK
Subjt:  WKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRVGNKLVKPHASYTLTVDERK

Query:  KFCSFLKTIK
        +FC FLK++K
Subjt:  KFCSFLKTIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTGTCGATGTCCGACCCTGCGACAAACGTCTGTTTTCGTCGCATCCCCATGCGACGGAAGAGAACTTTTGTCGCAGTCTTTGCGACGCAACTCTCGTTTCGTCGC
AAGGGGTCATGCGACGAATGACATAATTTCGTCGCATAGCCTCGAGGATAGAACATTGTGCAAGGAATATATTGATGGAATAAAATCATTTATTACTGTGGCAAAAGATC
ATGTTGATAAGAGAGGGTTTACACGTTGTCCATGTAAAAAGTGTCAGAATATCCTAATGAAACTACCATCCCTGTTGTGGACAATTAATGACTTTCCAGCCTATGGTAAT
TTGTCAGGATGGAGCACTAAAGGATATAAGGCATGTCCGGTTTCTATCGATTTTCCACCACTTGGAAAAACTGTAAATATATTGCAATGGAAAAGGAAGCGACTCCATAA
TGAATTAAATTGGGTGAAAAGAAGTATTTTTTTTGAGTTACCATATTGGTCAAGTCTTAAGATCCAACACAAGCTCGATGTCATGCATATTGAGAAAAATATTTGTGATA
ATGTCTTGGGTACATTATTAAATATTGAGGGAAAAACAAAAGACACTAACAAAGCGCGTAAGGATTTAATGACTTTGAATATTCGTAAAGAACTACATCTTCAACGTGTT
GGAAACAAATTGGTGAAGCCACATGCAAGTTACACATTAACAGTTGATGAAAGGAAAAAGTTTTGTAGTTTTCTTAAAACAATCAAAGTTGATGCGAGTCCAACAATGGT
TTGTGAAAGCAATGAATTGAGTGAGAAGGGGATAAATTTTGTAGAACTAGTAGAGTGTGAGGAAGACGATGAAGAAGATGAGGAAGATGAGGAGGATGATGAGGAAGATG
ATGAGGATGATGAGGAAGATGATGATGATGAGGAAGATGATGAAGATCGGCATCCTTTTAATAGTAGTCAGGTCCGTGGTGCTTCACGTGGATTGGGTCTGGCGAGAATT
ATTGAGGCCACTGGGGATAGAGTGCGCGTTTCATGGAGTTTAGAACAAGGCAAACCGGTTGGAAGTGTTGCTAGCCTCTTTAATAGTGAAATTGGAATTCTGACGAGGGC
GTTTGTCCCACTGAAGTATGCAACTAAGTATGACATTCCAAATGAAGTTTTCACAAACATAATTGAGCGGTTGCTGAATAAATTCGATGTAGACATCTCGGAGGACCATA
TTAAGAATTATATTGTTTACGAGATTGGAACTCGATATAAGGACTACAGATCGAGGTTATATCAGTACTACAGAAAATTGGGTGACCCAAGGAAAGCTCGTGAACGTCCA
CACAAGGATGTAGCACCTGAAGATTGGATTATGTTATGTGATAGATGGGAGACTACTGAATGGAAGAAAAGGGAAGATGGTTCTTATTTGAGTCCCATTGAGATCTTCCA
CCAAACTCATTGGTCTGTTGCAAAGGGATGGACGGACACTGCAGCAAGTGAAGCACATGCAAAGATGGTAGCACTTGCAGAAGAGCAAGCCACGTCGAGTACACCAATGA
CTGACGACGAAATTGTGGCTAGCGTTCTTGGAACACGACCATCCTATGCTCGTCTTACACAAACACAAGAAATGTTAGAGGCTCAACGCCAAGAAAATGAAAAATTGGAG
CAGCGAATGGAGCAAAGAATAGAACAACGAATTGAAGAACGAATGGAGATGCAACGCAGTGACGACGAACATACATCTATCGTCGCAACATGCGACGAAATTTCTGTTTT
TGTCGCAAATCGCTTTCGAGATCGGCTTCACGACGAAACTTGCGACGTCAAATATTGCGTCCCTAATGGTCTGTTGCGCCAAATTATGCGGTTTAGCGACGTAATTATTT
CACAACATCCAACAAACAAGTATATACATCAAAAAACACATTCATTCATAGACAACATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTGTCGATGTCCGACCCTGCGACAAACGTCTGTTTTCGTCGCATCCCCATGCGACGGAAGAGAACTTTTGTCGCAGTCTTTGCGACGCAACTCTCGTTTCGTCGC
AAGGGGTCATGCGACGAATGACATAATTTCGTCGCATAGCCTCGAGGATAGAACATTGTGCAAGGAATATATTGATGGAATAAAATCATTTATTACTGTGGCAAAAGATC
ATGTTGATAAGAGAGGGTTTACACGTTGTCCATGTAAAAAGTGTCAGAATATCCTAATGAAACTACCATCCCTGTTGTGGACAATTAATGACTTTCCAGCCTATGGTAAT
TTGTCAGGATGGAGCACTAAAGGATATAAGGCATGTCCGGTTTCTATCGATTTTCCACCACTTGGAAAAACTGTAAATATATTGCAATGGAAAAGGAAGCGACTCCATAA
TGAATTAAATTGGGTGAAAAGAAGTATTTTTTTTGAGTTACCATATTGGTCAAGTCTTAAGATCCAACACAAGCTCGATGTCATGCATATTGAGAAAAATATTTGTGATA
ATGTCTTGGGTACATTATTAAATATTGAGGGAAAAACAAAAGACACTAACAAAGCGCGTAAGGATTTAATGACTTTGAATATTCGTAAAGAACTACATCTTCAACGTGTT
GGAAACAAATTGGTGAAGCCACATGCAAGTTACACATTAACAGTTGATGAAAGGAAAAAGTTTTGTAGTTTTCTTAAAACAATCAAAGTTGATGCGAGTCCAACAATGGT
TTGTGAAAGCAATGAATTGAGTGAGAAGGGGATAAATTTTGTAGAACTAGTAGAGTGTGAGGAAGACGATGAAGAAGATGAGGAAGATGAGGAGGATGATGAGGAAGATG
ATGAGGATGATGAGGAAGATGATGATGATGAGGAAGATGATGAAGATCGGCATCCTTTTAATAGTAGTCAGGTCCGTGGTGCTTCACGTGGATTGGGTCTGGCGAGAATT
ATTGAGGCCACTGGGGATAGAGTGCGCGTTTCATGGAGTTTAGAACAAGGCAAACCGGTTGGAAGTGTTGCTAGCCTCTTTAATAGTGAAATTGGAATTCTGACGAGGGC
GTTTGTCCCACTGAAGTATGCAACTAAGTATGACATTCCAAATGAAGTTTTCACAAACATAATTGAGCGGTTGCTGAATAAATTCGATGTAGACATCTCGGAGGACCATA
TTAAGAATTATATTGTTTACGAGATTGGAACTCGATATAAGGACTACAGATCGAGGTTATATCAGTACTACAGAAAATTGGGTGACCCAAGGAAAGCTCGTGAACGTCCA
CACAAGGATGTAGCACCTGAAGATTGGATTATGTTATGTGATAGATGGGAGACTACTGAATGGAAGAAAAGGGAAGATGGTTCTTATTTGAGTCCCATTGAGATCTTCCA
CCAAACTCATTGGTCTGTTGCAAAGGGATGGACGGACACTGCAGCAAGTGAAGCACATGCAAAGATGGTAGCACTTGCAGAAGAGCAAGCCACGTCGAGTACACCAATGA
CTGACGACGAAATTGTGGCTAGCGTTCTTGGAACACGACCATCCTATGCTCGTCTTACACAAACACAAGAAATGTTAGAGGCTCAACGCCAAGAAAATGAAAAATTGGAG
CAGCGAATGGAGCAAAGAATAGAACAACGAATTGAAGAACGAATGGAGATGCAACGCAGTGACGACGAACATACATCTATCGTCGCAACATGCGACGAAATTTCTGTTTT
TGTCGCAAATCGCTTTCGAGATCGGCTTCACGACGAAACTTGCGACGTCAAATATTGCGTCCCTAATGGTCTGTTGCGCCAAATTATGCGGTTTAGCGACGTAATTATTT
CACAACATCCAACAAACAAGTATATACATCAAAAAACACATTCATTCATAGACAACATCTAA
Protein sequenceShow/hide protein sequence
MDCRCPTLRQTSVFVASPCDGRELLSQSLRRNSRFVARGHATNDIISSHSLEDRTLCKEYIDGIKSFITVAKDHVDKRGFTRCPCKKCQNILMKLPSLLWTINDFPAYGN
LSGWSTKGYKACPVSIDFPPLGKTVNILQWKRKRLHNELNWVKRSIFFELPYWSSLKIQHKLDVMHIEKNICDNVLGTLLNIEGKTKDTNKARKDLMTLNIRKELHLQRV
GNKLVKPHASYTLTVDERKKFCSFLKTIKVDASPTMVCESNELSEKGINFVELVECEEDDEEDEEDEEDDEEDDEDDEEDDDDEEDDEDRHPFNSSQVRGASRGLGLARI
IEATGDRVRVSWSLEQGKPVGSVASLFNSEIGILTRAFVPLKYATKYDIPNEVFTNIIERLLNKFDVDISEDHIKNYIVYEIGTRYKDYRSRLYQYYRKLGDPRKARERP
HKDVAPEDWIMLCDRWETTEWKKREDGSYLSPIEIFHQTHWSVAKGWTDTAASEAHAKMVALAEEQATSSTPMTDDEIVASVLGTRPSYARLTQTQEMLEAQRQENEKLE
QRMEQRIEQRIEERMEMQRSDDEHTSIVATCDEISVFVANRFRDRLHDETCDVKYCVPNGLLRQIMRFSDVIISQHPTNKYIHQKTHSFIDNI