; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010066 (gene) of Snake gourd v1 genome

Gene IDTan0010066
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG11:55904496..55936248
RNA-Seq ExpressionTan0010066
SyntenyTan0010066
Gene Ontology termsNA
InterPro domainsIPR039495 - TATA box-binding protein-associated factor RNA polymerase I subunit A-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600392.1 hypothetical protein SDJN03_05625, partial [Cucurbita argyrosperma subsp. sororia]2.8e-4475Show/hide
Query:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT
        LLL+GGRVDKAL++MENICRD+NATL       L EHFD SN +LLSTCYE+ILKKDPTCCHSL KLVHMH+N   NYSLE L+EMIALHLDGTC E DT
Subjt:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT

Query:  WRELAMCFLKLFQLEEDRIS-TCSIGTG
        WREL MCFLKL Q+EEDR+S  CSIG+G
Subjt:  WRELAMCFLKLFQLEEDRIS-TCSIGTG

KAG7031054.1 hypothetical protein SDJN02_05093 [Cucurbita argyrosperma subsp. argyrosperma]2.8e-4475Show/hide
Query:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT
        LLL+GGRVDKAL++MENICRD+NATL       L EHFD SN +LLSTCYE+ILKKDPTCCHSL KLVHMH+N   NYSLE L+EMIALHLDGTC E DT
Subjt:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT

Query:  WRELAMCFLKLFQLEEDRIS-TCSIGTG
        WREL MCFLKL Q+EEDR+S  CSIG+G
Subjt:  WRELAMCFLKLFQLEEDRIS-TCSIGTG

XP_022941583.1 uncharacterized protein LOC111446895 [Cucurbita moschata]5.7e-4575.78Show/hide
Query:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT
        LLL+GGRVDKAL++MENICRD+NATL       L EHFD SN +LLSTCYE+ILKKDPTCCHSL KLVHMH+N   NYSLE L+EMIALHLDGTC E DT
Subjt:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT

Query:  WRELAMCFLKLFQLEEDRIS-TCSIGTG
        WRELAMCFLKL Q+EEDR+S  CSIG+G
Subjt:  WRELAMCFLKLFQLEEDRIS-TCSIGTG

XP_022979683.1 uncharacterized protein LOC111479331 [Cucurbita maxima]1.0e-4174.22Show/hide
Query:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT
        LLL+GGRVDKAL++MENIC D+NATL       L EHFD SN VLLSTCYE+ILKKDPTCCHSL KLV MH+N   NYSLE L+EMIALHLDGT  E DT
Subjt:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT

Query:  WRELAMCFLKLFQLEEDRIS-TCSIGTG
        WRELAMCFLKL Q+EEDR+S  CSIG+G
Subjt:  WRELAMCFLKLFQLEEDRIS-TCSIGTG

XP_023536647.1 uncharacterized protein LOC111797853 [Cucurbita pepo subsp. pepo]5.7e-4575.78Show/hide
Query:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT
        LLL+GGRVDKAL++MENICRD+NATL       L EHFD SN +LLSTCYE+ILKKDPTCCHSL KLVHMH+N   NYSLE L+EMIALHLDGTC E DT
Subjt:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT

Query:  WRELAMCFLKLFQLEEDRIS-TCSIGTG
        WRELAMCFLKL Q+EEDR+S  CSIG+G
Subjt:  WRELAMCFLKLFQLEEDRIS-TCSIGTG

TrEMBL top hitse value%identityAlignment
A0A0A0KXN5 Uncharacterized protein4.5e-4069.53Show/hide
Query:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT
        LLL+GGR+DKAL++ME  C D+NA L       L EHFDRSN+VLLSTCYEQ LKKDPTCCHS+ KLV MH+N   NY+LE L+EMIALHLDGT  E DT
Subjt:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT

Query:  WRELAMCFLKLFQLEEDRIS-TCSIGTG
        WRELA+CFL+L Q EEDR+S  CSIGTG
Subjt:  WRELAMCFLKLFQLEEDRIS-TCSIGTG

A0A1S3BS63 uncharacterized protein LOC1034929167.7e-4070.31Show/hide
Query:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT
        LLL+GGR+DKAL++ME  C D+NA L       L EHFDRSN+VLLSTCYEQ LKKDPTC HS+ KLV MH+N   NY+LE L+EMIALHLDGT  E DT
Subjt:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT

Query:  WRELAMCFLKLFQLEEDRIST-CSIGTG
        WRELA+CFLKL Q EEDR+ST CSIGTG
Subjt:  WRELAMCFLKLFQLEEDRIST-CSIGTG

A0A6J1CPA4 uncharacterized protein LOC1110129191.4e-4172.87Show/hide
Query:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHL-DGTCVECD
        LLL+GGRVDKAL ++E IC D+NA L       L EHFDRSNDVLLS+CYEQILKKDPTCCHSL KLV MH+N   NY+LE L+EMI LHL DGTCVE D
Subjt:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHL-DGTCVECD

Query:  TWRELAMCFLKLFQLEEDRIST-CSIGTG
         WRELA+CFLKL Q EEDR+ST CSIGTG
Subjt:  TWRELAMCFLKLFQLEEDRIST-CSIGTG

A0A6J1FSH8 uncharacterized protein LOC1114468952.7e-4575.78Show/hide
Query:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT
        LLL+GGRVDKAL++MENICRD+NATL       L EHFD SN +LLSTCYE+ILKKDPTCCHSL KLVHMH+N   NYSLE L+EMIALHLDGTC E DT
Subjt:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT

Query:  WRELAMCFLKLFQLEEDRIS-TCSIGTG
        WRELAMCFLKL Q+EEDR+S  CSIG+G
Subjt:  WRELAMCFLKLFQLEEDRIS-TCSIGTG

A0A6J1IWZ8 uncharacterized protein LOC1114793314.8e-4274.22Show/hide
Query:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT
        LLL+GGRVDKAL++MENIC D+NATL       L EHFD SN VLLSTCYE+ILKKDPTCCHSL KLV MH+N   NYSLE L+EMIALHLDGT  E DT
Subjt:  LLLVGGRVDKALNQMENICRDANATL-------LKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT

Query:  WRELAMCFLKLFQLEEDRIS-TCSIGTG
        WRELAMCFLKL Q+EEDR+S  CSIG+G
Subjt:  WRELAMCFLKLFQLEEDRIS-TCSIGTG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G53200.1 unknown protein5.7e-1941.73Show/hide
Query:  LLLVGGRVDKALNQMENICRDAN-------ATLLKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT
        +LL+GG VD+A+  +E +C   +         L+ E F R++D +L+ CYE ILK DP C  +L KL+ M       YS E L EMIALH++ +  E + 
Subjt:  LLLVGGRVDKALNQMENICRDAN-------ATLLKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT

Query:  WRELAMCFLKLFQ-LEEDRISTCSIGT
        W+ELA CF   F+ L+EDR+S C  G+
Subjt:  WRELAMCFLKLFQ-LEEDRISTCSIGT

AT1G53200.2 unknown protein5.7e-1941.73Show/hide
Query:  LLLVGGRVDKALNQMENICRDAN-------ATLLKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT
        +LL+GG VD+A+  +E +C   +         L+ E F R++D +L+ CYE ILK DP C  +L KL+ M       YS E L EMIALH++ +  E + 
Subjt:  LLLVGGRVDKALNQMENICRDAN-------ATLLKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYLMEMIALHLDGTCVECDT

Query:  WRELAMCFLKLFQ-LEEDRISTCSIGT
        W+ELA CF   F+ L+EDR+S C  G+
Subjt:  WRELAMCFLKLFQ-LEEDRISTCSIGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAATCCAACGCGCATCACTTCCTCAAAATTTCAAGGCACAATTGGAGCCGTGTTTGTAGAGGCTTTTGCCTTCTATCTTCGCTCCCTTTTTGGACTGTTGTTGTT
GGTTGGAGGTCGAGTTGACAAAGCACTCAATCAAATGGAAAATATCTGTCGTGATGCCAATGCAACACTTCTCAAAGAACATTTTGATCGTAGTAACGATGTCTTGCTTT
CAACTTGTTATGAGCAAATATTGAAGAAGGATCCAACATGTTGTCATTCACTGCTAAAACTAGTTCACATGCATAAAAACACGGTATGCAATTACAGTCTTGAATATCTG
ATGGAAATGATAGCTTTGCATTTAGATGGTACATGTGTGGAATGTGATACATGGAGAGAGTTGGCTATGTGTTTTCTGAAACTTTTTCAATTGGAAGAGGATAGAATATC
AACATGTTCAATTGGGACAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAATCCAACGCGCATCACTTCCTCAAAATTTCAAGGCACAATTGGAGCCGTGTTTGTAGAGGCTTTTGCCTTCTATCTTCGCTCCCTTTTTGGACTGTTGTTGTT
GGTTGGAGGTCGAGTTGACAAAGCACTCAATCAAATGGAAAATATCTGTCGTGATGCCAATGCAACACTTCTCAAAGAACATTTTGATCGTAGTAACGATGTCTTGCTTT
CAACTTGTTATGAGCAAATATTGAAGAAGGATCCAACATGTTGTCATTCACTGCTAAAACTAGTTCACATGCATAAAAACACGGTATGCAATTACAGTCTTGAATATCTG
ATGGAAATGATAGCTTTGCATTTAGATGGTACATGTGTGGAATGTGATACATGGAGAGAGTTGGCTATGTGTTTTCTGAAACTTTTTCAATTGGAAGAGGATAGAATATC
AACATGTTCAATTGGGACAGGTTGA
Protein sequenceShow/hide protein sequence
MGNPTRITSSKFQGTIGAVFVEAFAFYLRSLFGLLLLVGGRVDKALNQMENICRDANATLLKEHFDRSNDVLLSTCYEQILKKDPTCCHSLLKLVHMHKNTVCNYSLEYL
MEMIALHLDGTCVECDTWRELAMCFLKLFQLEEDRISTCSIGTG