; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017026 (gene) of Snake gourd v1 genome

Gene IDTan0017026
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionFlagellin N-methylase
Genome locationLG03:66600121..66602205
RNA-Seq ExpressionTan0017026
SyntenyTan0017026
Gene Ontology termsNA
InterPro domainsIPR005358 - Putative zinc- or iron-chelating domain containing protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447323.1 PREDICTED: uncharacterized protein LOC103489795 [Cucumis melo]1.5e-8087.27Show/hide
Query:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK
        M   VAPPQ+TVTAAR+PQ+I TKD++T QGR+ NVGFGGKRKEQLWQC+EGCGACCKLAKG SFASPEEIFQN SD ELYKSLIG DGWCIHYEK+TRK
Subjt:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK

Query:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSESM
        CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGF SKELENFNKA+QSSES+
Subjt:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSESM

XP_022143803.1 uncharacterized protein LOC111013629 isoform X1 [Momordica charantia]2.6e-8088.82Show/hide
Query:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK
        M + VAPP++TVTAARRPQQI TKDN+T +GRSINVGFGGKRKEQLWQCVEGCGACCKLA GPSFA+PEEIF+N SD ELYKSLIGADGWCIHYEKSTRK
Subjt:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK

Query:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQS
        CSIYADRPYFCRVESPVFEKLYGIKENKFNK ACSSCRDTIKA+YGF SKELENFNKA+QS
Subjt:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQS

XP_022972574.1 uncharacterized protein LOC111471122 [Cucurbita maxima]7.6e-8087.2Show/hide
Query:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK
        ML +VAPPQ+TV AARR +Q+ TK NETRQGR  N GF GKRKE+LWQCVEGCGACCKLAKGPSFASPEEIFQNPSD ELYKSLIGADGWCIHYEKS+RK
Subjt:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK

Query:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSES
        CSIYADRPYFCRVESPVFEKLYGIKENKFNK ACSSCRDTIK IYGFQSKELE FNKA+Q SES
Subjt:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSES

XP_023518453.1 uncharacterized protein LOC111781942 [Cucurbita pepo subsp. pepo]4.4e-8087.8Show/hide
Query:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK
        ML +VAPPQ+TV AARR +QI TKD ET+QGR  N GF GKRKE+LWQCVEGCGACCKLAKGPSFASPEEIFQNPSD ELYKSLIGADGWCIHYEKS+RK
Subjt:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK

Query:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSES
        CSIYADRPYFCRVESPVFEKLYGIKENKFNK ACSSCRDTIKAIYGFQSKELE FNKA+Q SES
Subjt:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSES

XP_038883478.1 uncharacterized protein LOC120074431 [Benincasa hispida]1.6e-8289.7Show/hide
Query:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK
        M + VAPPQ+++TAARRPQQITTKD +  QGR+INVGFG KRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQN SD ELYKSLIGADGWCIHYEKSTRK
Subjt:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK

Query:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSESM
        CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGF SKELENFNKA+QSSES+
Subjt:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSESM

TrEMBL top hitse value%identityAlignment
A0A1S3BH68 uncharacterized protein LOC1034897957.4e-8187.27Show/hide
Query:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK
        M   VAPPQ+TVTAAR+PQ+I TKD++T QGR+ NVGFGGKRKEQLWQC+EGCGACCKLAKG SFASPEEIFQN SD ELYKSLIG DGWCIHYEK+TRK
Subjt:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK

Query:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSESM
        CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGF SKELENFNKA+QSSES+
Subjt:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSESM

A0A5A7TWI1 Flagellin N-methylase7.4e-8187.27Show/hide
Query:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK
        M   VAPPQ+TVTAAR+PQ+I TKD++T QGR+ NVGFGGKRKEQLWQC+EGCGACCKLAKG SFASPEEIFQN SD ELYKSLIG DGWCIHYEK+TRK
Subjt:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK

Query:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSESM
        CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGF SKELENFNKA+QSSES+
Subjt:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSESM

A0A6J1CQF1 uncharacterized protein LOC111013629 isoform X11.3e-8088.82Show/hide
Query:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK
        M + VAPP++TVTAARRPQQI TKDN+T +GRSINVGFGGKRKEQLWQCVEGCGACCKLA GPSFA+PEEIF+N SD ELYKSLIGADGWCIHYEKSTRK
Subjt:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK

Query:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQS
        CSIYADRPYFCRVESPVFEKLYGIKENKFNK ACSSCRDTIKA+YGF SKELENFNKA+QS
Subjt:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQS

A0A6J1HIY7 uncharacterized protein LOC1114633798.2e-8087.2Show/hide
Query:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK
        ML +VAPPQ+TV AARR +QI TK+ ET+QGR  N GF GKRKE+LWQCVEGCGACCKLAKGPSFASPEEIFQNPSD ELYKSLIGADGWCIHYEKS+RK
Subjt:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK

Query:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSES
        CSIYADRPYFCRVESPVFEKLYGIKENKFNK ACSSCRDTIKAIYGFQSKELE FNKA+Q SES
Subjt:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSES

A0A6J1I6C5 uncharacterized protein LOC1114711223.7e-8087.2Show/hide
Query:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK
        ML +VAPPQ+TV AARR +Q+ TK NETRQGR  N GF GKRKE+LWQCVEGCGACCKLAKGPSFASPEEIFQNPSD ELYKSLIGADGWCIHYEKS+RK
Subjt:  MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRK

Query:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSES
        CSIYADRPYFCRVESPVFEKLYGIKENKFNK ACSSCRDTIK IYGFQSKELE FNKA+Q SES
Subjt:  CSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G02710.1 unknown protein4.6e-5158.18Show/hide
Query:  APPQITVTAARRPQQIT------TKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTR
        AP   T+ +A R  Q++       K    R   S +   GG  KE  W+CVEGCGACCK+AK  SFA+P+EIF NP D ELY+S+IG DGWC++Y+K+TR
Subjt:  APPQITVTAARRPQQIT------TKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTR

Query:  KCSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSES
        KCSIYADRPYFCRVE  VF+ LYGI+E KFNK A S C DTIK IYG  SKEL++FN+AI+S+ S
Subjt:  KCSIYADRPYFCRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCGGGTGGTGGCTCCGCCGCAAATCACCGTGACTGCCGCACGCCGGCCTCAGCAGATTACGACGAAGGATAATGAGACTAGACAAGGCCGGAGCATCAATGTAGG
GTTTGGGGGCAAAAGAAAGGAACAATTATGGCAGTGCGTGGAGGGATGCGGCGCCTGCTGCAAGCTCGCCAAGGGGCCCTCCTTCGCCTCGCCGGAGGAAATCTTCCAGA
ATCCTTCCGATGCCGAGCTCTATAAAAGCTTGATTGGCGCCGATGGATGGTGCATTCACTACGAGAAGAGCACACGTAAATGCTCCATTTACGCCGATCGCCCATATTTT
TGCCGCGTAGAATCTCCTGTATTTGAAAAATTATATGGAATCAAAGAAAACAAGTTCAACAAGGCCGCTTGCAGTAGCTGCAGGGACACTATAAAAGCAATCTATGGCTT
CCAGTCCAAGGAATTGGAAAATTTCAACAAGGCAATTCAGAGCTCCGAGTCTATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAATTAAAAAAAAAAAACTCCATTATAATAGGCTATTTCTGGTTTTTTTAAACGCTTATCCAAAATCCAAAGAATGCTCCGGGTGGTGGCTCCGCCGCAAATCACCG
TGACTGCCGCACGCCGGCCTCAGCAGATTACGACGAAGGATAATGAGACTAGACAAGGCCGGAGCATCAATGTAGGGTTTGGGGGCAAAAGAAAGGAACAATTATGGCAG
TGCGTGGAGGGATGCGGCGCCTGCTGCAAGCTCGCCAAGGGGCCCTCCTTCGCCTCGCCGGAGGAAATCTTCCAGAATCCTTCCGATGCCGAGCTCTATAAAAGCTTGAT
TGGCGCCGATGGATGGTGCATTCACTACGAGAAGAGCACACGTAAATGCTCCATTTACGCCGATCGCCCATATTTTTGCCGCGTAGAATCTCCTGTATTTGAAAAATTAT
ATGGAATCAAAGAAAACAAGTTCAACAAGGCCGCTTGCAGTAGCTGCAGGGACACTATAAAAGCAATCTATGGCTTCCAGTCCAAGGAATTGGAAAATTTCAACAAGGCA
ATTCAGAGCTCCGAGTCTATGTAGAAAGCTCGAAGGTCATATATCTTTCTTATATATAGCTTATCAAGTATTATTGTACCATGCTGATTTGTTGACTATAAATATATATC
ATCATTCTCTGTAGTCTCCCTGCTAATCAAAACCTTCCTTCTAGAGCCATGAATAGATTGATTGGAAACTGTTGTCTAGACAATTATATTCTTTTGTTTTTTAGATAAAA
AAAATATGTAAAAG
Protein sequenceShow/hide protein sequence
MLRVVAPPQITVTAARRPQQITTKDNETRQGRSINVGFGGKRKEQLWQCVEGCGACCKLAKGPSFASPEEIFQNPSDAELYKSLIGADGWCIHYEKSTRKCSIYADRPYF
CRVESPVFEKLYGIKENKFNKAACSSCRDTIKAIYGFQSKELENFNKAIQSSESM