; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003374 (gene) of Snake gourd v1 genome

Gene IDTan0003374
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationLG06:69679882..69680600
RNA-Seq ExpressionTan0003374
SyntenyTan0003374
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458682.1 PREDICTED: uncharacterized protein LOC103498010 [Cucumis melo]2.5e-3143.65Show/hide
Query:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI
        MFL+RL+   PL  ATSL +Q+ K+  +KF+ L L II  N S  F A + +   LFTN+SV H   S VS+  FH+A+L+G + SS+T+HLL+T N+++
Subjt:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI

Query:  LRFESSGFTL-QLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE
        LRFE+    +  L HEL LSP +   L ++    + ++ +++ +RI+     +H   +SVTVT SQVKFS+ S+EI+LTKE
Subjt:  LRFESSGFTL-QLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE

XP_016903187.1 PREDICTED: uncharacterized protein LOC103502263 [Cucumis melo]7.5e-3648.65Show/hide
Query:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI
        MFL+RL++  PLF ATS  +QI +E  +KF+ L  SIIA N S  F A + M H  F NY V + H S +S++SFH+ALL+G  S S+T+HLL  + +LI
Subjt:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI

Query:  LRFESSGFTLQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKEVCIFI
        LRFESS    ++ HEL+L+P++   L E+   K+ SID++D +R++     +H   I VT T SQVKFS+AS+EIVLTKE  +FI
Subjt:  LRFESSGFTLQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKEVCIFI

XP_022958857.1 uncharacterized protein LOC111460011 [Cucurbita moschata]2.0e-2844.2Show/hide
Query:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI
        MFL+RL +  PL  ATS+ +QI  E  LKFS    S+I   PS+ F A   + H  F NY V   H S VS+ SF+NA+  G   SS+T+H  ET +R++
Subjt:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI

Query:  LRFESSGFT-LQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE
        L+FESS  T +Q+   L LSP++   L +I  D++ SI +QDF+ I+T   ++ +  I V++T S+VKF  ASEE +LTKE
Subjt:  LRFESSGFT-LQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE

XP_023548334.1 uncharacterized protein LOC111807002 [Cucurbita pepo subsp. pepo]8.9e-2945.3Show/hide
Query:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI
        MFL+RL +  PL  ATS+ +QI  E  LKFS    S+I   PS  F A   + H  F NY V   H S VS+ SF+NA+  G   SS+T+H  ET +R++
Subjt:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI

Query:  LRFESSGFT-LQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE
        L+FESS  T +Q+   L LSP++   L +I  D++ SI +QDF+ IVT   ++ +  I V++T SQVKF  ASEE +LTKE
Subjt:  LRFESSGFT-LQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE

XP_031744160.1 uncharacterized protein LOC116404808 [Cucumis sativus]3.2e-3447.06Show/hide
Query:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI
        MFL+RL+N  P FHATS  + I +E  +KF+ L  SI   N    F A + M +  F NY V + H S +S++SFH+ALL+G  S S+T+HLL  +N++I
Subjt:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI

Query:  LRFESSGFTLQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTK--EVCIFI
        LRFESS    Q+RHEL+L P++   L EI   K+ SID++  +R++     +H   I VT T SQVKFS+AS+EIVLTK  E CI +
Subjt:  LRFESSGFTLQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTK--EVCIFI

TrEMBL top hitse value%identityAlignment
A0A1S3C8J1 uncharacterized protein LOC1034980101.2e-3143.65Show/hide
Query:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI
        MFL+RL+   PL  ATSL +Q+ K+  +KF+ L L II  N S  F A + +   LFTN+SV H   S VS+  FH+A+L+G + SS+T+HLL+T N+++
Subjt:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI

Query:  LRFESSGFTL-QLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE
        LRFE+    +  L HEL LSP +   L ++    + ++ +++ +RI+     +H   +SVTVT SQVKFS+ S+EI+LTKE
Subjt:  LRFESSGFTL-QLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE

A0A1S4E4N8 uncharacterized protein LOC1035022633.6e-3648.65Show/hide
Query:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI
        MFL+RL++  PLF ATS  +QI +E  +KF+ L  SIIA N S  F A + M H  F NY V + H S +S++SFH+ALL+G  S S+T+HLL  + +LI
Subjt:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI

Query:  LRFESSGFTLQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKEVCIFI
        LRFESS    ++ HEL+L+P++   L E+   K+ SID++D +R++     +H   I VT T SQVKFS+AS+EIVLTKE  +FI
Subjt:  LRFESSGFTLQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKEVCIFI

A0A6J1CUU8 uncharacterized protein LOC1110149882.8e-2846.11Show/hide
Query:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI
        MFLIRLQ +APLF A    ++I     +KFS     II    S PF A + M    FT+++V   H S + +DS H+ L++G+   ++T HLLE  NRL+
Subjt:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI

Query:  LRFESSGFTLQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE
        LRFE+S    + R EL LSP+E   + EI     VSI + +F+ IVT  SAY +  I  T+T SQVKFSVA+EEI+LTKE
Subjt:  LRFESSGFTLQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE

A0A6J1H2Z8 uncharacterized protein LOC1114600119.6e-2944.2Show/hide
Query:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI
        MFL+RL +  PL  ATS+ +QI  E  LKFS    S+I   PS+ F A   + H  F NY V   H S VS+ SF+NA+  G   SS+T+H  ET +R++
Subjt:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI

Query:  LRFESSGFT-LQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE
        L+FESS  T +Q+   L LSP++   L +I  D++ SI +QDF+ I+T   ++ +  I V++T S+VKF  ASEE +LTKE
Subjt:  LRFESSGFT-LQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE

A0A6J1KZ05 uncharacterized protein LOC1114988871.3e-2844.75Show/hide
Query:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI
        MFL+RL +  PL  ATSL +QI  E  LKFS    S+I   PS  F A   + H  F NYSV   H S VS+ SF++A+ +G   SS+T+H  ET +R++
Subjt:  MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLI

Query:  LRFESSGFT-LQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE
        L+FESS  T L++   L LSP++   L +I  D++ SI +QDF+ I+T   ++ +  I V++T S+VKF  ASEE +LTKE
Subjt:  LRFESSGFT-LQLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTGATCAGGCTTCAAAATCTTGCTCCTCTTTTTCACGCAACATCTTTATTCAGTCAAATTGTTAAGGAAGTGAAACTAAAATTCTCGCGATTAACTCTCTCCAT
AATTGCTCAAAATCCATCGTATCCGTTTCAAGCAGTCATGTTCATGCCACATCTTTTATTCACAAACTATTCTGTTCATCACACTCACATTTCAAGTGTTTCTATTGATT
CCTTTCACAATGCTTTGTTGGAAGGCCAAAATTCTTCTTCCATCACTCTCCATCTTCTCGAAACCCTCAATCGCTTAATCCTCCGATTTGAATCTTCAGGGTTTACGTTA
CAACTCCGTCATGAATTGACATTGTCGCCCACTGAAAACGTGGTTCTTGCTGAAATTTCTAGGGACAAATATGTCTCTATTGATGCACAAGATTTTAAACGCATTGTTAC
TGCATTTTCTGCCTACCATGACCAAATAATTTCTGTTACTGTAACGCCTTCTCAAGTTAAGTTCTCTGTTGCATCTGAGGAGATTGTTCTTACCAAAGAGGTATGTATTT
TCATATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTGATCAGGCTTCAAAATCTTGCTCCTCTTTTTCACGCAACATCTTTATTCAGTCAAATTGTTAAGGAAGTGAAACTAAAATTCTCGCGATTAACTCTCTCCAT
AATTGCTCAAAATCCATCGTATCCGTTTCAAGCAGTCATGTTCATGCCACATCTTTTATTCACAAACTATTCTGTTCATCACACTCACATTTCAAGTGTTTCTATTGATT
CCTTTCACAATGCTTTGTTGGAAGGCCAAAATTCTTCTTCCATCACTCTCCATCTTCTCGAAACCCTCAATCGCTTAATCCTCCGATTTGAATCTTCAGGGTTTACGTTA
CAACTCCGTCATGAATTGACATTGTCGCCCACTGAAAACGTGGTTCTTGCTGAAATTTCTAGGGACAAATATGTCTCTATTGATGCACAAGATTTTAAACGCATTGTTAC
TGCATTTTCTGCCTACCATGACCAAATAATTTCTGTTACTGTAACGCCTTCTCAAGTTAAGTTCTCTGTTGCATCTGAGGAGATTGTTCTTACCAAAGAGGTATGTATTT
TCATATAG
Protein sequenceShow/hide protein sequence
MFLIRLQNLAPLFHATSLFSQIVKEVKLKFSRLTLSIIAQNPSYPFQAVMFMPHLLFTNYSVHHTHISSVSIDSFHNALLEGQNSSSITLHLLETLNRLILRFESSGFTL
QLRHELTLSPTENVVLAEISRDKYVSIDAQDFKRIVTAFSAYHDQIISVTVTPSQVKFSVASEEIVLTKEVCIFI