; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013547 (gene) of Snake gourd v1 genome

Gene IDTan0013547
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationLG06:69648246..69649266
RNA-Seq ExpressionTan0013547
SyntenyTan0013547
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458682.1 PREDICTED: uncharacterized protein LOC103498010 [Cucumis melo]9.5e-3144.75Show/hide
Query:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS
        MFL++LE   PL  ATSL +Q+A   D+KF+ L   II  + SP+F A + +  +LF N+SVDH+  S VSLQ FH A+L+   FSS+TI L +  + + 
Subjt:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS

Query:  LSFDT-SRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE
        L F+T S  VP L HEL LSP +  + G++     F++ + E +RII  L  +   T+ +T+T SQVKFS+ S+EIIL+KE
Subjt:  LSFDT-SRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE

XP_016903187.1 PREDICTED: uncharacterized protein LOC103502263 [Cucumis melo]7.5e-3646.11Show/hide
Query:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS
        MFL++L++  PL+ ATS  +QIA + D+KF+ L FSIIA + SPRF A + M H  F NY VD+ H S +SL++FH A+L+     S+TI L      + 
Subjt:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS

Query:  LSFDTSRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE
        L F++S + P++ HEL+L+P++  D GE+  AK FSID+++ +R+I  L  +   +IC+T T SQVKFS+AS+EI+L+KE
Subjt:  LSFDTSRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE

XP_023006010.1 uncharacterized protein LOC111498887 [Cucurbita maxima]6.2e-3044.2Show/hide
Query:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS
        MFL++L +  PL  ATSL +QI+N+ DLKFS   FS+I  +PS RF A   + H+ FANYSVD +H S VSLQ+F+ A+ +   FSS+TI   E  S + 
Subjt:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS

Query:  LSFDTSRYVPRLGHE-LTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE
        L F++S +     H  L LSP++  + G+I + + FSI +++F+ II  L ++ +++I +++T S+VKF  ASEE IL+KE
Subjt:  LSFDTSRYVPRLGHE-LTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE

XP_031744160.1 uncharacterized protein LOC116404808 [Cucumis sativus]9.2e-3444.44Show/hide
Query:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS
        MFL++L+N  P +HATS  + IA + D+KF+ L FSI   +  PRF A + M +  F NY VD+ H S +SL++FH A+L+     S+TI L    + + 
Subjt:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS

Query:  LSFDTSRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE
        L F++S + P++ HEL+L P++  D GEI  AK FSID++  +R+I  L  +   +IC+T T SQVKFS+AS+EI+L+KE
Subjt:  LSFDTSRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE

XP_038875055.1 uncharacterized protein LOC120067580 [Benincasa hispida]5.1e-3245.41Show/hide
Query:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS
        MFL+KL N  PL  ATS  +QI+N  D+KF+ L F +IA +PSPRF A + +  + F NYSVDH H S V L++FH AIL+   F+S+TI L E+ + + 
Subjt:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS

Query:  LSFDT-SRYVPRLGHELTLSPTENVD---FGEISNAKSFSIDTEEFKRIIIALSNY-DDHTICITITHSQVKFSVASEEIILSKE
        L F T S  +P L HELT SP +  D    G++   K F + +E  +RII  L  + DD  +C+ +T SQ+KFS+AS+EI+L  +
Subjt:  LSFDT-SRYVPRLGHELTLSPTENVD---FGEISNAKSFSIDTEEFKRIIIALSNY-DDHTICITITHSQVKFSVASEEIILSKE

TrEMBL top hitse value%identityAlignment
A0A1S3C8J1 uncharacterized protein LOC1034980104.6e-3144.75Show/hide
Query:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS
        MFL++LE   PL  ATSL +Q+A   D+KF+ L   II  + SP+F A + +  +LF N+SVDH+  S VSLQ FH A+L+   FSS+TI L +  + + 
Subjt:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS

Query:  LSFDT-SRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE
        L F+T S  VP L HEL LSP +  + G++     F++ + E +RII  L  +   T+ +T+T SQVKFS+ S+EIIL+KE
Subjt:  LSFDT-SRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE

A0A1S4E4N8 uncharacterized protein LOC1035022633.6e-3646.11Show/hide
Query:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS
        MFL++L++  PL+ ATS  +QIA + D+KF+ L FSIIA + SPRF A + M H  F NY VD+ H S +SL++FH A+L+     S+TI L      + 
Subjt:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS

Query:  LSFDTSRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE
        L F++S + P++ HEL+L+P++  D GE+  AK FSID+++ +R+I  L  +   +IC+T T SQVKFS+AS+EI+L+KE
Subjt:  LSFDTSRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE

A0A6J1CUU8 uncharacterized protein LOC1110149883.9e-3042.78Show/hide
Query:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS
        MFLI+L+ +APL+ A    ++IA + D+KFS   F II    SP F A + M  + F +++VD +H S + L + H  +++ + + ++T  L E  + + 
Subjt:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS

Query:  LSFDTSRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE
        L F+ SR +PR   EL LSP+E  D GEI      SI ++EF+ I+  LS Y +H IC T+T SQVKFSVA+EEIIL+KE
Subjt:  LSFDTSRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE

A0A6J1H2Z8 uncharacterized protein LOC1114600114.3e-2943.09Show/hide
Query:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS
        MFL++L +  PL  ATS+ +QI+N+ DLKFS   FS+I  +PS RF A   + H+ FANY VD +H S VSLQ+F+ A+     FSS+TI   E  S + 
Subjt:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS

Query:  LSFDTSRYVPRLGHE-LTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE
        L F++S +     H  L LSP++  + G+I + + FSI +++F+ II  L ++ +++I +++T S+VKF  ASEE IL+KE
Subjt:  LSFDTSRYVPRLGHE-LTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE

A0A6J1KZ05 uncharacterized protein LOC1114988873.0e-3044.2Show/hide
Query:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS
        MFL++L +  PL  ATSL +QI+N+ DLKFS   FS+I  +PS RF A   + H+ FANYSVD +H S VSLQ+F+ A+ +   FSS+TI   E  S + 
Subjt:  MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCIS

Query:  LSFDTSRYVPRLGHE-LTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE
        L F++S +     H  L LSP++  + G+I + + FSI +++F+ II  L ++ +++I +++T S+VKF  ASEE IL+KE
Subjt:  LSFDTSRYVPRLGHE-LTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTGATCAAGCTCGAGAACCTTGCTCCTCTTTATCATGCAACCTCCCTATTCTCTCAAATTGCTAACAAAGTCGACCTGAAATTCTCGCGGTTGGCGTTCTCGAT
CATTGCTCGGCACCCGTCCCCTCGGTTCGATGCAGTTATGTTCATGATGCATCAATTATTTGCCAACTATTCTGTCGATCATCATCACATTTCAACTGTTTCCCTCCAAA
ACTTCCACAAGGCTATATTGGAAAGCCAAAAGTTTTCTTCACTGACCATCCAGCTTGCGGAACAAGCAAGTTGCATAAGCCTTTCATTTGACACTTCAAGGTATGTGCCA
AGACTCGGCCATGAATTGACATTGTCACCCACAGAAAACGTGGATTTTGGTGAAATCTCCAATGCAAAATCTTTTTCAATTGACACAGAAGAGTTTAAACGCATTATAAT
AGCACTATCTAACTACGATGATCATACAATTTGTATTACTATAACCCATTCACAAGTCAAGTTCTCTGTTGCATCTGAGGAGATAATTCTTAGCAAAGAGGTATATGTTC
ACAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTGATCAAGCTCGAGAACCTTGCTCCTCTTTATCATGCAACCTCCCTATTCTCTCAAATTGCTAACAAAGTCGACCTGAAATTCTCGCGGTTGGCGTTCTCGAT
CATTGCTCGGCACCCGTCCCCTCGGTTCGATGCAGTTATGTTCATGATGCATCAATTATTTGCCAACTATTCTGTCGATCATCATCACATTTCAACTGTTTCCCTCCAAA
ACTTCCACAAGGCTATATTGGAAAGCCAAAAGTTTTCTTCACTGACCATCCAGCTTGCGGAACAAGCAAGTTGCATAAGCCTTTCATTTGACACTTCAAGGTATGTGCCA
AGACTCGGCCATGAATTGACATTGTCACCCACAGAAAACGTGGATTTTGGTGAAATCTCCAATGCAAAATCTTTTTCAATTGACACAGAAGAGTTTAAACGCATTATAAT
AGCACTATCTAACTACGATGATCATACAATTTGTATTACTATAACCCATTCACAAGTCAAGTTCTCTGTTGCATCTGAGGAGATAATTCTTAGCAAAGAGGTATATGTTC
ACAAATAA
Protein sequenceShow/hide protein sequence
MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCISLSFDTSRYVP
RLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKEVYVHK