; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018611 (gene) of Snake gourd v1 genome

Gene IDTan0018611
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationLG06:68562369..68564288
RNA-Seq ExpressionTan0018611
SyntenyTan0018611
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575236.1 hypothetical protein SDJN03_25875, partial [Cucurbita argyrosperma subsp. sororia]1.8e-2440.22Show/hide
Query:  MFSFILENIDGLVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRS
        MF FIL+    L E   +++ + +I  LKFSME  SI    SP +  +I++Q+ PP+F+ Y C +   SWINI  + P + +L R  F +L FS   P  
Subjt:  MFSFILENIDGLVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRS

Query:  ASLNFSGPNILARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFSSCRQEIIFT
        A+L F GP+ L R   F ++P + S+ D+  +D+STFV++ S++FIN++T F  + +V V LTSS V FS  R + I T
Subjt:  ASLNFSGPNILARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFSSCRQEIIFT

TYK02997.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.7e-2041.45Show/hide
Query:  MFSFILENIDGLVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFF-EQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPR
        MF FILE I   V+   +++GL  + +LKFS EMFSI +  +PS    I+LQ+ PPFF +QY C     SWI I    P + +++R  F +L FS     
Subjt:  MFSFILENIDGLVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFF-EQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPR

Query:  SASLNFSGPNILARTLHFPMHPFDNS-KVDVPEFDWSTFVSITSQDFINVVT
         A L F   +   R + FPM+  D S      + DW TFVS +SQ+FIN+VT
Subjt:  SASLNFSGPNILARTLHFPMHPFDNS-KVDVPEFDWSTFVSITSQDFINVVT

XP_022959414.1 uncharacterized protein LOC111460397 [Cucurbita moschata]4.9e-2235.85Show/hide
Query:  LVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRSASLNFSGPNIL
        LV+   +++  D++ D+KF  EMF+I +   P+   +I+L + P FF++Y+C     SW  + +L P +I+++ + F +L F+V  P SA L F  PN L
Subjt:  LVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRSASLNFSGPNIL

Query:  ARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFS
        +  + F + P     ++V +FD+S+FVS+ S++F+N+VT ++ F++V V +TS+ V+FS
Subjt:  ARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFS

XP_023006644.1 uncharacterized protein LOC111499308 [Cucurbita maxima]1.4e-2435.88Show/hide
Query:  MFSFILENIDGLVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRS
        +F F+L  +  LV+   +++  D++ D+KFS EMF+I +   P+   +I+L + P FF++Y+C     SW  + +L P +I+++ + F +L F+V  P S
Subjt:  MFSFILENIDGLVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRS

Query:  ASLNFSGPNILARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFS
        A L F  PN L+  + F + P     ++V +FD+S+FVS+ S++F+N+VT ++ F++V V +TS+ V+FS
Subjt:  ASLNFSGPNILARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFS

XP_023549342.1 uncharacterized protein LOC111807724 [Cucurbita pepo subsp. pepo]5.8e-2336.48Show/hide
Query:  LVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRSASLNFSGPNIL
        LV+   +++  D++ D+KFS EMF+I +   P+   +I+L + P FF++Y+C     SW  + +L P +I+++ + F +L F+V  P SA L F  PN L
Subjt:  LVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRSASLNFSGPNIL

Query:  ARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFS
        +  + F + P     ++V +FD+S+FVS+ S++F+N+VT ++ F++V V++TS+ V+FS
Subjt:  ARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFS

TrEMBL top hitse value%identityAlignment
A0A5D3BU35 LINE-1 retrotransposable element ORF2 protein1.3e-2041.45Show/hide
Query:  MFSFILENIDGLVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFF-EQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPR
        MF FILE I   V+   +++GL  + +LKFS EMFSI +  +PS    I+LQ+ PPFF +QY C     SWI I    P + +++R  F +L FS     
Subjt:  MFSFILENIDGLVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFF-EQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPR

Query:  SASLNFSGPNILARTLHFPMHPFDNS-KVDVPEFDWSTFVSITSQDFINVVT
         A L F   +   R + FPM+  D S      + DW TFVS +SQ+FIN+VT
Subjt:  SASLNFSGPNILARTLHFPMHPFDNS-KVDVPEFDWSTFVSITSQDFINVVT

A0A6J1H678 uncharacterized protein LOC1114603982.5e-1940.54Show/hide
Query:  MEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRSASLNFSGPNILARTLHFPMHPFDNSKVDVPE
        ME  SI    SP +  +I++Q+ PP+F+ Y C +   SWINI  + P + +L R  F +L FS   P  A+L F GP+ L R   F ++P + S+ D+  
Subjt:  MEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRSASLNFSGPNILARTLHFPMHPFDNSKVDVPE

Query:  FDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFSSCRQEIIFT
        +D+STFV++ S++FIN++T F  + +V V LTSS V FS  R + I T
Subjt:  FDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFSSCRQEIIFT

A0A6J1H801 uncharacterized protein LOC1114603972.4e-2235.85Show/hide
Query:  LVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRSASLNFSGPNIL
        LV+   +++  D++ D+KF  EMF+I +   P+   +I+L + P FF++Y+C     SW  + +L P +I+++ + F +L F+V  P SA L F  PN L
Subjt:  LVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRSASLNFSGPNIL

Query:  ARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFS
        +  + F + P     ++V +FD+S+FVS+ S++F+N+VT ++ F++V V +TS+ V+FS
Subjt:  ARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFS

A0A6J1KYD4 uncharacterized protein LOC1114993191.1e-1940.97Show/hide
Query:  SITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRSASLNFSGPNILARTLHFPMHPFDNSKVDVPEFDWS
        SI    SP +  +I++Q+ PP+F++Y C +   SWINI  + P + +L R  F +L FS   P  A+L F GP+ L R  +F ++P D S+ D+  +D+S
Subjt:  SITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRSASLNFSGPNILARTLHFPMHPFDNSKVDVPEFDWS

Query:  TFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFSSCRQEIIFT
        TFV++ S++FIN++T F  + +V V LTSS V FS  R + I T
Subjt:  TFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFSSCRQEIIFT

A0A6J1L2R2 uncharacterized protein LOC1114993086.7e-2535.88Show/hide
Query:  MFSFILENIDGLVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRS
        +F F+L  +  LV+   +++  D++ D+KFS EMF+I +   P+   +I+L + P FF++Y+C     SW  + +L P +I+++ + F +L F+V  P S
Subjt:  MFSFILENIDGLVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRS

Query:  ASLNFSGPNILARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFS
        A L F  PN L+  + F + P     ++V +FD+S+FVS+ S++F+N+VT ++ F++V V +TS+ V+FS
Subjt:  ASLNFSGPNILARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAGCTTCATTCTTGAAAACATCGATGGGCTGGTGGAAAACACGTGCATCATGAGCGGGCTTGATAATATATTGGATCTTAAATTCTCAATGGAAATGTTCTCTAT
AACGTCCAGATTGTCTCCTTCCTCTCCTTTCATGATATCTCTTCAAATGCTTCCTCCATTCTTCGAACAATATTTGTGCTATAACTACACATCTTCATGGATTAACATTG
GCGATCTTTTGCCATTTCTCATCAACTTGAAAAGGAACTCTTTCTTTACTTTAGTCTTCTCCGTTCAAATTCCACGAAGTGCTTCTCTCAATTTTTCTGGTCCAAATATC
CTTGCCCGGACGCTTCATTTTCCTATGCATCCTTTCGATAACTCCAAGGTTGATGTCCCCGAATTTGATTGGTCAACTTTTGTCTCCATCACCTCACAAGACTTCATTAA
CGTTGTCACTTTCTTCTATAATTTTGAATGGGTCCGTGTTGTTCTAACCAGTTCTGATGTCATGTTCTCTTCTTGTAGACAAGAGATCATTTTCACCATTCCACTTGCTG
AACTACGTATGGCCGAACATATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCAGCTTCATTCTTGAAAACATCGATGGGCTGGTGGAAAACACGTGCATCATGAGCGGGCTTGATAATATATTGGATCTTAAATTCTCAATGGAAATGTTCTCTAT
AACGTCCAGATTGTCTCCTTCCTCTCCTTTCATGATATCTCTTCAAATGCTTCCTCCATTCTTCGAACAATATTTGTGCTATAACTACACATCTTCATGGATTAACATTG
GCGATCTTTTGCCATTTCTCATCAACTTGAAAAGGAACTCTTTCTTTACTTTAGTCTTCTCCGTTCAAATTCCACGAAGTGCTTCTCTCAATTTTTCTGGTCCAAATATC
CTTGCCCGGACGCTTCATTTTCCTATGCATCCTTTCGATAACTCCAAGGTTGATGTCCCCGAATTTGATTGGTCAACTTTTGTCTCCATCACCTCACAAGACTTCATTAA
CGTTGTCACTTTCTTCTATAATTTTGAATGGGTCCGTGTTGTTCTAACCAGTTCTGATGTCATGTTCTCTTCTTGTAGACAAGAGATCATTTTCACCATTCCACTTGCTG
AACTACGTATGGCCGAACATATTTAG
Protein sequenceShow/hide protein sequence
MFSFILENIDGLVENTCIMSGLDNILDLKFSMEMFSITSRLSPSSPFMISLQMLPPFFEQYLCYNYTSSWINIGDLLPFLINLKRNSFFTLVFSVQIPRSASLNFSGPNI
LARTLHFPMHPFDNSKVDVPEFDWSTFVSITSQDFINVVTFFYNFEWVRVVLTSSDVMFSSCRQEIIFTIPLAELRMAEHI