; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004272 (gene) of Snake gourd v1 genome

Gene IDTan0004272
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationLG02:1992117..1993142
RNA-Seq ExpressionTan0004272
SyntenyTan0004272
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575236.1 hypothetical protein SDJN03_25875, partial [Cucurbita argyrosperma subsp. sororia]9.1e-3738.31Show/hide
Query:  MIGFVLNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPTDFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVD
        M  F+L+   +LAEA+ VIA+V +   LKFS E +S+M     S  T  I+ +Q+   +F  Y CD  H SWI +  ++P +S L ++G+SS +F+    
Subjt:  MIGFVLNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPTDFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVD

Query:  DEADFKFEHPNGGSRGTSLYLPPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRF
        + A+  F  P+          P   S+ +  +DY+TF+T++S++F+ I+T F    YVLVT+TS+QV FS        ++T E+G CIIGG+     I++
Subjt:  DEADFKFEHPNGGSRGTSLYLPPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRF

Query:  IIRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYFP
        II    M TF N+     R+WLF S  ++KGV+ A LG H+RF+ YFP
Subjt:  IIRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYFP

XP_022959414.1 uncharacterized protein LOC111460397 [Cucurbita moschata]8.8e-4038.59Show/hide
Query:  IKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPT-DFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDEADFKF
        ++ L +A SV+A+ ++  D+KF  EM ++MA    S P    ++ L ++  FF +Y C++   SW ++ +L+P +  +E+SG++S TFTV   + A+ KF
Subjt:  IKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPT-DFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDEADFKF

Query:  EHPNGGSRGTSLYL-PPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFIIRSTR
        + PNG S      L P  D   V +FD+++F++++S++F+ I+T++H   YV VTVTST+V+FS A +L TI LT E+GEC+IGG+     + FII  + 
Subjt:  EHPNGGSRGTSLYL-PPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFIIRSTR

Query:  MDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF
        ++ F +MA  + R+WL+ S++++KGV++  LG + RF+ YF
Subjt:  MDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF

XP_023006644.1 uncharacterized protein LOC111499308 [Cucurbita maxima]2.5e-4239.84Show/hide
Query:  FVLNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPT-DFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDE
        FVLN ++ L +A SV+A+ ++  D+KFS EM ++MA    S P    ++ L ++  FF +Y C++   SW ++ +L+P +  +E+SG++S +FTV   + 
Subjt:  FVLNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPT-DFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDE

Query:  ADFKFEHPNGGSRGTSLYL-PPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFI
        A+ KF+ PNG S      L P  D   V +FD+++F+++DS++F+ I+T++H   YV VTVTST+V+FS A +L TI LT E+GEC+IGG+     + FI
Subjt:  ADFKFEHPNGGSRGTSLYL-PPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFI

Query:  IRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF
        I  + ++ F +MA  + R+WL+ S+ ++KGV++  LG + RF+ YF
Subjt:  IRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF

XP_023549342.1 uncharacterized protein LOC111807724 [Cucurbita pepo subsp. pepo]1.2e-3938.59Show/hide
Query:  IKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPT-DFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDEADFKF
        ++ L +A SV+A+ ++  D+KFS EM ++MA    S P    ++ L ++  FF +Y C++   SW ++ +L+P +  +E+SG++S TFTV   + A+ KF
Subjt:  IKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPT-DFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDEADFKF

Query:  EHPNGGSRGTSLYL-PPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFIIRSTR
        + PNG S      L P  D   V +FD+++F++++S++F+ I+T++H   YV V VTST+V+FS A +L TI LT E+GEC+IGG+     + FII  + 
Subjt:  EHPNGGSRGTSLYL-PPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFIIRSTR

Query:  MDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF
        ++ F +MA  + R+WL+ S+ ++KGV++  LG + RF+ YF
Subjt:  MDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF

XP_038876014.1 uncharacterized protein LOC120068348 [Benincasa hispida]9.1e-3738.65Show/hide
Query:  MIGFVLNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPTDFILDLQMFSHFFSKYEC-DQPHCSWIYLTDLYPEISLLEQSGYSSFTFTV--
        +  FV+ND++ L  A + +  V+   D+ FSPEM+ +MA    SI     + +Q++  FF  Y C +    SW YLT ++P    L  SGY+SFTF++  
Subjt:  MIGFVLNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPTDFILDLQMFSHFFSKYEC-DQPHCSWIYLTDLYPEISLLEQSGYSSFTFTV--

Query:  RVDDEADFKFEHPNGGSRGTSLYL-PPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPET
          ++ A  KFE PNG        L       +V EFD + F+++DSQ+F  I+ ++H   YV VT+TST+V FS A +  TI LTP++G+C+IGGV    
Subjt:  RVDDEADFKFEHPNGGSRGTSLYL-PPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPET

Query:  EIRFIIRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF
        +I+FII     + F ++A  A RIW F + NS+KGV+ A +G + R + +F
Subjt:  EIRFIIRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF

TrEMBL top hitse value%identityAlignment
A0A1S3CBB9 uncharacterized protein LOC1034987761.0e-3334.15Show/hide
Query:  FVLNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPTDFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVD-DE
        FV+N+++ L +A++ +   +   D  FSPEM  +M     SI +   + LQ++  FF  Y C +   SW +  +++P    L+ SGY+SF+F++  + D 
Subjt:  FVLNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPTDFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVD-DE

Query:  ADFKFEHPNGGSRGTSLYLP-PQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFI
        A  KF+ PNG    T+  L        + +FD + F++MDSQ+F  +++Q+H    V VT+TS +V+FS  SI+   +L  +NG+CIIGG+    +++FI
Subjt:  ADFKFEHPNGGSRGTSLYLP-PQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFI

Query:  IRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF
        +     + F+++A    R+W F   NS+KG++ A LG +SR +  F
Subjt:  IRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF

A0A5D3DMK7 Uncharacterized protein1.9e-3538.21Show/hide
Query:  LNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPTDFILDLQMFSHFFSKYECDQ-PHCSWIYLTDLYPEISLLEQSGYSSFTFTVR--VDDE
        +ND+K L + +  +  V+   D  FSP+M  +MA    SI + F   +++   FF  +  D+     W  LT L+P    L +SGY+SFTF++     + 
Subjt:  LNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPTDFILDLQMFSHFFSKYECDQ-PHCSWIYLTDLYPEISLLEQSGYSSFTFTVR--VDDE

Query:  ADFKFEHPNGGSRGTSLYLPP-QDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFI
        A F+FE PNG  R  +  L P      + E D + F+TMDSQ+F  I++++H   YV V +T+ +V FS A I  TI +TP++G+CIIGG+ P  E++FI
Subjt:  ADFKFEHPNGGSRGTSLYLPP-QDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFI

Query:  IRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF
        I  ++ D F + A  + RIWLF  +NS+KG++ A LG H R + +F
Subjt:  IRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF

A0A6J1H801 uncharacterized protein LOC1114603974.3e-4038.59Show/hide
Query:  IKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPT-DFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDEADFKF
        ++ L +A SV+A+ ++  D+KF  EM ++MA    S P    ++ L ++  FF +Y C++   SW ++ +L+P +  +E+SG++S TFTV   + A+ KF
Subjt:  IKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPT-DFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDEADFKF

Query:  EHPNGGSRGTSLYL-PPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFIIRSTR
        + PNG S      L P  D   V +FD+++F++++S++F+ I+T++H   YV VTVTST+V+FS A +L TI LT E+GEC+IGG+     + FII  + 
Subjt:  EHPNGGSRGTSLYL-PPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFIIRSTR

Query:  MDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF
        ++ F +MA  + R+WL+ S++++KGV++  LG + RF+ YF
Subjt:  MDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF

A0A6J1KYD4 uncharacterized protein LOC1114993196.6e-3337.8Show/hide
Query:  MEPTSIPTDFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDEADFKFEHPNGGSRGTSLYLPPQDSKHVTEFDYTTFIT
        M   S  T  I+ +Q+   +F KY C+  H SWI +  ++P +S L ++G+SS +F+    + A+  FE P+          P   S+ +  +DY+TF+T
Subjt:  MEPTSIPTDFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDEADFKFEHPNGGSRGTSLYLPPQDSKHVTEFDYTTFIT

Query:  MDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFIIRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGF
        ++S++F+ I+T F   RYVLVT+TS+QV FS     N  ++T E+G C+IGG+     I+++I    M TF N+     R+WLF S  ++KGV+ A LG 
Subjt:  MDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFIIRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGF

Query:  HSRFLVYFP
        H+RF+ YFP
Subjt:  HSRFLVYFP

A0A6J1L2R2 uncharacterized protein LOC1114993081.2e-4239.84Show/hide
Query:  FVLNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPT-DFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDE
        FVLN ++ L +A SV+A+ ++  D+KFS EM ++MA    S P    ++ L ++  FF +Y C++   SW ++ +L+P +  +E+SG++S +FTV   + 
Subjt:  FVLNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPT-DFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDE

Query:  ADFKFEHPNGGSRGTSLYL-PPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFI
        A+ KF+ PNG S      L P  D   V +FD+++F+++DS++F+ I+T++H   YV VTVTST+V+FS A +L TI LT E+GEC+IGG+     + FI
Subjt:  ADFKFEHPNGGSRGTSLYL-PPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFI

Query:  IRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF
        I  + ++ F +MA  + R+WL+ S+ ++KGV++  LG + RF+ YF
Subjt:  IRSTRMDTFSNMALLAGRIWLFNSINSSKGVMVAVLGFHSRFLVYF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGGTTTCGTTCTGAATGATATCAAGAAGCTGGCGGAAGCGATGTCGGTGATCGCCACCGTGAACAACGAAGTGGATCTAAAATTCTCGCCGGAGATGGTGTCGTT
AATGGCGATGGAACCCACAAGTATTCCCACTGATTTCATTTTAGACCTCCAAATGTTTTCGCACTTCTTTAGCAAATATGAGTGCGATCAACCTCACTGTTCATGGATTT
ACCTCACCGACCTTTATCCCGAGATATCCCTTTTGGAACAATCGGGCTATTCTTCTTTTACCTTTACTGTCCGAGTCGATGATGAAGCCGACTTCAAATTTGAACATCCA
AATGGGGGTTCTCGGGGGACTAGTTTGTACTTGCCTCCTCAAGATTCGAAACACGTCACTGAATTTGATTACACAACTTTTATCACCATGGACTCCCAGCAGTTCATGCC
CATTCTCACCCAATTTCATTCCCGTCGTTACGTTCTTGTTACTGTAACCAGTACACAAGTCGTGTTCTCTAATGCATCCATACTGAATACTATTCTTCTTACTCCTGAGA
ATGGAGAATGTATTATTGGAGGGGTTCCACCTGAAACTGAAATTCGTTTTATAATAAGGTCTACACGCATGGATACTTTCTCTAATATGGCACTTCTAGCTGGAAGAATA
TGGTTATTCAACTCAATTAATTCTTCCAAAGGTGTAATGGTTGCTGTTTTAGGATTCCATTCGAGATTTTTGGTCTATTTTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCGGTTTCGTTCTGAATGATATCAAGAAGCTGGCGGAAGCGATGTCGGTGATCGCCACCGTGAACAACGAAGTGGATCTAAAATTCTCGCCGGAGATGGTGTCGTT
AATGGCGATGGAACCCACAAGTATTCCCACTGATTTCATTTTAGACCTCCAAATGTTTTCGCACTTCTTTAGCAAATATGAGTGCGATCAACCTCACTGTTCATGGATTT
ACCTCACCGACCTTTATCCCGAGATATCCCTTTTGGAACAATCGGGCTATTCTTCTTTTACCTTTACTGTCCGAGTCGATGATGAAGCCGACTTCAAATTTGAACATCCA
AATGGGGGTTCTCGGGGGACTAGTTTGTACTTGCCTCCTCAAGATTCGAAACACGTCACTGAATTTGATTACACAACTTTTATCACCATGGACTCCCAGCAGTTCATGCC
CATTCTCACCCAATTTCATTCCCGTCGTTACGTTCTTGTTACTGTAACCAGTACACAAGTCGTGTTCTCTAATGCATCCATACTGAATACTATTCTTCTTACTCCTGAGA
ATGGAGAATGTATTATTGGAGGGGTTCCACCTGAAACTGAAATTCGTTTTATAATAAGGTCTACACGCATGGATACTTTCTCTAATATGGCACTTCTAGCTGGAAGAATA
TGGTTATTCAACTCAATTAATTCTTCCAAAGGTGTAATGGTTGCTGTTTTAGGATTCCATTCGAGATTTTTGGTCTATTTTCCTTAG
Protein sequenceShow/hide protein sequence
MIGFVLNDIKKLAEAMSVIATVNNEVDLKFSPEMVSLMAMEPTSIPTDFILDLQMFSHFFSKYECDQPHCSWIYLTDLYPEISLLEQSGYSSFTFTVRVDDEADFKFEHP
NGGSRGTSLYLPPQDSKHVTEFDYTTFITMDSQQFMPILTQFHSRRYVLVTVTSTQVVFSNASILNTILLTPENGECIIGGVPPETEIRFIIRSTRMDTFSNMALLAGRI
WLFNSINSSKGVMVAVLGFHSRFLVYFP