; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016521 (gene) of Snake gourd v1 genome

Gene IDTan0016521
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationLG08:4949243..4951494
RNA-Seq ExpressionTan0016521
SyntenyTan0016521
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013723.1 hypothetical protein SDJN02_23890, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-5249.19Show/hide
Query:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEA---
        M  V+L+ F  L + TSLL QI+++AD+RFT    S+IA  SH S  FVATLQ+  R FTNYSVDH +S+ +SL+SFH A++DG  F S++IH+LE+   
Subjt:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEA---

Query:  ISLTFQS-SRDLPPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ
        + L +++ S + PPLH EL  S  Q E LGQ++ GKFFT+ SK   +II E  +F ++ V V  T+++VKFS  SKEI +TKE G C  IVGYEGE ET+
Subjt:  ISLTFQS-SRDLPPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ

Query:  FQIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYF
          I   P  FFLN   K + + FYKT ++++VIS+    +Y QYV+YF
Subjt:  FQIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYF

XP_008458682.1 PREDICTED: uncharacterized protein LOC103498010 [Cucumis melo]4.8e-5455.25Show/hide
Query:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-
        M  VRLE F  L + TSLL Q+A++AD++FTP    II   S+ S  FVATLQL  R FTN+SVDHN S+ +SLQ FH A++DG  F S+TIHLL+  + 
Subjt:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-

Query:  --LTFQS-SRDLPPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ
          L F++ S D+PPLHHEL  S  Q E+LGQ++ G FFT+ S++  RII E  +F  ++V VT+T SQVKFS  SKEIILTKEGG C  IVGYEGEVET+
Subjt:  --LTFQS-SRDLPPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ

Query:  FQIVLHPTSFFLNLANKGH
         Q+VL P  FFLN   + +
Subjt:  FQIVLHPTSFFLNLANKGH

XP_022958857.1 uncharacterized protein LOC111460011 [Cucurbita moschata]3.6e-5453.41Show/hide
Query:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-
        M  VRL  F  L   TS+L QI+ EADL+F+ S+FS+I   S+ SH FVAT Q+  RFF NY VD NHS+ +SLQSF+ A+  G  F S+TIH  E  S 
Subjt:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-

Query:  --LTFQSSRDL-PPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ
          L F+SS      +H  L  S SQ E+LGQIQ  +FF+I+S+DF  IIT    F NNS+FV+LT+S+VKF   S+E ILTKEGG+C +IVGYEG+ E  
Subjt:  --LTFQSSRDL-PPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ

Query:  FQIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYFS
        FQI L+P  FF NL+   + I FYKT D+R VI I    L AQYVIYFS
Subjt:  FQIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYFS

XP_023006010.1 uncharacterized protein LOC111498887 [Cucurbita maxima]1.9e-5553.82Show/hide
Query:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-
        M  VRL  F  L   TSLL QI+ EADL+F+ S+FS+I   S+ S  FVAT Q+  RFF NYSVD NHS+ +SLQSF+ A+ DG  F S+TIH  E  S 
Subjt:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-

Query:  --LTFQSSRDLP-PLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ
          L F+SS      +H  L  S SQ E+LGQIQ  +FF+I+S+DF  IIT    F NNS+FV+LT+S+VKF   S+E ILTKEGG+C  I+GYEGE E  
Subjt:  --LTFQSSRDLP-PLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ

Query:  FQIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYFS
        FQI L+P  FF NL+   + I FYKT D+R VI +    L AQYVIYFS
Subjt:  FQIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYFS

XP_023548334.1 uncharacterized protein LOC111807002 [Cucurbita pepo subsp. pepo]9.0e-5352.61Show/hide
Query:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-
        M  VRL  F  L   TS+L QI+ EADL+F+ S+FS+I   S+ S  FVAT Q+  RFF NY VD NHS+ +SLQSF+ A+  G  F S+TIH  E  S 
Subjt:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-

Query:  --LTFQSSRDL-PPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ
          L F+SS      +H  L  S SQ E+LGQIQ  +FF+I+S+DF  I+T    F N+S+FV+LT+SQVKF   S+E ILTKEGG+C +IVGYEG+ E  
Subjt:  --LTFQSSRDL-PPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ

Query:  FQIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYFS
        FQI L+P  FF NL+   + I FYKT D+R VI I    L AQYVIYFS
Subjt:  FQIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYFS

TrEMBL top hitse value%identityAlignment
A0A1S3C8J1 uncharacterized protein LOC1034980102.3e-5455.25Show/hide
Query:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-
        M  VRLE F  L + TSLL Q+A++AD++FTP    II   S+ S  FVATLQL  R FTN+SVDHN S+ +SLQ FH A++DG  F S+TIHLL+  + 
Subjt:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-

Query:  --LTFQS-SRDLPPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ
          L F++ S D+PPLHHEL  S  Q E+LGQ++ G FFT+ S++  RII E  +F  ++V VT+T SQVKFS  SKEIILTKEGG C  IVGYEGEVET+
Subjt:  --LTFQS-SRDLPPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ

Query:  FQIVLHPTSFFLNLANKGH
         Q+VL P  FFLN   + +
Subjt:  FQIVLHPTSFFLNLANKGH

A0A1S3CL88 uncharacterized protein LOC1035022502.6e-5050.2Show/hide
Query:  MLSVRLESFHVLENGTSLLGQIA-EEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS
        M  V+L++F  L + TS L QI+ + ADL+FTPS+F IIA  SH S  F+ATLQL P++FT +SVD++HS+ +SL+SFH A++DG  F S+TIHLL+  +
Subjt:  MLSVRLESFHVLENGTSLLGQIA-EEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS

Query:  ---LTFQS-SRDLPPLHHELIPSLSQGED--LGQ--IQRGKFFTILSKDFTRIITEFSIFENNSVF-VTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYE
           L F + S ++ PLHHEL  S  Q ED  +GQ  +   K+F + SK   RII +  IF+N+S+  V +TNS+VKFS  SKEIILT EG  C  I G+E
Subjt:  ---LTFQS-SRDLPPLHHELIPSLSQGED--LGQ--IQRGKFFTILSKDFTRIITEFSIFENNSVF-VTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYE

Query:  GEVETQFQIVLHPTSFFLNLANKGHWIQFYKT-NDARTVISISILELYAQYVIYF
         EVETQFQI+L P  FFLN   K + + FYKT N+A T++ +    ++ QYVIYF
Subjt:  GEVETQFQIVLHPTSFFLNLANKGHWIQFYKT-NDARTVISISILELYAQYVIYF

A0A6J1CUU8 uncharacterized protein LOC1110149884.8e-4444.13Show/hide
Query:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLE---A
        M  +RL+    L +    L +IA  AD++F+P++F II S+   S PF+A LQ+ P FFT+++VD NH++ I L S H  LMDG  + ++T HLLE    
Subjt:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLE---A

Query:  ISLTFQSSRDLPPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQF
        + L F++SR+LP    EL  S S+ ED+G+I  G   +I S +F  I+T+ S + N+ +  TLT+SQVKFS  ++EIILTKEGGQCL+I    G VET+F
Subjt:  ISLTFQSSRDLPPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQF

Query:  QIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYF
        +  LHPTSFF +LA+K   +   K+ D+R +I I    L   +++YF
Subjt:  QIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYF

A0A6J1H2Z8 uncharacterized protein LOC1114600111.8e-5453.41Show/hide
Query:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-
        M  VRL  F  L   TS+L QI+ EADL+F+ S+FS+I   S+ SH FVAT Q+  RFF NY VD NHS+ +SLQSF+ A+  G  F S+TIH  E  S 
Subjt:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-

Query:  --LTFQSSRDL-PPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ
          L F+SS      +H  L  S SQ E+LGQIQ  +FF+I+S+DF  IIT    F NNS+FV+LT+S+VKF   S+E ILTKEGG+C +IVGYEG+ E  
Subjt:  --LTFQSSRDL-PPLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ

Query:  FQIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYFS
        FQI L+P  FF NL+   + I FYKT D+R VI I    L AQYVIYFS
Subjt:  FQIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYFS

A0A6J1KZ05 uncharacterized protein LOC1114988879.4e-5653.82Show/hide
Query:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-
        M  VRL  F  L   TSLL QI+ EADL+F+ S+FS+I   S+ S  FVAT Q+  RFF NYSVD NHS+ +SLQSF+ A+ DG  F S+TIH  E  S 
Subjt:  MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAIS-

Query:  --LTFQSSRDLP-PLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ
          L F+SS      +H  L  S SQ E+LGQIQ  +FF+I+S+DF  IIT    F NNS+FV+LT+S+VKF   S+E ILTKEGG+C  I+GYEGE E  
Subjt:  --LTFQSSRDLP-PLHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQ

Query:  FQIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYFS
        FQI L+P  FF NL+   + I FYKT D+R VI +    L AQYVIYFS
Subjt:  FQIVLHPTSFFLNLANKGHWIQFYKTNDARTVISISILELYAQYVIYFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTCAGTGAGGCTTGAGAGCTTTCACGTTCTTGAGAATGGAACCTCCCTATTGGGTCAAATTGCTGAGGAAGCCGACCTGAGATTCACACCGTCAGAGTTCTCCAT
AATTGCCTCAAAGTCTCACCATTCACATCCCTTCGTTGCAACGCTGCAATTGAGGCCTCGATTCTTCACCAACTATTCTGTCGATCACAATCACAGTGCCACAATTTCCC
TTCAATCTTTCCACCAAGCTTTGATGGATGGCGCTAGATTTTGTTCATTGACCATCCATCTTCTCGAAGCCATATCCCTTACATTTCAAAGTTCAAGGGATCTCCCACCG
TTGCATCATGAATTGATACCGTCGCTTTCCCAAGGGGAGGATTTAGGCCAAATTCAACGAGGGAAATTTTTCACAATTCTTTCTAAAGATTTTACACGAATTATAACCGA
ATTTTCTATCTTTGAAAATAATTCAGTTTTTGTTACTCTCACGAATTCACAAGTCAAGTTCTCGACTCCATCTAAAGAGATTATTCTTACCAAAGAGGGTGGACAATGCC
TAATGATAGTAGGTTATGAAGGAGAGGTTGAAACTCAATTCCAAATCGTTCTCCATCCGACGTCGTTTTTCCTTAATTTGGCAAATAAAGGGCATTGGATACAGTTTTAT
AAGACAAATGATGCTCGTACTGTAATCAGTATCTCAATTCTTGAATTGTATGCTCAATATGTGATCTATTTTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGTCAGTGAGGCTTGAGAGCTTTCACGTTCTTGAGAATGGAACCTCCCTATTGGGTCAAATTGCTGAGGAAGCCGACCTGAGATTCACACCGTCAGAGTTCTCCAT
AATTGCCTCAAAGTCTCACCATTCACATCCCTTCGTTGCAACGCTGCAATTGAGGCCTCGATTCTTCACCAACTATTCTGTCGATCACAATCACAGTGCCACAATTTCCC
TTCAATCTTTCCACCAAGCTTTGATGGATGGCGCTAGATTTTGTTCATTGACCATCCATCTTCTCGAAGCCATATCCCTTACATTTCAAAGTTCAAGGGATCTCCCACCG
TTGCATCATGAATTGATACCGTCGCTTTCCCAAGGGGAGGATTTAGGCCAAATTCAACGAGGGAAATTTTTCACAATTCTTTCTAAAGATTTTACACGAATTATAACCGA
ATTTTCTATCTTTGAAAATAATTCAGTTTTTGTTACTCTCACGAATTCACAAGTCAAGTTCTCGACTCCATCTAAAGAGATTATTCTTACCAAAGAGGGTGGACAATGCC
TAATGATAGTAGGTTATGAAGGAGAGGTTGAAACTCAATTCCAAATCGTTCTCCATCCGACGTCGTTTTTCCTTAATTTGGCAAATAAAGGGCATTGGATACAGTTTTAT
AAGACAAATGATGCTCGTACTGTAATCAGTATCTCAATTCTTGAATTGTATGCTCAATATGTGATCTATTTTTCTTAA
Protein sequenceShow/hide protein sequence
MLSVRLESFHVLENGTSLLGQIAEEADLRFTPSEFSIIASKSHHSHPFVATLQLRPRFFTNYSVDHNHSATISLQSFHQALMDGARFCSLTIHLLEAISLTFQSSRDLPP
LHHELIPSLSQGEDLGQIQRGKFFTILSKDFTRIITEFSIFENNSVFVTLTNSQVKFSTPSKEIILTKEGGQCLMIVGYEGEVETQFQIVLHPTSFFLNLANKGHWIQFY
KTNDARTVISISILELYAQYVIYFS