; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016788 (gene) of Snake gourd v1 genome

Gene IDTan0016788
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG01:88109492..88110061
RNA-Seq ExpressionTan0016788
SyntenyTan0016788
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038875070.1 uncharacterized protein LOC120067596 [Benincasa hispida]1.2e-3253.1Show/hide
Query:  MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPME
        +L EK+P   L QNTI+CKVR+LKKQYN ++EMLS   S F WNEEFKCV+ E+E+F+ WV+SH N KGM NK F  YDDL+ VF KDRA G  +E P  
Subjt:  MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPME

Query:  MASSAVEQMEEEIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDTP
        MA++A  + E+EIRLGSQD    E R  E+     + +D++ + P
Subjt:  MASSAVEQMEEEIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDTP

XP_038880837.1 uncharacterized protein LOC120072528 [Benincasa hispida]4.1e-3362.18Show/hide
Query:  MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPME
        +L EK+P   L QNTIECKVR+LKKQYN ++EMLS   SGFGWNEEFKCV+ E+E+ D WV+SH NAK M NK F  YDDL+ VF KDR  G  +E P  
Subjt:  MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPME

Query:  MASSAVEQMEEEIRLGSQD
        MA++A  + E+EIRLGSQD
Subjt:  MASSAVEQMEEEIRLGSQD

XP_038889264.1 uncharacterized protein At2g29880-like [Benincasa hispida]1.2e-3260.17Show/hide
Query:  MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPME
        +  EK+P+  +  +TIECKVR LK+QY  I EMLSNAC+GFGWN+EFKCV+ EKEVFD WV SH N KG+R+KPFP  D+L+ VF KDRAT  G++TP +
Subjt:  MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPME

Query:  MASSAVEQMEEEIRLGSQ
         AS+  E + E+IRL SQ
Subjt:  MASSAVEQMEEEIRLGSQ

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]7.6e-3258.59Show/hide
Query:  EKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMAS
        EK+    L QNTIECKVR+LKKQ N ++EMLS   SGF WNEEFKCV+ E+E+FD WV+SH NAKGM NKPFP YDDL+ VF K +A G  +E P  M +
Subjt:  EKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMAS

Query:  SAVEQMEEEIRLGSQDFMGVEQRTMENL
        +A  + E+EIRLGSQD    E   M  L
Subjt:  SAVEQMEEEIRLGSQDFMGVEQRTMENL

XP_038902479.1 uncharacterized protein At2g29880-like [Benincasa hispida]8.7e-3653.66Show/hide
Query:  MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPME
        +L EK+P   L QNTIECKVR+LKKQYN ++EMLS   SGF WNEEFKCV+ E+E+FD WV SH NAK M NKPFP YDD + VF KDR  G  +E P  
Subjt:  MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPME

Query:  MASSAVEQMEEEIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDTPTSMRNTSGMSSRCIGSKRK
        MA++A  + E+EIRLGSQD    E R  E+    D  +++  +  T   +    SSR  GSKRK
Subjt:  MASSAVEQMEEEIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDTPTSMRNTSGMSSRCIGSKRK

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859536.1e-2741.67Show/hide
Query:  MLAEKLPNSCL-EQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPM
        M+AEKLP + + E +TI+C V++LKK Y+ IAEM   +CSGFGWNEEF+C+  E+++FD+W+KSH  AKG+ +K FP YDDL++VF KDRATG  +ET  
Subjt:  MLAEKLPNSCL-EQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPM

Query:  EMASSAVEQMEEEIRLGSQDFMGVEQRTMENLRIGDIGEDDL----PDTPTSMRNTSGMSSRCIGSKR
         + S+      + I LG      +     + +    +  D++        +  RN S +S R  GS+R
Subjt:  EMASSAVEQMEEEIRLGSQDFMGVEQRTMENLRIGDIGEDDL----PDTPTSMRNTSGMSSRCIGSKR

A0A5A7TC56 Retrotransposon protein3.0e-2643.53Show/hide
Query:  MLAEKLPNSCLEQNT-IECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPM
        M+AEKLP   +   T I+C+++TLK+ +  IAEM   ACSGFGWN+E KC+  EKE+FD WV+SH  AKG+ NKPFP YD+L +VF +DRATG  AET  
Subjt:  MLAEKLPNSCLEQNT-IECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPM

Query:  EMASSAVEQMEEEIRL--GSQDFMGVEQRTMENLRIGDIGEDDL----PDTPTSMRNTSGMSSRCIGSKR
        ++ S+      +   +  G++DF  V  + +      DI +DD+    P   +  R  S  S R  GS+R
Subjt:  EMASSAVEQMEEEIRL--GSQDFMGVEQRTMENLRIGDIGEDDL----PDTPTSMRNTSGMSSRCIGSKR

A0A5A7U0H7 Retrotransposon protein6.1e-2741.67Show/hide
Query:  MLAEKLPNSCL-EQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPM
        M+AEKLP + + E +TI+C V++LKK Y+ IAEM   +CSGFGWNEEF+C+  E+++FD+W+KSH  AKG+ +K FP YDDL++VF KDRATG  +ET  
Subjt:  MLAEKLPNSCL-EQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPM

Query:  EMASSAVEQMEEEIRLGSQDFMGVEQRTMENLRIGDIGEDDL----PDTPTSMRNTSGMSSRCIGSKR
         + S+      + I LG      +     + +    +  D++        +  RN S +S R  GS+R
Subjt:  EMASSAVEQMEEEIRLGSQDFMGVEQRTMENLRIGDIGEDDL----PDTPTSMRNTSGMSSRCIGSKR

A0A5D3CWL2 Retrotransposon protein4.0e-2643.53Show/hide
Query:  MLAEKLPNSCLEQNT-IECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPM
        M+AEKLP   +   T I+C+++TLK+ +  IAEM   ACSGFGWN+E KC+  EKE+FD WV+SH  AKG+ NKPFP YD+L +VF +DRATG  AET  
Subjt:  MLAEKLPNSCLEQNT-IECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPM

Query:  EMASSAVEQMEEEIRL--GSQDFMGVEQRTMENLRIGDIGEDDL----PDTPTSMRNTSGMSSRCIGSKR
        ++ S+      +   +  G++DF  V  + +      DI +DD+    P   +  R  S  S R  GS+R
Subjt:  EMASSAVEQMEEEIRL--GSQDFMGVEQRTMENLRIGDIGEDDL----PDTPTSMRNTSGMSSRCIGSKR

A0A5D3D9Q6 Retrotransposon protein3.0e-2641.56Show/hide
Query:  MLAEKLPNSCLEQNTI-ECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPM
        M+AEKLP   +   T+ +C+++TLK+ +  IAEM   ACSGFGWN+E KC+  EKE+FD WV+SH  AKG+ NKPFP YD+L +VF++DRATG  A+T  
Subjt:  MLAEKLPNSCLEQNTI-ECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPM

Query:  EMASSAVEQMEE-EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDTPTSMRNTS
        ++ S+  ++ +  ++R G++DF  V + +    + G   + D+     ++  T+
Subjt:  EMASSAVEQMEE-EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDTPTSMRNTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02210.1 unknown protein1.7e-0527.4Show/hide
Query:  SCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDL
        S  + + ++ + ++L++Q+N I  +L +   GF W+ E + V  +  V+  ++K+H +A+    +P P Y DL
Subjt:  SCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDL

AT4G02210.2 unknown protein1.7e-0527.4Show/hide
Query:  SCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDL
        S  + + ++ + ++L++Q+N I  +L +   GF W+ E + V  +  V+  ++K+H +A+    +P P Y DL
Subjt:  SCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGCTGAGAAATTACCAAATTCATGCCTAGAACAAAACACAATCGAATGCAAGGTCAGAACTCTAAAAAAACAATACAATACTATTGCAGAGATGCTTAGTAATGC
ATGCAGTGGCTTCGGCTGGAACGAAGAGTTCAAGTGTGTTGAGGAAGAGAAGGAGGTGTTCGATGCATGGGTTAAGAGCCATACAAACGCAAAGGGGATGAGGAATAAGC
CATTTCCGCAATATGATGACCTCGCATTTGTGTTCGAAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCCAATGGAAATGGCATCTAGCGCTGTAGAACAAATGGAG
GAGGAGATTCGTTTGGGATCACAAGACTTCATGGGAGTGGAACAACGAACAATGGAGAATCTAAGAATTGGTGACATAGGGGAAGATGACTTGCCAGACACTCCTACTAG
CATGCGTAATACATCTGGCATGTCTTCTAGATGTATTGGGAGCAAAAGAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAGCTGAGAAATTACCAAATTCATGCCTAGAACAAAACACAATCGAATGCAAGGTCAGAACTCTAAAAAAACAATACAATACTATTGCAGAGATGCTTAGTAATGC
ATGCAGTGGCTTCGGCTGGAACGAAGAGTTCAAGTGTGTTGAGGAAGAGAAGGAGGTGTTCGATGCATGGGTTAAGAGCCATACAAACGCAAAGGGGATGAGGAATAAGC
CATTTCCGCAATATGATGACCTCGCATTTGTGTTCGAAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCCAATGGAAATGGCATCTAGCGCTGTAGAACAAATGGAG
GAGGAGATTCGTTTGGGATCACAAGACTTCATGGGAGTGGAACAACGAACAATGGAGAATCTAAGAATTGGTGACATAGGGGAAGATGACTTGCCAGACACTCCTACTAG
CATGCGTAATACATCTGGCATGTCTTCTAGATGTATTGGGAGCAAAAGAAAATGA
Protein sequenceShow/hide protein sequence
MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMASSAVEQME
EEIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDTPTSMRNTSGMSSRCIGSKRK