; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010988 (gene) of Snake gourd v1 genome

Gene IDTan0010988
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposable element Tf2
Genome locationLG04:15170957..15176856
RNA-Seq ExpressionTan0010988
SyntenyTan0010988
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025901.1 hypothetical protein E6C27_scaffold34G001890 [Cucumis melo var. makuwa]6.6e-2035.78Show/hide
Query:  TKLENERGK--ASSSKGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSLMRPTAVTPMRPSVSPTTPSSSQQGSS
        +K+ + RG+  ++SSKG  P+SSSN  P+PMSADQYAM+LGFT                                                         
Subjt:  TKLENERGK--ASSSKGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSLMRPTAVTPMRPSVSPTTPSSSQQGSS

Query:  ILQPLLKPSILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD-
                            +KY+F HA IK +K+WF +NGYLQD+ + KN +FLN KSKLLA LAQ TT+AD Q +L    + ++SS   +S ++ED+ 
Subjt:  ILQPLLKPSILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD-

Query:  ----EYDINDPFLDSQPM
            EYD+++PFLDSQPM
Subjt:  ----EYDINDPFLDSQPM

KAA0034823.1 hypothetical protein E6C27_scaffold213G00570 [Cucumis melo var. makuwa]3.5e-2155.05Show/hide
Query:  ILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD-----EYDIN
        +L K L+IK W+KYNF HA IK +K+WFA+N YLQD+ ++KN EFLN KSKLL ALAQ TT+AD Q +L+   + +SSS   +S ++ED+     EYD++
Subjt:  ILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD-----EYDIN

Query:  DPFLDSQPM
        D  LDSQPM
Subjt:  DPFLDSQPM

KAA0049709.1 hypothetical protein E6C27_scaffold76G00530 [Cucumis melo var. makuwa]7.3e-1941.31Show/hide
Query:  ENERGK--ASSSKGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSLMRPT-AVTPMRPSVSPTTPSSSQQGSSIL
        EN RG+  ++SSKG  P+SSSN  P+PMSADQYAMDLGFT V R R+RS+SI+I    ES T PP+PST+L+RP+  V  MR   SP +PSS+++ S+  
Subjt:  ENERGK--ASSSKGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSLMRPT-AVTPMRPSVSPTTPSSSQQGSSIL

Query:  QPLLKPSILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD---
                                               Y Q ++  K       +SKLLAALAQ TT+AD Q +L+   + +SSS   +S ++ED+   
Subjt:  QPLLKPSILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD---

Query:  --EYDINDPFLDS
          EYD++DP+LDS
Subjt:  --EYDINDPFLDS

KAA0059031.1 polyprotein [Cucumis melo var. makuwa]7.3e-1950.83Show/hide
Query:  PPRPSTSLMRPTAV-TPMRPSVSPTTPSSSQQGSSILQPLLKP------------SILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAE
        PPRPS +L+R + +   MRP  S ++   S   S+     + P             IL K L+IKWW KYNF HA IK +K+WFADNGYLQD+ ++KN E
Subjt:  PPRPSTSLMRPTAV-TPMRPSVSPTTPSSSQQGSSILQPLLKP------------SILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAE

Query:  FLNSKSKLLAALAQTTTEAD
        FLN KSKLLAALAQ TT+AD
Subjt:  FLNSKSKLLAALAQTTTEAD

TYJ98361.1 hypothetical protein E5676_scaffold232G00950 [Cucumis melo var. makuwa]2.1e-2643.87Show/hide
Query:  RGKASSSKGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSLMRPTA-VTPMRPSVSPTTPSSSQQGSSILQPLLK
        R  ++SSKG  P+SSSN  P+PMS DQYAMDLG+T V + R++S+ I IR  MES T PPRPS +L+ P   V  MR S SP++   S    +     + 
Subjt:  RGKASSSKGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSLMRPTA-VTPMRPSVSPTTPSSSQQGSSILQPLLK

Query:  P--SILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD-----E
        P    +P++    ++ K    +  I E++  +  NG LQD+ ++KNA+FLN K K LAAL Q T +AD Q +L+   + +SSS P  S ++ED+     E
Subjt:  P--SILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD-----E

Query:  YDINDPFLDSQP
        YD++DPFLDSQP
Subjt:  YDINDPFLDSQP

TrEMBL top hitse value%identityAlignment
A0A5A7SNT5 Uncharacterized protein3.2e-2035.78Show/hide
Query:  TKLENERGK--ASSSKGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSLMRPTAVTPMRPSVSPTTPSSSQQGSS
        +K+ + RG+  ++SSKG  P+SSSN  P+PMSADQYAM+LGFT                                                         
Subjt:  TKLENERGK--ASSSKGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSLMRPTAVTPMRPSVSPTTPSSSQQGSS

Query:  ILQPLLKPSILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD-
                            +KY+F HA IK +K+WF +NGYLQD+ + KN +FLN KSKLLA LAQ TT+AD Q +L    + ++SS   +S ++ED+ 
Subjt:  ILQPLLKPSILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD-

Query:  ----EYDINDPFLDSQPM
            EYD+++PFLDSQPM
Subjt:  ----EYDINDPFLDSQPM

A0A5A7SWC0 Uncharacterized protein1.7e-2155.05Show/hide
Query:  ILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD-----EYDIN
        +L K L+IK W+KYNF HA IK +K+WFA+N YLQD+ ++KN EFLN KSKLL ALAQ TT+AD Q +L+   + +SSS   +S ++ED+     EYD++
Subjt:  ILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD-----EYDIN

Query:  DPFLDSQPM
        D  LDSQPM
Subjt:  DPFLDSQPM

A0A5A7U8A0 Uncharacterized protein3.5e-1941.31Show/hide
Query:  ENERGK--ASSSKGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSLMRPT-AVTPMRPSVSPTTPSSSQQGSSIL
        EN RG+  ++SSKG  P+SSSN  P+PMSADQYAMDLGFT V R R+RS+SI+I    ES T PP+PST+L+RP+  V  MR   SP +PSS+++ S+  
Subjt:  ENERGK--ASSSKGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSLMRPT-AVTPMRPSVSPTTPSSSQQGSSIL

Query:  QPLLKPSILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD---
                                               Y Q ++  K       +SKLLAALAQ TT+AD Q +L+   + +SSS   +S ++ED+   
Subjt:  QPLLKPSILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD---

Query:  --EYDINDPFLDS
          EYD++DP+LDS
Subjt:  --EYDINDPFLDS

A0A5A7UZX0 Polyprotein3.5e-1950.83Show/hide
Query:  PPRPSTSLMRPTAV-TPMRPSVSPTTPSSSQQGSSILQPLLKP------------SILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAE
        PPRPS +L+R + +   MRP  S ++   S   S+     + P             IL K L+IKWW KYNF HA IK +K+WFADNGYLQD+ ++KN E
Subjt:  PPRPSTSLMRPTAV-TPMRPSVSPTTPSSSQQGSSILQPLLKP------------SILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAE

Query:  FLNSKSKLLAALAQTTTEAD
        FLN KSKLLAALAQ TT+AD
Subjt:  FLNSKSKLLAALAQTTTEAD

A0A5D3BI61 Uncharacterized protein1.0e-2643.87Show/hide
Query:  RGKASSSKGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSLMRPTA-VTPMRPSVSPTTPSSSQQGSSILQPLLK
        R  ++SSKG  P+SSSN  P+PMS DQYAMDLG+T V + R++S+ I IR  MES T PPRPS +L+ P   V  MR S SP++   S    +     + 
Subjt:  RGKASSSKGKSPTSSSN-IPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSLMRPTA-VTPMRPSVSPTTPSSSQQGSSILQPLLK

Query:  P--SILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD-----E
        P    +P++    ++ K    +  I E++  +  NG LQD+ ++KNA+FLN K K LAAL Q T +AD Q +L+   + +SSS P  S ++ED+     E
Subjt:  P--SILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSIASSSSPQNSEVEEDD-----E

Query:  YDINDPFLDSQP
        YD++DPFLDSQP
Subjt:  YDINDPFLDSQP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTTGTTGAAATACAATTCTCAGAAGAAACAGAGGTTACTAAAGTCAAGGAATTCATGTCCTCAAGACCAAGTACCTCTCGAACTTCTGACCTCGAAAGGAATGCC
TTTTTCCAAAAATATGATGAAAACCAAAGATCTGAAATCAGGTGATCGCCACTATATCTTAAATCATTCTAGTACTGAAAAAAGAGTACAAACACTCAAGGTAGAAGGTC
AATTACCTCAAATAGTTGAAATAGAAGAAAAAATAGACGATGCATTTACGAGTCCAATAACCCGCCATGAAATCCATAATTTACAACAAACAACGGTCCAAAAATCCATA
AATATTATAGAGAGCAAACAAAAACAAATTCAATTCTTAAAGGAAGAAATCTCATACAAGAAAATAGATGAAGAATTAAAAAGAAAATCAATACAAGACAAAATATCTAA
TTTCAAAACAAAATTAGAGAATGAGCGAGGCAAAGCCTCATCTTCAAAGGGCAAAAGCCCAACCTCTTCTTCAAACATTCCAGCTCCCATGAGCGCCGATCAATACGCAA
TGGATTTGGGTTTCACTCAGGTGAATCGCCCAAGAACCAGAAGCGCATCGATTCAAATAAGAGATTCAATGGAGTCATTAACTCCACCACCGAGGCCATCAACCTCCCTC
ATGCGGCCTACTGCCGTTACACCAATGAGACCTTCCGTCTCACCTACTACTCCATCTTCGTCTCAACAGGGATCTTCAATCCTACAACCTTTGTTGAAGCCGTCAATCCT
CCCAAAAGTTTTACAAATCAAATGGTGGGACAAGTACAACTTTTCTCACGCAGGAATCAAAGAAGTGAAGCAATGGTTTGCCGATAATGGCTACCTCCAAGACCTGTCAA
AGAAGAAAAATGCAGAGTTCCTCAATTCCAAATCAAAGTTGCTAGCTGCCTTAGCACAAACAACAACAGAGGCAGATTTACAAACAATTTTGAATCAAGTTACTTCAATA
GCATCTTCGTCTTCTCCACAAAACTCTGAAGTGGAAGAAGATGATGAATATGATATCAATGATCCGTTTTTAGATTCCCAACCAATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTTGTTGAAATACAATTCTCAGAAGAAACAGAGGTTACTAAAGTCAAGGAATTCATGTCCTCAAGACCAAGTACCTCTCGAACTTCTGACCTCGAAAGGAATGCC
TTTTTCCAAAAATATGATGAAAACCAAAGATCTGAAATCAGGTGATCGCCACTATATCTTAAATCATTCTAGTACTGAAAAAAGAGTACAAACACTCAAGGTAGAAGGTC
AATTACCTCAAATAGTTGAAATAGAAGAAAAAATAGACGATGCATTTACGAGTCCAATAACCCGCCATGAAATCCATAATTTACAACAAACAACGGTCCAAAAATCCATA
AATATTATAGAGAGCAAACAAAAACAAATTCAATTCTTAAAGGAAGAAATCTCATACAAGAAAATAGATGAAGAATTAAAAAGAAAATCAATACAAGACAAAATATCTAA
TTTCAAAACAAAATTAGAGAATGAGCGAGGCAAAGCCTCATCTTCAAAGGGCAAAAGCCCAACCTCTTCTTCAAACATTCCAGCTCCCATGAGCGCCGATCAATACGCAA
TGGATTTGGGTTTCACTCAGGTGAATCGCCCAAGAACCAGAAGCGCATCGATTCAAATAAGAGATTCAATGGAGTCATTAACTCCACCACCGAGGCCATCAACCTCCCTC
ATGCGGCCTACTGCCGTTACACCAATGAGACCTTCCGTCTCACCTACTACTCCATCTTCGTCTCAACAGGGATCTTCAATCCTACAACCTTTGTTGAAGCCGTCAATCCT
CCCAAAAGTTTTACAAATCAAATGGTGGGACAAGTACAACTTTTCTCACGCAGGAATCAAAGAAGTGAAGCAATGGTTTGCCGATAATGGCTACCTCCAAGACCTGTCAA
AGAAGAAAAATGCAGAGTTCCTCAATTCCAAATCAAAGTTGCTAGCTGCCTTAGCACAAACAACAACAGAGGCAGATTTACAAACAATTTTGAATCAAGTTACTTCAATA
GCATCTTCGTCTTCTCCACAAAACTCTGAAGTGGAAGAAGATGATGAATATGATATCAATGATCCGTTTTTAGATTCCCAACCAATGTAA
Protein sequenceShow/hide protein sequence
MRLLKYNSQKKQRLLKSRNSCPQDQVPLELLTSKGMPFSKNMMKTKDLKSGDRHYILNHSSTEKRVQTLKVEGQLPQIVEIEEKIDDAFTSPITRHEIHNLQQTTVQKSI
NIIESKQKQIQFLKEEISYKKIDEELKRKSIQDKISNFKTKLENERGKASSSKGKSPTSSSNIPAPMSADQYAMDLGFTQVNRPRTRSASIQIRDSMESLTPPPRPSTSL
MRPTAVTPMRPSVSPTTPSSSQQGSSILQPLLKPSILPKVLQIKWWDKYNFSHAGIKEVKQWFADNGYLQDLSKKKNAEFLNSKSKLLAALAQTTTEADLQTILNQVTSI
ASSSSPQNSEVEEDDEYDINDPFLDSQPM