; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020924 (gene) of Snake gourd v1 genome

Gene IDTan0020924
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPlant transposase
Genome locationLG05:72647009..72648094
RNA-Seq ExpressionTan0020924
SyntenyTan0020924
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031738359.1 uncharacterized protein LOC101217008 isoform X1 [Cucumis sativus]7.4e-6852.48Show/hide
Query:  DKEGDHVTDKLCIDQSQDYSTVGEKSLNAAENCDTTHM---------TQRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIAVEEHR
        +KEGD++T+K  IDQSQ+    G K+ N  +N D+TH          +   D    +PL++++S +   +I+ SEQ ++     CR  T+M A A+EE  
Subjt:  DKEGDHVTDKLCIDQSQDYSTVGEKSLNAAENCDTTHM---------TQRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIAVEEHR

Query:  KVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQ----MKEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDDELVKL
        KVDI FNE+GQPIGE S+G+SSFLG LVREVVPVTL DWR LSTR KE+LW S+Q    MKED +RKYIFQKMGRLWRAGKSRIVS I++ S  +ELVK+
Subjt:  KVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQ----MKEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDDELVKL

Query:  KPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV
        KP+NI+SMHDWM+F KEKKSA FK                              KK C  SSS++RVA+ AKAHRKKDGNPV
Subjt:  KPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV

XP_038904085.1 uncharacterized protein LOC120090469 isoform X1 [Benincasa hispida]4.4e-6853.74Show/hide
Query:  MCDKEGDHVTDKLCIDQSQD-YSTVGEKSLNAAENCDTTHMT------------------QRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGA
        M +KEGD+ T+KL +DQSQD     G+KS N  +N + TH T                  Q+L S GQ    +++S +V  +I+ASE  ++   KKCRG 
Subjt:  MCDKEGDHVTDKLCIDQSQD-YSTVGEKSLNAAENCDTTHMT------------------QRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGA

Query:  TKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQ----MKEDGKRKYIFQKMGRLWRAGKSRIVSHI
        TKM   A+EE  KVDITFNE+GQPIGE S+G+SSFLG LVRE VPVTL DWR LST  KEILWTSIQ    +KED +RKY+FQKMGRLWRAGKSRIVS I
Subjt:  TKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQ----MKEDGKRKYIFQKMGRLWRAGKSRIVSHI

Query:  KNASNDDELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV
        ++ S ++ELVK+KP+NIQSMHDWMDF KEKKSA FK                              KKSCS SS+ TRV L AK HRKK GN V
Subjt:  KNASNDDELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV

XP_038904087.1 uncharacterized protein LOC120090469 isoform X2 [Benincasa hispida]4.4e-6853.74Show/hide
Query:  MCDKEGDHVTDKLCIDQSQD-YSTVGEKSLNAAENCDTTHMT------------------QRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGA
        M +KEGD+ T+KL +DQSQD     G+KS N  +N + TH T                  Q+L S GQ    +++S +V  +I+ASE  ++   KKCRG 
Subjt:  MCDKEGDHVTDKLCIDQSQD-YSTVGEKSLNAAENCDTTHMT------------------QRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGA

Query:  TKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQ----MKEDGKRKYIFQKMGRLWRAGKSRIVSHI
        TKM   A+EE  KVDITFNE+GQPIGE S+G+SSFLG LVRE VPVTL DWR LST  KEILWTSIQ    +KED +RKY+FQKMGRLWRAGKSRIVS I
Subjt:  TKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQ----MKEDGKRKYIFQKMGRLWRAGKSRIVSHI

Query:  KNASNDDELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV
        ++ S ++ELVK+KP+NIQSMHDWMDF KEKKSA FK                              KKSCS SS+ TRV L AK HRKK GN V
Subjt:  KNASNDDELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV

XP_038904088.1 uncharacterized protein LOC120090469 isoform X3 [Benincasa hispida]4.4e-6853.74Show/hide
Query:  MCDKEGDHVTDKLCIDQSQD-YSTVGEKSLNAAENCDTTHMT------------------QRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGA
        M +KEGD+ T+KL +DQSQD     G+KS N  +N + TH T                  Q+L S GQ    +++S +V  +I+ASE  ++   KKCRG 
Subjt:  MCDKEGDHVTDKLCIDQSQD-YSTVGEKSLNAAENCDTTHMT------------------QRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGA

Query:  TKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQ----MKEDGKRKYIFQKMGRLWRAGKSRIVSHI
        TKM   A+EE  KVDITFNE+GQPIGE S+G+SSFLG LVRE VPVTL DWR LST  KEILWTSIQ    +KED +RKY+FQKMGRLWRAGKSRIVS I
Subjt:  TKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQ----MKEDGKRKYIFQKMGRLWRAGKSRIVSHI

Query:  KNASNDDELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV
        ++ S ++ELVK+KP+NIQSMHDWMDF KEKKSA FK                              KKSCS SS+ TRV L AK HRKK GN V
Subjt:  KNASNDDELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV

XP_038904090.1 uncharacterized protein LOC120090469 isoform X5 [Benincasa hispida]4.4e-6853.74Show/hide
Query:  MCDKEGDHVTDKLCIDQSQD-YSTVGEKSLNAAENCDTTHMT------------------QRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGA
        M +KEGD+ T+KL +DQSQD     G+KS N  +N + TH T                  Q+L S GQ    +++S +V  +I+ASE  ++   KKCRG 
Subjt:  MCDKEGDHVTDKLCIDQSQD-YSTVGEKSLNAAENCDTTHMT------------------QRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGA

Query:  TKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQ----MKEDGKRKYIFQKMGRLWRAGKSRIVSHI
        TKM   A+EE  KVDITFNE+GQPIGE S+G+SSFLG LVRE VPVTL DWR LST  KEILWTSIQ    +KED +RKY+FQKMGRLWRAGKSRIVS I
Subjt:  TKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQ----MKEDGKRKYIFQKMGRLWRAGKSRIVSHI

Query:  KNASNDDELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV
        ++ S ++ELVK+KP+NIQSMHDWMDF KEKKSA FK                              KKSCS SS+ TRV L AK HRKK GN V
Subjt:  KNASNDDELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV

TrEMBL top hitse value%identityAlignment
A0A0A0L5I2 Transposase_23 domain-containing protein6.4e-7358.73Show/hide
Query:  DKEGDHVTDKLCIDQSQDYSTVGEKSLNAAENCDTTHM---------TQRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIAVEEHR
        +KEGD++T+K  IDQSQ+    G K+ N  +N D+TH          +   D    +PL++++S +   +I+ SEQ ++     CR  T+M A A+EE  
Subjt:  DKEGDHVTDKLCIDQSQDYSTVGEKSLNAAENCDTTHM---------TQRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIAVEEHR

Query:  KVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQ----MKEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDDELVKL
        KVDI FNE+GQPIGE S+G+SSFLG LVREVVPVTL DWR LSTR KE+LW S+Q    MKED +RKYIFQKMGRLWRAGKSRIVS I++ S  +ELVK+
Subjt:  KVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQ----MKEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDDELVKL

Query:  KPTNIQSMHDWMDFGKEKKSARFKKKSCSGSSSMTRVALCAKAHRKKDGNPV
        KP+NI+SMHDWM+F KEKKSA FKKK C  SSS++RVA+ AKAHRKKDGNPV
Subjt:  KPTNIQSMHDWMDFGKEKKSARFKKKSCSGSSSMTRVALCAKAHRKKDGNPV

A0A1S4DZ32 uncharacterized protein LOC103493280 isoform X66.2e-6853.66Show/hide
Query:  EGDHVTDKLCIDQSQDYSTV-GEKSLNAAENCDTTHMTQ---------------RLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIA
        EGD++T+KL +DQSQD   V G +  N  +N D+TH T                +L S GQ    +++S +V   I+ SE  ++   KK RG TKMK IA
Subjt:  EGDHVTDKLCIDQSQDYSTV-GEKSLNAAENCDTTHMTQ---------------RLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIA

Query:  VEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQM----KEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDD
        +EE  KVDITF+++GQPIGE S+G+SSFLG+LVRE+VPVTL DWR LSTR KEILWTSIQ+    KED +RK IF+KMGRLWRAGKSRIVS I++ S ++
Subjt:  VEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQM----KEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDD

Query:  ELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV
        ELVK+KP+NIQSMHDWMDF KEKKSA FK                              +KSC  SSS+TR+AL AKAHRKKD NPV
Subjt:  ELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV

A0A1S4DZ36 uncharacterized protein LOC103493280 isoform X16.2e-6853.66Show/hide
Query:  EGDHVTDKLCIDQSQDYSTV-GEKSLNAAENCDTTHMTQ---------------RLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIA
        EGD++T+KL +DQSQD   V G +  N  +N D+TH T                +L S GQ    +++S +V   I+ SE  ++   KK RG TKMK IA
Subjt:  EGDHVTDKLCIDQSQDYSTV-GEKSLNAAENCDTTHMTQ---------------RLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIA

Query:  VEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQM----KEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDD
        +EE  KVDITF+++GQPIGE S+G+SSFLG+LVRE+VPVTL DWR LSTR KEILWTSIQ+    KED +RK IF+KMGRLWRAGKSRIVS I++ S ++
Subjt:  VEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQM----KEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDD

Query:  ELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV
        ELVK+KP+NIQSMHDWMDF KEKKSA FK                              +KSC  SSS+TR+AL AKAHRKKD NPV
Subjt:  ELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV

A0A1S4DZ41 uncharacterized protein LOC103493280 isoform X56.2e-6853.66Show/hide
Query:  EGDHVTDKLCIDQSQDYSTV-GEKSLNAAENCDTTHMTQ---------------RLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIA
        EGD++T+KL +DQSQD   V G +  N  +N D+TH T                +L S GQ    +++S +V   I+ SE  ++   KK RG TKMK IA
Subjt:  EGDHVTDKLCIDQSQDYSTV-GEKSLNAAENCDTTHMTQ---------------RLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIA

Query:  VEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQM----KEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDD
        +EE  KVDITF+++GQPIGE S+G+SSFLG+LVRE+VPVTL DWR LSTR KEILWTSIQ+    KED +RK IF+KMGRLWRAGKSRIVS I++ S ++
Subjt:  VEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQM----KEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDD

Query:  ELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV
        ELVK+KP+NIQSMHDWMDF KEKKSA FK                              +KSC  SSS+TR+AL AKAHRKKD NPV
Subjt:  ELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV

A0A5D3D4T6 Plant transposase6.2e-6853.66Show/hide
Query:  EGDHVTDKLCIDQSQDYSTV-GEKSLNAAENCDTTHMTQ---------------RLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIA
        EGD++T+KL +DQSQD   V G +  N  +N D+TH T                +L S GQ    +++S +V   I+ SE  ++   KK RG TKMK IA
Subjt:  EGDHVTDKLCIDQSQDYSTV-GEKSLNAAENCDTTHMTQ---------------RLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIA

Query:  VEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQM----KEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDD
        +EE  KVDITF+++GQPIGE S+G+SSFLG+LVRE+VPVTL DWR LSTR KEILWTSIQ+    KED +RK IF+KMGRLWRAGKSRIVS I++ S ++
Subjt:  VEEHRKVDITFNEYGQPIGEDSVGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQM----KEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDD

Query:  ELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV
        ELVK+KP+NIQSMHDWMDF KEKKSA FK                              +KSC  SSS+TR+AL AKAHRKKD NPV
Subjt:  ELVKLKPTNIQSMHDWMDFGKEKKSARFK------------------------------KKSCSGSSSMTRVALCAKAHRKKDGNPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGATAAGGAAGGAGATCATGTTACTGATAAGTTGTGTATTGACCAATCTCAAGATTACTCAACAGTTGGAGAAAAGTCATTAAATGCTGCAGAGAATTGTGATAC
TACACACATGACTCAAAGGTTAGATTCTATCGGTCAAGCTCCACTAAACATAGAAAAATCTTCGGATGTAGAAGTCAATATCGATGCATCTGAACAAATTTTAAAACATC
CCTCCAAGAAATGTAGAGGAGCTACAAAAATGAAAGCTATTGCAGTTGAGGAACATAGAAAAGTAGATATAACATTCAATGAGTATGGACAACCGATTGGAGAGGATTCA
GTTGGGATGTCTTCATTTTTGGGTTCACTCGTGAGAGAGGTAGTGCCTGTGACTTTACAAGATTGGAGGAATTTGTCTACCCGATTGAAGGAAATTTTATGGACTTCAAT
TCAAATGAAGGAAGATGGGAAAAGAAAGTATATTTTTCAAAAGATGGGTAGATTATGGAGGGCAGGTAAATCTCGAATTGTGTCACATATTAAAAATGCCTCCAATGATG
ATGAGCTTGTTAAATTGAAGCCAACCAATATACAATCTATGCACGATTGGATGGACTTTGGGAAAGAAAAGAAGAGTGCAAGGTTCAAGAAAAAGAGTTGTTCAGGTTCA
TCTTCGATGACAAGGGTTGCGTTATGCGCAAAAGCACATAGGAAAAAGGATGGGAATCCTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGATAAGGAAGGAGATCATGTTACTGATAAGTTGTGTATTGACCAATCTCAAGATTACTCAACAGTTGGAGAAAAGTCATTAAATGCTGCAGAGAATTGTGATAC
TACACACATGACTCAAAGGTTAGATTCTATCGGTCAAGCTCCACTAAACATAGAAAAATCTTCGGATGTAGAAGTCAATATCGATGCATCTGAACAAATTTTAAAACATC
CCTCCAAGAAATGTAGAGGAGCTACAAAAATGAAAGCTATTGCAGTTGAGGAACATAGAAAAGTAGATATAACATTCAATGAGTATGGACAACCGATTGGAGAGGATTCA
GTTGGGATGTCTTCATTTTTGGGTTCACTCGTGAGAGAGGTAGTGCCTGTGACTTTACAAGATTGGAGGAATTTGTCTACCCGATTGAAGGAAATTTTATGGACTTCAAT
TCAAATGAAGGAAGATGGGAAAAGAAAGTATATTTTTCAAAAGATGGGTAGATTATGGAGGGCAGGTAAATCTCGAATTGTGTCACATATTAAAAATGCCTCCAATGATG
ATGAGCTTGTTAAATTGAAGCCAACCAATATACAATCTATGCACGATTGGATGGACTTTGGGAAAGAAAAGAAGAGTGCAAGGTTCAAGAAAAAGAGTTGTTCAGGTTCA
TCTTCGATGACAAGGGTTGCGTTATGCGCAAAAGCACATAGGAAAAAGGATGGGAATCCTGTTTAA
Protein sequenceShow/hide protein sequence
MCDKEGDHVTDKLCIDQSQDYSTVGEKSLNAAENCDTTHMTQRLDSIGQAPLNIEKSSDVEVNIDASEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDS
VGMSSFLGSLVREVVPVTLQDWRNLSTRLKEILWTSIQMKEDGKRKYIFQKMGRLWRAGKSRIVSHIKNASNDDELVKLKPTNIQSMHDWMDFGKEKKSARFKKKSCSGS
SSMTRVALCAKAHRKKDGNPV