; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020939 (gene) of Snake gourd v1 genome

Gene IDTan0020939
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCACTA en-spm transposon protein
Genome locationLG05:68946770..68947881
RNA-Seq ExpressionTan0020939
SyntenyTan0020939
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1223933.1 hypothetical protein CJ030_MR2G000632 [Morella rubra]1.5e-1231.4Show/hide
Query:  ATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLLD------------KSTK
        A++S S + RG +RG++L + + +   K+SV     +   V S  ++ +++I +L R  +P++     D+P EV   I+DR+LD            +S +
Subjt:  ATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLLD------------KSTK

Query:  NKASGSKLPFNHCAGTKSFLTHREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYGLKPPPS
        NK + SKL  NH AG++SF   R    +M A  ++         + EV + +LG +  YV+G+G  +KPPPS
Subjt:  NKASGSKLPFNHCAGTKSFLTHREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYGLKPPPS

XP_020252357.1 uncharacterized protein LOC109829729 [Asparagus officinalis]1.6e-1127.69Show/hide
Query:  EATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLLDKSTKNKASGSKLPFN
        E     + +VRG +RG+   ++      K+ VT + + D+  G    +F +EI    R + PLN      I      N+I R+  +S KN  + SKL ++
Subjt:  EATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLLDKSTKNKASGSKLPFN

Query:  HCAGTKSFLTHREE------------------------------KAKMVALAKEQANSSTP-----MNDEEVVATVLGTRLPYVKGMGYGLKPPP
        H AG+KSF++H+E+                              K K   + K +A+++ P     M DE++ A VL  +  Y+KG G+  +PPP
Subjt:  HCAGTKSFLTHREE------------------------------KAKMVALAKEQANSSTP-----MNDEEVVATVLGTRLPYVKGMGYGLKPPP

XP_038895319.1 uncharacterized protein LOC120083572 isoform X1 [Benincasa hispida]3.2e-1527.27Show/hide
Query:  HDEDDHVEMNEATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLL------
        HD+  +       +    +VRG SRG+ L +   AT  ++ VTW+  Q K +G + SLFN EI +L R F+PL Y  + DIPNE++  + ++LL      
Subjt:  HDEDDHVEMNEATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLL------

Query:  ----------------------------------------------------------------DKSTKNKASGSKLPFNHCAGTKSFLTHREEKAK---
                                                                        +KS +NK S SK+ FNHC G+KSFL+ R +K K   
Subjt:  ----------------------------------------------------------------DKSTKNKASGSKLPFNHCAGTKSFLTHREEKAK---

Query:  -------------------MVALAKEQANSSTPM--------NDEEVVATVLGTRLPYVKGMGYGLKPPPSQKKS
                            V  A ++A  +  M         DEE++  VLG R  Y+ G GYG KPP  ++ S
Subjt:  -------------------MVALAKEQANSSTPM--------NDEEVVATVLGTRLPYVKGMGYGLKPPPSQKKS

XP_038895320.1 uncharacterized protein LOC120083572 isoform X2 [Benincasa hispida]2.3e-1027.92Show/hide
Query:  HDEDDHVEMNEATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLL------
        HD+  +       +    +VRG SRG+ L +   AT  ++ VTW+  Q K +G + SLFN EI +L R F+PL Y  + DIPNE++  + ++LL      
Subjt:  HDEDDHVEMNEATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLL------

Query:  ----------------------------------------------------------------DKSTKNKASGSKLPFNHCAGTKSFLTHREEKAK
                                                                        +KS +NK S SK+ FNHC G+KSFL+ R +K K
Subjt:  ----------------------------------------------------------------DKSTKNKASGSKLPFNHCAGTKSFLTHREEKAK

XP_038895321.1 uncharacterized protein LOC120083572 isoform X3 [Benincasa hispida]2.3e-1027.92Show/hide
Query:  HDEDDHVEMNEATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLL------
        HD+  +       +    +VRG SRG+ L +   AT  ++ VTW+  Q K +G + SLFN EI +L R F+PL Y  + DIPNE++  + ++LL      
Subjt:  HDEDDHVEMNEATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLL------

Query:  ----------------------------------------------------------------DKSTKNKASGSKLPFNHCAGTKSFLTHREEKAK
                                                                        +KS +NK S SK+ FNHC G+KSFL+ R +K K
Subjt:  ----------------------------------------------------------------DKSTKNKASGSKLPFNHCAGTKSFLTHREEKAK

TrEMBL top hitse value%identityAlignment
A0A6A1WGC8 Uncharacterized protein7.1e-1331.4Show/hide
Query:  ATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLLD------------KSTK
        A++S S + RG +RG++L + + +   K+SV     +   V S  ++ +++I +L R  +P++     D+P EV   I+DR+LD            +S +
Subjt:  ATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLLD------------KSTK

Query:  NKASGSKLPFNHCAGTKSFLTHREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYGLKPPPS
        NK + SKL  NH AG++SF   R    +M A  ++         + EV + +LG +  YV+G+G  +KPPPS
Subjt:  NKASGSKLPFNHCAGTKSFLTHREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYGLKPPPS

A0A6A1WGQ9 Receptor-like protein 123.1e-0826.7Show/hide
Query:  ATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLLD------------KSTK
        A++S S + RG +RG++L + + +   K+SV     +   V S  ++ +++I +L R  +P++     D+P EV   I+DR+LD            +S +
Subjt:  ATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLLD------------KSTK

Query:  NKASGSKLPFNHCAGTKSFLT----------------------------------HREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYG
        NK + SKL  NH AG++SF                                     RE   +M A  ++         + EV + +LG +  YV+G+G  
Subjt:  NKASGSKLPFNHCAGTKSFLT----------------------------------HREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYG

Query:  LKPPPS
        +KPPPS
Subjt:  LKPPPS

A0A6J1CQT5 uncharacterized protein LOC1110134769.6e-1041.58Show/hide
Query:  DIPNEVFTNIIDRL-----LDKSTKNKASGSKLPFNHCAGTKSFLTHREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYGLKPPPSQKK
        DI  E +  + D+L      +KS KNK + SKL FNH  G K F  HRE+  +M+ L    A+      +EE++ TVLG R  YV GMGYG KP  ++  
Subjt:  DIPNEVFTNIIDRL-----LDKSTKNKASGSKLPFNHCAGTKSFLTHREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYGLKPPPSQKK

Query:  S
        S
Subjt:  S

A0A6J1D6S9 uncharacterized protein LOC1110174619.6e-1041.58Show/hide
Query:  DIPNEVFTNIIDRL-----LDKSTKNKASGSKLPFNHCAGTKSFLTHREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYGLKPPPSQKK
        DI  E +  + D+L      +KS KNK + SKL FNH  G K F  HRE+  +M+ L    A+      +EE++ TVLG R  YV GMGYG KP  ++  
Subjt:  DIPNEVFTNIIDRL-----LDKSTKNKASGSKLPFNHCAGTKSFLTHREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYGLKPPPSQKK

Query:  S
        S
Subjt:  S

A0A6J1DLF1 uncharacterized protein LOC1110211382.1e-0942.55Show/hide
Query:  DIPNEVFTNIIDRL-----LDKSTKNKASGSKLPFNHCAGTKSFLTHREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYGLKP
        DI  E +  + DRL      +KS KNK + SKL FNH    K F  HRE+  +M+ L    A+      +EE++ TVLG R  Y+ GMGYG KP
Subjt:  DIPNEVFTNIIDRL-----LDKSTKNKASGSKLPFNHCAGTKSFLTHREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYGLKP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGATGAGGACGATCATGTAGAAATGAATGAAGCAACATCCTCTAATAGTAACCAGGTTCGTGGGTTTTCACGTGGACTTGCCCTGACTAGGATTATTAAGGCCAC
AGGGGATAAAGTAAGTGTTACATGGAGTCTACAACAAGACAAATCAGTTGGAAGTGTTGTTAGTCTCTTTAACAGTGAAATTAGAATTTTGACGAGGACGTTTGTCCCCT
TGAACTATGCAACTAAGTATGACATTCCAAATGAAGTTTTTACCAACATAATAGATCGATTGCTGGACAAGTCAACGAAGAACAAGGCTAGTGGAAGCAAACTCCCTTTC
AATCATTGTGCTGGAACAAAATCATTTCTTACTCATAGAGAAGAAAAGGCAAAAATGGTGGCACTAGCAAAAGAGCAAGCCAACTCTAGTACACCAATGAATGACGAAGA
AGTTGTGGCTACCGTTCTTGGAACACGATTGCCCTATGTTAAAGGAATGGGATATGGACTAAAACCACCACCATCTCAGAAGAAATCATACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGATGAGGACGATCATGTAGAAATGAATGAAGCAACATCCTCTAATAGTAACCAGGTTCGTGGGTTTTCACGTGGACTTGCCCTGACTAGGATTATTAAGGCCAC
AGGGGATAAAGTAAGTGTTACATGGAGTCTACAACAAGACAAATCAGTTGGAAGTGTTGTTAGTCTCTTTAACAGTGAAATTAGAATTTTGACGAGGACGTTTGTCCCCT
TGAACTATGCAACTAAGTATGACATTCCAAATGAAGTTTTTACCAACATAATAGATCGATTGCTGGACAAGTCAACGAAGAACAAGGCTAGTGGAAGCAAACTCCCTTTC
AATCATTGTGCTGGAACAAAATCATTTCTTACTCATAGAGAAGAAAAGGCAAAAATGGTGGCACTAGCAAAAGAGCAAGCCAACTCTAGTACACCAATGAATGACGAAGA
AGTTGTGGCTACCGTTCTTGGAACACGATTGCCCTATGTTAAAGGAATGGGATATGGACTAAAACCACCACCATCTCAGAAGAAATCATACTGA
Protein sequenceShow/hide protein sequence
MHDEDDHVEMNEATSSNSNQVRGFSRGLALTRIIKATGDKVSVTWSLQQDKSVGSVVSLFNSEIRILTRTFVPLNYATKYDIPNEVFTNIIDRLLDKSTKNKASGSKLPF
NHCAGTKSFLTHREEKAKMVALAKEQANSSTPMNDEEVVATVLGTRLPYVKGMGYGLKPPPSQKKSY