; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013467 (gene) of Snake gourd v1 genome

Gene IDTan0013467
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationLG06:8406637..8407372
RNA-Seq ExpressionTan0013467
SyntenyTan0013467
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY98609.1 haloacid dehalogenase-like hydrolase (HAD) superfamily protein [Actinidia rufa]1.3e-3140.93Show/hide
Query:  HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGE--NLLSAWQCNNDDIIISWIINSISKEITTSLVYTGSCY-------
        + L  S  P +VLV+Q  L   NY+SW++AM +AL  KNKL F+   I KPEG   NLL++W   N++++ISWI+NS+SKEI+ S++++ S         
Subjt:  HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGE--NLLSAWQCNNDDIIISWIINSISKEITTSLVYTGSCY-------

Query:  ---------------------------VEAYYAKITTIWKSLMEYRPP---DECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINR
                                   V  Y+ K+ TIW+ L  YRP      CT GG+K    +   E+IM FLM L+ SF+ IR  +LLM+ LP IN+
Subjt:  ---------------------------VEAYYAKITTIWKSLMEYRPP---DECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINR

Query:  VFSLVIQEEHQRSIG
        VFSL+ QEEHQR IG
Subjt:  VFSLVIQEEHQRSIG

KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]3.5e-4546.26Show/hide
Query:  EAQLN-HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGENLLSAWQCNNDDIIISWIINSISKEITTSLVYTGS-----
        +AQLN + +  S+ PT  +VTQPL  A NY+SWS+AM +A+ G+NK  F+T  I+KP    LL AW CNN DI+ SWI+NS+SKEI  S++Y GS     
Subjt:  EAQLN-HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGENLLSAWQCNNDDIIISWIINSISKEITTSLVYTGS-----

Query:  -----------------------------CYVEAYYAKITTIWKSLMEYRPPDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLIN
                                       +E YY K+ TIW++L EYR  ++CT GGLK F+D+L SE+IM FLMGLN S++++RA ILLM  LP IN
Subjt:  -----------------------------CYVEAYYAKITTIWKSLMEYRPPDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLIN

Query:  RVFSLVIQEEHQRSIGKTAMSVETFSL
         VFSL+IQEE QRS G     ++  +L
Subjt:  RVFSLVIQEEHQRSIGKTAMSVETFSL

KAA8536734.1 hypothetical protein F0562_029212 [Nyssa sinensis]4.9e-3141.51Show/hide
Query:  HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEG--ENLLSAWQCNNDDIIISWIINSISKEITTSLVYTG----------
        + L  S +P  VLV+Q  L   NY++WS+AM +AL  KNKL FV  FI +P+G   NLL +W   N++I+ISWI+NSISKEI+ S+++            
Subjt:  HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEG--ENLLSAWQCNNDDIIISWIINSISKEITTSLVYTG----------

Query:  ------------------------SCYVEAYYAKITTIWKSLMEYRPP---DECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINR
                                   V  Y+ K+ TIW+ L  YRP     +C  GG+K   DY  +E+IM FLMGL+ SFS +   +LLM+S+P INR
Subjt:  ------------------------SCYVEAYYAKITTIWKSLMEYRPP---DECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINR

Query:  VFSLVIQEEHQR
        VFSL++QEE QR
Subjt:  VFSLVIQEEHQR

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]8.3e-4749.31Show/hide
Query:  VEAQLN-HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGENLLSAWQCNNDDIIISWIINSISKEITTSLVYTGSC---
        +E+QLN +L+  S APT +LVTQ LL ASNY+SW ++M +AL GKNK+ F+   IKKP G NLL+AW+CNN DII SWIINS+SKEI  S++YTGS    
Subjt:  VEAQLN-HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGENLLSAWQCNNDDIIISWIINSISKEITTSLVYTGSC---

Query:  -------------------------------YVEAYYAKITTIWKSLMEYRPPDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLI
                                        +EAYY K+ T+W+ L +YRP  +CT  GLK   ++  SE++M FLMGLN S++ IRA ILLM+ +P +
Subjt:  -------------------------------YVEAYYAKITTIWKSLMEYRPPDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLI

Query:  NRVFSLVIQEEHQRSIG
        N+VFSL+IQEE QR+IG
Subjt:  NRVFSLVIQEEHQRSIG

XP_022888913.1 uncharacterized protein LOC111404319 [Olea europaea var. sylvestris]8.0e-3440.38Show/hide
Query:  HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPE-GENLLSAWQCNNDDIIISWIINSISKEITTSLVYT------------
        + L  S +P +VLV+QPL+   NY+SWS+AM +AL  KNK+DF+   I KPE  ++LL+AW   N++++ISWI+NS+SKEI+ S++Y             
Subjt:  HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPE-GENLLSAWQCNNDDIIISWIINSISKEITTSLVYT------------

Query:  ----------------------GSCYVEAYYAKITTIWKSLMEYRP---PDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINRV
                              G   V  Y+ K+ TIW+ L  +RP     +C  GG K   DY + E++M FLMGLN SF+ +R  +L+M+ +P IN+ 
Subjt:  ----------------------GSCYVEAYYAKITTIWKSLMEYRP---PDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINRV

Query:  FSLVIQEEHQRSI
        F+L+ QEEHQRSI
Subjt:  FSLVIQEEHQRSI

TrEMBL top hitse value%identityAlignment
A0A2N9HYD2 Integrase catalytic domain-containing protein2.4e-3138.57Show/hide
Query:  APTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGENL--LSAW-QCNNDDIIISWIINSISKEITTSLVYTGS---------------
        +P  +LV+QP L   NY +WS++M +AL  KNK+ F+   I  P+ ++L   + W +CN   ++ISW++NS+SKEI +S++Y  +               
Subjt:  APTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGENL--LSAW-QCNNDDIIISWIINSISKEITTSLVYTGS---------------

Query:  -------------------CYVEAYYAKITTIWKSLMEYRPPDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINRVFSLVIQEE
                           C V AY+ K+ ++W  L  YR    C+ G LK+ +D    E +M FLMGLN SF+++RA IL+M  LP IN+ FSLV+QEE
Subjt:  -------------------CYVEAYYAKITTIWKSLMEYRPPDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINRVFSLVIQEE

Query:  HQRSIGKTAM
         QRSIG TA+
Subjt:  HQRSIGKTAM

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 81.7e-4546.26Show/hide
Query:  EAQLN-HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGENLLSAWQCNNDDIIISWIINSISKEITTSLVYTGS-----
        +AQLN + +  S+ PT  +VTQPL  A NY+SWS+AM +A+ G+NK  F+T  I+KP    LL AW CNN DI+ SWI+NS+SKEI  S++Y GS     
Subjt:  EAQLN-HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGENLLSAWQCNNDDIIISWIINSISKEITTSLVYTGS-----

Query:  -----------------------------CYVEAYYAKITTIWKSLMEYRPPDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLIN
                                       +E YY K+ TIW++L EYR  ++CT GGLK F+D+L SE+IM FLMGLN S++++RA ILLM  LP IN
Subjt:  -----------------------------CYVEAYYAKITTIWKSLMEYRPPDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLIN

Query:  RVFSLVIQEEHQRSIGKTAMSVETFSL
         VFSL+IQEE QRS G     ++  +L
Subjt:  RVFSLVIQEEHQRSIGKTAMSVETFSL

A0A5J5B2C5 Uncharacterized protein2.4e-3141.51Show/hide
Query:  HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEG--ENLLSAWQCNNDDIIISWIINSISKEITTSLVYTG----------
        + L  S +P  VLV+Q  L   NY++WS+AM +AL  KNKL FV  FI +P+G   NLL +W   N++I+ISWI+NSISKEI+ S+++            
Subjt:  HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEG--ENLLSAWQCNNDDIIISWIINSISKEITTSLVYTG----------

Query:  ------------------------SCYVEAYYAKITTIWKSLMEYRPP---DECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINR
                                   V  Y+ K+ TIW+ L  YRP     +C  GG+K   DY  +E+IM FLMGL+ SFS +   +LLM+S+P INR
Subjt:  ------------------------SCYVEAYYAKITTIWKSLMEYRPP---DECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINR

Query:  VFSLVIQEEHQR
        VFSL++QEE QR
Subjt:  VFSLVIQEEHQR

A0A6J1CXR2 uncharacterized protein LOC1110152394.0e-4749.31Show/hide
Query:  VEAQLN-HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGENLLSAWQCNNDDIIISWIINSISKEITTSLVYTGSC---
        +E+QLN +L+  S APT +LVTQ LL ASNY+SW ++M +AL GKNK+ F+   IKKP G NLL+AW+CNN DII SWIINS+SKEI  S++YTGS    
Subjt:  VEAQLN-HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGENLLSAWQCNNDDIIISWIINSISKEITTSLVYTGSC---

Query:  -------------------------------YVEAYYAKITTIWKSLMEYRPPDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLI
                                        +EAYY K+ T+W+ L +YRP  +CT  GLK   ++  SE++M FLMGLN S++ IRA ILLM+ +P +
Subjt:  -------------------------------YVEAYYAKITTIWKSLMEYRPPDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLI

Query:  NRVFSLVIQEEHQRSIG
        N+VFSL+IQEE QR+IG
Subjt:  NRVFSLVIQEEHQRSIG

A0A7J0FKC9 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein6.2e-3240.93Show/hide
Query:  HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGE--NLLSAWQCNNDDIIISWIINSISKEITTSLVYTGSCY-------
        + L  S  P +VLV+Q  L   NY+SW++AM +AL  KNKL F+   I KPEG   NLL++W   N++++ISWI+NS+SKEI+ S++++ S         
Subjt:  HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGE--NLLSAWQCNNDDIIISWIINSISKEITTSLVYTGSCY-------

Query:  ---------------------------VEAYYAKITTIWKSLMEYRPP---DECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINR
                                   V  Y+ K+ TIW+ L  YRP      CT GG+K    +   E+IM FLM L+ SF+ IR  +LLM+ LP IN+
Subjt:  ---------------------------VEAYYAKITTIWKSLMEYRPP---DECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINR

Query:  VFSLVIQEEHQRSIG
        VFSL+ QEEHQR IG
Subjt:  VFSLVIQEEHQRSIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.5e-0623.47Show/hide
Query:  HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEG-ENLLSAW-QCNNDDIIISWIINSISKEITTSLVYT-----------
        +L P    P+   + +   D  NY +W       L    K  F+   + KP+    L   W QCN   +++ W++NS++ ++  S++Y            
Subjt:  HLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEG-ENLLSAW-QCNNDDIIISWIINSISKEITTSLVYT-----------

Query:  -----------------------GSCYVEAYYAKITTIWKSLMEYRPPDECTYGG-----LKVFLDYLNSEFIMIFLMG--LNGSFSSIRAHILLMNSLP
                               G   VE Y+ K++ +W  L EY P  EC  GG      K   +    E    FLMG  LN  F ++   I+     P
Subjt:  -----------------------GSCYVEAYYAKITTIWKSLMEYRPPDECTYGG-----LKVFLDYLNSEFIMIFLMG--LNGSFSSIRAHILLMNSLP

Query:  LINRVFSLVIQEE
         ++  F++V   E
Subjt:  LINRVFSLVIQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGAGTCGACTGGTGTCGCTATGGATTTCTTGGATGTCGAGGCACAACTGAACCATCTCCTTCCTCAGTCGATTGCCCCGACTATTGTCCTCGTTACTCAACCACT
TCTCGATGCAAGCAATTACAGTTCTTGGAGCAAGGCGATGTTCCTGGCTCTGTTTGGAAAGAATAAATTGGACTTCGTTACTAGATTCATCAAGAAACCAGAAGGGGAAA
ATCTTCTTTCTGCTTGGCAGTGCAATAATGATGATATAATCATTTCTTGGATTATCAACTCAATCTCTAAGGAAATTACAACGAGTCTCGTTTACACAGGATCTTGTTAT
GTCGAAGCCTACTACGCGAAGATTACCACCATATGGAAAAGTCTCATGGAGTATAGGCCCCCCGATGAATGCACCTATGGAGGATTAAAGGTTTTTCTCGATTACTTAAA
CTCTGAATTCATCATGATTTTCTTGATGGGTTTGAATGGATCTTTTTCTTCAATTCGAGCTCATATATTGCTCATGAACTCCCTACCTCTCATAAACAGAGTATTTTCTT
TGGTAATACAAGAGGAACATCAGAGGTCCATCGGTAAAACTGCCATGTCAGTAGAGACTTTCAGTCTAATAGTTAATGTAGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAGAGTCGACTGGTGTCGCTATGGATTTCTTGGATGTCGAGGCACAACTGAACCATCTCCTTCCTCAGTCGATTGCCCCGACTATTGTCCTCGTTACTCAACCACT
TCTCGATGCAAGCAATTACAGTTCTTGGAGCAAGGCGATGTTCCTGGCTCTGTTTGGAAAGAATAAATTGGACTTCGTTACTAGATTCATCAAGAAACCAGAAGGGGAAA
ATCTTCTTTCTGCTTGGCAGTGCAATAATGATGATATAATCATTTCTTGGATTATCAACTCAATCTCTAAGGAAATTACAACGAGTCTCGTTTACACAGGATCTTGTTAT
GTCGAAGCCTACTACGCGAAGATTACCACCATATGGAAAAGTCTCATGGAGTATAGGCCCCCCGATGAATGCACCTATGGAGGATTAAAGGTTTTTCTCGATTACTTAAA
CTCTGAATTCATCATGATTTTCTTGATGGGTTTGAATGGATCTTTTTCTTCAATTCGAGCTCATATATTGCTCATGAACTCCCTACCTCTCATAAACAGAGTATTTTCTT
TGGTAATACAAGAGGAACATCAGAGGTCCATCGGTAAAACTGCCATGTCAGTAGAGACTTTCAGTCTAATAGTTAATGTAGGGTAA
Protein sequenceShow/hide protein sequence
MTESTGVAMDFLDVEAQLNHLLPQSIAPTIVLVTQPLLDASNYSSWSKAMFLALFGKNKLDFVTRFIKKPEGENLLSAWQCNNDDIIISWIINSISKEITTSLVYTGSCY
VEAYYAKITTIWKSLMEYRPPDECTYGGLKVFLDYLNSEFIMIFLMGLNGSFSSIRAHILLMNSLPLINRVFSLVIQEEHQRSIGKTAMSVETFSLIVNVG