; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004494 (gene) of Snake gourd v1 genome

Gene IDTan0004494
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG04:80400474..80401232
RNA-Seq ExpressionTan0004494
SyntenyTan0004494
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO61345.1 reverse transcriptase [Corchorus capsularis]2.4e-4036.1Show/hide
Query:  LSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAI
        L+  W+   LTEEEN    +   +  +       CL+GK+LS++ +NVE  +NVM  VW++   + +  IGEN+F F F + ++K R+    PW F  A+
Subjt:  LSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAI

Query:  ILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQID-LPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEK
        ++ +  +     ++++    +FW Q H+L L  M E I R +G S G V +ID   +   W +F R R ++N+TKPLRRGM + A + G+ + I++R+EK
Subjt:  ILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQID-LPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEK

Query:  LPEFCFECGCVSHIQNQCDKEFETEENVEGPI---YGPWMK
        LP+FC+ CGC++H++N+C+K      + +G I   YGPW++
Subjt:  LPEFCFECGCVSHIQNQCDKEFETEENVEGPI---YGPWMK

OMP03234.1 hypothetical protein COLO4_10561 [Corchorus olitorius]6.6e-4334.17Show/hide
Query:  LSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAI
        LSA W+   LT+EE     +   + A+     ++CL+GK+LS++ +N++  +NV+  VW++   + +  IGE ++ F F + ++K R+   GPW F  A+
Subjt:  LSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAI

Query:  ILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQID-LPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEK
        ++ +  +  I  + ++     FW+Q+H+L L  M E + +A+G+S GEVI+ID   +   W +F R+R  +N+ KPLRRGM + A + G+ + +++R+EK
Subjt:  ILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQID-LPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEK

Query:  LPEFCFECGCVSHIQNQCDKE--FETEENVEGPIYGPWMK
        LP+FC+ CGC++H +N+C+K      +       YGPW++
Subjt:  LPEFCFECGCVSHIQNQCDKE--FETEENVEGPIYGPWMK

TXG68535.1 hypothetical protein EZV62_003470 [Acer yangbiense]3.4e-3932.79Show/hide
Query:  MNVNDLSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWF
        M  NDL+   + + + +E+N V  +   I      + + CLVGKVLS K +N E+FK V+  +W    ++ I+++G+N F F F     ++RI   GPW 
Subjt:  MNVNDLSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWF

Query:  FYGAIILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITY
        F  ++I+ EK   +     L F+ A FWVQ+H++ + CM + + + L   IG V++I + +   W +F R+++ ++I+KPL+R +++K +     + +  
Subjt:  FYGAIILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITY

Query:  RFEKLPEFCFECGCVSHIQNQCDKEFETEENVEGP--IYGPWMK
        ++E+LPEFC+ CG V H  N+C      +E +EGP   +G W++
Subjt:  RFEKLPEFCFECGCVSHIQNQCDKEFETEENVEGP--IYGPWMK

XP_018829381.1 uncharacterized protein LOC108997518 [Juglans regia]5.8e-3933.47Show/hide
Query:  LSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAI
        +  +W+ ++LTEEE+ + ++ S    +     E  L+GKV S +LIN+E+ ++ M  +WR+ K      +  N+F  TF T   K+RI+    W F  A+
Subjt:  LSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAI

Query:  ILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLG-GWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEK
         L    + ++QP      H +FW+Q+H+L L CM E+  + +G+S+G+V+++D+   G GW +  R+R++V +TK + RG  I    +G+ +W+T+++EK
Subjt:  ILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLG-GWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEK

Query:  LPEFCFECGCVSHIQNQC----DKEFETEENVEGPIYGPWMK
        LP+ CF CG + H +  C      + ET+EN     +GPW++
Subjt:  LPEFCFECGCVSHIQNQC----DKEFETEENVEGPIYGPWMK

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]5.6e-4231.17Show/hide
Query:  VNDLSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFY
        +++++  W+  K T +EN    I         +N ++C+V K+ + K I+ E+ ++VM++VWRVH     + +G N++   F+++ +K+R++++GPW F 
Subjt:  VNDLSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFY

Query:  GAIILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLGGWS-RFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYR
         ++++        QP ++ F+   FW+Q+HN+   C+  ++   LG  +G+V +I+     GW+  F R+R+K++++KPLRRG+K+K  D G+ +W   R
Subjt:  GAIILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLGGWS-RFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYR

Query:  FEKLPEFCFECGCVSHIQNQCDKEFETEENVEGPIYGPWMKVIEFKK
        +EKLP+FC+ECG + H   +C++  +         YG W++    KK
Subjt:  FEKLPEFCFECGCVSHIQNQCDKEFETEENVEGPIYGPWMKVIEFKK

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase1.1e-4036.1Show/hide
Query:  LSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAI
        L+  W+   LTEEEN    +   +  +       CL+GK+LS++ +NVE  +NVM  VW++   + +  IGEN+F F F + ++K R+    PW F  A+
Subjt:  LSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAI

Query:  ILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQID-LPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEK
        ++ +  +     ++++    +FW Q H+L L  M E I R +G S G V +ID   +   W +F R R ++N+TKPLRRGM + A + G+ + I++R+EK
Subjt:  ILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQID-LPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEK

Query:  LPEFCFECGCVSHIQNQCDKEFETEENVEGPI---YGPWMK
        LP+FC+ CGC++H++N+C+K      + +G I   YGPW++
Subjt:  LPEFCFECGCVSHIQNQCDKEFETEENVEGPI---YGPWMK

A0A1R3K847 Uncharacterized protein3.2e-4334.17Show/hide
Query:  LSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAI
        LSA W+   LT+EE     +   + A+     ++CL+GK+LS++ +N++  +NV+  VW++   + +  IGE ++ F F + ++K R+   GPW F  A+
Subjt:  LSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAI

Query:  ILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQID-LPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEK
        ++ +  +  I  + ++     FW+Q+H+L L  M E + +A+G+S GEVI+ID   +   W +F R+R  +N+ KPLRRGM + A + G+ + +++R+EK
Subjt:  ILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQID-LPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEK

Query:  LPEFCFECGCVSHIQNQCDKE--FETEENVEGPIYGPWMK
        LP+FC+ CGC++H +N+C+K      +       YGPW++
Subjt:  LPEFCFECGCVSHIQNQCDKE--FETEENVEGPIYGPWMK

A0A2I4FCK6 uncharacterized protein LOC1089975182.8e-3933.47Show/hide
Query:  LSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAI
        +  +W+ ++LTEEE+ + ++ S    +     E  L+GKV S +LIN+E+ ++ M  +WR+ K      +  N+F  TF T   K+RI+    W F  A+
Subjt:  LSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAI

Query:  ILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLG-GWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEK
         L    + ++QP      H +FW+Q+H+L L CM E+  + +G+S+G+V+++D+   G GW +  R+R++V +TK + RG  I    +G+ +W+T+++EK
Subjt:  ILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLG-GWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEK

Query:  LPEFCFECGCVSHIQNQC----DKEFETEENVEGPIYGPWMK
        LP+ CF CG + H +  C      + ET+EN     +GPW++
Subjt:  LPEFCFECGCVSHIQNQC----DKEFETEENVEGPIYGPWMK

A0A5C7IHI0 CCHC-type domain-containing protein1.7e-3932.79Show/hide
Query:  MNVNDLSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWF
        M  NDL+   + + + +E+N V  +   I      + + CLVGKVLS K +N E+FK V+  +W    ++ I+++G+N F F F     ++RI   GPW 
Subjt:  MNVNDLSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWF

Query:  FYGAIILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITY
        F  ++I+ EK   +     L F+ A FWVQ+H++ + CM + + + L   IG V++I + +   W +F R+++ ++I+KPL+R +++K +     + +  
Subjt:  FYGAIILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITY

Query:  RFEKLPEFCFECGCVSHIQNQCDKEFETEENVEGP--IYGPWMK
        ++E+LPEFC+ CG V H  N+C      +E +EGP   +G W++
Subjt:  RFEKLPEFCFECGCVSHIQNQCDKEFETEENVEGP--IYGPWMK

A0A6J1D765 uncharacterized protein LOC1110179022.7e-4231.17Show/hide
Query:  VNDLSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFY
        +++++  W+  K T +EN    I         +N ++C+V K+ + K I+ E+ ++VM++VWRVH     + +G N++   F+++ +K+R++++GPW F 
Subjt:  VNDLSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFY

Query:  GAIILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLGGWS-RFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYR
         ++++        QP ++ F+   FW+Q+HN+   C+  ++   LG  +G+V +I+     GW+  F R+R+K++++KPLRRG+K+K  D G+ +W   R
Subjt:  GAIILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLGGWS-RFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYR

Query:  FEKLPEFCFECGCVSHIQNQCDKEFETEENVEGPIYGPWMKVIEFKK
        +EKLP+FC+ECG + H   +C++  +         YG W++    KK
Subjt:  FEKLPEFCFECGCVSHIQNQCDKEFETEENVEGPIYGPWMKVIEFKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding1.3e-0420.37Show/hide
Query:  RVHKDMPIDIIGE----NMFFFTFRTVMKKNRIITNGPWFFYGAIILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLP
        R  K++  +++G     +   F F++      I+  GPW F   + + ++   L    +  F    FW+Q+  + L  +   I+ ++G  +G  ++    
Subjt:  RVHKDMPIDIIGE----NMFFFTFRTVMKKNRIITNGPWFFYGAIILFEKTNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLP

Query:  NLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEKLPEFCFECGCVSHIQNQC
                T L   V++ K                    +++EKL  FC  CG +SH  ++C
Subjt:  NLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEKLPEFCFECGCVSHIQNQC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGTCAATGACTTATCAGCTAGATGGAAAGGAATGAAGCTCACTGAGGAGGAGAACACTGTGACAGATATACGATCCTTGATAAAAGCAAAGAAGCCGAACAACTA
TGAGGTATGTTTAGTGGGCAAAGTTCTATCACAAAAATTGATCAACGTAGAATCTTTCAAAAATGTTATGAGAAATGTATGGAGAGTCCATAAAGATATGCCTATAGATA
TAATTGGAGAAAACATGTTTTTCTTTACGTTTAGAACAGTGATGAAGAAAAATCGAATTATCACAAATGGTCCATGGTTTTTTTATGGAGCCATAATTCTGTTTGAAAAA
ACAAATACTTTGATTCAACCTCAAAACCTAAGATTCCATCATGCGACTTTTTGGGTTCAACTTCATAACCTTCTTCTATGTTGTATGGAGGAAGATATCCTTCGAGCATT
GGGAAATTCAATAGGAGAGGTCATTCAAATAGATCTTCCCAATCTAGGGGGATGGAGTAGATTCACACGACTGAGAATCAAGGTGAATATCACCAAACCCCTCCGGAGGG
GTATGAAAATCAAAGCAGAGGATGAAGGAGAAGGCCTGTGGATAACCTATAGATTTGAAAAACTACCAGAATTCTGTTTTGAGTGTGGATGTGTGAGCCATATACAAAAT
CAGTGTGATAAGGAATTTGAAACAGAAGAGAATGTTGAAGGACCTATATATGGCCCTTGGATGAAGGTAATAGAGTTCAAAAAGTTGAGTACGCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGTCAATGACTTATCAGCTAGATGGAAAGGAATGAAGCTCACTGAGGAGGAGAACACTGTGACAGATATACGATCCTTGATAAAAGCAAAGAAGCCGAACAACTA
TGAGGTATGTTTAGTGGGCAAAGTTCTATCACAAAAATTGATCAACGTAGAATCTTTCAAAAATGTTATGAGAAATGTATGGAGAGTCCATAAAGATATGCCTATAGATA
TAATTGGAGAAAACATGTTTTTCTTTACGTTTAGAACAGTGATGAAGAAAAATCGAATTATCACAAATGGTCCATGGTTTTTTTATGGAGCCATAATTCTGTTTGAAAAA
ACAAATACTTTGATTCAACCTCAAAACCTAAGATTCCATCATGCGACTTTTTGGGTTCAACTTCATAACCTTCTTCTATGTTGTATGGAGGAAGATATCCTTCGAGCATT
GGGAAATTCAATAGGAGAGGTCATTCAAATAGATCTTCCCAATCTAGGGGGATGGAGTAGATTCACACGACTGAGAATCAAGGTGAATATCACCAAACCCCTCCGGAGGG
GTATGAAAATCAAAGCAGAGGATGAAGGAGAAGGCCTGTGGATAACCTATAGATTTGAAAAACTACCAGAATTCTGTTTTGAGTGTGGATGTGTGAGCCATATACAAAAT
CAGTGTGATAAGGAATTTGAAACAGAAGAGAATGTTGAAGGACCTATATATGGCCCTTGGATGAAGGTAATAGAGTTCAAAAAGTTGAGTACGCCTTAG
Protein sequenceShow/hide protein sequence
MNVNDLSARWKGMKLTEEENTVTDIRSLIKAKKPNNYEVCLVGKVLSQKLINVESFKNVMRNVWRVHKDMPIDIIGENMFFFTFRTVMKKNRIITNGPWFFYGAIILFEK
TNTLIQPQNLRFHHATFWVQLHNLLLCCMEEDILRALGNSIGEVIQIDLPNLGGWSRFTRLRIKVNITKPLRRGMKIKAEDEGEGLWITYRFEKLPEFCFECGCVSHIQN
QCDKEFETEENVEGPIYGPWMKVIEFKKLSTP