; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015234 (gene) of Snake gourd v1 genome

Gene IDTan0015234
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNHL domain protein
Genome locationLG11:5354423..5355599
RNA-Seq ExpressionTan0015234
SyntenyTan0015234
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602448.1 hypothetical protein SDJN03_07681, partial [Cucurbita argyrosperma subsp. sororia]1.1e-7488.16Show/hide
Query:  MSVAQSPDLSSDENVFHGDD-LHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD
        MSV+QSPDLSSD N FH DD LHEAA+ASRGCC W+PCLRSNPS+SWW+RIRAADNDDEWWLRGWKRFR WSEIVAGPKWKTFIRQFHKNR+RQS +RYD
Subjt:  MSVAQSPDLSSDENVFHGDD-LHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD

Query:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI
        PLSYALNFDEGPA DDPFS+DFMRRDFS RFA+IPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI

XP_022961939.1 uncharacterized protein LOC111462557 [Cucurbita moschata]2.0e-7488.16Show/hide
Query:  MSVAQSPDLSSDENVFHGDD-LHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD
        MSV QSPDLSSD N FH DD LHEAA+ASRGCC W+PCLRSNPS+SWW+RIRAADNDDEWWLRGWKRFR WSEIVAGPKWKTFIRQFHKNR+RQS +RYD
Subjt:  MSVAQSPDLSSDENVFHGDD-LHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD

Query:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI
        PLSYALNFDEGPA DDPFS+DFMRRDFS RFA+IPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI

XP_023533538.1 uncharacterized protein LOC111795380 [Cucurbita pepo subsp. pepo]4.4e-7487.5Show/hide
Query:  MSVAQSPDLSSDENVFHGDD-LHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD
        MSV+QSPDLSSD N FH DD LHEAA+ASRGCC W+PCLRS+PS+SWW+RIRAADNDDEWWLRGWKRFR WSEIVAGPKWKTFIRQFHKNR+RQS +RYD
Subjt:  MSVAQSPDLSSDENVFHGDD-LHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD

Query:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI
        PLSYALNFDEGPA DDPFS+DFMRRDFS RFA+IPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI

XP_031741765.1 uncharacterized protein LOC116403956 [Cucumis sativus]3.9e-7586.84Show/hide
Query:  MSVAQSPDLSSDENVF-HGDDLHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD
        MS+A SPDLSSDEN F H DDLH+A +A+RGCC WIPCLRSN SQSWW+RIRAADNDDEWWL+GWKRFR+WSEIVAGPKWKTFIRQFHKNRNRQS FRYD
Subjt:  MSVAQSPDLSSDENVF-HGDDLHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD

Query:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI
        PLSY+LNFDEGPAHDDPF+DDF+RRDFS+RFAAIPASAKSSMDLGKD PSFI
Subjt:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI

XP_038889212.1 uncharacterized protein LOC120079097 [Benincasa hispida]2.2e-7888.74Show/hide
Query:  MSVAQSPDLSSDENVFHGDDLHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYDP
        MS+AQSPDLSSDEN FHGDDLH+AA+ SRGCC W+PCLRSNPSQSWW+RIRAADNDDEWWLRGWKRFR+WSEI+AGPKWKTFIRQF+KNRNRQS FRYDP
Subjt:  MSVAQSPDLSSDENVFHGDDLHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYDP

Query:  LSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI
        LSY+LNFDEGPAHDDPFSDD +RRDFS+RFAAIPASAKSSMDLGKDGP FI
Subjt:  LSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI

TrEMBL top hitse value%identityAlignment
A0A0A0KRY7 Uncharacterized protein1.9e-7586.84Show/hide
Query:  MSVAQSPDLSSDENVF-HGDDLHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD
        MS+A SPDLSSDEN F H DDLH+A +A+RGCC WIPCLRSN SQSWW+RIRAADNDDEWWL+GWKRFR+WSEIVAGPKWKTFIRQFHKNRNRQS FRYD
Subjt:  MSVAQSPDLSSDENVF-HGDDLHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD

Query:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI
        PLSY+LNFDEGPAHDDPF+DDF+RRDFS+RFAAIPASAKSSMDLGKD PSFI
Subjt:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI

A0A1S4E267 uncharacterized protein LOC1079916286.8e-7384.11Show/hide
Query:  MSVAQSPDLSSDENVFHGDDLHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYDP
        MS+AQSPD SSDEN FH +DLH+A +A+ GCC WIPCLRSN SQSWW+RIRAADNDDEWWLRGWKRFR+WSEIVAGPKWKTFIRQFHKNRNRQS FRYDP
Subjt:  MSVAQSPDLSSDENVFHGDDLHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYDP

Query:  LSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI
        LSY+LNFDEGPAH+D F+DDF+RRDFS+RFAAIPASAKSSMDL KD PS I
Subjt:  LSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI

A0A5A7T228 Putative NHL domain-containing protein3.7e-7184.93Show/hide
Query:  MSVAQSPDLSSDENVFHGDDLHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYDP
        MS+AQSPD SSDEN FH +DLH+A +A+ GCC WIPCLRSN SQSWW+RIRAADNDDEWWLRGWKRFR+WSEIVAGPKWKTFIRQFHKNRNRQS FRYDP
Subjt:  MSVAQSPDLSSDENVFHGDDLHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYDP

Query:  LSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKD
        LSY+LNFDEGPAH+D F+DDF+RRDFS+RFAAIPASAKSSMDL KD
Subjt:  LSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKD

A0A6J1HFG4 uncharacterized protein LOC1114625579.5e-7588.16Show/hide
Query:  MSVAQSPDLSSDENVFHGDD-LHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD
        MSV QSPDLSSD N FH DD LHEAA+ASRGCC W+PCLRSNPS+SWW+RIRAADNDDEWWLRGWKRFR WSEIVAGPKWKTFIRQFHKNR+RQS +RYD
Subjt:  MSVAQSPDLSSDENVFHGDD-LHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD

Query:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI
        PLSYALNFDEGPA DDPFS+DFMRRDFS RFA+IPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI

A0A6J1JTQ3 uncharacterized protein LOC1114874221.0e-7386.18Show/hide
Query:  MSVAQSPDLSSDENVFHGDD-LHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD
        MSV+QSPDLSSD N FH DD LHEAA+ASRGCC W+PCLRSNPS++WW+RIRAADNDDEWWLRGWKRFR WSEIVAGPKWKTFIRQFHKNR+RQS +RYD
Subjt:  MSVAQSPDLSSDENVFHGDD-LHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYD

Query:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI
        PLSYALNFD+GPA DDPF +DFMRRDFS RFA+IPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)9.8e-3242.37Show/hide
Query:  QSPDLSSDENVFHGDDLHEAAYASRGCCCWIPCLRSNPSQS-----WWKRIRAADN---DDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNR------
        QSP ++    V   DD+HEA +A RGCC  +PCL S+   +     WW+RI   D    D+ WW+RGW+R R+WSE+VAGP+WKT+IR+F ++       
Subjt:  QSPDLSSDENVFHGDDLHEAAYASRGCCCWIPCLRSNPSQS-----WWKRIRAADN---DDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNR------

Query:  ----------------NR---QSNFRYDPLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAA--IPASAKSSMDLGKD
                        NR   Q  FRYD LSY+LNFD+G      F D+F  RD+S RFAA  +P S K S+D   D
Subjt:  ----------------NR---QSNFRYDPLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAA--IPASAKSSMDLGKD

AT3G48020.1 unknown protein6.6e-2041.53Show/hide
Query:  CLRSNPSQSWWKRI-RAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNR------QSNFRYDPLSYALNFDEGPAHDDPFSDDFMRRDFSSR
        C  +    SWW+RI R    +  WW+R + + R+WSEIVAGP+WKTFIR+F+++  R         FRYDP+SY L+F++    DD  +     R FS R
Subjt:  CLRSNPSQSWWKRI-RAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNR------QSNFRYDPLSYALNFDEGPAHDDPFSDDFMRRDFSSR

Query:  FAAIP-ASAKSSMDLGKD
        +A++P AS KS   +  D
Subjt:  FAAIP-ASAKSSMDLGKD

AT5G14890.1 NHL domain-containing protein3.4e-3248Show/hide
Query:  DDLHEAAYASRGCCCWIPCLRSN----PSQS-WWKRIRAADN---DDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHK------------NRNRQSNFRY
        D++HEA +A RGCC  +PCL S+    P+ S WW+RIR  D    D+ WW+ GW + R+WSEIVAGPKWKTFIR+F +            NR    +FRY
Subjt:  DDLHEAAYASRGCCCWIPCLRSN----PSQS-WWKRIRAADN---DDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHK------------NRNRQSNFRY

Query:  DPLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAA--IPASAKSSMDLGKD
        D  SY+LNFD+G      F D+F  RD+S RFAA  +P S K S+D   D
Subjt:  DPLSYALNFDEGPAHDDPFSDDFMRRDFSSRFAA--IPASAKSSMDLGKD

AT5G25240.1 unknown protein4.9e-0734.94Show/hide
Query:  SQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQF---HKNRNRQSNFRYDPLSYALNFDEGPAHDDPFSDDFM
        S+  W      +    W     K  ++ SE +AGPKWK FIR F    K   R  +F YD  +Y+LNFD+G    D   + F+
Subjt:  SQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQF---HKNRNRQSNFRYDPLSYALNFDEGPAHDDPFSDDFM

AT5G62865.1 unknown protein2.5e-1941.73Show/hide
Query:  CCCWIPCLRSNPS----QSWWKRIRAAD---------NDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNR------QSNFRYDPLSYALNFDEGP
        CCC+    RS  S     S W RIR  D         ++  WW+R   + R+WSEIVAGP+WKTFIR+F+++  R         F+YDPLSY+LNFD   
Subjt:  CCCWIPCLRSNPS----QSWWKRIRAAD---------NDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNR------QSNFRYDPLSYALNFDEGP

Query:  AHDDPFSDDFM----RRDFSSRFAAIP
          DD   D+++     R FS+RFA++P
Subjt:  AHDDPFSDDFM----RRDFSSRFAAIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTTGCTCAATCTCCCGACCTATCTTCCGACGAAAATGTCTTCCACGGCGACGATCTTCACGAGGCCGCCTACGCCAGCCGCGGCTGCTGCTGCTGGATCCCTTG
CCTGAGGTCCAATCCCTCGCAGTCCTGGTGGAAGCGGATTAGGGCAGCCGATAACGACGACGAGTGGTGGCTCCGAGGCTGGAAGAGGTTCCGTCAGTGGTCCGAAATCG
TCGCCGGGCCTAAATGGAAGACCTTCATTCGTCAATTCCACAAGAATCGCAATCGCCAATCCAATTTCCGCTACGATCCTCTCAGTTACGCTCTCAATTTTGATGAAGGT
CCAGCCCACGACGATCCGTTCAGCGACGACTTTATGCGCCGCGATTTCTCCTCTCGATTCGCCGCCATTCCGGCCTCTGCCAAGTCGTCCATGGACCTCGGAAAGGATGG
GCCGTCCTTCATTTGA
mRNA sequenceShow/hide mRNA sequence
TGGAAAACAAAAATGCCATTTTTTTTTAACTAAAAAAAAAAAAAAAAGATGATGGAGAAATTTGGACCAGGAAATGGAAGATTTACAGAGAGAGAAAAAAACATTTGGCG
AATGGGATAAGAATTAATTTTCCAGCGTGTTTTCCGTTTCAATTTTCTCCGATACGGCCGCTCTCATATATTAATCTCAGACGAACATCATCTTCTTCCCTCTCCAAAAA
AAAAAAAAAGAAAAGAAAAAAGTTCACCGATTCAATTTCAAACAGAGAGAATAGAGAAGAGAAGAGAGAATCTCTGTGTTAATTGATTTGCAGAAAAGTTTGAAAAAACT
TTTCTGCAAATCAATTCCAACAGATCGACATACCTCTCTCTCTCTCTCTCTCTCTAAATGTCTGTTGCTCAATCTCCCGACCTATCTTCCGACGAAAATGTCTTCCACGG
CGACGATCTTCACGAGGCCGCCTACGCCAGCCGCGGCTGCTGCTGCTGGATCCCTTGCCTGAGGTCCAATCCCTCGCAGTCCTGGTGGAAGCGGATTAGGGCAGCCGATA
ACGACGACGAGTGGTGGCTCCGAGGCTGGAAGAGGTTCCGTCAGTGGTCCGAAATCGTCGCCGGGCCTAAATGGAAGACCTTCATTCGTCAATTCCACAAGAATCGCAAT
CGCCAATCCAATTTCCGCTACGATCCTCTCAGTTACGCTCTCAATTTTGATGAAGGTCCAGCCCACGACGATCCGTTCAGCGACGACTTTATGCGCCGCGATTTCTCCTC
TCGATTCGCCGCCATTCCGGCCTCTGCCAAGTCGTCCATGGACCTCGGAAAGGATGGGCCGTCCTTCATTTGACGCCGGCGCGCCGGCTGAACAGATGGATGGATGTATC
GTGTGGCTCTGGAATCTCGGTGTGCTCGGCTTCCTCTTGCCGGAGCCGTCGTGCGCGGTGGTCGTCGGAGGCGATTCCGGGGGAGGAGGAGCGGAGGGGGATTAGCATTT
TGGGGACGACGGGGATTGTGAATGGTGGGGGTAATTTGTGGCCAGCGTTAAAAACTATAAACTCGGTGATTCCATTTTTCTTTTCTTTTCTTTTTTTTTTTTCATTCTGT
TTTTTAAGATTATTTTTTTATTTGTTTATAAAAATATATTTTTTTTCAAATGAAATTATAATGATTTTGTGAAATTT
Protein sequenceShow/hide protein sequence
MSVAQSPDLSSDENVFHGDDLHEAAYASRGCCCWIPCLRSNPSQSWWKRIRAADNDDEWWLRGWKRFRQWSEIVAGPKWKTFIRQFHKNRNRQSNFRYDPLSYALNFDEG
PAHDDPFSDDFMRRDFSSRFAAIPASAKSSMDLGKDGPSFI