; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003984 (gene) of Snake gourd v1 genome

Gene IDTan0003984
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNHL domain protein
Genome locationLG05:70411248..70412284
RNA-Seq ExpressionTan0003984
SyntenyTan0003984
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584232.1 hypothetical protein SDJN03_20164, partial [Cucurbita argyrosperma subsp. sororia]1.7e-7685.8Show/hide
Query:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ES GEIDGADALLASRRY C CFPCFG  RSAS+ELSWWERAKAKAK+TKFDG+DHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASV
        NRPAAVK GKFQYDPISYALNFDEG+NGDVDFEG+EY+GGFQ+FS RF+AV A  KSS A+V
Subjt:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASV

XP_022923865.1 uncharacterized protein LOC111431456 [Cucurbita moschata]7.1e-7887.04Show/hide
Query:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ES GEIDGADALLASRRY C CFPCFG  RSAS+ELSWWERAKAKAK+TKFDG+DHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASV
        NRPAAVKLGKFQYDPISYALNFDEG+NGDVDFEG+EY+GGFQNFS RF+AV A  KSS A+V
Subjt:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASV

XP_023000665.1 uncharacterized protein LOC111495035 [Cucurbita maxima]4.1e-7887.04Show/hide
Query:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ES GEIDGADALLASRRYSC CFPCFG  RSAS+ELSWWERAKAKAK+TKFDG+DHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASV
        NRPAAVKLGKFQYDPISYALNFDEG+NGDV+FEG+EY+GGFQNFS RF+AV A  KSS A+V
Subjt:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASV

XP_023519821.1 uncharacterized protein LOC111783153 [Cucurbita pepo subsp. pepo]2.4e-7887.65Show/hide
Query:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ES GEIDGADALLASRRYSC CFPCFG  RSAS+ELSWWERAKAKAK+TKFDG+DHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASV
        NRPAAVKLGKFQYDPISYALNFDEG+NGDVDFEG+EY+GGFQNFS RF+AV A  KSS A+V
Subjt:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASV

XP_038894780.1 uncharacterized protein LOC120083201 [Benincasa hispida]9.2e-7886.96Show/hide
Query:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKE ES GEIDGADALLAS+RYSC CFPCFGPRRSAS+E+SWWER K KAK+TKFDG+DHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYN-GGFQNFSGRFAAVQAPVKSSSA
        NRPA VKLGKFQYDPISYALNFD+G+NGDVDF+G+EY+ GGFQNFS RFAAV  PVKSS A
Subjt:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYN-GGFQNFSGRFAAVQAPVKSSSA

TrEMBL top hitse value%identityAlignment
A0A0A0LU30 Uncharacterized protein6.2e-7282.84Show/hide
Query:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDD-HWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFN
        MGDRDGAFKE ES   IDGADALL S+RYSC CFPCFGP RS S+ELSWWER K KAK+TKFD +D HWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFN
Subjt:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDD-HWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFN

Query:  RNRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYN--GGFQNFSGRFAAV-QAPVK-SSSASVTG
        RNRPA VKLGKFQYDPISYALNFDEG+NGDVDF+G+EYN  GGFQNFS RFAA+  APVK SSSA+V G
Subjt:  RNRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYN--GGFQNFSGRFAAV-QAPVK-SSSASVTG

A0A5A7ULQ3 Uncharacterized protein1.8e-7184.34Show/hide
Query:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKE ES   IDGADALLAS+RYSC CFPCFGP RS S+ELSWWER K KAK+TKFDG+DHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPAAVKLGKFQYDPISYALNFDEG-NNGDVDFEG-EEY--NGGFQNFSGRFAAV-QAPVKSSSAS
        NRPA VKLGKFQYDPISYALNFDEG NNGDVDFEG  EY   GGFQNFS RFAAV  AP+KSSS++
Subjt:  NRPAAVKLGKFQYDPISYALNFDEG-NNGDVDFEG-EEY--NGGFQNFSGRFAAV-QAPVKSSSAS

A0A6J1ED51 uncharacterized protein LOC1114314563.4e-7887.04Show/hide
Query:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ES GEIDGADALLASRRY C CFPCFG  RSAS+ELSWWERAKAKAK+TKFDG+DHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASV
        NRPAAVKLGKFQYDPISYALNFDEG+NGDVDFEG+EY+GGFQNFS RF+AV A  KSS A+V
Subjt:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASV

A0A6J1GSC5 uncharacterized protein LOC1114570521.3e-7282.72Show/hide
Query:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKT---TKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRR
        MGDRDGAFKE ESTGEI GADA L S RY+CLCFPCFGPRRS S+E+SWWERAKA A+       DG+DHWWTGG+RS+KKLREWSEIVAGPRWKTFIRR
Subjt:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKT---TKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRR

Query:  FNRNRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSS
        FNRNRPAAVKLGKFQYDPISYALNFDEGNNGDVDFE EE NGGF+NFS RFAAV APVKS++
Subjt:  FNRNRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSS

A0A6J1KJ01 uncharacterized protein LOC1114950352.0e-7887.04Show/hide
Query:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE ES GEIDGADALLASRRYSC CFPCFG  RSAS+ELSWWERAKAKAK+TKFDG+DHWW+GGIRSLKKLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASV
        NRPAAVKLGKFQYDPISYALNFDEG+NGDV+FEG+EY+GGFQNFS RF+AV A  KSS A+V
Subjt:  NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)9.3e-2034.91Show/hide
Query:  EIDGADAL---LASRRYSCLCFPCFGPRRSASEELS-WWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNR---------
        E+D  D +   L ++R  C   PC    + ++   S WW+R        K + D+ WW   IR  +++REWSE+VAGPRWKT+IRRF R+          
Subjt:  EIDGADAL---LASRRYSCLCFPCFGPRRSASEELS-WWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNR---------

Query:  -------------PAAVKLGKFQYDPISYALNFDEGN-NGDVDFEGEEYNGGFQNFSGRFAAVQAPVKS
                       +   GKF+YD +SY+LNFD+GN  G  D E       ++++S RFAA   PV +
Subjt:  -------------PAAVKLGKFQYDPISYALNFDEGN-NGDVDFEGEEYNGGFQNFSGRFAAVQAPVKS

AT3G48020.1 unknown protein5.1e-1840.8Show/hide
Query:  SASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPAAV---KLGKFQYDPISYALNFDEGNNGDVDFEGEEYN
        S + + SWW+R            +  WW   +R+  K+REWSEIVAGPRWKTFIRRFNR+           KF+YDP+SY L+F++ +  D D  G    
Subjt:  SASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPAAV---KLGKFQYDPISYALNFDEGNNGDVDFEGEEYN

Query:  GGFQNFSGRFAAVQAPVKSSSASVT
        GG ++FS R+A+V      S A ++
Subjt:  GGFQNFSGRFAAVQAPVKSSSASVT

AT5G14890.1 NHL domain-containing protein7.2e-2037.65Show/hide
Query:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELS-WWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFN
        MG       E ++T E+  A   + ++R  C   PC G  + +    S WW+R +      K + D+ WW  G     K+REWSEIVAGP+WKTFIRRF 
Subjt:  MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELS-WWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFN

Query:  R------------NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKS
        R            NRP  V    F+YD  SY+LNFD+G      FE E     ++++S RFAA   PV +
Subjt:  R------------NRPAAVKLGKFQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKS

AT5G25240.1 unknown protein1.3e-0847.37Show/hide
Query:  GIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPAAVKLGKFQYDPISYALNFDEGNNG
        G   LK L+E SE +AGP+WK FIR F+  R    +   F YD  +Y+LNFD+G +G
Subjt:  GIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPAAVKLGKFQYDPISYALNFDEGNNG

AT5G62865.1 unknown protein1.1e-2047.14Show/hide
Query:  CLCFPCFGPRRSASE-ELSWWERAKAKAKTTKFDGD----DHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPAAVKLG---KFQYDPISYALNF
        C CFP F   RS++    S W R +     +   GD      WW   IR+  K+REWSEIVAGPRWKTFIRRFNR+           KFQYDP+SY+LNF
Subjt:  CLCFPCFGPRRSASE-ELSWWERAKAKAKTTKFDGD----DHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPAAVKLG---KFQYDPISYALNF

Query:  DEGNNGDVDFEGEEY--NGGFQNFSGRFAAVQAPVKSSSA
        D+      D E +EY   GG ++FS RFA+V  PV S  A
Subjt:  DEGNNGDVDFEGEEY--NGGFQNFSGRFAAVQAPVKSSSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGACCGTGACGGAGCTTTCAAAGAATTTGAATCCACCGGCGAAATCGACGGCGCCGATGCTCTTTTGGCTTCTAGACGATATAGTTGTTTGTGCTTCCCTTGCTT
TGGACCTCGCCGGTCGGCTTCCGAGGAGCTCTCTTGGTGGGAACGGGCGAAGGCGAAGGCCAAAACGACGAAGTTCGACGGCGACGATCACTGGTGGACCGGCGGAATCA
GATCCCTCAAGAAGCTTCGTGAATGGTCCGAGATCGTTGCCGGTCCTAGATGGAAGACCTTCATTCGGCGCTTCAACCGGAACCGGCCCGCCGCCGTGAAGCTTGGGAAA
TTCCAGTACGATCCCATCAGTTACGCTTTGAATTTCGACGAGGGCAATAACGGTGATGTGGATTTCGAAGGGGAGGAATACAACGGTGGGTTTCAAAACTTCTCCGGCCG
GTTTGCTGCCGTGCAGGCGCCGGTGAAGTCTTCGTCGGCTTCGGTGACTGGATAG
mRNA sequenceShow/hide mRNA sequence
TTTTTTTCCCAATAATAAATAATTAATTACTCTTTAGTGCTCTCGCTCCTTGTCACGGCTCTGTAGCTTTCGGATAATCCGAGTCCAACTACCGGCTAACAAACACTGGG
CCCCACCACTTTACCGGCAGTCTCCGGTCGCCCACTGTTTCTTTGTTTCTTTCAACCCGTTGGCTCTATATACGGGAGCCTAATCGGACTCTGAATCCCTAATCTCACTC
TTTGCGGCCATGGGGGACCGTGACGGAGCTTTCAAAGAATTTGAATCCACCGGCGAAATCGACGGCGCCGATGCTCTTTTGGCTTCTAGACGATATAGTTGTTTGTGCTT
CCCTTGCTTTGGACCTCGCCGGTCGGCTTCCGAGGAGCTCTCTTGGTGGGAACGGGCGAAGGCGAAGGCCAAAACGACGAAGTTCGACGGCGACGATCACTGGTGGACCG
GCGGAATCAGATCCCTCAAGAAGCTTCGTGAATGGTCCGAGATCGTTGCCGGTCCTAGATGGAAGACCTTCATTCGGCGCTTCAACCGGAACCGGCCCGCCGCCGTGAAG
CTTGGGAAATTCCAGTACGATCCCATCAGTTACGCTTTGAATTTCGACGAGGGCAATAACGGTGATGTGGATTTCGAAGGGGAGGAATACAACGGTGGGTTTCAAAACTT
CTCCGGCCGGTTTGCTGCCGTGCAGGCGCCGGTGAAGTCTTCGTCGGCTTCGGTGACTGGATAGGGCGCTGCGGCGGCGGCAGCGGTCTTTTTTATTTTTCTGGTGAAGT
TGAACCGGTGGTGGGATTATAACGCATCTTTTTTCAACTGATGAGCTGGACGTTGACCGTTTGCAATGAATTAGGAGGCGCCAATGGCGGCGGTACACATGTTCACTAAA
TAGTTTACTTTTTCGTGGGGGCTACGCTTACGTGTAGCAAAAAAATTGGTGTTTATATTCTTATAGTTGATATGATTTTGTTTATTTTATTCAATTAAATTTGTATAATT
CAAACAGATTTTTATTTATATAAATTCTTATAGCGTGAAAAATATAT
Protein sequenceShow/hide protein sequence
MGDRDGAFKEFESTGEIDGADALLASRRYSCLCFPCFGPRRSASEELSWWERAKAKAKTTKFDGDDHWWTGGIRSLKKLREWSEIVAGPRWKTFIRRFNRNRPAAVKLGK
FQYDPISYALNFDEGNNGDVDFEGEEYNGGFQNFSGRFAAVQAPVKSSSASVTG