; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016647 (gene) of Snake gourd v1 genome

Gene IDTan0016647
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNHL domain-containing protein
Genome locationLG09:72244488..72244997
RNA-Seq ExpressionTan0016647
SyntenyTan0016647
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587481.1 hypothetical protein SDJN03_16046, partial [Cucurbita argyrosperma subsp. sororia]4.0e-7686.31Show/hide
Query:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA
        MA   AEE PPCEADKSNSLLPTT CGC RLSCF SRRT AGGPSWWERIRAS+VH+EGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSG +A
Subjt:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA

Query:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTV
        AARPGKFQYDPLSYAMNFDE SGQIGELDDDIDDFSGFRNFSARYASIP  +KT  GTTV  K+  TV
Subjt:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTV

KAG7021468.1 putative alpha,alpha-trehalose-phosphate synthase [UDP-forming] 10 [Cucurbita argyrosperma subsp. argyrosperma]2.3e-7687.12Show/hide
Query:  AEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSAAARPG
        +EEPP CEADKSNSLLPTT CGC RLSCF SRRT AGGPSWWERIRAS+VH+EGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSG +AAARPG
Subjt:  AEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSAAARPG

Query:  KFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTV
        KFQYDPLSYAMNFDE SGQIGELDDDIDDFSGFRNFSARYASIP  +KT  GTTV  K+  TV
Subjt:  KFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTV

XP_022921694.1 uncharacterized protein LOC111429865 [Cucurbita moschata]1.2e-7582.84Show/hide
Query:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA
        MA  DAEEPPPCE D SNSLLPTTRCGCFRLSC   RR  A GPSWWERIRASQVHSEGRWWARGVRV+LK+REWSEIVAGPRWKTFIRRFNRNR+GG+ 
Subjt:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA

Query:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTVA
        A RPGKFQYDPLSYAMNFDE S QIGELDDD DDF+G+RNFSARYASIPAP+KTGG T    K++STVA
Subjt:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTVA

XP_022928159.1 uncharacterized protein LOC111435061 [Cucurbita moschata]2.1e-7786.31Show/hide
Query:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA
        MA   AEEPP CEADKSNSLLPTT CGC RLSCF SRRT AGGPSWWERIRAS+VH+EGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSG +A
Subjt:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA

Query:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTV
        AARPGKFQYDPLSYAMNFDE SGQIGELDDDIDDFSGFRNFSARYASIP  +KT  GTTV  K+  TV
Subjt:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTV

XP_023003975.1 uncharacterized protein LOC111497423 [Cucurbita maxima]2.1e-7786.31Show/hide
Query:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA
        MA   AEEPP CEADKSNSLLPTT C C RLSCF SRRT AGGPSWWERIRAS+VH+EGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSG +A
Subjt:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA

Query:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTV
        AARPGKFQYDPLSYAMNFDE SGQIGELDDDIDDFSGFRNFSARYASIP  +KT GGTTV  K+  TV
Subjt:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTV

TrEMBL top hitse value%identityAlignment
A0A0A0LUB2 Uncharacterized protein1.2e-7077.22Show/hide
Query:  MARTDAEEPPP-CEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS---
        MA TD EE  P C+ADKSNSLLPT RCGCFRLSCF SRR    GPSWWERIRASQVH+EGRWW RGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS   
Subjt:  MARTDAEEPPP-CEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS---

Query:  -------GGSAAARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTVA
               GG +  R GKFQYDPLSYAMNFDE S QIGELDDDIDDF+G+RNFSARYASIPAP+KTGG      KNVS VA
Subjt:  -------GGSAAARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTVA

A0A6J1E188 uncharacterized protein LOC1114298655.6e-7682.84Show/hide
Query:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA
        MA  DAEEPPPCE D SNSLLPTTRCGCFRLSC   RR  A GPSWWERIRASQVHSEGRWWARGVRV+LK+REWSEIVAGPRWKTFIRRFNRNR+GG+ 
Subjt:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA

Query:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTVA
        A RPGKFQYDPLSYAMNFDE S QIGELDDD DDF+G+RNFSARYASIPAP+KTGG T    K++STVA
Subjt:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTVA

A0A6J1EK25 uncharacterized protein LOC1114350611.0e-7786.31Show/hide
Query:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA
        MA   AEEPP CEADKSNSLLPTT CGC RLSCF SRRT AGGPSWWERIRAS+VH+EGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSG +A
Subjt:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA

Query:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTV
        AARPGKFQYDPLSYAMNFDE SGQIGELDDDIDDFSGFRNFSARYASIP  +KT  GTTV  K+  TV
Subjt:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTV

A0A6J1JH52 uncharacterized protein LOC1114856665.3e-7481.03Show/hide
Query:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS----
        MA  DAEEPPP EAD SNSLLPTTRCGCFRLSC  SRR  A GPSWWERIR SQVHSEGRWWARGVRV+LK+REWSEIVAGPRWKTFIRRFNRNR+    
Subjt:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRS----

Query:  -GGSAAARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTVA
         GG+AA RPGKFQYDPLSYAMNFDE S QIGELDDD DDF+G+RNFSARYASIPAP+KTGG T    K++STVA
Subjt:  -GGSAAARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTVA

A0A6J1KTA4 uncharacterized protein LOC1114974231.0e-7786.31Show/hide
Query:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA
        MA   AEEPP CEADKSNSLLPTT C C RLSCF SRRT AGGPSWWERIRAS+VH+EGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSG +A
Subjt:  MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSA

Query:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTV
        AARPGKFQYDPLSYAMNFDE SGQIGELDDDIDDFSGFRNFSARYASIP  +KT GGTTV  K+  TV
Subjt:  AARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)4.9e-2437.79Show/hide
Query:  EEPPPCEADKSNSL---LPTTRCGCFRLSCFVSRR-TPAGGPSWWERI-RASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNR--------
        + PP  E D ++ +   L   R  CF + C  S + +  GG  WW+RI    ++  + RWW RG R   ++REWSE+VAGPRWKT+IRRF R        
Subjt:  EEPPPCEADKSNSL---LPTTRCGCFRLSCFVSRR-TPAGGPSWWERI-RASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNR--------

Query:  ----NRSGG-------SAAARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKT
            N SGG       + ++  GKF+YD LSY++NFD+ + Q G  DD+      +R++S R+A+   P+ T
Subjt:  ----NRSGG-------SAAARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKT

AT3G48020.1 unknown protein1.3e-2453.33Show/hide
Query:  SWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSAAARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSAR
        SWW+RI  +  H E RWW   VR  LK+REWSEIVAGPRWKTFIRRFNR+   G       KF+YDP+SY ++F++        DDD     G R+FS R
Subjt:  SWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSAAARPGKFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSAR

Query:  YASIP
        YAS+P
Subjt:  YASIP

AT5G14890.1 NHL domain-containing protein7.9e-2239.6Show/hide
Query:  DKSNSLLPTTRCGCFRLSCFVSRRTPAG--GPSWWERIR-ASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNR------SGGSAAARPG
        D+ +  +   R  CF L C  S + P+G  G  WW+RIR   ++  + RWW  G    +K+REWSEIVAGP+WKTFIRRF RN        GG       
Subjt:  DKSNSLLPTTRCGCFRLSCFVSRRTPAG--GPSWWERIR-ASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNR------SGGSAAARPG

Query:  KFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKT
         F+YD  SY++NFD+   Q G  +D+      +R++S R+A+   P+ T
Subjt:  KFQYDPLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKT

AT5G25240.1 unknown protein1.6e-0636.84Show/hide
Query:  CGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSAAARPGKFQYDPLSYAMNFDE
        CGC     F   R   G      R   S    E R    G   L  L+E SE +AGP+WK FIR F+   SG     R   F YD  +Y++NFD+
Subjt:  CGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSAAARPGKFQYDPLSYAMNFDE

AT5G62865.1 unknown protein1.7e-2450.37Show/hide
Query:  RCGCFRLSCFVSRRTPAGGPSWWERIRA--SQVHS-----EGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSAAARPGKFQYDPLSYAMN
        RC CF  S   SR + A G S W RIR      HS     E RWW   +R  LK+REWSEIVAGPRWKTFIRRFNR+   G       KFQYDPLSY++N
Subjt:  RCGCFRLSCFVSRRTPAGGPSWWERIRA--SQVHS-----EGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSAAARPGKFQYDPLSYAMN

Query:  FDESSGQIGELDDDIDDF---SGFRNFSARYASIP
        FD+        DD+ D++    G R+FS R+AS+P
Subjt:  FDESSGQIGELDDDIDDF---SGFRNFSARYASIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCGTACAGATGCAGAAGAGCCGCCTCCTTGTGAAGCGGATAAATCCAATTCTCTGTTGCCGACGACCAGGTGTGGTTGTTTCCGGCTATCGTGCTTCGTATCTCG
CCGAACGCCCGCCGGCGGGCCGTCCTGGTGGGAGCGGATTAGGGCATCTCAGGTTCACAGCGAGGGGCGCTGGTGGGCGCGAGGCGTAAGGGTTCTTTTGAAGCTCCGAG
AGTGGTCGGAGATCGTCGCTGGACCGAGATGGAAGACTTTCATTCGCCGATTTAATCGAAATCGGAGTGGTGGTAGTGCTGCTGCTAGGCCTGGGAAATTTCAATACGAT
CCATTGAGTTACGCTATGAATTTCGACGAAAGTTCGGGGCAGATAGGGGAATTGGACGACGATATCGACGATTTCAGTGGGTTTCGGAACTTTTCCGCTCGTTACGCTTC
GATTCCGGCGCCGATGAAGACTGGAGGAGGAACCACTGTTTTCGAGAAGAATGTTTCCACCGTCGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCGTACAGATGCAGAAGAGCCGCCTCCTTGTGAAGCGGATAAATCCAATTCTCTGTTGCCGACGACCAGGTGTGGTTGTTTCCGGCTATCGTGCTTCGTATCTCG
CCGAACGCCCGCCGGCGGGCCGTCCTGGTGGGAGCGGATTAGGGCATCTCAGGTTCACAGCGAGGGGCGCTGGTGGGCGCGAGGCGTAAGGGTTCTTTTGAAGCTCCGAG
AGTGGTCGGAGATCGTCGCTGGACCGAGATGGAAGACTTTCATTCGCCGATTTAATCGAAATCGGAGTGGTGGTAGTGCTGCTGCTAGGCCTGGGAAATTTCAATACGAT
CCATTGAGTTACGCTATGAATTTCGACGAAAGTTCGGGGCAGATAGGGGAATTGGACGACGATATCGACGATTTCAGTGGGTTTCGGAACTTTTCCGCTCGTTACGCTTC
GATTCCGGCGCCGATGAAGACTGGAGGAGGAACCACTGTTTTCGAGAAGAATGTTTCCACCGTCGCGTAA
Protein sequenceShow/hide protein sequence
MARTDAEEPPPCEADKSNSLLPTTRCGCFRLSCFVSRRTPAGGPSWWERIRASQVHSEGRWWARGVRVLLKLREWSEIVAGPRWKTFIRRFNRNRSGGSAAARPGKFQYD
PLSYAMNFDESSGQIGELDDDIDDFSGFRNFSARYASIPAPMKTGGGTTVFEKNVSTVA