; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004641 (gene) of Snake gourd v1 genome

Gene IDTan0004641
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBEST Arabidopsis thaliana protein match is: NHL domain-containing protein .
Genome locationLG09:67137682..67140159
RNA-Seq ExpressionTan0004641
SyntenyTan0004641
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587852.1 hypothetical protein SDJN03_16417, partial [Cucurbita argyrosperma subsp. sororia]1.6e-7391.22Show/hide
Query:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQ-QGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRPPPTPYSPLN+EQE LHD DEAVPSNGC CF+LFGFGFNRNGNYE+GNLLQQ +GRE+E WMV+KLKKLKEVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQ-QGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEDDRPPIGFSTRFAVPLASREQH
        GKKQRNRFQYDPESYALNFDGG DGEDDRPPIGFS+RFAVPLASREQH
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEDDRPPIGFSTRFAVPLASREQH

XP_011648475.1 uncharacterized protein LOC105434480 [Cucumis sativus]1.4e-6482.43Show/hide
Query:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQ-QGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRPP TPYSPL  +Q+DL DID+++ SNGCGCF+LFGFG NRN NYE GNLLQQ QGRE+ESWMV++LKK++EVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQ-QGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEDD--RPPIGFSTRFAVPLASRE
        GKK+RNRFQYDPESYALNFDGGFDGE+D   PPIGFS+RFAVPLASRE
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEDD--RPPIGFSTRFAVPLASRE

XP_022927019.1 uncharacterized protein LOC111433973 [Cucurbita moschata]7.2e-7491.22Show/hide
Query:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQQ-GREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRPPPTPYSPLN+EQE LHD DEA+PSNGC CF+LFGFGFNRNGNYE+GNLLQQQ GRE+E WMV+KLKKLKEVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQQ-GREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEDDRPPIGFSTRFAVPLASREQH
        GKKQRNRFQYDPESYALNFDGG DGEDDRPPIGFS+RFAVPLASREQH
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEDDRPPIGFSTRFAVPLASREQH

XP_023003195.1 uncharacterized protein LOC111496880 [Cucurbita maxima]9.5e-7491.22Show/hide
Query:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQQ-GREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRPPPTPYSPLN+EQE LHD DEAVPSNGC CF+LFGFGFNRNGNYE+GNLLQQQ GRE+E WMV+KLKKLKEVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQQ-GREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEDDRPPIGFSTRFAVPLASREQH
        G+KQRNRFQYDPESYALNFDGG DGEDDRPPIGFS+RFAVPLASREQH
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEDDRPPIGFSTRFAVPLASREQH

XP_038876901.1 uncharacterized protein LOC120069255 [Benincasa hispida]1.5e-7188.67Show/hide
Query:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQ-QGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRPP TPYSPLN++Q+DL D+DEAVPSNGCGCFRLFGFGFNRNGNYE  NLLQQ QGRE+ESWMVRKLKK+KEVSEMVAGPKWKNF+RKMGGYLK
Subjt:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQ-QGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEDD--RPPIGFSTRFAVPLASREQH
        GKKQRNRFQYDPESYALNFDGGFDGE+D   PPIGFS+RFAVPLASREQH
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEDD--RPPIGFSTRFAVPLASREQH

TrEMBL top hitse value%identityAlignment
A0A0A0LTG0 Uncharacterized protein6.6e-6582.43Show/hide
Query:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQ-QGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRPP TPYSPL  +Q+DL DID+++ SNGCGCF+LFGFG NRN NYE GNLLQQ QGRE+ESWMV++LKK++EVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQ-QGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEDD--RPPIGFSTRFAVPLASRE
        GKK+RNRFQYDPESYALNFDGGFDGE+D   PPIGFS+RFAVPLASRE
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEDD--RPPIGFSTRFAVPLASRE

A0A1S3B8N4 uncharacterized protein LOC1034873831.1e-6483.89Show/hide
Query:  MATHQTRPPPTPYSPLN-QEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQ-QGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYL
        MA+HQTRPP TPYSPLN Q+Q+DL DID+++ SNGCGCF+LFGFG NRN NYE  NLLQQ QGRE+ESWMV+KLKK+KEVSEMVAGPKWKNFIRKMGGYL
Subjt:  MATHQTRPPPTPYSPLN-QEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQ-QGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYL

Query:  KGKKQRNRFQYDPESYALNFDGGFDGEDD--RPPIGFSTRFAVPLASRE
        KGKKQRNRFQYDPESYALNFDGGFDGE+D   PPIGFS+RFAVPLASRE
Subjt:  KGKKQRNRFQYDPESYALNFDGGFDGEDD--RPPIGFSTRFAVPLASRE

A0A5A7U6Q0 Uncharacterized protein1.1e-6483.89Show/hide
Query:  MATHQTRPPPTPYSPLN-QEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQ-QGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYL
        MA+HQTRPP TPYSPLN Q+Q+DL DID+++ SNGCGCF+LFGFG NRN NYE  NLLQQ QGRE+ESWMV+KLKK+KEVSEMVAGPKWKNFIRKMGGYL
Subjt:  MATHQTRPPPTPYSPLN-QEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQ-QGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYL

Query:  KGKKQRNRFQYDPESYALNFDGGFDGEDD--RPPIGFSTRFAVPLASRE
        KGKKQRNRFQYDPESYALNFDGGFDGE+D   PPIGFS+RFAVPLASRE
Subjt:  KGKKQRNRFQYDPESYALNFDGGFDGEDD--RPPIGFSTRFAVPLASRE

A0A6J1EGU0 uncharacterized protein LOC1114339733.5e-7491.22Show/hide
Query:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQQ-GREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRPPPTPYSPLN+EQE LHD DEA+PSNGC CF+LFGFGFNRNGNYE+GNLLQQQ GRE+E WMV+KLKKLKEVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQQ-GREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEDDRPPIGFSTRFAVPLASREQH
        GKKQRNRFQYDPESYALNFDGG DGEDDRPPIGFS+RFAVPLASREQH
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEDDRPPIGFSTRFAVPLASREQH

A0A6J1KR34 uncharacterized protein LOC1114968804.6e-7491.22Show/hide
Query:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQQ-GREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRPPPTPYSPLN+EQE LHD DEAVPSNGC CF+LFGFGFNRNGNYE+GNLLQQQ GRE+E WMV+KLKKLKEVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQQ-GREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEDDRPPIGFSTRFAVPLASREQH
        G+KQRNRFQYDPESYALNFDGG DGEDDRPPIGFS+RFAVPLASREQH
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEDDRPPIGFSTRFAVPLASREQH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)8.7e-0936.27Show/hide
Query:  DESWMVRKLKKLKEVSEMVAGPKWKNFIRKMG----------------GYLKGKKQRNR------FQYDPESYALNFDGGFDG---EDDRPPIGFSTRFA
        DE W +R  ++++E SE+VAGP+WK +IR+ G                G   G    NR      F+YD  SY+LNFD G      +D+ P   +S RFA
Subjt:  DESWMVRKLKKLKEVSEMVAGPKWKNFIRKMG----------------GYLKGKKQRNR------FQYDPESYALNFDGGFDG---EDDRPPIGFSTRFA

Query:  VP
         P
Subjt:  VP

AT3G48020.1 unknown protein7.9e-1037.89Show/hide
Query:  LQQQGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMG-GYLKGK--KQRNRFQYDPESYALNFDGGFDGEDDRPPIG----FSTRFA-VPLAS
        + +   ++  W VR   K++E SE+VAGP+WK FIR+      +G+     ++F+YDP SY L+F+     +DD   +G    FS R+A VP+AS
Subjt:  LQQQGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMG-GYLKGK--KQRNRFQYDPESYALNFDGGFDGEDDRPPIG----FSTRFA-VPLAS

AT5G14890.1 NHL domain-containing protein2.7e-1043.82Show/hide
Query:  DESWMVRKLKKLKEVSEMVAGPKWKNFIRKMG------GYLKG---KKQRNRFQYDPESYALNFDGGFDG---EDDRPPIGFSTRFAVP
        DE W V    K++E SE+VAGPKWK FIR+ G      G + G   + +   F+YD  SY+LNFD G      ED+ P   +S RFA P
Subjt:  DESWMVRKLKKLKEVSEMVAGPKWKNFIRKMG------GYLKG---KKQRNRFQYDPESYALNFDGGFDG---EDDRPPIGFSTRFAVP

AT5G25240.1 unknown protein1.9e-1641.46Show/hide
Query:  DIDEAVPSNGCGCFRLFGFGFNRNGNYED------GNLLQQQGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLKGKKQRNRFQYDPESYALNF
        D +E     GCG FR F F   R G+ E          LQ++ R   +W   KLK LKE+SE +AGPKWKNFIR      K  ++   F YD ++Y+LNF
Subjt:  DIDEAVPSNGCGCFRLFGFGFNRNGNYED------GNLLQQQGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLKGKKQRNRFQYDPESYALNF

Query:  DGGFDGEDDRPPIGFSTRFAVPL
        D G DG+D  P      RF  P+
Subjt:  DGGFDGEDDRPPIGFSTRFAVPL

AT5G62865.1 unknown protein2.1e-1042Show/hide
Query:  EDGNLLQQQGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLKGKKQ---RNRFQYDPESYALNFDGGFDGEDDRPPIG----FSTRFA-VPLAS
        +D N     G ++  W +R   K++E SE+VAGP+WK FIR+     +  +      +FQYDP SY+LNFD   D ED+   +G    FSTRFA VP+ S
Subjt:  EDGNLLQQQGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLKGKKQ---RNRFQYDPESYALNFDGGFDGEDDRPPIG----FSTRFA-VPLAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACCCATCAAACCAGACCGCCCCCCACGCCTTATTCACCTCTAAACCAGGAACAAGAAGACCTTCACGACATCGACGAGGCCGTTCCCTCAAATGGGTGCGGCTG
TTTCCGGCTATTCGGGTTCGGATTCAATCGGAATGGCAATTACGAAGACGGAAATCTTTTGCAGCAACAGGGTCGGGAAGACGAATCTTGGATGGTGAGGAAATTGAAGA
AGTTGAAGGAGGTTTCAGAAATGGTGGCTGGACCCAAATGGAAGAATTTCATTAGAAAAATGGGTGGGTATTTGAAGGGGAAAAAACAGAGGAACAGATTTCAGTATGAC
CCTGAAAGCTATGCTCTGAATTTCGATGGCGGGTTTGATGGAGAAGACGATCGTCCGCCAATTGGCTTCTCTACGAGGTTTGCTGTGCCTTTGGCTTCTAGGGAACAACA
TTGA
mRNA sequenceShow/hide mRNA sequence
GGGTTTTGTTTTCACAAAGTTTCCCTTCTCTCGACGTCATATAAACTCATCTTTAGACGCATTCAAGGGCAAATCAGAAGAGAATGAAAACGGAAGAAACGACCCATTTT
CCCAATTCTCGCCCAAATCTAAACCCTCCAATTCTGCGTCATTTCGATTCACCTTTATATCCCTTCAATGGCTTCCACAAACCCCATAACTGAACAAAACCCTCCATCCC
CCAAATCCCACTTTACTTAAAACCTCTGCCCTCTCTCTCCCATGGCGACCCATCAAACCAGACCGCCCCCCACGCCTTATTCACCTCTAAACCAGGAACAAGAAGACCTT
CACGACATCGACGAGGCCGTTCCCTCAAATGGGTGCGGCTGTTTCCGGCTATTCGGGTTCGGATTCAATCGGAATGGCAATTACGAAGACGGAAATCTTTTGCAGCAACA
GGGTCGGGAAGACGAATCTTGGATGGTGAGGAAATTGAAGAAGTTGAAGGAGGTTTCAGAAATGGTGGCTGGACCCAAATGGAAGAATTTCATTAGAAAAATGGGTGGGT
ATTTGAAGGGGAAAAAACAGAGGAACAGATTTCAGTATGACCCTGAAAGCTATGCTCTGAATTTCGATGGCGGGTTTGATGGAGAAGACGATCGTCCGCCAATTGGCTTC
TCTACGAGGTTTGCTGTGCCTTTGGCTTCTAGGGAACAACATTGAAGGAATTGATTGATTGACTGTGATTGTTCACGGAAATAATTGGGGAATTGTTTGGATTGATTCAT
ATCATATACTGTTTGTTAGTGTGTATATATGGAGAAATTTGGATCATTGGTTCATTCGTTCTTCTTAATTGATTTGCTTGGTTTCGTTCTAAGAGAATTTGTGCTTGAAA
ATTCTACAAATTGAAAGACTATCTTGCAATTTCTTGTTGTTTTGTCCACAAGTGAGTCTAAAAACTTTATATCAACTATTAAGTGAGGTAATCAGAGAAGCAATAGCAGA
GGTTTTGTTTCTAGATTTTGCAGTAATAGCATTGTTGTTACTGTTTAGTGATTTTTGGGCACAGATATCCAAGTCATTCTCCAAGCCATCTTGAAATAATTCCAGCATAG
TCCGAGACCATGGATGCAAGAAGACACAATACAACTTTGCTTTTGGATTTGTGAAGGGAGGCAAGAAGCTTTATCTATGCCACACCTTGAGATGAAGTTTGGGAGGCTCG
AACTCTCACACAATGGAGTATTGTGCATCGCTATAAAAGGAAAAAAAAAATGTGTGCATCCTAGGATGTTGAAGTGGATTGCACAAGATACAGGTGCATTGTGTCACAAC
CATACTAGTCGAGTGTAAGTTGTGACCCCACGAGTACCATGGTACGATCGAGGAGGATAGTGCATCATCCAACACATCAAAATAGGTAAGCATGAATACTAAGTTGAGGG
CAACCAAGATTGGGGACAACCGTGTCGTCAAAGGGGTCGATGTATGGGCATATGTTCTCTCGCTAAATTGACGTGCAAGGGTGTTAGACCTAATAATCTTCATAATATAC
TAAAGGGGTAGCCTGATGTCAATTATCCACCATCCTTGGGCTCGAGGGCCTAAGTGAGTCACCTCGTGGCACGTTAGCGTGTCATGGCGTGGGCAAGCGTTATAAGACGA
GTGTCTTGGACGTTTTCCTTTGGAATGAGTTTTCAAGGGACAGTGTCGAGTGACACATGTGAACCGTCATTGCGCCCCTTCCCATGGCCTGGAGGTTTTGATATAGGGCT
TGCTAACAACCAGCAGTTCGAGAATGTCTCGACAAAGCCAACTCATGAGAAAGATCGAGAGCATGTTCACAGACGGGCGACGTGACGTGTACGACTAACAGACGGGCGAC
GTGTACGACTAACAGACGGGCGACGTGTACGACTAAAGTTAAGTTCTGCTGCATATTTATTTGATGTTTGGTTGAGGATGAAGAACCTCGCCTAAGGTTCGATGAAGCGT
TGCGATCATAGAGATGGGATCATGGTTAATATGTACGATGCCTCAATATGTATGATGTATTAGACAAAGTTCAAAGGTGGAGTGTGATACCTCTATGCCTCAATATGTAC
GATGTATTAGCTCTGCCAATGCTCGCATGTTGATGGGTGTATTCCATCAATTACTTAAAATCACTCCCTTTATAAAAGAACAATACATATTGTTGTTTGTACATCATTTT
AGCCGACTTCGAAGCAATCAAAATATTCACTAAAAGTCTTTTATCTCTTGATAAATTTGATCTTTTGCTTGAAAAGTAACAGCTTGACACAAGTTTGCAGAAAATTCAAG
TCAAAATTGTTTGGCTAGGGAGATCGCCAGAAATAGGGTGGAAATTGTCAAAATTGTTTTTGTTCATACCCTCATATTCTCAGAAGTTATGAAGTTGTTTAGACCACTGA
GTGAGTTATAATAATAGTTTAAGTCATAATAATCTGTGGGTTATAATAATATGTAGAG
Protein sequenceShow/hide protein sequence
MATHQTRPPPTPYSPLNQEQEDLHDIDEAVPSNGCGCFRLFGFGFNRNGNYEDGNLLQQQGREDESWMVRKLKKLKEVSEMVAGPKWKNFIRKMGGYLKGKKQRNRFQYD
PESYALNFDGGFDGEDDRPPIGFSTRFAVPLASREQH