; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010601 (gene) of Snake gourd v1 genome

Gene IDTan0010601
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransmembrane protein
Genome locationLG06:21729349..21731514
RNA-Seq ExpressionTan0010601
SyntenyTan0010601
Gene Ontology termsGO:0070072 - vacuolar proton-transporting V-type ATPase complex assembly (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592966.1 hypothetical protein SDJN03_12442, partial [Cucurbita argyrosperma subsp. sororia]1.4e-6980.11Show/hide
Query:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKV
        ME+EQRL+LI LAI RLL+ENKNK+SSDRS++ D DENGDRTLL DLLSQIESLKEG ESEELASALD LK+KAKNS  +D  DD E SREDI+KELKKV
Subjt:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKV

Query:  KKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIENSSKQQHDEASVLLPLQIPELPYVDLQ
        K+QNVLT CL+SVMIVVTVVWQLSEVS++LNVKD+ISHPFR LGSLISGALKRPKTI+ENS KQ+HDEAS+L  LQIPELP++DLQ
Subjt:  KKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIENSSKQQHDEASVLLPLQIPELPYVDLQ

KAG7025378.1 hypothetical protein SDJN02_11873 [Cucurbita argyrosperma subsp. argyrosperma]7.5e-6880.11Show/hide
Query:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKV
        ME+EQRLELI LAI RLL+ENKNK+SSDRS++ D DENGDRTLL DLLSQIESLKEG ESEELASALD LK+KAKNS   D  DD E SREDI+KELKKV
Subjt:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKV

Query:  KKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIENSSKQQHDEASVLLPLQIPELPYVDLQ
        K+QNVLT CL+SVMIVVTVVWQLSEVS++LNVKD+ISHPFR LGSLISGALKRPKTI+ENS KQ+HDEAS+L  LQIPELP++ LQ
Subjt:  KKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIENSSKQQHDEASVLLPLQIPELPYVDLQ

XP_004140639.1 uncharacterized protein LOC101222559 [Cucumis sativus]6.4e-6777.25Show/hide
Query:  EEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKVK
        EEEQRLELIS AIRRLL +NKNKKSSDRS VDDGDENG+ +LLRDLLSQIESLKEG ESEEL SALD LKTK ++S+KE+IVDD ECSRED++KELKK+K
Subjt:  EEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKVK

Query:  KQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIEN----SSKQQHDEASVLLPLQIPELPYVDLQ
        +QN+LTHCLLSVMIV+TVVWQLSEVS+ILNVKDKISHPFRSLG+ ISG  +RPKTI++N    SSKQ HDE S+L PL+I +LP V LQ
Subjt:  KQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIEN----SSKQQHDEASVLLPLQIPELPYVDLQ

XP_023004225.1 uncharacterized protein LOC111497620 [Cucurbita maxima]4.9e-6775.25Show/hide
Query:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESL------------KEGGESEELASALDALKTKAKNSVKEDIVDDG--
        ME+EQRLELI LAI RLL+ENKNK+SSDRS++ D DENGDRTLL DLLSQIESL            KEG ESEELASALD LK K+KNSVKE+I DD   
Subjt:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESL------------KEGGESEELASALDALKTKAKNSVKEDIVDDG--

Query:  --ECSREDIIKELKKVKKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIENSSKQQHDEASVLLPLQIPELPYVD
          E S+EDI+KELKKVK+QNVLT CL+SVMIVVTVVWQLSEVS++LNVKD+ISHPFR LGSLISGALKRPKTIIENS KQ+HDEAS+L PLQIPELP+V 
Subjt:  --ECSREDIIKELKKVKKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIENSSKQQHDEASVLLPLQIPELPYVD

Query:  LQ
        L+
Subjt:  LQ

XP_038874574.1 uncharacterized protein LOC120067172 [Benincasa hispida]1.2e-7081.68Show/hide
Query:  EEEQRLELISLAIRRLLEE--NKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKK
        EEEQRLELISLAIRRLLEE  NKNKKSSDRS +DDGDEN +RTLLRDLLSQIESLKEG ESEE AS LD LKTK ++SVKE+IVDD ECSREDI+KELKK
Subjt:  EEEQRLELISLAIRRLLEE--NKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKK

Query:  VKKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIE----NSSKQQHDEASVLLPLQIPELPYVDLQ
        +K+QN+LTHCLLSVMI+VTVVWQLSEVS+ILNVKDKISHPF+SLG+LISG LKRPKTI++    NSSKQ HDEASVL PL+IPELP+V LQ
Subjt:  VKKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIE----NSSKQQHDEASVLLPLQIPELPYVDLQ

TrEMBL top hitse value%identityAlignment
A0A0A0K9P4 Uncharacterized protein3.1e-6777.25Show/hide
Query:  EEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKVK
        EEEQRLELIS AIRRLL +NKNKKSSDRS VDDGDENG+ +LLRDLLSQIESLKEG ESEEL SALD LKTK ++S+KE+IVDD ECSRED++KELKK+K
Subjt:  EEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKVK

Query:  KQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIEN----SSKQQHDEASVLLPLQIPELPYVDLQ
        +QN+LTHCLLSVMIV+TVVWQLSEVS+ILNVKDKISHPFRSLG+ ISG  +RPKTI++N    SSKQ HDE S+L PL+I +LP V LQ
Subjt:  KQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIEN----SSKQQHDEASVLLPLQIPELPYVDLQ

A0A1S3CAM6 uncharacterized protein LOC1034988433.1e-6775.79Show/hide
Query:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKV
        MEEEQRLELIS AI+RLLE+NKNKKSSDRS VDDGDENG+ +LLRDLLSQIESLKEG ES+E  SALD LKTK ++ +KE+IVDD ECSREDI+KELKK+
Subjt:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKV

Query:  KKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIE----NSSKQQHDEASVLLPLQIPELPYVDLQ
        K+QN+LTHCLLSVMIV+T+VWQLSEVS+ILNVKDKISHPFRSLG+ ISG  KRPKTI++    NSS+Q +DE S+L PL+IPELP++ LQ
Subjt:  KKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIE----NSSKQQHDEASVLLPLQIPELPYVDLQ

A0A5A7T973 Uncharacterized protein3.1e-6775.79Show/hide
Query:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKV
        MEEEQRLELIS AI+RLLE+NKNKKSSDRS VDDGDENG+ +LLRDLLSQIESLKEG ES+E  SALD LKTK ++ +KE+IVDD ECSREDI+KELKK+
Subjt:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKV

Query:  KKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIE----NSSKQQHDEASVLLPLQIPELPYVDLQ
        K+QN+LTHCLLSVMIV+T+VWQLSEVS+ILNVKDKISHPFRSLG+ ISG  KRPKTI++    NSS+Q +DE S+L PL+IPELP++ LQ
Subjt:  KKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIE----NSSKQQHDEASVLLPLQIPELPYVDLQ

A0A6J1H7B2 uncharacterized protein LOC1114608262.2e-6577.37Show/hide
Query:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESL----KEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKE
        ME+EQRLELI LAI RLL+ENKNK+SSDRS++ D DE GDRTLL DLLSQIESL    KEG ESEELASALD LK+KAKNS   D  DD + SREDI+KE
Subjt:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESL----KEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKE

Query:  LKKVKKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIENSSKQQHDEASVLLPLQIPELPYVDLQ
        LKKVK+QNVLT CL+SVMIVVTVVWQLSEVS++LNVKD+ISHPFR LGSLISGALKRPKTI+ENS KQ+HDEAS+L  LQIPELP++ LQ
Subjt:  LKKVKKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIENSSKQQHDEASVLLPLQIPELPYVDLQ

A0A6J1KTZ8 uncharacterized protein LOC1114976202.4e-6775.25Show/hide
Query:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESL------------KEGGESEELASALDALKTKAKNSVKEDIVDDG--
        ME+EQRLELI LAI RLL+ENKNK+SSDRS++ D DENGDRTLL DLLSQIESL            KEG ESEELASALD LK K+KNSVKE+I DD   
Subjt:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESL------------KEGGESEELASALDALKTKAKNSVKEDIVDDG--

Query:  --ECSREDIIKELKKVKKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIENSSKQQHDEASVLLPLQIPELPYVD
          E S+EDI+KELKKVK+QNVLT CL+SVMIVVTVVWQLSEVS++LNVKD+ISHPFR LGSLISGALKRPKTIIENS KQ+HDEAS+L PLQIPELP+V 
Subjt:  --ECSREDIIKELKKVKKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIENSSKQQHDEASVLLPLQIPELPYVD

Query:  LQ
        L+
Subjt:  LQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27300.1 unknown protein6.0e-2337.56Show/hide
Query:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIES--------LKEGGESEELASALDALKTKAKNSVKEDIVDDGECSRED
        ME  + +E +S AI +LL E + +++S  +F++D D   D+  L  L+SQ+ES        +  G E E+   + D+  +K K+  +  +    E S E+
Subjt:  MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIES--------LKEGGESEELASALDALKTKAKNSVKEDIVDDGECSRED

Query:  IIKELKKVKKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIEN--------SSKQQHDEASVLLP-LQIPEL
        I K++KKVKKQN +TH LLS  I++T+VWQLSE S+I  +KD+ISHP RS+G +++G  K     I+N        + +  H   S   P LQ+PEL
Subjt:  IIKELKKVKKQNVLTHCLLSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIEN--------SSKQQHDEASVLLP-LQIPEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAGAGCAACGCCTGGAGCTGATCAGCCTCGCCATTAGGAGGCTGCTTGAAGAGAACAAGAACAAGAAATCTTCTGATCGGAGCTTCGTCGACGATGGCGATGA
GAACGGGGACCGTACTCTCCTCCGTGATTTGCTCTCTCAGATAGAGTCACTGAAAGAAGGAGGAGAATCAGAGGAGTTAGCTTCTGCATTAGATGCTTTGAAAACAAAAG
CTAAGAACTCCGTTAAAGAGGACATTGTTGATGATGGTGAATGTAGTAGAGAAGACATTATTAAGGAACTCAAGAAGGTAAAGAAGCAAAACGTTTTAACTCACTGTCTT
CTCTCAGTGATGATTGTGGTGACTGTTGTTTGGCAACTTTCTGAGGTCTCCCTTATCTTGAATGTAAAAGATAAAATCAGCCATCCCTTTAGATCCTTGGGCAGTTTGAT
CTCAGGAGCGCTCAAACGCCCTAAAACCATTATTGAGAATTCATCCAAACAACAACATGATGAAGCTTCAGTGCTTCTTCCTCTCCAAATTCCAGAACTTCCTTATGTGG
ATTTGCAATAA
mRNA sequenceShow/hide mRNA sequence
CAGAAATTAAACCAAAACTCTAAATCATGGGACTAAAATGGAATTTAGAATGATGAAGACAAACCACTCACTACATGCCACGTCAATGTCGATGCTACGTGTCACTCTCA
CAGTCGCGGTTAGCCATGGCGGCCTCCATACTTCTGAAAGCGTCCATCAATGGCGGAACCGATACTGCAAGAGCGCCCTCACCTCTTCCAGAGAAATCTCAAGTGAAACA
GATCGATGGAGGAAGAGCAACGCCTGGAGCTGATCAGCCTCGCCATTAGGAGGCTGCTTGAAGAGAACAAGAACAAGAAATCTTCTGATCGGAGCTTCGTCGACGATGGC
GATGAGAACGGGGACCGTACTCTCCTCCGTGATTTGCTCTCTCAGATAGAGTCACTGAAAGAAGGAGGAGAATCAGAGGAGTTAGCTTCTGCATTAGATGCTTTGAAAAC
AAAAGCTAAGAACTCCGTTAAAGAGGACATTGTTGATGATGGTGAATGTAGTAGAGAAGACATTATTAAGGAACTCAAGAAGGTAAAGAAGCAAAACGTTTTAACTCACT
GTCTTCTCTCAGTGATGATTGTGGTGACTGTTGTTTGGCAACTTTCTGAGGTCTCCCTTATCTTGAATGTAAAAGATAAAATCAGCCATCCCTTTAGATCCTTGGGCAGT
TTGATCTCAGGAGCGCTCAAACGCCCTAAAACCATTATTGAGAATTCATCCAAACAACAACATGATGAAGCTTCAGTGCTTCTTCCTCTCCAAATTCCAGAACTTCCTTA
TGTGGATTTGCAATAATTTTGGATTTGAGTCCTTAACCAAGAAAGGGAGATGGGGGACAGAGAGTGAGGTTTTTAATGGTGATGATGAAGAAACAGGAAGAGATTTTTCT
TGGAGGAATCCCCACATTATGTAATTATTAAATTGCTTTGGTTTTGTGAAATTTTTGCTTTGTAATTTAAGAGTCAGAGATGATTTTTTGAGAAATAAAATACATAGTAT
TAGAATGAAAGAGCCTTTTGATCATGGCCATTAGAATGAAAGAGCCTTTTGATCATGGCCCTTTTGTCTCCTCGCCTCTTGGGAAAATTAGTGCTTTTAGTTTTGAGGTC
TTATGTTTGTTCTCTTTTATCTGAAAAAGTAGCTTTGGAAGTCTGTGTTAAAGGAATAACAATTTCAAATCATTAATATATATGATGTGATTGACATCTGTGGGCAATAA
CAGATGGTACTATTAGAAATAACTCATGAAATATCTTGTGTTCAAACTCACCGAATATTTTATAGTCCAAATATAATTAAAGCCACGTGTTATAGAACTATTCATAAAAG
ATCAACTTTTTAAACATGCACTTTAAATTATTTATGAGTATATTTTTTGTACTTGTGAAGATGTATATACTTCTTTAAAAAAAGACAAAAAGATGTATATATAATGATGT
GGATTTTTTGTTCGAAAGTTGTGACTTGTACATGTCATTATTGACTAGTTATATTATAGATAATAGGTCACTCAGCATGGTAGGAAGATACTTGAATAGATGTATTGGAA
TCTAATGTAGAATTTGTAAAAAAAAAAACCTCCCACGTTTAAGTTAGACTTAGAAGTGTTGTTTAAACAAAGTTCTATGATATATAAATTCAACGTGTCAGGTGATCTTG
TTTTCTGTTAACTATAAGATAGGTATGATAATCGCTAACATGAGCATAACTTAATGATTAAGACACCTCTTTCTTTTTTTGAAGTCGGCTTGACATATGTTATTATCGAC
TAGTTATAGATAATAGATAAGGTGAG
Protein sequenceShow/hide protein sequence
MEEEQRLELISLAIRRLLEENKNKKSSDRSFVDDGDENGDRTLLRDLLSQIESLKEGGESEELASALDALKTKAKNSVKEDIVDDGECSREDIIKELKKVKKQNVLTHCL
LSVMIVVTVVWQLSEVSLILNVKDKISHPFRSLGSLISGALKRPKTIIENSSKQQHDEASVLLPLQIPELPYVDLQ