; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021460 (gene) of Snake gourd v1 genome

Gene IDTan0021460
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein TonB like
Genome locationLG07:71611080..71614512
RNA-Seq ExpressionTan0021460
SyntenyTan0021460
Gene Ontology termsGO:0070072 - vacuolar proton-transporting V-type ATPase complex assembly (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138504.1 uncharacterized protein LOC101222938 [Cucumis sativus]6.3e-9290.78Show/hide
Query:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND   PSGLV+S+TEAIRSFLTSASIDS+LS+ELRQIA D  SQ NIPYK LRAIW+AT+SSTRPDL  LLAGSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD + SSSSRS
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS

Query:  KLKKSQ
        KLKKSQ
Subjt:  KLKKSQ

XP_008458236.1 PREDICTED: uncharacterized protein LOC103497719 [Cucumis melo]1.1e-9190.29Show/hide
Query:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND   PSGLV+S+TEAIRSFLTSASIDS+LS+ELRQIA D +SQ NIPYK LRAIW+AT+SSTRPDL  LLAGSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYL+GYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD + SSSSRS
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS

Query:  KLKKSQ
        KLKKSQ
Subjt:  KLKKSQ

XP_022959366.1 uncharacterized protein LOC111460360 [Cucurbita moschata]2.8e-9288.83Show/hide
Query:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND+ HPSGLVI++TEAIRSFLTSASIDSR+SEELRQ+A + +SQ N+PYKPLRAIW+AT+SSTRPDL  LL GSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIR+S+YD + SSSSRS
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS

Query:  KLKKSQ
        KLKK+Q
Subjt:  KLKKSQ

XP_023548168.1 uncharacterized protein LOC111806888 [Cucurbita pepo subsp. pepo]3.7e-9288.35Show/hide
Query:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND+ HPSGLVI++TEAIRSFLTSASIDSR+SEELRQ+A + +SQ N+PYKPLRAIW+AT+SSTRPDL  LL GSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS
        DVAERKAYQELVKDIAPKKP+DEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIR+S+YD + SSSSRS
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS

Query:  KLKKSQ
        KLKK+Q
Subjt:  KLKKSQ

XP_038876067.1 uncharacterized protein LOC120068388 [Benincasa hispida]5.7e-9390.29Show/hide
Query:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND THPSGLV+SNTEAIRSFLTSASIDS+LS+ELRQIA D +SQ NIPYKPLRAIW+A +SS RPDL  LLAGSEFV TSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS
        DVAERKAYQELVKDIAPKKPI+EPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD + SSSS S
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS

Query:  KLKKSQ
        KLKKSQ
Subjt:  KLKKSQ

TrEMBL top hitse value%identityAlignment
A0A0A0K7F5 Uncharacterized protein3.0e-9290.78Show/hide
Query:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND   PSGLV+S+TEAIRSFLTSASIDS+LS+ELRQIA D  SQ NIPYK LRAIW+AT+SSTRPDL  LLAGSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD + SSSSRS
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS

Query:  KLKKSQ
        KLKKSQ
Subjt:  KLKKSQ

A0A1S3C6X8 uncharacterized protein LOC1034977195.2e-9290.29Show/hide
Query:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND   PSGLV+S+TEAIRSFLTSASIDS+LS+ELRQIA D +SQ NIPYK LRAIW+AT+SSTRPDL  LLAGSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYL+GYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD + SSSSRS
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS

Query:  KLKKSQ
        KLKKSQ
Subjt:  KLKKSQ

A0A5D3BV73 Uncharacterized protein5.2e-9290.29Show/hide
Query:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND   PSGLV+S+TEAIRSFLTSASIDS+LS+ELRQIA D +SQ NIPYK LRAIW+AT+SSTRPDL  LLAGSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYL+GYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD + SSSSRS
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS

Query:  KLKKSQ
        KLKKSQ
Subjt:  KLKKSQ

A0A6J1H4N6 uncharacterized protein LOC1114603601.4e-9288.83Show/hide
Query:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND+ HPSGLVI++TEAIRSFLTSASIDSR+SEELRQ+A + +SQ N+PYKPLRAIW+AT+SSTRPDL  LL GSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIR+S+YD + SSSSRS
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS

Query:  KLKKSQ
        KLKK+Q
Subjt:  KLKKSQ

A0A6J1L0S3 uncharacterized protein LOC1114993223.4e-9187.38Show/hide
Query:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND+ HPSGLVI++TEAIR+FLTSASIDSR+SEELRQ+A + +SQ N+PYKPLRAIW+AT+SSTRPDL  LL GSEFVFTSPKPREKSEELKARLK LA
Subjt:  MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGY LFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIR+S+YD + SSSSRS
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYD-KQSSSSRS

Query:  KLKKSQ
        KLKK+Q
Subjt:  KLKKSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G52980.1 CONTAINS InterPro DOMAIN/s: ATPase, vacuolar ER assembly factor, Vma12 (InterPro:IPR021013); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).4.8e-6668.16Show/hide
Query:  DRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLADVA
        D  + SGL++S TE +RSFL  AS D RLS+ELR IA D  S+  IPYK LRAIW  +D STRPDL  L +GS FVFTSPKPREKSEELK RL KL ++A
Subjt:  DRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLADVA

Query:  ERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDKQSSSSRSKLKK
        ERK Y ELVKDI PKK ++EPFSSYKDQLGFGLHV L MFTGYLVGYA FRALF  +P +SAAGGILGLV  MLVETLLFII++S  D Q  SS+S  + 
Subjt:  ERKAYQELVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDKQSSSSRSKLKK

Query:  S
        S
Subjt:  S


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAACGACCGGACACATCCGTCTGGTCTCGTCATATCTAACACCGAAGCAATTCGCTCATTTCTGACCTCAGCGTCCATAGATTCGCGACTTTCTGAGGAACTCCG
GCAGATTGCTTTGGATTTCTCTTCACAATATAACATTCCGTATAAGCCGCTCAGAGCTATCTGGTATGCTACGGATTCGTCCACCCGGCCGGACTTGTTCTGTCTTTTAG
CCGGATCGGAGTTCGTCTTTACAAGCCCTAAACCGAGGGAGAAGAGTGAAGAGTTAAAGGCTAGACTGAAGAAGCTTGCGGATGTAGCAGAGAGGAAGGCCTATCAGGAG
CTGGTGAAGGATATTGCACCTAAGAAACCAATTGATGAGCCTTTCTCTTCCTACAAAGATCAGCTGGGATTCGGTTTACACGTTGTGTTGATAATGTTTACTGGCTATCT
TGTTGGATATGCTTTATTCCGAGCATTGTTTAGGCACGATCCAATCATGAGTGCTGCTGGAGGTATCCTCGGGCTGGTTTTCGGCATGCTCGTAGAAACACTTCTTTTCA
TTATTAGATCGTCTAACTATGATAAACAATCTTCATCTTCCCGCTCTAAGCTAAAGAAAAGTCAATAG
mRNA sequenceShow/hide mRNA sequence
TATAGATATGTATATTGGATAAAAATTTAGCCTCATTTAAGTAAAACGCTGAACTCCATTGTTGCTGCTTCAAGCTCTGGTCTTCTACAGCTCTCGTCAGCGCAGGATAT
CGTTGTTCTCGAACCTCAAAACCCCTTTCTACGATGATCAACGACCGGACACATCCGTCTGGTCTCGTCATATCTAACACCGAAGCAATTCGCTCATTTCTGACCTCAGC
GTCCATAGATTCGCGACTTTCTGAGGAACTCCGGCAGATTGCTTTGGATTTCTCTTCACAATATAACATTCCGTATAAGCCGCTCAGAGCTATCTGGTATGCTACGGATT
CGTCCACCCGGCCGGACTTGTTCTGTCTTTTAGCCGGATCGGAGTTCGTCTTTACAAGCCCTAAACCGAGGGAGAAGAGTGAAGAGTTAAAGGCTAGACTGAAGAAGCTT
GCGGATGTAGCAGAGAGGAAGGCCTATCAGGAGCTGGTGAAGGATATTGCACCTAAGAAACCAATTGATGAGCCTTTCTCTTCCTACAAAGATCAGCTGGGATTCGGTTT
ACACGTTGTGTTGATAATGTTTACTGGCTATCTTGTTGGATATGCTTTATTCCGAGCATTGTTTAGGCACGATCCAATCATGAGTGCTGCTGGAGGTATCCTCGGGCTGG
TTTTCGGCATGCTCGTAGAAACACTTCTTTTCATTATTAGATCGTCTAACTATGATAAACAATCTTCATCTTCCCGCTCTAAGCTAAAGAAAAGTCAATAGAGTTACTTC
TCCTGAGGTTGTGTTTGGGCAATATGCAAAAATGGAAAGAATATTCTACATGGGATATACAAAGATAGAAGAATACAACATCTCACTAGACCAAAGAATATGCAGGACAT
TCATTACAAGCGAAAGAAAAGCTAAGGTTGATGATTAAACACTGATAATAGAATGAGGGATCTTAAAGCTTTTAGTAAGATAAATTTCCTTCAAATACACAAGATGTTGT
GGGGCAGGAGAGCCTAGAGCCTTTGAGAAATGGATAGAAATAACTAGATATTTGTTCGTCTGTGTTTTTTCAAATTTTTGACAAAGGTTATTATAACCTATGTTATAATA
AATGTGGAATATTATGTTATAATAATTGTGGAATACTTTATATCCTAAACATAGACTATTATGATCGTAGATTAAACAATTCTGCATTTTGAATAACGTTATATAATTTC
ACAGATGATAATAACTCATGTTATTATAATCCACTCAATGTCTCAAATAGCTTATAATAGTCTATAAATAATTTTGCACTTCAACTCATAGACTATTAGGAGTTGTTTGG
GACGCGGAATGAGTTATAATAACAGGGGTTATAATAGTCTGTGGGTTATTATAGTCTGTGAAACATATAATATTATTTAAAACGCAGAGTAGTATAGTCTGGAGTTATAA
TAGTTTGAGTTTGG
Protein sequenceShow/hide protein sequence
MINDRTHPSGLVISNTEAIRSFLTSASIDSRLSEELRQIALDFSSQYNIPYKPLRAIWYATDSSTRPDLFCLLAGSEFVFTSPKPREKSEELKARLKKLADVAERKAYQE
LVKDIAPKKPIDEPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDKQSSSSRSKLKKSQ