; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC06G110800 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC06G110800
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionVacuolar sorting-associated protein 62
Genome locationCicolChr06:1393871..1397661
RNA-Seq ExpressionCcUC06G110800
SyntenyCcUC06G110800
Gene Ontology termsNA
InterPro domainsIPR014807 - Cytochrome c oxidase assembly factor 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140577.1 uncharacterized protein LOC101206927 [Cucumis sativus]1.6e-8990.27Show/hide
Query:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
        MLAKRF SIFKRS  P +SS+SIKP E+ VN+S GRKAVSFVLITVTGGVALSALDDLAIY SCSSKAIEK RNNQAVIDAIGEPI KGPWYNASLAVAH
Subjt:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPE
        KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWI FLRPRDWDIL+MDALL+VPENEGKQKTLRINL+EKFAPAACVSCTDCQPPE
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPE

XP_022991411.1 uncharacterized protein LOC111488050 isoform X1 [Cucurbita maxima]1.3e-8886.91Show/hide
Query:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
        MLAKRF+SIFKRSP P +SSSS++P E+GVN+S GRKAVSFVL+TVTGGVALSALDDLAIY SCSSKAIEKA+NN+A+IDAIGEPI KGPWYNASLAVAH
Subjt:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAA--CVSCTDCQPP--EKR
        KRHSLSCTFPVSGPQGTGI+QLKAVRNGEDSWI FLRPRDWDILIMDALLHVPENEGKQ+TLRINLTEKFAPAA  CV+CTDCQ P  EKR
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAA--CVSCTDCQPP--EKR

XP_023547892.1 uncharacterized protein LOC111806704 isoform X1 [Cucurbita pepo subsp. pepo]7.8e-8987.23Show/hide
Query:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
        MLAKRF+SIFKRSP P +SSSS++P E+GVN+S GRKAVSFVL+TVTGGVALSALDDLAIY SCSSKAIEKA+NN+A+IDAIGEPI KGPWYNASLAVAH
Subjt:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAP---AACVSCTDCQPPE
        KRHSLSCTFPVSGPQGTGI+QLKAVRNGEDSWI FLRPRDWDILIMDALLHVPENEGKQ+TLRINLTEKFAP   AACV+CTDCQ PE
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAP---AACVSCTDCQPPE

XP_038876218.1 uncharacterized protein LOC120068500 isoform X1 [Benincasa hispida]4.7e-9494.59Show/hide
Query:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
        MLAKRFVSIFKRSP P +SSSSIKP EDGVN+S GRKAVSFVLITVTGGVALSALDDLAIY SCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
Subjt:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPE
        KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWI FLRPRDWDILIMDALLHVP NEGKQKT+RINLTEKFAPAACVSCTDCQPPE
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPE

XP_038876219.1 uncharacterized protein LOC120068500 isoform X2 [Benincasa hispida]3.4e-9294.05Show/hide
Query:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
        MLAKRFVSIFKRSP P  +SSSIKP EDGVN+S GRKAVSFVLITVTGGVALSALDDLAIY SCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
Subjt:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPE
        KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWI FLRPRDWDILIMDALLHVP NEGKQKT+RINLTEKFAPAACVSCTDCQPPE
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPE

TrEMBL top hitse value%identityAlignment
A0A0A0KF26 Uncharacterized protein7.6e-9090.27Show/hide
Query:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
        MLAKRF SIFKRS  P +SS+SIKP E+ VN+S GRKAVSFVLITVTGGVALSALDDLAIY SCSSKAIEK RNNQAVIDAIGEPI KGPWYNASLAVAH
Subjt:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPE
        KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWI FLRPRDWDIL+MDALL+VPENEGKQKTLRINL+EKFAPAACVSCTDCQPPE
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPE

A0A1S3CBF0 uncharacterized protein LOC103498925 isoform X16.4e-8990.27Show/hide
Query:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
        MLAKRF SIFKRS  P +SS+SIKP E+ VN+S GRKAVSFVLITVTGGVALSALDDLAIY SCSSKAIEKARNNQAV DAIGEPIAKGPWYNASLAVAH
Subjt:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPE
        KRHSLSCTFPVSGPQG GILQLKAVRNGEDSWI FLRPRDWDIL+MDALL+VPENEGKQKTLRINLTEKFAPAACVSCT CQPPE
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPE

A0A5D3DMW4 Vacuolar sorting-associated protein 626.4e-8990.27Show/hide
Query:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
        MLAKRF SIFKRS  P +SS+SIKP E+ VN+S GRKAVSFVLITVTGGVALSALDDLAIY SCSSKAIEKARNNQAV DAIGEPIAKGPWYNASLAVAH
Subjt:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPE
        KRHSLSCTFPVSGPQG GILQLKAVRNGEDSWI FLRPRDWDIL+MDALL+VPENEGKQKTLRINLTEKFAPAACVSCT CQPPE
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPE

A0A6J1GQN1 uncharacterized protein LOC111456591 isoform X12.7e-8785.56Show/hide
Query:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
        MLAKR +SIFKRSP P +SSSS++P E+GVN+S GR AVSFVL+TVTGGVALSALDDLAIY SCSSKAIEKA+NN+A+IDAIGEPI KGPWYNASLAVAH
Subjt:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAP--AACVSCTDCQPPE
        KRHSLSCTFPVSGPQGTGI+QLKAVRNGE+SWI FLRPRDWDILIMDALLHVPENEGKQ+T+RINLTEKFAP  AACV+CTDCQ PE
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAP--AACVSCTDCQPPE

A0A6J1JQM9 uncharacterized protein LOC111488050 isoform X16.4e-8986.91Show/hide
Query:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
        MLAKRF+SIFKRSP P +SSSS++P E+GVN+S GRKAVSFVL+TVTGGVALSALDDLAIY SCSSKAIEKA+NN+A+IDAIGEPI KGPWYNASLAVAH
Subjt:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAA--CVSCTDCQPP--EKR
        KRHSLSCTFPVSGPQGTGI+QLKAVRNGEDSWI FLRPRDWDILIMDALLHVPENEGKQ+TLRINLTEKFAPAA  CV+CTDCQ P  EKR
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAA--CVSCTDCQPP--EKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20390.1 unknown protein3.9e-6265.43Show/hide
Query:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
        M A+RF S FK       SS+S      G   S GRKAVSFVLITVTGGVALSALDDL+IYR CSSKA+EK  N++ +I+AIGEPI KGPWYNASLAV+H
Subjt:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQP--PEK
        +RHS+SC+FPV GPQGTGIL LKAVRNGEDS   FL+ RDWDILIMDAL+HVP NEG Q+TLRIN+T+   P+        +P  PEK
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQP--PEK

AT2G20390.2 unknown protein1.1e-5654.91Show/hide
Query:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH
        M A+RF S FK       SS+S      G   S GRKAVSFVLITVTGGVALSALDDL+IYR CSSKA+EK  N++ +I+AIGEPI KGPWYNASLAV+H
Subjt:  MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNG------------------------------------EDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRI
        +RHS+SC+FPV GPQGTGIL LKAVRNG                                    EDS   FL+ RDWDILIMDAL+HVP NEG Q+TLRI
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNG------------------------------------EDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRI

Query:  NLTEKFAPAACVSCTDCQP--PEK
        N+T+   P+        +P  PEK
Subjt:  NLTEKFAPAACVSCTDCQP--PEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGCGAAAAGGTTCGTTTCCATATTCAAGCGCTCTCCAAATCCAGAGTCTTCCAGCAGTTCCATAAAGCCATTGGAGGATGGGGTGAATAGATCCTTGGGT
CGGAAAGCAGTCTCTTTTGTACTTATTACTGTTACTGGTGGTGTAGCTTTGAGTGCTTTAGATGACCTTGCCATTTATCGTAGCTGTAGCAGCAAAGCCATAGAG
AAAGCCAGAAACAATCAAGCAGTTATAGATGCTATTGGAGAACCCATTGCCAAAGGTCCATGGTACAATGCATCACTTGCAGTAGCTCATAAGAGACATTCTCTA
TCCTGCACGTTTCCAGTATCAGGACCACAAGGCACAGGGATCCTCCAACTGAAGGCAGTTCGTAATGGAGAGGATTCCTGGATTCCTTTTCTCCGGCCTCGAGAC
TGGGACATTCTGATCATGGATGCTCTCCTCCATGTTCCTGAAAACGAAGGTAAGCAGAAAACATTGCGTATTAATCTCACTGAGAAGTTTGCCCCCGCTGCTTGT
GTCTCATGCACTGATTGTCAGCCTCCAGAGAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATTAAAGACACCTTATTAGGGAAGCGCTGAGCGATTGAAGGAGCAAGCAAAGCTGAAAATGCTGGCGAAAAGGTTCGTTTCCATATTCAAGCGCTCTCCAAATCC
AGAGTCTTCCAGCAGTTCCATAAAGCCATTGGAGGATGGGGTGAATAGATCCTTGGGTCGGAAAGCAGTCTCTTTTGTACTTATTACTGTTACTGGTGGTGTAGC
TTTGAGTGCTTTAGATGACCTTGCCATTTATCGTAGCTGTAGCAGCAAAGCCATAGAGAAAGCCAGAAACAATCAAGCAGTTATAGATGCTATTGGAGAACCCAT
TGCCAAAGGTCCATGGTACAATGCATCACTTGCAGTAGCTCATAAGAGACATTCTCTATCCTGCACGTTTCCAGTATCAGGACCACAAGGCACAGGGATCCTCCA
ACTGAAGGCAGTTCGTAATGGAGAGGATTCCTGGATTCCTTTTCTCCGGCCTCGAGACTGGGACATTCTGATCATGGATGCTCTCCTCCATGTTCCTGAAAACGA
AGGTAAGCAGAAAACATTGCGTATTAATCTCACTGAGAAGTTTGCCCCCGCTGCTTGTGTCTCATGCACTGATTGTCAGCCTCCAGAGAAGAGATGAAGTCAGCT
TATAAGTTATTGAGAAACTAATATCAAATGTTAGCAGTTTCTATTAGTTTTCTGTCCCGACATAATTTTGAGTTAACCATCAAAGCGTTCGTTATCCCTACCAAA
ATAATATTTTCTTTGAGGTTGAGTTTAAACCATGGTGGAACCAGTCTAGAAGATTTAGTATTATTCAACTTTGCTGACATCAAATGTGGCAAAAGTTATATGATT
CTCCTGTAAAACCCATGTTATAACAGAAATAATAGCAATTATTCAC
Protein sequenceShow/hide protein sequence
MLAKRFVSIFKRSPNPESSSSSIKPLEDGVNRSLGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAHKRHSL
SCTFPVSGPQGTGILQLKAVRNGEDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFAPAACVSCTDCQPPEKR