; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008308 (gene) of Snake gourd v1 genome

Gene IDTan0008308
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionVacuolar sorting-associated protein 62
Genome locationLG06:18126826..18134838
RNA-Seq ExpressionTan0008308
SyntenyTan0008308
Gene Ontology termsNA
InterPro domainsIPR014807 - Cytochrome c oxidase assembly factor 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008459968.1 PREDICTED: uncharacterized protein LOC103498925 isoform X1 [Cucumis melo]9.8e-9486.21Show/hide
Query:  SAERRKEQRIPKLKIMAKRFISIFKRSSTPHASS-SIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPI
        +++R KEQ  PKLK++AKRF SIFKRSSTP+ASS SIKP+E  VNKSWGRKA SFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA+ DAIGEPI
Subjt:  SAERRKEQRIPKLKIMAKRFISIFKRSSTPHASS-SIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPI

Query:  VKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPES
         KGPWYNASLAVAHKRHSLSCTFPVSGPQG GILQLKAVRNGEDSWISFLRPRDWDIL+MDALL+VP NEGKQ+TLRINLTEKF PAACV+CT  QPPE+
Subjt:  VKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPES

Query:  EKR
        EKR
Subjt:  EKR

XP_023547892.1 uncharacterized protein LOC111806704 isoform X1 [Cucurbita pepo subsp. pepo]6.8e-9586.41Show/hide
Query:  SAERRKEQRIPKLKIMAKRFISIFKRSSTPHA-SSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPI
        S ER +E+R  KLK++AKRF+SIFKRS TP+A SSS++P+EEGVNKSWGRKA SFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+AIIDAIGEPI
Subjt:  SAERRKEQRIPKLKIMAKRFISIFKRSSTPHA-SSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPI

Query:  VKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTP---AACVACTDFQP
        VKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI+QLKAVRNGEDSWISFLRPRDWDILIMDALLHVP NEGKQQTLRINLTEKF P   AACVACTD Q 
Subjt:  VKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTP---AACVACTDFQP

Query:  PESEKR
        PE+EKR
Subjt:  PESEKR

XP_023547893.1 uncharacterized protein LOC111806704 isoform X2 [Cucurbita pepo subsp. pepo]2.7e-9686.83Show/hide
Query:  SAERRKEQRIPKLKIMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIV
        S ER +E+R  KLK++AKRF+SIFKRS TP+ASSS++P+EEGVNKSWGRKA SFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+AIIDAIGEPIV
Subjt:  SAERRKEQRIPKLKIMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIV

Query:  KGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTP---AACVACTDFQPP
        KGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI+QLKAVRNGEDSWISFLRPRDWDILIMDALLHVP NEGKQQTLRINLTEKF P   AACVACTD Q P
Subjt:  KGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTP---AACVACTDFQPP

Query:  ESEKR
        E+EKR
Subjt:  ESEKR

XP_038876218.1 uncharacterized protein LOC120068500 isoform X1 [Benincasa hispida]1.7e-9390.48Show/hide
Query:  IMAKRFISIFKRSSTPHA-SSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAH
        ++AKRF+SIFKRS  PHA SSSIKP+E+GVNKSWGRKA SFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA+IDAIGEPI KGPWYNASLAVAH
Subjt:  IMAKRFISIFKRSSTPHA-SSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPESEKR
        KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQ+T+RINLTEKF PAACV+CTD QPPE+E R
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPESEKR

XP_038876219.1 uncharacterized protein LOC120068500 isoform X2 [Benincasa hispida]6.8e-9590.96Show/hide
Query:  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHK
        ++AKRF+SIFKRS  PHASSSIKP+E+GVNKSWGRKA SFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA+IDAIGEPI KGPWYNASLAVAHK
Subjt:  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPESEKR
        RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQ+T+RINLTEKF PAACV+CTD QPPE+E R
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPESEKR

TrEMBL top hitse value%identityAlignment
A0A0A0KF26 Uncharacterized protein2.2e-9189.42Show/hide
Query:  IMAKRFISIFKRSSTPHASS-SIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAH
        ++AKRF SIFKRSSTPHASS SIKP E  VNKSWGRKA SFVLITVTGGVALSALDDLAIYHSCSSKAIEK RNNQA+IDAIGEPI KGPWYNASLAVAH
Subjt:  IMAKRFISIFKRSSTPHASS-SIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPESEKR
        KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIL+MDALL+VP NEGKQ+TLRINL+EKF PAACV+CTD QPPE+EKR
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPESEKR

A0A1S3CBF0 uncharacterized protein LOC103498925 isoform X14.7e-9486.21Show/hide
Query:  SAERRKEQRIPKLKIMAKRFISIFKRSSTPHASS-SIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPI
        +++R KEQ  PKLK++AKRF SIFKRSSTP+ASS SIKP+E  VNKSWGRKA SFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA+ DAIGEPI
Subjt:  SAERRKEQRIPKLKIMAKRFISIFKRSSTPHASS-SIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPI

Query:  VKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPES
         KGPWYNASLAVAHKRHSLSCTFPVSGPQG GILQLKAVRNGEDSWISFLRPRDWDIL+MDALL+VP NEGKQ+TLRINLTEKF PAACV+CT  QPPE+
Subjt:  VKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPES

Query:  EKR
        EKR
Subjt:  EKR

A0A5D3DMW4 Vacuolar sorting-associated protein 626.8e-9387.37Show/hide
Query:  KEQRIPKLKIMAKRFISIFKRSSTPHASS-SIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPW
        +EQ  PKLK++AKRF SIFKRSSTP+ASS SIKP+E  VNKSWGRKA SFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA+ DAIGEPI KGPW
Subjt:  KEQRIPKLKIMAKRFISIFKRSSTPHASS-SIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPW

Query:  YNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPESEKR
        YNASLAVAHKRHSLSCTFPVSGPQG GILQLKAVRNGEDSWISFLRPRDWDIL+MDALL+VP NEGKQ+TLRINLTEKF PAACV+CT  QPPE+EKR
Subjt:  YNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPESEKR

A0A6J1GQQ6 uncharacterized protein LOC111456591 isoform X26.4e-9187.37Show/hide
Query:  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHK
        ++AKR +SIFKRS TP+ASSS++P+EEGVNKSWGR A SFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+AIIDAIGEPIVKGPWYNASLAVAHK
Subjt:  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTP--AACVACTDFQPPESEKR
        RHSLSCTFPVSGPQGTGI+QLKAVRNGE+SWISFLRPRDWDILIMDALLHVP NEGKQQT+RINLTEKF P  AACVACTD Q PE+EKR
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTP--AACVACTDFQPPESEKR

A0A6J1JLR1 uncharacterized protein LOC111488050 isoform X23.4e-9288.95Show/hide
Query:  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHK
        ++AKRF+SIFKRS TP+ASSS++P+EEGVNKSWGRKA SFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+AIIDAIGEPIVKGPWYNASLAVAHK
Subjt:  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAA--CVACTDFQPPESEKR
        RHSLSCTFPVSGPQGTGI+QLKAVRNGEDSWISFLRPRDWDILIMDALLHVP NEGKQQTLRINLTEKF PAA  CVACTD Q P +EKR
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAA--CVACTDFQPPESEKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20390.1 unknown protein3.3e-6366.84Show/hide
Query:  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHK
        + A+RF S FK SST   SS  K A  G   S+GRKA SFVLITVTGGVALSALDDL+IY  CSSKA+EK  N++ +I+AIGEPI KGPWYNASLAV+H+
Subjt:  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPESEK
        RHS+SC+FPV GPQGTGIL LKAVRNGEDS   FL+ RDWDILIMDAL+HVP+NEG QQTLRIN+T+   P+        +P E EK
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPESEK

AT2G20390.2 unknown protein9.2e-5856.05Show/hide
Query:  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHK
        + A+RF S FK SST   SS  K A  G   S+GRKA SFVLITVTGGVALSALDDL+IY  CSSKA+EK  N++ +I+AIGEPI KGPWYNASLAV+H+
Subjt:  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNG------------------------------------EDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRIN
        RHS+SC+FPV GPQGTGIL LKAVRNG                                    EDS   FL+ RDWDILIMDAL+HVP+NEG QQTLRIN
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNG------------------------------------EDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRIN

Query:  LTEKFTPAACVACTDFQPPESEK
        +T+   P+        +P E EK
Subjt:  LTEKFTPAACVACTDFQPPESEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTTACTAGAAATAAAAAAAAAGCATAGATTTTATTTATTCAAAAAGCGCCGCAGTGTGATTTTGGCCTGCGGAAGCGCTGAGCGAAGGAAGGAGCAAAGAATCCC
GAAGCTGAAAATTATGGCGAAAAGGTTCATTTCCATATTCAAGCGCTCTTCAACTCCACACGCTTCCAGTTCCATAAAGCCAGCGGAGGAAGGGGTGAATAAATCCTGGG
GTCGTAAAGCAGCCTCCTTTGTGCTTATTACTGTTACTGGTGGTGTAGCTTTGAGTGCTTTAGATGACCTCGCCATTTATCATAGCTGTAGCAGCAAAGCCATAGAGAAA
GCCAGAAACAATCAAGCAATTATAGATGCTATTGGAGAACCCATTGTTAAAGGTCCATGGTACAATGCATCACTTGCAGTAGCTCATAAAAGACATTCTCTATCCTGCAC
ATTTCCAGTATCAGGACCACAAGGCACAGGGATCCTCCAATTGAAGGCAGTTCGTAATGGAGAGGACTCCTGGATTTCTTTTCTCCGGCCACGTGACTGGGACATTCTAA
TCATGGACGCTCTCCTTCATGTTCCTGCAAACGAAGGTAAGCAGCAAACATTGCGCATCAATCTAACTGAGAAGTTTACCCCTGCTGCTTGTGTTGCATGCACTGATTTT
CAGCCTCCGGAGTCAGAGAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
AAAATAACCAAAACTCACACATAGAAAAATATCGATCTCAAAAAATGAGAAAATTTTAATAAAACCATCCAAATAATTACGAATTTTCTATTACCTAATAAATTCTAGAA
AAGGAAGACGAAGAGAGGAAGAAGAAAGAAGAAAGAAGAAAGAATAAAGAATGGGCAAACCTTGGAAAATGAACCAAAAAAAATGGTAAACCCTATGTAATATTAATCCG
TAAAAGGATGAGTGCCCTTTCGTTATTGGCTCACTGTGTCAAAAGTAGGATCCTCATTCAATTAGCAGAAACGGTAAGCAGAATTAAACGTAACGCAATGAAATCACTTT
TTCGAATTTCGTTTGATTTTCAGTAAAAACACCCTAGATTCAAACTATAGTAGTCTAGTCACTCATTTAAAATTTTACAAATACTCGAGAATATATATAATTGAGATGAA
ATCAGACAAATGACTTTACTAGAAATAAAAAAAAAGCATAGATTTTATTTATTCAAAAAGCGCCGCAGTGTGATTTTGGCCTGCGGAAGCGCTGAGCGAAGGAAGGAGCA
AAGAATCCCGAAGCTGAAAATTATGGCGAAAAGGTTCATTTCCATATTCAAGCGCTCTTCAACTCCACACGCTTCCAGTTCCATAAAGCCAGCGGAGGAAGGGGTGAATA
AATCCTGGGGTCGTAAAGCAGCCTCCTTTGTGCTTATTACTGTTACTGGTGGTGTAGCTTTGAGTGCTTTAGATGACCTCGCCATTTATCATAGCTGTAGCAGCAAAGCC
ATAGAGAAAGCCAGAAACAATCAAGCAATTATAGATGCTATTGGAGAACCCATTGTTAAAGGTCCATGGTACAATGCATCACTTGCAGTAGCTCATAAAAGACATTCTCT
ATCCTGCACATTTCCAGTATCAGGACCACAAGGCACAGGGATCCTCCAATTGAAGGCAGTTCGTAATGGAGAGGACTCCTGGATTTCTTTTCTCCGGCCACGTGACTGGG
ACATTCTAATCATGGACGCTCTCCTTCATGTTCCTGCAAACGAAGGTAAGCAGCAAACATTGCGCATCAATCTAACTGAGAAGTTTACCCCTGCTGCTTGTGTTGCATGC
ACTGATTTTCAGCCTCCGGAGTCAGAGAAGAGATGAACTCAGCTCGATTTCGAACTAACCGAACGATCAAAATATTCGTTATTCCCATCAAAATAATATTTTTATCTGGA
CTGCTCTGTCCAAATGCCTTTGAGGTTGAGTTCAAACCATAGTGGAACCAACTTAGAAGATTTATTATCCTACAATTTTGCCACATCAAATGTAGCAAAGTTAAATGGTT
GTTCATGTAACCCAAGTTATAAAAGCAATATTAGTAACTAATTCACTTTATTTTTGTCTTCA
Protein sequenceShow/hide protein sequence
MTLLEIKKKHRFYLFKKRRSVILACGSAERRKEQRIPKLKIMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEK
ARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDF
QPPESEKR