; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018561 (gene) of Snake gourd v1 genome

Gene IDTan0018561
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic
Genome locationLG06:6552641..6554789
RNA-Seq ExpressionTan0018561
SyntenyTan0018561
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR003425 - CCB3/YggT


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136017.2 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X3 [Cucumis sativus]6.2e-8185.86Show/hide
Query:  SAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKLA
        +A ACSSLSSIRR FKYHPQR+PNC+FQA +CSS LLGSFTSSK  LSLAY  PPLKPA AAYEAARTIPF LQDASMAASDF+N++ LADLDP TAKLA
Subjt:  SAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKLA

Query:  IGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        I FLGP LS FSFLFI RIVMSWYPKLPV KFPYVIA+APTEPLL+ATRKVIPPLGGVDVTPVVWFGL+SFLNEILLGPQGLLVLLSQQVS
Subjt:  IGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

XP_022953720.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic [Cucurbita moschata]2.5e-8287.43Show/hide
Query:  SAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKLA
        +A ACSSLS+IRR F YHPQ +PNCRFQA +CSS LLGSFTSSKIHL L YATPPLKP      AARTIPFALQDAS+AASDFMNNV+LADLDP TAKLA
Subjt:  SAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKLA

Query:  IGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        IGFLGPFLSAFSFLFI RIVMSWYPKLPV KFPYVIA+APTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  IGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

XP_023548988.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X2 [Cucurbita pepo subsp. pepo]1.9e-8287.96Show/hide
Query:  SAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKLA
        +A ACSSLS+IRR F YHPQ +PNCRFQA +CSS LLGSFTSSKIHL L YATPPLKP      AARTIPFALQDASMAASDFMNNVALADLDP TAKLA
Subjt:  SAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKLA

Query:  IGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        IG LGPFLSAFSFLFI RIVMSWYPKLPV KFPYVIA+APTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  IGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

XP_038896182.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Benincasa hispida]5.1e-8382.94Show/hide
Query:  AAASAAACSSLSSIR-----------------RSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAA
        AA +AAACSSLS IR                 R FKYHPQR+PNCRFQAT+CSS +L SFT+SK+HLSLAYAT PLKPA AAYEAARTIPFALQDASM+A
Subjt:  AAASAAACSSLSSIR-----------------RSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAA

Query:  SDFMNNVALADLDPTTAKLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
        SDFMNNVALADLDP  AKLAIGFLGPFLSAFSFLFI RIVMSWYPKLPV KFPYVIA+APTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
Subjt:  SDFMNNVALADLDPTTAKLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ

Query:  GLLVLLSQQVS
        GLLVLLSQQVS
Subjt:  GLLVLLSQQVS

XP_038896183.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X2 [Benincasa hispida]2.9e-8690.21Show/hide
Query:  AAASAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTA
        AA +AAACSSLS IRR FKYHPQR+PNCRFQAT+CSS +L SFT+SK+HLSLAYAT PLKPA AAYEAARTIPFALQDASM+ASDFMNNVALADLDP  A
Subjt:  AAASAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTA

Query:  KLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        KLAIGFLGPFLSAFSFLFI RIVMSWYPKLPV KFPYVIA+APTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  KLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

TrEMBL top hitse value%identityAlignment
A0A1S3BSI7 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic6.7e-8186.98Show/hide
Query:  ASAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKL
        A+ AACSSLSSIRR FKYHPQR+PN +FQA +CSS LLGSFTSSKI  SLAY  PPLKPA AAYEAARTIPFALQDASMAASDF+N++ LADLDP TAKL
Subjt:  ASAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKL

Query:  AIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        AI FLGP LS FSFLFI RIVMSWYPKLPV KFPYVIA+APTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  AIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

A0A6J1CUI3 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X27.0e-7885.34Show/hide
Query:  ASAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGS-FTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAK
        A+AA CSSL+SIRR FK+HPQR+PN RFQA QCSS LLGS  TSSK+ L LA ATPPLKPA AA+E  RT PFALQDASMAASDF  N+ALADLDP TAK
Subjt:  ASAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGS-FTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAK

Query:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
        LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPV KFPYV+A+APTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
Subjt:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ

A0A6J1CUU2 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X36.3e-7985.49Show/hide
Query:  ASAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGS-FTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAK
        A+AA CSSL+SIRR FK+HPQR+PN RFQA QCSS LLGS  TSSK+ L LA ATPPLKPA AA+E  RT PFALQDASMAASDF  N+ALADLDP TAK
Subjt:  ASAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGS-FTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAK

Query:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPV KFPYV+A+APTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

A0A6J1GQG0 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic1.2e-8287.43Show/hide
Query:  SAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKLA
        +A ACSSLS+IRR F YHPQ +PNCRFQA +CSS LLGSFTSSKIHL L YATPPLKP      AARTIPFALQDAS+AASDFMNNV+LADLDP TAKLA
Subjt:  SAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKLA

Query:  IGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        IGFLGPFLSAFSFLFI RIVMSWYPKLPV KFPYVIA+APTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  IGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

A0A6J1JVN7 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X22.2e-7986.39Show/hide
Query:  SAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKLA
        +A ACSSLS+IRR F YHPQ +PNCRFQA +CSS LLGSFTSSKI L L  ATP LKP      AARTIPFALQDASMAASDF NNVALADLDP TAKLA
Subjt:  SAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKLA

Query:  IGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        IGFLGPFLSAFSFLFI RIVMSWYPKLPV KFPYVIA+APTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  IGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

SwissProt top hitse value%identityAlignment
Q8RWM7 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic1.3e-4474.6Show/hide
Query:  EAARTIPFALQDASMAASDFMNNVALADLDPTTAKLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPV
        EAA T     Q  S+  S+ + N++LADLDP TAKLAIG LGP LSAF FLFI+RIVMSWYPKLPV+KFPYV+A+APTEP+L+ TRKVIPPL GVDVTPV
Subjt:  EAARTIPFALQDASMAASDFMNNVALADLDPTTAKLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPV

Query:  VWFGLVSFLNEILLGPQGLLVLLSQQ
        VWFGLVSFL+EIL+GPQGLLVL+SQQ
Subjt:  VWFGLVSFLNEILLGPQGLLVLLSQQ

Arabidopsis top hitse value%identityAlignment
AT4G27990.1 YGGT family protein2.3e-0430.38Show/hide
Query:  LGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPL-GGVDVTPVVWFGLVSFLNEILLGPQG
        L  +L  +S + +VR+++SW+P +P ++ P        +P L   R +IPP+   +DV+P++ F ++  L  IL   +G
Subjt:  LGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPL-GGVDVTPVVWFGLVSFLNEILLGPQG

AT5G21920.1 YGGT family protein6.6e-0435.48Show/hide
Query:  FLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPT--EPLLIATRKVIPPLGGVDVTPVVWF
        FL+ ++ + +VR+V++W+P  P    P ++    T  +P L   R  IPPLGG+D++P++ F
Subjt:  FLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPT--EPLLIATRKVIPPLGGVDVTPVVWF

AT5G21920.2 YGGT family protein6.6e-0435.48Show/hide
Query:  FLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPT--EPLLIATRKVIPPLGGVDVTPVVWF
        FL+ ++ + +VR+V++W+P  P    P ++    T  +P L   R  IPPLGG+D++P++ F
Subjt:  FLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPT--EPLLIATRKVIPPLGGVDVTPVVWF

AT5G36120.1 cofactor assembly, complex C (B6F)9.1e-4674.6Show/hide
Query:  EAARTIPFALQDASMAASDFMNNVALADLDPTTAKLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPV
        EAA T     Q  S+  S+ + N++LADLDP TAKLAIG LGP LSAF FLFI+RIVMSWYPKLPV+KFPYV+A+APTEP+L+ TRKVIPPL GVDVTPV
Subjt:  EAARTIPFALQDASMAASDFMNNVALADLDPTTAKLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPV

Query:  VWFGLVSFLNEILLGPQGLLVLLSQQ
        VWFGLVSFL+EIL+GPQGLLVL+SQQ
Subjt:  VWFGLVSFLNEILLGPQGLLVLLSQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGCCGCTTCCGCTGCCGCCTGCTCCTCTCTCAGCTCCATTCGAAGAAGCTTCAAGTACCATCCTCAAAGAAGTCCAAACTGCAGATTTCAAGCAACCCAATG
TAGCTCATGTTTGTTGGGTTCTTTTACCTCTTCCAAGATTCATCTGTCATTGGCCTATGCCACCCCTCCATTAAAGCCAGCTTCTGCTGCATATGAAGCTGCGAGGACTA
TCCCCTTTGCCTTACAAGATGCATCAATGGCTGCCTCTGATTTCATGAACAATGTGGCCCTGGCCGACCTCGATCCCACAACGGCAAAGCTTGCGATCGGGTTTTTGGGG
CCATTTCTATCAGCATTTTCGTTTTTGTTCATTGTGAGAATAGTAATGTCTTGGTATCCCAAGTTGCCTGTGGAGAAGTTTCCATATGTTATAGCTTTTGCCCCCACAGA
ACCACTTCTAATTGCAACAAGGAAGGTGATCCCCCCTCTCGGCGGAGTCGACGTAACACCAGTCGTCTGGTTCGGATTGGTTAGTTTCCTCAACGAGATATTGCTCGGTC
CCCAAGGGCTGCTTGTCCTCCTTTCTCAACAGGTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
CAAAAAAAAATCTCTTTCACAAACCCAATTCCTCCCATTGATGATAAAAGGATAGCCTTCTCCTTCTCCTTCGCCGGAGCTCCTCTGCGAAAGCCATGGCCGCCGCCGCT
TCCGCTGCCGCCTGCTCCTCTCTCAGCTCCATTCGAAGAAGCTTCAAGTACCATCCTCAAAGAAGTCCAAACTGCAGATTTCAAGCAACCCAATGTAGCTCATGTTTGTT
GGGTTCTTTTACCTCTTCCAAGATTCATCTGTCATTGGCCTATGCCACCCCTCCATTAAAGCCAGCTTCTGCTGCATATGAAGCTGCGAGGACTATCCCCTTTGCCTTAC
AAGATGCATCAATGGCTGCCTCTGATTTCATGAACAATGTGGCCCTGGCCGACCTCGATCCCACAACGGCAAAGCTTGCGATCGGGTTTTTGGGGCCATTTCTATCAGCA
TTTTCGTTTTTGTTCATTGTGAGAATAGTAATGTCTTGGTATCCCAAGTTGCCTGTGGAGAAGTTTCCATATGTTATAGCTTTTGCCCCCACAGAACCACTTCTAATTGC
AACAAGGAAGGTGATCCCCCCTCTCGGCGGAGTCGACGTAACACCAGTCGTCTGGTTCGGATTGGTTAGTTTCCTCAACGAGATATTGCTCGGTCCCCAAGGGCTGCTTG
TCCTCCTTTCTCAACAGGTCAGCTGAATCTTGAGAATGCTTCAGAGAAACATTTTTGTTTTCTTGTTTGAAGTTCATACCTGTAAGATTCTCCATGATCTGTATATGTAC
TTGATATTCTTATCTGAGCCTCCTTGGTTTTAAAAAGCAGAGTGTGAGAAGAACTTGAAATGCTTAATTAGAGAAAGTTTCCCCCCCCTCGGTTCGGCTTTGGATTGGTT
TTGATTGGTTTTGAGCTAAGACCAACACGAACGAACATGGGTCTGAAAATTTGATCTTTTTAATACCGAACTAATCGAACTTCCC
Protein sequenceShow/hide protein sequence
MAAAASAAACSSLSSIRRSFKYHPQRSPNCRFQATQCSSCLLGSFTSSKIHLSLAYATPPLKPASAAYEAARTIPFALQDASMAASDFMNNVALADLDPTTAKLAIGFLG
PFLSAFSFLFIVRIVMSWYPKLPVEKFPYVIAFAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS