; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013219 (gene) of Snake gourd v1 genome

Gene IDTan0013219
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransmembrane protein
Genome locationLG11:5586763..5595314
RNA-Seq ExpressionTan0013219
SyntenyTan0013219
Gene Ontology termsGO:0010190 - cytochrome b6f complex assembly (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN51853.2 hypothetical protein Csa_008870 [Cucumis sativus]8.9e-7165.53Show/hide
Query:  KSLSRAATLSKTGNSPFLFLTQSFYSPPFH-SKNLLISFLFFFFLTYFQLNFIFPLSLFLQPLIKKENKVSKMEENPSRATRRRRFAVDDGADLIDCSGK
        ++L    T S + +   L L QS Y  P   S+NL+ +F FF               L ++   ++  ++ KM+E+PS+ATRRRRFAVDDGADLIDCSGK
Subjt:  KSLSRAATLSKTGNSPFLFLTQSFYSPPFH-SKNLLISFLFFFFLTYFQLNFIFPLSLFLQPLIKKENKVSKMEENPSRATRRRRFAVDDGADLIDCSGK

Query:  HCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETSGGPTGEEGMPEISPGSGEEEEEMGNFSA
        HCRSCTAGLVADCVAVCCCPCSVVSFLALAL+KLPWM+GRRCLQ+A   RKKRKL+ R+GE DGAT AAET G    E+G+PE+SPGSGEE+E  GNFSA
Subjt:  HCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETSGGPTGEEGMPEISPGSGEEEEEMGNFSA

Query:  RFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN
        RFEAERIWLQLYQVGQLGFGRVSFTGN+NLWPNSN
Subjt:  RFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN

XP_008458773.1 PREDICTED: uncharacterized protein LOC103498078 [Cucumis melo]1.3e-6985.89Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETS
        M+E+PSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALAL+KLPWM+GRRCLQ+A   RKKRKL+RR+GE DGAT AAET 
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETS

Query:  GGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN
        G    E+G+PEISPGSGEE+E  GNFSARFEAERIWLQLYQVGQLGFGRVSFTGN+NLWPNSN
Subjt:  GGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN

XP_011655414.1 uncharacterized protein LOC105435525 [Cucumis sativus]4.1e-6884.05Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETS
        M+E+PS+ATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALAL+KLPWM+GRRCLQ+A   RKKRKL+ R+GE DGAT AAET 
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETS

Query:  GGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN
        G    E+G+PE+SPGSGEE+E  GNFSARFEAERIWLQLYQVGQLGFGRVSFTGN+NLWPNSN
Subjt:  GGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN

XP_023536558.1 uncharacterized protein LOC111797691 [Cucurbita pepo subsp. pepo]1.1e-6884.66Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRARKKRKLIRRKGEPDGATAAAETSGGP
        MEE+PS  TRRRRF VDDG DLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALAL+KLPWM+GRRCLQRARKKRKLIRRK E DGA AA+ET  GP
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRARKKRKLIRRKGEPDGATAAAETSGGP

Query:  TGEEGMPEISPGSG---EEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN
          EE +PEI PGSG   EEEEE+GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGN+N WPNSN
Subjt:  TGEEGMPEISPGSG---EEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN

XP_038891069.1 uncharacterized protein LOC120080480 [Benincasa hispida]4.0e-7188.34Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETS
        M+E+PSR TRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWM+GRRCLQRA   RKKRKLIRR+GE DGAT AAET 
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETS

Query:  GGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN
        G    EEG+PEISPGSGEEEE +GNFSARFEAERIWLQLYQVGQLGFGRVSFTGN+NLWPNSN
Subjt:  GGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN

TrEMBL top hitse value%identityAlignment
A0A0A0KR58 Uncharacterized protein4.3e-7165.53Show/hide
Query:  KSLSRAATLSKTGNSPFLFLTQSFYSPPFH-SKNLLISFLFFFFLTYFQLNFIFPLSLFLQPLIKKENKVSKMEENPSRATRRRRFAVDDGADLIDCSGK
        ++L    T S + +   L L QS Y  P   S+NL+ +F FF               L ++   ++  ++ KM+E+PS+ATRRRRFAVDDGADLIDCSGK
Subjt:  KSLSRAATLSKTGNSPFLFLTQSFYSPPFH-SKNLLISFLFFFFLTYFQLNFIFPLSLFLQPLIKKENKVSKMEENPSRATRRRRFAVDDGADLIDCSGK

Query:  HCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETSGGPTGEEGMPEISPGSGEEEEEMGNFSA
        HCRSCTAGLVADCVAVCCCPCSVVSFLALAL+KLPWM+GRRCLQ+A   RKKRKL+ R+GE DGAT AAET G    E+G+PE+SPGSGEE+E  GNFSA
Subjt:  HCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETSGGPTGEEGMPEISPGSGEEEEEMGNFSA

Query:  RFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN
        RFEAERIWLQLYQVGQLGFGRVSFTGN+NLWPNSN
Subjt:  RFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN

A0A1S3C8M3 uncharacterized protein LOC1034980786.2e-7085.89Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETS
        M+E+PSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALAL+KLPWM+GRRCLQ+A   RKKRKL+RR+GE DGAT AAET 
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETS

Query:  GGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN
        G    E+G+PEISPGSGEE+E  GNFSARFEAERIWLQLYQVGQLGFGRVSFTGN+NLWPNSN
Subjt:  GGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN

A0A5A7T2E5 Uncharacterized protein6.2e-7085.89Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETS
        M+E+PSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALAL+KLPWM+GRRCLQ+A   RKKRKL+RR+GE DGAT AAET 
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRA---RKKRKLIRRKGEPDGATAAAETS

Query:  GGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN
        G    E+G+PEISPGSGEE+E  GNFSARFEAERIWLQLYQVGQLGFGRVSFTGN+NLWPNSN
Subjt:  GGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN

A0A6J1HEE2 uncharacterized protein LOC1114632061.3e-6783.54Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRARKKRKLIRRKGEPDGATAAAETSGGP
        MEE+PS  TRRRRF VDDG DLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALAL+KLPWM+GRRCLQRARKKRKLIRRK E DGA AA+ET+ G 
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRARKKRKLIRRKGEPDGATAAAETSGGP

Query:  TGEEGMPEISPGSGEEEEE----MGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN
          EEG+PEI PGS EEEEE    +GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGN+N WPNSN
Subjt:  TGEEGMPEISPGSGEEEEE----MGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN

A0A6J1JH40 uncharacterized protein LOC1114869584.5e-6881.66Show/hide
Query:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRARKKRKLIRRKGEPDGATAAAETSGGP
        MEE+PS  TRRRRF VDDG DLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALAL+KLPWM+GRRCLQRARKKRKLIRRK E DGA AA+ET  GP
Subjt:  MEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRARKKRKLIRRKGEPDGATAAAETSGGP

Query:  TGEEGMPEISPGSGEEEEE---------MGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN
          EEG+PEI PGS EEEEE         +GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGN+N WPNSN
Subjt:  TGEEGMPEISPGSGEEEEE---------MGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01516.1 unknown protein1.3e-2438.62Show/hide
Query:  KKENKVSKMEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQR---ARKKRKLIRRK-----
        +K ++VS+ +++ S  ++       D      CSGK CRS  A  +ADCVA+CCCPC+VV+   LA +K+PWM+GR+C+ R   ++K+ K I R+     
Subjt:  KKENKVSKMEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQR---ARKKRKLIRRK-----

Query:  ----------------------GEPDGATAAAETSGGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTG
                              GE D      E  G  T EE     +  +  +EEE    SAR EAER+WL+LYQ+G LGFGRVSFTG
Subjt:  ----------------------GEPDGATAAAETSGGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTG

AT3G11690.1 unknown protein2.0e-0432.29Show/hide
Query:  IKKENKVSKM-EENPSRATRRR---RFAVDDGADLIDCSGKHCRSCTAGLVADCVAV-CCCPCSVVSFLALALIKLPWMLGRRCLQRARKKRKLIR
        I ++ K S     +PSR+ RR+   + ++   +    C G        G  A C AV CCCPC +V+ L LA+ K+P  + RR + R+R++++L++
Subjt:  IKKENKVSKM-EENPSRATRRR---RFAVDDGADLIDCSGKHCRSCTAGLVADCVAV-CCCPCSVVSFLALALIKLPWMLGRRCLQRARKKRKLIR

AT5G06380.1 unknown protein4.5e-0430.43Show/hide
Query:  GKHCRSCTAGLVADCVAVC-CCPCSVVSFLALALIKLPWMLGRRCLQRARKKRKLIRRKGEPDGATAAAETSGGPTGEEGMPEISPGSGEEEEEMGNFSA
        G     C  G  A C A+C C PCSVV+ + LA+ KLP  L RR ++R R+K     R  + +   +  E   G + +  +  +     EEEEE  + + 
Subjt:  GKHCRSCTAGLVADCVAVC-CCPCSVVSFLALALIKLPWMLGRRCLQRARKKRKLIRRKGEPDGATAAAETSGGPTGEEGMPEISPGSGEEEEEMGNFSA

Query:  RFEAERIWLQLYQVG
            + +W + Y  G
Subjt:  RFEAERIWLQLYQVG

AT5G14690.1 unknown protein1.9e-2636.74Show/hide
Query:  MEENPSRATRR-------RRFAVDD-----------GADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCL---QRARKKRK
        MEENP R +RR       +  AVD+             D + CS K CRS  A  +ADCVA+CCCPC++++ L L L+K+PWM+GRRCL    R +KKR+
Subjt:  MEENPSRATRR-------RRFAVDD-----------GADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCL---QRARKKRK

Query:  LIRRK-------------------------------------------GEPDGATAAAETSGGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQL
        +I R+                                           G+ D      E  G  T EE   E +     E+ +    SAR EAER+WL+L
Subjt:  LIRRK-------------------------------------------GEPDGATAAAETSGGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQL

Query:  YQVGQLGFGRVSFTG
        YQ+G LGFGRVSFTG
Subjt:  YQVGQLGFGRVSFTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATTTCGGAAGCAGTTCCCAATATTCGCTGTCACTCACTCCCCATCGGTACTCATGAATATCAACCCTCTTCCTTCTTCTTCTTTCCCTCTCCTCTTTTTTCTTT
TTTCTTTTCTTTTTTAATTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTCGTCAGTTTTTTAATTCCGTTTATTCAAAACGGCTTATTTTGTAAATCTCTGTCACGCGCTG
CCACGCTTTCTAAAACTGGTAACTCTCCTTTTCTTTTTCTCACCCAATCCTTCTATTCACCCCCTTTTCACTCGAAAAATCTACTCATTTCCTTCCTTTTTTTTTTTTTT
TTAACTTATTTTCAACTAAATTTCATTTTCCCCTTGTCTTTGTTCTTGCAGCCTCTAATCAAGAAAGAAAATAAAGTCTCAAAAATGGAAGAAAACCCATCTCGAGCCAC
ACGGCGGCGGAGATTCGCCGTGGACGACGGCGCCGATCTAATCGACTGCTCCGGCAAGCATTGCCGGTCGTGCACCGCCGGGCTGGTGGCCGATTGCGTCGCCGTCTGCT
GCTGCCCGTGCTCCGTGGTCAGCTTCTTGGCTCTGGCCCTCATCAAACTGCCGTGGATGCTCGGCCGGCGGTGTCTGCAGCGGGCGAGGAAGAAGAGGAAATTGATTCGC
CGGAAAGGGGAACCCGATGGCGCCACGGCGGCGGCCGAAACTAGTGGGGGTCCAACGGGAGAGGAGGGAATGCCGGAAATTTCGCCGGGGTCCGGCGAGGAAGAAGAAGA
GATGGGGAATTTCAGTGCGAGATTTGAAGCAGAGAGAATTTGGTTGCAATTGTATCAGGTGGGTCAGTTGGGTTTTGGAAGAGTTTCATTCACTGGGAATACAAATCTAT
GGCCCAACTCCAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAATTTCGGAAGCAGTTCCCAATATTCGCTGTCACTCACTCCCCATCGGTACTCATGAATATCAACCCTCTTCCTTCTTCTTCTTTCCCTCTCCTCTTTTTTCTTT
TTTCTTTTCTTTTTTAATTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTCGTCAGTTTTTTAATTCCGTTTATTCAAAACGGCTTATTTTGTAAATCTCTGTCACGCGCTG
CCACGCTTTCTAAAACTGGTAACTCTCCTTTTCTTTTTCTCACCCAATCCTTCTATTCACCCCCTTTTCACTCGAAAAATCTACTCATTTCCTTCCTTTTTTTTTTTTTT
TTAACTTATTTTCAACTAAATTTCATTTTCCCCTTGTCTTTGTTCTTGCAGCCTCTAATCAAGAAAGAAAATAAAGTCTCAAAAATGGAAGAAAACCCATCTCGAGCCAC
ACGGCGGCGGAGATTCGCCGTGGACGACGGCGCCGATCTAATCGACTGCTCCGGCAAGCATTGCCGGTCGTGCACCGCCGGGCTGGTGGCCGATTGCGTCGCCGTCTGCT
GCTGCCCGTGCTCCGTGGTCAGCTTCTTGGCTCTGGCCCTCATCAAACTGCCGTGGATGCTCGGCCGGCGGTGTCTGCAGCGGGCGAGGAAGAAGAGGAAATTGATTCGC
CGGAAAGGGGAACCCGATGGCGCCACGGCGGCGGCCGAAACTAGTGGGGGTCCAACGGGAGAGGAGGGAATGCCGGAAATTTCGCCGGGGTCCGGCGAGGAAGAAGAAGA
GATGGGGAATTTCAGTGCGAGATTTGAAGCAGAGAGAATTTGGTTGCAATTGTATCAGGTGGGTCAGTTGGGTTTTGGAAGAGTTTCATTCACTGGGAATACAAATCTAT
GGCCCAACTCCAATTAG
Protein sequenceShow/hide protein sequence
MGISEAVPNIRCHSLPIGTHEYQPSSFFFFPSPLFSFFFSFLILSLSLSLSLFVSFLIPFIQNGLFCKSLSRAATLSKTGNSPFLFLTQSFYSPPFHSKNLLISFLFFFF
LTYFQLNFIFPLSLFLQPLIKKENKVSKMEENPSRATRRRRFAVDDGADLIDCSGKHCRSCTAGLVADCVAVCCCPCSVVSFLALALIKLPWMLGRRCLQRARKKRKLIR
RKGEPDGATAAAETSGGPTGEEGMPEISPGSGEEEEEMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNTNLWPNSN