; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001938 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001938
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontrigger factor-like isoform X3
Genome locationchr4:37324136..37332383
RNA-Seq ExpressionLag0001938
SyntenyLag0001938
Gene Ontology termsGO:0000413 - protein peptidyl-prolyl isomerization (biological process)
GO:0015031 - protein transport (biological process)
GO:0043335 - protein unfolding (biological process)
GO:0051083 - 'de novo' cotranslational protein folding (biological process)
GO:0061077 - chaperone-mediated protein folding (biological process)
GO:0003755 - peptidyl-prolyl cis-trans isomerase activity (molecular function)
GO:0043022 - ribosome binding (molecular function)
GO:0044183 - protein folding chaperone (molecular function)
InterPro domainsIPR005215 - Trigger factor
IPR008881 - Trigger factor, ribosome-binding, bacterial
IPR036611 - Trigger factor ribosome-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155449.1 uncharacterized protein LOC111022592 isoform X2 [Momordica charantia]7.1e-8685.78Show/hide
Query:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTR
        A+ F      KVIYHKQTADCFTPSL C SVSFPSQ+ YSSRK SL    R+LPAACAVLSEDVSVSSSQF+DFSVT A  +NENRELKIRVEVSG KTR
Subjt:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTR

Query:  AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL
        AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPS+VYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDE FFFDAIIQL
Subjt:  AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL

Query:  KETS
         +T+
Subjt:  KETS

XP_022155452.1 uncharacterized protein LOC111022592 isoform X4 [Momordica charantia]1.3e-8485.78Show/hide
Query:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL---RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTRA
        A+ F      KVIYHKQTADCFTPSL C SVSFPSQ+ YSSRK SL   R+LPAACAVLSEDVSVSSSQF+DFSVT A  +NENRELKIRVEVSG KTRA
Subjt:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL---RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTRA

Query:  IFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEK-EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL
        IFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPS+VYKQVIKKVINSTVAAYVEK EALKVGKDLRIEQSYEDLEDQFEPDE FFFDAIIQL
Subjt:  IFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEK-EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL

Query:  KETS
         +T+
Subjt:  KETS

XP_038876223.1 trigger factor isoform X1 [Benincasa hispida]1.2e-8585.37Show/hide
Query:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTR
        A+ F   T  KVIYHKQTAD FTP + C +VSFPSQM Y SRKFSL    RYLPAACAVLSEDVSVSSSQF+DFSVT+ATN  EN+ELKIRVEVSGAKTR
Subjt:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTR

Query:  AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEK-EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQ
        AIFNNVFD+MVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVI++VINSTVAAYVEK EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQ
Subjt:  AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEK-EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQ

Query:  LKETS
        LKE++
Subjt:  LKETS

XP_038876224.1 trigger factor isoform X2 [Benincasa hispida]4.9e-8785.78Show/hide
Query:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTR
        A+ F   T  KVIYHKQTAD FTP + C +VSFPSQM Y SRKFSL    RYLPAACAVLSEDVSVSSSQF+DFSVT+ATN  EN+ELKIRVEVSGAKTR
Subjt:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTR

Query:  AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL
        AIFNNVFD+MVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVI++VINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL
Subjt:  AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL

Query:  KETS
        KE++
Subjt:  KETS

XP_038876225.1 trigger factor isoform X3 [Benincasa hispida]1.0e-8486.8Show/hide
Query:  FTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTRAIFNNVFD
        +  VIYHKQTAD FTP + C +VSFPSQM Y SRKFSL    RYLPAACAVLSEDVSVSSSQF+DFSVT+ATN  EN+ELKIRVEVSGAKTRAIFNNVFD
Subjt:  FTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTRAIFNNVFD

Query:  EMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEK-EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQLKETS
        +MVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVI++VINSTVAAYVEK EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQLKE++
Subjt:  EMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEK-EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQLKETS

TrEMBL top hitse value%identityAlignment
A0A0A0KBW0 Trigger_N domain-containing protein3.5e-8382.35Show/hide
Query:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTR
        A+ F      K IYHKQT D FTP+L C  VSFPSQ+RY SRK SL    RYLPAACAVLSE+VSVSSSQF+DFSVT+ TN  EN+ELKIRVEVSGAKTR
Subjt:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTR

Query:  AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL
        AIFN VFD MVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIK+VINSTVAAYVEKEALKVGKDLRI+QSYEDLEDQFEPDE FFFDAIIQL
Subjt:  AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL

Query:  KETS
        KE++
Subjt:  KETS

A0A6J1DMZ5 uncharacterized protein LOC111022592 isoform X18.4e-8585.37Show/hide
Query:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTR
        A+ F      KVIYHKQTADCFTPSL C SVSFPSQ+ YSSRK SL    R+LPAACAVLSEDVSVSSSQF+DFSVT A  +NENRELKIRVEVSG KTR
Subjt:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTR

Query:  AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEK-EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQ
        AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPS+VYKQVIKKVINSTVAAYVEK EALKVGKDLRIEQSYEDLEDQFEPDE FFFDAIIQ
Subjt:  AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEK-EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQ

Query:  LKETS
        L +T+
Subjt:  LKETS

A0A6J1DN00 uncharacterized protein LOC111022592 isoform X56.0e-8385.29Show/hide
Query:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL---RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTRA
        A+ F      KVIYHKQTADCFTPSL C SVSFPSQ+ YSSRK SL   R+LPAACAVLS DVSVSSSQF+DFSVT A  +NENRELKIRVEVSG KTRA
Subjt:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL---RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTRA

Query:  IFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEK-EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL
        IFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPS+VYKQVIKKVINSTVAAYVEK EALKVGKDLRIEQSYEDLEDQFEPDE FFFDAIIQL
Subjt:  IFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEK-EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL

Query:  KETS
         +T+
Subjt:  KETS

A0A6J1DPD9 uncharacterized protein LOC111022592 isoform X46.5e-8585.78Show/hide
Query:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL---RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTRA
        A+ F      KVIYHKQTADCFTPSL C SVSFPSQ+ YSSRK SL   R+LPAACAVLSEDVSVSSSQF+DFSVT A  +NENRELKIRVEVSG KTRA
Subjt:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL---RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTRA

Query:  IFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEK-EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL
        IFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPS+VYKQVIKKVINSTVAAYVEK EALKVGKDLRIEQSYEDLEDQFEPDE FFFDAIIQL
Subjt:  IFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEK-EALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL

Query:  KETS
         +T+
Subjt:  KETS

A0A6J1DQB1 uncharacterized protein LOC111022592 isoform X23.4e-8685.78Show/hide
Query:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTR
        A+ F      KVIYHKQTADCFTPSL C SVSFPSQ+ YSSRK SL    R+LPAACAVLSEDVSVSSSQF+DFSVT A  +NENRELKIRVEVSG KTR
Subjt:  ASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSL----RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTR

Query:  AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL
        AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPS+VYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDE FFFDAIIQL
Subjt:  AIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQL

Query:  KETS
         +T+
Subjt:  KETS

SwissProt top hitse value%identityAlignment
B1XL18 Trigger factor3.7e-0523.08Show/hide
Query:  ELKIRVEVSGAKTRAIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQF
        ++ + +EV    T+  +++   ++ A    +PGFR+ K      +P+ IL++ LGP+++   V++ +I+ ++ A + +E ++   + +++ S++DL   +
Subjt:  ELKIRVEVSGAKTRAIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQF

Query:  EPDEKFFFDAIIQLKET
        +P E   F A + +  T
Subjt:  EPDEKFFFDAIIQLKET

Arabidopsis top hitse value%identityAlignment
AT2G30695.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast stroma, chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Trigger factor, ribosome-binding, bacterial (InterPro:IPR008881); Has 253 Blast hits to 253 proteins in 72 species: Archae - 0; Bacteria - 138; Metazoa - 0; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 75 (source: NCBI BLink).3.5e-4358.44Show/hide
Query:  RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTRAIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVI
        R   A CA  S+   V +S  D+  +        + E+K+ V+VSG KT+ +FN+VF++MVA AQPIPGFRRVKGGKTPNIP+D+LLEILG SKVYKQVI
Subjt:  RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTRAIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVI

Query:  KKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQLKETS
        KK+INS +  YV++E LKVGK+L + QSYEDLE+ FEP E F FDA I+L+E S
Subjt:  KKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQLKETS

AT2G30695.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Trigger factor, ribosome-binding, bacterial (InterPro:IPR008881); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).3.5e-4358.44Show/hide
Query:  RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTRAIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVI
        R   A CA  S+   V +S  D+  +        + E+K+ V+VSG KT+ +FN+VF++MVA AQPIPGFRRVKGGKTPNIP+D+LLEILG SKVYKQVI
Subjt:  RYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKTRAIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVI

Query:  KKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQLKETS
        KK+INS +  YV++E LKVGK+L + QSYEDLE+ FEP E F FDA I+L+E S
Subjt:  KKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQLKETS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTGCGACCGCAACCGCAACCGCAACGGTAACCAATATTGCCTCAGAATTCCGGCGGCCAACATTCACCAAAGTTATATATCACAAGCAGACAGCCGATTGCTT
TACTCCAAGCCTTGCCTGCAAAAGTGTGTCTTTTCCCAGTCAAATGAGATACAGTAGTAGAAAATTTTCTTTGAGGTATCTTCCAGCTGCTTGTGCTGTGTTATCAGAAG
ATGTGAGTGTTTCTTCTTCTCAGTTTGACGACTTCTCTGTCACTGATGCTACTAATATCAATGAGAATAGAGAACTAAAGATTCGTGTAGAGGTGTCTGGTGCCAAAACT
CGAGCAATTTTCAACAATGTCTTTGATGAAATGGTTGCTGAAGCCCAGCCTATTCCAGGCTTTCGGAGAGTGAAAGGAGGAAAGACACCAAACATACCCCGAGACATTCT
ATTAGAAATACTGGGACCTTCTAAGGTTTACAAGCAAGTTATCAAGAAAGTTATCAACTCCACTGTGGCTGCATATGTGGAAAAGGAAGCTCTAAAAGTGGGTAAAGACT
TGAGAATAGAACAAAGCTATGAGGATCTTGAAGACCAATTTGAACCAGATGAAAAGTTCTTTTTTGATGCCATTATTCAGCTCAAGGAAACAAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTGCGACCGCAACCGCAACCGCAACGGTAACCAATATTGCCTCAGAATTCCGGCGGCCAACATTCACCAAAGTTATATATCACAAGCAGACAGCCGATTGCTT
TACTCCAAGCCTTGCCTGCAAAAGTGTGTCTTTTCCCAGTCAAATGAGATACAGTAGTAGAAAATTTTCTTTGAGGTATCTTCCAGCTGCTTGTGCTGTGTTATCAGAAG
ATGTGAGTGTTTCTTCTTCTCAGTTTGACGACTTCTCTGTCACTGATGCTACTAATATCAATGAGAATAGAGAACTAAAGATTCGTGTAGAGGTGTCTGGTGCCAAAACT
CGAGCAATTTTCAACAATGTCTTTGATGAAATGGTTGCTGAAGCCCAGCCTATTCCAGGCTTTCGGAGAGTGAAAGGAGGAAAGACACCAAACATACCCCGAGACATTCT
ATTAGAAATACTGGGACCTTCTAAGGTTTACAAGCAAGTTATCAAGAAAGTTATCAACTCCACTGTGGCTGCATATGTGGAAAAGGAAGCTCTAAAAGTGGGTAAAGACT
TGAGAATAGAACAAAGCTATGAGGATCTTGAAGACCAATTTGAACCAGATGAAAAGTTCTTTTTTGATGCCATTATTCAGCTCAAGGAAACAAGCTGA
Protein sequenceShow/hide protein sequence
MASATATATATVTNIASEFRRPTFTKVIYHKQTADCFTPSLACKSVSFPSQMRYSSRKFSLRYLPAACAVLSEDVSVSSSQFDDFSVTDATNINENRELKIRVEVSGAKT
RAIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPRDILLEILGPSKVYKQVIKKVINSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDEKFFFDAIIQLKETS