; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004561 (gene) of Snake gourd v1 genome

Gene IDTan0004561
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG01:102527196..102527704
RNA-Seq ExpressionTan0004561
SyntenyTan0004561
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB2629575.1 pentatricopeptide repeat-containing protein [Pyrus ussuriensis x Pyrus communis]1.9e-1756.6Show/hide
Query:  MGIK-WRNSRSQSIRLGQS--CVSNEQESKGFGWQILWQKIKKEKKKIFSSSAV--EFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPS-IVSR
        MGI+ WRNSRSQSIRLG +    SN  +S+  GWQ  W++ K E+KK F SS V  + ++SY+P  Y  NFDQG G  EPDNL RSFSARFADPS ++  
Subjt:  MGIK-WRNSRSQSIRLGQS--CVSNEQESKGFGWQILWQKIKKEKKKIFSSSAV--EFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPS-IVSR

Query:  NLRLLD
        N  LLD
Subjt:  NLRLLD

KAG6591653.1 hypothetical protein SDJN03_13999, partial [Cucurbita argyrosperma subsp. sororia]5.9e-3580.39Show/hide
Query:  MGIKWRNSRSQSIRLGQSCVS-NEQESKG-FGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRL
        MGIKWRNSRSQSIRLGQSC S NEQESK  FGWQILWQK KKEK++IFS S+VE RSSYNPNAY LNF+Q    S+PD+LCRSFSARFADPSIVSR+ RL
Subjt:  MGIKWRNSRSQSIRLGQSCVS-NEQESKG-FGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRL

Query:  LD
        LD
Subjt:  LD

KGN60340.2 hypothetical protein Csa_000895 [Cucumis sativus]8.2e-3782.18Show/hide
Query:  MGIKWRNSRSQSIRLGQSCV-SNEQESKGFGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRLL
        MG+KWRNSRSQSIRLGQSCV SNEQESK  GWQILW+K+KKEK+K+FS S+VE RSSYNPNAY LNFD+    SEPDNL RSFSARFADPSIVSRNLRLL
Subjt:  MGIKWRNSRSQSIRLGQSCV-SNEQESKGFGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRLL

Query:  D
        D
Subjt:  D

XP_022936244.1 uncharacterized protein LOC111442916 [Cucurbita moschata]4.5e-3580.39Show/hide
Query:  MGIKWRNSRSQSIRLGQSC-VSNEQESKG-FGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRL
        MGIKWRNSRSQSIRLGQSC  SNEQESK  FGWQILWQK KKEK++IFS S+VE RSSYNPNAY LNF+Q    S+PD+LCRSFSARFADPSIVSR+ RL
Subjt:  MGIKWRNSRSQSIRLGQSC-VSNEQESKG-FGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRL

Query:  LD
        LD
Subjt:  LD

XP_023536137.1 uncharacterized protein LOC111797385 [Cucurbita pepo subsp. pepo]2.2e-3479.41Show/hide
Query:  MGIKWRNSRSQSIRLGQSC-VSNEQESKG-FGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRL
        MGIKWRNSRSQSIRLGQSC  SNEQESK   GWQILWQK KKEK++IFS S+VE RSSYNPNAY LNF+Q    S+PD+LCRSFSARFADPSIVSR+ RL
Subjt:  MGIKWRNSRSQSIRLGQSC-VSNEQESKG-FGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRL

Query:  LD
        LD
Subjt:  LD

TrEMBL top hitse value%identityAlignment
A0A0A0LJH9 Uncharacterized protein4.0e-3782.18Show/hide
Query:  MGIKWRNSRSQSIRLGQSCV-SNEQESKGFGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRLL
        MG+KWRNSRSQSIRLGQSCV SNEQESK  GWQILW+K+KKEK+K+FS S+VE RSSYNPNAY LNFD+    SEPDNL RSFSARFADPSIVSRNLRLL
Subjt:  MGIKWRNSRSQSIRLGQSCV-SNEQESKGFGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRLL

Query:  D
        D
Subjt:  D

A0A251RD41 Uncharacterized protein1.2e-1758.1Show/hide
Query:  MGIK-WRNSRSQSIRLGQSCV-SNEQESKGFGWQILWQKIKKE-KKKIFSSSAVEFR-SSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNL
        MGIK W NSRSQS+RLGQ  + SN  +    GWQ  W+K K + KKK FSSS V  + +SY+P  Y  NFD+G+G  EPDNL RSFSARFADPS++  N 
Subjt:  MGIK-WRNSRSQSIRLGQSCV-SNEQESKGFGWQILWQKIKKE-KKKIFSSSAVEFR-SSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNL

Query:  R-LLD
        R LLD
Subjt:  R-LLD

A0A5N5HP14 Pentatricopeptide repeat-containing protein9.2e-1856.6Show/hide
Query:  MGIK-WRNSRSQSIRLGQS--CVSNEQESKGFGWQILWQKIKKEKKKIFSSSAV--EFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPS-IVSR
        MGI+ WRNSRSQSIRLG +    SN  +S+  GWQ  W++ K E+KK F SS V  + ++SY+P  Y  NFDQG G  EPDNL RSFSARFADPS ++  
Subjt:  MGIK-WRNSRSQSIRLGQS--CVSNEQESKGFGWQILWQKIKKEKKKIFSSSAV--EFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPS-IVSR

Query:  NLRLLD
        N  LLD
Subjt:  NLRLLD

A0A5N5I1S5 Pentatricopeptide repeat-containing protein9.2e-1856.6Show/hide
Query:  MGIK-WRNSRSQSIRLGQS--CVSNEQESKGFGWQILWQKIKKEKKKIFSSSAV--EFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPS-IVSR
        MGI+ WRNSRSQSIRLG +    SN  +S+  GWQ  W++ K E+KKIF SS V  + ++SY+P+ Y  NFDQG    EPDNL RSFSARFADPS ++  
Subjt:  MGIK-WRNSRSQSIRLGQS--CVSNEQESKGFGWQILWQKIKKEKKKIFSSSAV--EFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPS-IVSR

Query:  NLRLLD
        N  LLD
Subjt:  NLRLLD

A0A6J1F7R9 uncharacterized protein LOC1114429162.2e-3580.39Show/hide
Query:  MGIKWRNSRSQSIRLGQSC-VSNEQESKG-FGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRL
        MGIKWRNSRSQSIRLGQSC  SNEQESK  FGWQILWQK KKEK++IFS S+VE RSSYNPNAY LNF+Q    S+PD+LCRSFSARFADPSIVSR+ RL
Subjt:  MGIKWRNSRSQSIRLGQSC-VSNEQESKG-FGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRL

Query:  LD
        LD
Subjt:  LD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25735.1 unknown protein2.5e-0737.63Show/hide
Query:  RLGQSCVSNEQESKGF----GWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGC---SEPDNLCRSFSARFADPSIVSRNLRLL
        R  +SC   +Q  +       W++L  K+K    +  S+  V    +Y P  Y LNFDQG G     EP+NL RSFS RFADP+ +     LL
Subjt:  RLGQSCVSNEQESKGF----GWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGC---SEPDNLCRSFSARFADPSIVSRNLRLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATCAAATGGCGGAATTCACGTAGCCAAAGCATCAGGTTGGGTCAGAGCTGTGTTTCGAATGAACAAGAAAGCAAAGGATTTGGATGGCAGATTTTGTGGCAAAA
AATCAAGAAGGAGAAGAAGAAAATCTTCAGTTCTTCTGCTGTTGAATTCCGTTCTTCTTATAATCCAAATGCTTATGAGTTAAATTTTGATCAAGGAATTGGCTGCTCTG
AGCCTGATAATCTCTGCAGATCCTTCTCTGCTCGTTTTGCTGATCCATCAATCGTCTCAAGGAACTTGAGATTGTTGGATTGA
mRNA sequenceShow/hide mRNA sequence
CACGCATTCTCTTCACCCAAAAAAAAAAAAAAAAACTCTTTCGGCGGCTCTTTACATTTATAGAGTTTCAGGCCAATCTCTATTCCCTTTTAAACCACCATCGCCAAATG
ATCTTCTCCATTTTTCTCTCAATTTTTGTACATTAAATTCATCGACCCTTTTGATTTTTGCTTCGAATATTCATAGTTTTGGCTTCAAATCAGCAAATGGGAATCAAATG
GCGGAATTCACGTAGCCAAAGCATCAGGTTGGGTCAGAGCTGTGTTTCGAATGAACAAGAAAGCAAAGGATTTGGATGGCAGATTTTGTGGCAAAAAATCAAGAAGGAGA
AGAAGAAAATCTTCAGTTCTTCTGCTGTTGAATTCCGTTCTTCTTATAATCCAAATGCTTATGAGTTAAATTTTGATCAAGGAATTGGCTGCTCTGAGCCTGATAATCTC
TGCAGATCCTTCTCTGCTCGTTTTGCTGATCCATCAATCGTCTCAAGGAACTTGAGATTGTTGGATTGA
Protein sequenceShow/hide protein sequence
MGIKWRNSRSQSIRLGQSCVSNEQESKGFGWQILWQKIKKEKKKIFSSSAVEFRSSYNPNAYELNFDQGIGCSEPDNLCRSFSARFADPSIVSRNLRLLD