; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004704 (gene) of Snake gourd v1 genome

Gene IDTan0004704
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionpentatricopeptide repeat-containing protein At4g20090
Genome locationLG01:72125723..72131959
RNA-Seq ExpressionTan0004704
SyntenyTan0004704
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594382.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]2.7e-4575.4Show/hide
Query:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR
        MPKCSKHQLNLLRI+  K PGL FHSVSR FPTF PFFYFSSFALSPNPTP+S+PEKD+++ELSV GKIFKS PQLGS+KL DATFYSLIENYAS GEFR
Subjt:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR

Query:  LIEQVLDRMKHEGQLFLDELMVRLLK
        LIE VLDRMK EG++ ++   + + K
Subjt:  LIEQVLDRMKHEGQLFLDELMVRLLK

XP_022146575.1 pentatricopeptide repeat-containing protein At4g20090 [Momordica charantia]1.2e-4069.84Show/hide
Query:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR
        MPKCSKH L LL IA  K PGL F+ VSR FPTFSPFFY +SFAL  NPTP+SKP+KDD NELS+ G+IFKS PQ GS+KL DATFYSLIENYAS GEFR
Subjt:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR

Query:  LIEQVLDRMKHEGQLFLDELMVRLLK
        LIEQVLDRMK EG++ ++   + + K
Subjt:  LIEQVLDRMKHEGQLFLDELMVRLLK

XP_022926533.1 pentatricopeptide repeat-containing protein At4g20090 [Cucurbita moschata]2.7e-4575.4Show/hide
Query:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR
        MPKCSKHQLNLLRI+  K PGL FHSVSR FPTF PFFYFSSFALSPNPTP+S+PEKD+++ELSV GKIFKS PQLGS+KL DATFYSLIENYAS GEFR
Subjt:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR

Query:  LIEQVLDRMKHEGQLFLDELMVRLLK
        LIE VLDRMK EG++ ++   + + K
Subjt:  LIEQVLDRMKHEGQLFLDELMVRLLK

XP_023003433.1 pentatricopeptide repeat-containing protein At4g20090 [Cucurbita maxima]8.0e-4575.4Show/hide
Query:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR
        MPKCSKHQLNLLRIA  K PGL FH VSR FPTF PFFYFSSFALSPNPTP+S+PEKD+++ELSV GKIFKS PQLGS+KL DATFYSLIENYAS GEFR
Subjt:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR

Query:  LIEQVLDRMKHEGQLFLDELMVRLLK
        LIE VLDRMK EG++ ++   + + K
Subjt:  LIEQVLDRMKHEGQLFLDELMVRLLK

XP_023518291.1 pentatricopeptide repeat-containing protein At4g20090 [Cucurbita pepo subsp. pepo]1.2e-4576.19Show/hide
Query:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR
        MPKCSKHQLNLLRIA  K PGL FHSVSR FPTF PFFYFSSFALSPNPTP+S+PEKD+++ELSV GKIFKS PQLGS+KL DATFYSLIENYAS GEFR
Subjt:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR

Query:  LIEQVLDRMKHEGQLFLDELMVRLLK
        LIE VLDRMK EG++ ++   + + K
Subjt:  LIEQVLDRMKHEGQLFLDELMVRLLK

TrEMBL top hitse value%identityAlignment
A0A0A0LP34 Uncharacterized protein7.8e-2252.38Show/hide
Query:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR
        MPK S HQLN L I+ HKP  L            SPFFYFSS  LS N TP      D +NELS+  +IFKS PQ GS+KL DATFY LIENYA+  EF 
Subjt:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR

Query:  LIEQVLDRMKHEGQLFLDELMVRLLK
         I QVLDRMK EG++  + + + + K
Subjt:  LIEQVLDRMKHEGQLFLDELMVRLLK

A0A1S3CAZ3 pentatricopeptide repeat-containing protein At4g200905.1e-2150.79Show/hide
Query:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR
        MPK S HQLN L I+ HKP  LP            PF YFSS  LS N TP      D +NELS+  ++FKS PQ GS+K+ DATFY LIENYA+ GEF 
Subjt:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR

Query:  LIEQVLDRMKHEGQLFLDELMVRLLK
        LI QVLDRMK E ++  + + + + K
Subjt:  LIEQVLDRMKHEGQLFLDELMVRLLK

A0A6J1CYY5 pentatricopeptide repeat-containing protein At4g200905.8e-4169.84Show/hide
Query:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR
        MPKCSKH L LL IA  K PGL F+ VSR FPTFSPFFY +SFAL  NPTP+SKP+KDD NELS+ G+IFKS PQ GS+KL DATFYSLIENYAS GEFR
Subjt:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR

Query:  LIEQVLDRMKHEGQLFLDELMVRLLK
        LIEQVLDRMK EG++ ++   + + K
Subjt:  LIEQVLDRMKHEGQLFLDELMVRLLK

A0A6J1EIB7 pentatricopeptide repeat-containing protein At4g200901.3e-4575.4Show/hide
Query:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR
        MPKCSKHQLNLLRI+  K PGL FHSVSR FPTF PFFYFSSFALSPNPTP+S+PEKD+++ELSV GKIFKS PQLGS+KL DATFYSLIENYAS GEFR
Subjt:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR

Query:  LIEQVLDRMKHEGQLFLDELMVRLLK
        LIE VLDRMK EG++ ++   + + K
Subjt:  LIEQVLDRMKHEGQLFLDELMVRLLK

A0A6J1KWH5 pentatricopeptide repeat-containing protein At4g200903.9e-4575.4Show/hide
Query:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR
        MPKCSKHQLNLLRIA  K PGL FH VSR FPTF PFFYFSSFALSPNPTP+S+PEKD+++ELSV GKIFKS PQLGS+KL DATFYSLIENYAS GEFR
Subjt:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFR

Query:  LIEQVLDRMKHEGQLFLDELMVRLLK
        LIE VLDRMK EG++ ++   + + K
Subjt:  LIEQVLDRMKHEGQLFLDELMVRLLK

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200904.0e-0730.67Show/hide
Query:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYF-SSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEF
        MPKC       +RI+F          +S N   FS    F SS ++SPNP   S    ++  E  +  K+FKS P++GS KL D+T  S+IE+YA+ G+F
Subjt:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYF-SSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEF

Query:  RLIEQVLDRMKHEGQLFLDELMVRLLKRERK--LSSLRIDMLVRCLHR-RHQLGLESFKGCAN
          +E++L R++ E ++ ++   + + +   K  L    +D+  R +   R +  ++SF    N
Subjt:  RLIEQVLDRMKHEGQLFLDELMVRLLKRERK--LSSLRIDMLVRCLHR-RHQLGLESFKGCAN

Arabidopsis top hitse value%identityAlignment
AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein2.9e-0830.67Show/hide
Query:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYF-SSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEF
        MPKC       +RI+F          +S N   FS    F SS ++SPNP   S    ++  E  +  K+FKS P++GS KL D+T  S+IE+YA+ G+F
Subjt:  MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYF-SSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEF

Query:  RLIEQVLDRMKHEGQLFLDELMVRLLKRERK--LSSLRIDMLVRCLHR-RHQLGLESFKGCAN
          +E++L R++ E ++ ++   + + +   K  L    +D+  R +   R +  ++SF    N
Subjt:  RLIEQVLDRMKHEGQLFLDELMVRLLKRERK--LSSLRIDMLVRCLHR-RHQLGLESFKGCAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAAATGTTCAAAACACCAATTAAACCTCCTCAGAATCGCTTTTCATAAGCCGCCTGGGCTTCCTTTCCATTCAGTTTCACGCAACTTCCCAACTTTTTCGCCTTT
CTTCTATTTTTCATCTTTCGCCCTTTCTCCCAACCCTACTCCCCAGAGCAAACCCGAGAAGGATGATGAAAATGAGCTCTCAGTATTAGGTAAAATATTCAAATCCGACC
CGCAACTGGGTTCCCATAAATTGGAGGATGCAACATTTTATAGTCTCATTGAAAACTATGCAAGTTTTGGGGAATTTCGTTTGATAGAGCAAGTTTTGGATAGAATGAAA
CACGAAGGACAACTGTTTTTAGATGAGCTCATGGTAAGGTTACTCAAGCGAGAGAGAAAATTATCTTCCTTAAGAATTGACATGCTGGTAAGATGTCTGCACCGGAGGCA
TCAACTTGGTCTAGAGTCATTCAAAGGATGTGCAAACCAAAGAAGATTCAAGAAACCATAG
mRNA sequenceShow/hide mRNA sequence
TTTGGTGAAAAAAAAGAGGGAAGTTTTTTTCGTTTTGCGCCATTTTACCTTAATTGAGATCAATTTTTTTTTCATTTTGGCTTCTTCTTCGCCCGATTTCGACTTCCCCA
GCTGTGTGCTCATCCTCGTCTCTTCTTCTTCTTCACACGGCGCAATCGACTTCCTTTGCTCATCAACCCTGGATGTATGCTTCTCTCTAACTCTGCTCATAAGCGTTGAT
TCTACACTGCACATACTCTGAATCTGCACCGTAAAGAAAGCCTTTCGACCCCATGCCAAAATGTTCAAAACACCAATTAAACCTCCTCAGAATCGCTTTTCATAAGCCGC
CTGGGCTTCCTTTCCATTCAGTTTCACGCAACTTCCCAACTTTTTCGCCTTTCTTCTATTTTTCATCTTTCGCCCTTTCTCCCAACCCTACTCCCCAGAGCAAACCCGAG
AAGGATGATGAAAATGAGCTCTCAGTATTAGGTAAAATATTCAAATCCGACCCGCAACTGGGTTCCCATAAATTGGAGGATGCAACATTTTATAGTCTCATTGAAAACTA
TGCAAGTTTTGGGGAATTTCGTTTGATAGAGCAAGTTTTGGATAGAATGAAACACGAAGGACAACTGTTTTTAGATGAGCTCATGGTAAGGTTACTCAAGCGAGAGAGAA
AATTATCTTCCTTAAGAATTGACATGCTGGTAAGATGTCTGCACCGGAGGCATCAACTTGGTCTAGAGTCATTCAAAGGATGTGCAAACCAAAGAAGATTCAAGAAACCA
TAGATTTGTTTTGCAGAAGCATGTATGGACAATGATGATTGTACTTTGCTATCCTTTGGGATGTCTAGTATATGGAGCATGATGTCTCGGGAGAACGCTGCTATGAGTTC
TGAGGTGGTTAGATTTTTAGTACTTTTGATAATGGACTCTGGAACTACAAATTCAAAATCTGATCGCACATATTCTTGGCTTGGAATGACAATATCCTGTCCATCTGTTC
GTTCTGGAATTGAAGGCCAAGTGACTGCAACTGGTTGTTCAACTTCTTGGCTCAGTCTCGCAGATTTGAGTGGAGCGTATTCAACAAATTGACAAAGAAGAGACAAACAC
TTATTCAAATAATGTGTCCAAGGACGCAATAAGTAAAGTTCTTGGTTCTGATCGTGGCCATATTAGAGGACTTGGATTTGGAGTGACCTTTTCAAAGTTATCTGCATTGA
CTCAAAGAGATGACAACTATGCCAAGCTTGAAGAAAAGTAAAAAAAGATGGAAGGCAAATGTCTTAAATGAGATCTTTATTGGCTCACGTACTCCAATGACAAGTTGATG
AAAAAATATCATTGGGAGTACTTCTTGGTCACCAAAACAACTAGCTCGAGAGTTATATATTTTTATTTTGATGAATGAAACATTTTGTAATTTAATGAATAGTAATTGAC
AAAGTTATATTTAGTGACATCCCGAAGTTGATATTCATTACTATATCTATTTTGTGAAC
Protein sequenceShow/hide protein sequence
MPKCSKHQLNLLRIAFHKPPGLPFHSVSRNFPTFSPFFYFSSFALSPNPTPQSKPEKDDENELSVLGKIFKSDPQLGSHKLEDATFYSLIENYASFGEFRLIEQVLDRMK
HEGQLFLDELMVRLLKRERKLSSLRIDMLVRCLHRRHQLGLESFKGCANQRRFKKP