; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS007597 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS007597
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPsbP domain-containing protein
Genome locationscaffold836:311220..312769
RNA-Seq ExpressionMS007597
SyntenyMS007597
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009654 - photosystem II oxygen evolving complex (cellular component)
GO:0019898 - extrinsic component of membrane (cellular component)
GO:0005509 - calcium ion binding (molecular function)
InterPro domainsIPR002683 - PsbP, C-terminal
IPR016123 - Mog1/PsbP, alpha/beta/alpha sandwich


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030019.1 PsbP domain-containing protein 3, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.3e-9771.53Show/hide
Query:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE
        MASL  PSPSAVIQRPR WRF  SSLSNGIAI IR++ +  V CS  NIDI D + C W SGVNRREI+LG+ L+AFSFQ VVS ALAES          
Subjt:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE

Query:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG
              VVAED+RTYTDEANKFRLVIPQDW VGNGEPNGFK VTAF+P+ET SSN                          VSVVISGLGPDFTRMESFG
Subjt:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG

Query:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ
        KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAIGMASNGWYNRLYT+TGQ
Subjt:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ

XP_022155884.1 psbP domain-containing protein 3, chloroplastic isoform X1 [Momordica charantia]5.9e-12381.91Show/hide
Query:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE
        MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALS FSFQ VVSN+LAES          
Subjt:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE

Query:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG
             VVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSN                          VSVVISGLGPDFTRMESFG
Subjt:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG

Query:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQVTCSKTLI
        KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ+   +T++
Subjt:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQVTCSKTLI

XP_022155885.1 psbP domain-containing protein 3, chloroplastic isoform X2 [Momordica charantia]5.0e-12283.94Show/hide
Query:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE
        MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALS FSFQ VVSN+LAES          
Subjt:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE

Query:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG
             VVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSN                          VSVVISGLGPDFTRMESFG
Subjt:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG

Query:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ
        KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ
Subjt:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ

XP_022946502.1 psbP domain-containing protein 3, chloroplastic [Cucurbita moschata]3.3e-9771.53Show/hide
Query:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE
        MASL  PSPSAVIQRPR WRF  SSLSNGIAI IR++ +  V CS  NIDI D + C W SGVNRREI+LG+ L+AFSFQ VVS ALAES          
Subjt:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE

Query:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG
              VVAED+RTYTDEANKFRLVIPQDW VGNGEPNGFK VTAF+P+ET SSN                          VSVVISGLGPDFTRMESFG
Subjt:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG

Query:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ
        KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAIGMASNGWYNRLYT+TGQ
Subjt:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ

XP_038885576.1 psbP domain-containing protein 3, chloroplastic [Benincasa hispida]6.8e-10374.82Show/hide
Query:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE
        MASL  PSPSAVIQRPR WRF +SS SNG+ I IRSK +  V CS NNI+IS+ Q CYWASGVNRREIMLGI L+AFSFQ VVSNALAES          
Subjt:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE

Query:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG
             V+VAED+RTYTDEANKFRL IPQDW VGNGEPNGFKSVTAF+PQETSSSN                          VSVVISGLGPDFTRMESFG
Subjt:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG

Query:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ
        KVEEFADTLVSGLDRSWKRPPGVAAKLI+CRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ
Subjt:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ

TrEMBL top hitse value%identityAlignment
A0A0A0KDI5 PsbP domain-containing protein3.8e-9169.06Show/hide
Query:  LSMASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSC--NNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAF
        ++MASL   SPSAVI RP   RF +SSLSNG +I I  +S   V CS   N+I  S+ +  Y ASGVNRREIMLGI  +AFSFQ V SNALAES      
Subjt:  LSMASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSC--NNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAF

Query:  LPLERFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRM
                 VVVAED+RTYTDEANKF LVIPQDW VGNGEPNGFKSVTAF+PQETS+SN                          VSVVISGLGPD+TRM
Subjt:  LPLERFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRM

Query:  ESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ
        ESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGM+SNGWYNRLYTITGQ
Subjt:  ESFGKVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ

A0A6J1DNN6 psbP domain-containing protein 3, chloroplastic isoform X22.4e-12283.94Show/hide
Query:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE
        MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALS FSFQ VVSN+LAES          
Subjt:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE

Query:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG
             VVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSN                          VSVVISGLGPDFTRMESFG
Subjt:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG

Query:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ
        KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ
Subjt:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ

A0A6J1DRN2 psbP domain-containing protein 3, chloroplastic isoform X12.9e-12381.91Show/hide
Query:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE
        MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALS FSFQ VVSN+LAES          
Subjt:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE

Query:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG
             VVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSN                          VSVVISGLGPDFTRMESFG
Subjt:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG

Query:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQVTCSKTLI
        KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ+   +T++
Subjt:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQVTCSKTLI

A0A6J1G3W9 psbP domain-containing protein 3, chloroplastic1.6e-9771.53Show/hide
Query:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE
        MASL  PSPSAVIQRPR WRF  SSLSNGIAI IR++ +  V CS  NIDI D + C W SGVNRREI+LG+ L+AFSFQ VVS ALAES          
Subjt:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE

Query:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG
              VVAED+RTYTDEANKFRLVIPQDW VGNGEPNGFK VTAF+P+ET SSN                          VSVVISGLGPDFTRMESFG
Subjt:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG

Query:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ
        KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAIGMASNGWYNRLYT+TGQ
Subjt:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ

A0A6J1KI01 psbP domain-containing protein 3, chloroplastic1.0e-9671.17Show/hide
Query:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE
        MASL  PSPSAVIQRPR WRF  SSLSNGIAI IR++ +  V CS  NIDI D + C W SGVNRREI LG+ L+AFSFQ VVS ALAE+          
Subjt:  MASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLE

Query:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG
              VVAED+RTYTDEANKFRLVIPQDW VGNGEPNGFK VTAF+P+ET SSN                          VSVVISGLGPDFTRMESFG
Subjt:  RFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFG

Query:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ
        KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGE R HLYSAIGMASNGWYNRLYT+TGQ
Subjt:  KVEEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQ

SwissProt top hitse value%identityAlignment
Q9S720 PsbP domain-containing protein 3, chloroplastic4.1e-6651.71Show/hide
Query:  PWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWAS----GVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLERFRVIVVVAEDFR
        PW     S SN       S+    +  + + +D S+ +    +S    G+ RR++ML IA S F     +S A AE+                  +E FR
Subjt:  PWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWAS----GVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLERFRVIVVVAEDFR

Query:  TYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFGKVEEFADTLVSGL
         YTDE NKF + IPQDW VG  EPNGFKS+TAFYPQETS+SN                          VS+ I+GLGPDFTRMESFGKVE FA+TLVSGL
Subjt:  TYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFGKVEEFADTLVSGL

Query:  DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQVT
        DRSW++P GV AKLID R+SKG YYIEYTLQNPGE+RKHLYSAIGMA+NGWYNRLYT+TGQ T
Subjt:  DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQVT

Arabidopsis top hitse value%identityAlignment
AT1G76450.1 Photosystem II reaction center PsbP family protein2.9e-6751.71Show/hide
Query:  PWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWAS----GVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLERFRVIVVVAEDFR
        PW     S SN       S+    +  + + +D S+ +    +S    G+ RR++ML IA S F     +S A AE+                  +E FR
Subjt:  PWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWAS----GVNRREIMLGIALSAFSFQTVVSNALAESGTDFAFLPLERFRVIVVVAEDFR

Query:  TYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFGKVEEFADTLVSGL
         YTDE NKF + IPQDW VG  EPNGFKS+TAFYPQETS+SN                          VS+ I+GLGPDFTRMESFGKVE FA+TLVSGL
Subjt:  TYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFGKVEEFADTLVSGL

Query:  DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQVT
        DRSW++P GV AKLID R+SKG YYIEYTLQNPGE+RKHLYSAIGMA+NGWYNRLYT+TGQ T
Subjt:  DRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGAGGACAAATTGTTCAAAAGAAATCTTCTATCTGTAAACTAGTGGAGTTGAGCATGGCGTCTCTTCCTTCTCCCTCACCGAGCGCTGTAATCCAGCGCCCTCGTCCATG
GCGGTTCAGAGAATCATCACTCTCCAATGGAATCGCCATTCATATCCGCTCGAAGTCAAAACCCGGCGTTCTCTGCTCCTGCAACAACATTGACATCTCCGATCCACAGC
TCTGTTATTGGGCGAGTGGAGTCAATAGACGAGAAATTATGCTAGGGATTGCATTGAGCGCGTTTTCTTTTCAAACTGTGGTTTCTAATGCCTTGGCTGAGAGTGGTACT
GATTTTGCCTTTCTTCCTCTCGAGCGTTTTCGAGTGATAGTTGTTGTCGCTGAGGATTTTCGGACGTACACGGATGAAGCGAATAAGTTCAGATTGGTGATTCCTCAAGA
TTGGGTTGTGGGTAATGGTGAACCGAATGGATTCAAGTCGGTTACGGCTTTTTATCCACAAGAAACTTCAAGTTCCAATGGTATTTTCAGATCTTCTCCCCAGTTCTCTT
CCTGCCTCGTTAGTTTTGAATATTTTGATAAACTGAAATCTGGTTCAGTCAGTGTTGTAATCTCGGGGCTTGGTCCTGATTTCACGAGAATGGAATCCTTTGGCAAGGTT
GAAGAATTTGCTGATACCCTGGTGAGCGGACTGGACAGAAGCTGGAAAAGGCCACCAGGTGTGGCGGCGAAACTTATAGACTGTAGATCATCTAAAGGGATATATTACAT
AGAGTACACGCTGCAGAATCCAGGTGAAAGCCGCAAACATTTATACTCTGCAATTGGGATGGCATCCAATGGCTGGTACAACAGACTTTACACCATAACAGGACAGGTAA
CTTGCTCAAAAACACTTATCTTG
mRNA sequenceShow/hide mRNA sequence
AGAGGACAAATTGTTCAAAAGAAATCTTCTATCTGTAAACTAGTGGAGTTGAGCATGGCGTCTCTTCCTTCTCCCTCACCGAGCGCTGTAATCCAGCGCCCTCGTCCATG
GCGGTTCAGAGAATCATCACTCTCCAATGGAATCGCCATTCATATCCGCTCGAAGTCAAAACCCGGCGTTCTCTGCTCCTGCAACAACATTGACATCTCCGATCCACAGC
TCTGTTATTGGGCGAGTGGAGTCAATAGACGAGAAATTATGCTAGGGATTGCATTGAGCGCGTTTTCTTTTCAAACTGTGGTTTCTAATGCCTTGGCTGAGAGTGGTACT
GATTTTGCCTTTCTTCCTCTCGAGCGTTTTCGAGTGATAGTTGTTGTCGCTGAGGATTTTCGGACGTACACGGATGAAGCGAATAAGTTCAGATTGGTGATTCCTCAAGA
TTGGGTTGTGGGTAATGGTGAACCGAATGGATTCAAGTCGGTTACGGCTTTTTATCCACAAGAAACTTCAAGTTCCAATGGTATTTTCAGATCTTCTCCCCAGTTCTCTT
CCTGCCTCGTTAGTTTTGAATATTTTGATAAACTGAAATCTGGTTCAGTCAGTGTTGTAATCTCGGGGCTTGGTCCTGATTTCACGAGAATGGAATCCTTTGGCAAGGTT
GAAGAATTTGCTGATACCCTGGTGAGCGGACTGGACAGAAGCTGGAAAAGGCCACCAGGTGTGGCGGCGAAACTTATAGACTGTAGATCATCTAAAGGGATATATTACAT
AGAGTACACGCTGCAGAATCCAGGTGAAAGCCGCAAACATTTATACTCTGCAATTGGGATGGCATCCAATGGCTGGTACAACAGACTTTACACCATAACAGGACAGGTAA
CTTGCTCAAAAACACTTATCTTG
Protein sequenceShow/hide protein sequence
RGQIVQKKSSICKLVELSMASLPSPSPSAVIQRPRPWRFRESSLSNGIAIHIRSKSKPGVLCSCNNIDISDPQLCYWASGVNRREIMLGIALSAFSFQTVVSNALAESGT
DFAFLPLERFRVIVVVAEDFRTYTDEANKFRLVIPQDWVVGNGEPNGFKSVTAFYPQETSSSNGIFRSSPQFSSCLVSFEYFDKLKSGSVSVVISGLGPDFTRMESFGKV
EEFADTLVSGLDRSWKRPPGVAAKLIDCRSSKGIYYIEYTLQNPGESRKHLYSAIGMASNGWYNRLYTITGQVTCSKTLIL