; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029788 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029788
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPhotosystem II 10 kDa polypeptide, chloroplastic
Genome locationtig00153490:1332894..1342548
RNA-Seq ExpressionSgr029788
SyntenySgr029788
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009536 - plastid (cellular component)
GO:0009654 - photosystem II oxygen evolving complex (cellular component)
InterPro domainsIPR006814 - Photosystem II PsbR


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH01538.1 FAD/NAD(P)-binding oxidoreductase family protein [Prunus dulcis]2.8e-13951.59Show/hide
Query:  SKVAVVGSG------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPDVLSLVRE
        +KVAVVGSG      ++A + +S++LF+S              +R P              GG M+   E+AEDGKEL FDHGAP+F  ++ +V  LV E
Subjt:  SKVAVVGSG------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPDVLSLVRE

Query:  WESKHLCAEWKESFGIFDCFSNQFTSIEQEG-VSGRYVGTPGMNSICKALCHEP----------------------------------FEGIVASDKNIV
        WE+K L A W+E FG FD  SN+F  +EQEG +S RYVG PGMNS+C+ALCHEP                                  F+G+VA+DKN+V
Subjt:  WESKHLCAEWKESFGIFDCFSNQFTSIEQEG-VSGRYVGTPGMNSICKALCHEP----------------------------------FEGIVASDKNIV

Query:  SPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSI--------PVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERV
        SPRFTSVTGRPPPL             DIPV PCFALM+AF +PLSS+         + GFSIKNS+VLSWA+CDSSKPGRS+SSERW+LHSTMEYA+ +
Subjt:  SPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSI--------PVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERV

Query:  IAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGA---PRVALPIT
        IA+ GLQKP++ATL KVAEEL+QELQ MGL+I +P F KAHRWG+AFPA SIAREEKCLWD +KRLAICGDFCVSP+        + GA   P+    + 
Subjt:  IAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGA---PRVALPIT

Query:  FLRLSRFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASG
                + S+ LQ   +++R R +   S + +  L     +    ++ +  +    S++R    SFKV+ASG KKIKTDTPYG  G MNL++G DASG
Subjt:  FLRLSRFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASG

Query:  RKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNT
        RK  GKGVYQFVDKYGANVDGYSPIY+  +WSPSGDVY GG+TGLAIWAVTL G+LA G L ++ T
Subjt:  RKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNT

KAF1866503.1 hypothetical protein Lal_00017886 [Lupinus albus]4.7e-13951.58Show/hide
Query:  TAMSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSS
        T M++   KVAVVGSG        ++A + + ++LF+S              +R P              GG M+   E  EDGKEL FDHGAP+F+VS 
Subjt:  TAMSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSS

Query:  PDVLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHE----------------------------------PFEGI
         ++L LV+EW+++ L AEWKE FG FD  + +F SI QEG+  RYVG PGMNSICKALC+E                                   F+G+
Subjt:  PDVLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHE----------------------------------PFEGI

Query:  VASDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERV
        VASDKNIVSPR   VTGR PPL             ++   PCFA+MLAF +PLSSIPVKGFS KNSK+LSWAYCDSSKP RS++SERW+LHST EYA  +
Subjt:  VASDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERV

Query:  IAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFLR
        IA+ GL+KP+D TL KVAE+L+QE Q  GL+I +PF+ +AHRWGSAFPA SIA +EKCLWD SKRLAICGDFCVSPN        + GA    L    LR
Subjt:  IAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFLR

Query:  LSRFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKG
        L                                     ++S++SS+       R+S       SFKV ASG KKIKTD P GI G MNLR+G+DASGRKG
Subjt:  LSRFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKG

Query:  KGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNTSALVQ
         GKGVYQ+VDKYGANVDGYSPIY  ++WSP+GDVY GG+TGLAIWAVTLAGLLAGGALLVYNTSAL Q
Subjt:  KGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNTSALVQ

KAF7140053.1 hypothetical protein RHSIM_Rhsim06G0237200 [Rhododendron simsii]4.6e-13450.35Show/hide
Query:  MSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPD
        M+ VL+KVAV+GSG        ++A + LS+++ DS              +R P              GG     +E   DG+EL FDHGAPYFT S+ D
Subjt:  MSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPD

Query:  VLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEP----------------------------------FEGIVA
        VL L+R+WES+ L AEWKE+FG FD  S QF +IE+EG   +YVG PGMNSIC+ALC+EP                                  F+G+VA
Subjt:  VLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEP----------------------------------FEGIVA

Query:  SDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA
        SDKNIVSPRFT+VTGRPPPL             +IPV PCFALMLAFE+PLS IPVKGFS  NS+VLSWA+C+SSKP RS+ SE+W+LHST +YA+ VIA
Subjt:  SDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA

Query:  EYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFLRLS
        + GLQKP++  L  VA+EL+QE Q  GL I  PFF KAHRWGSAFPAASIA EEKCLWDG KRLAICGDFCVSPN        + G     + + +L   
Subjt:  EYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFLRLS

Query:  RFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKG
                                            R++    + R            D +   + A+   K      +GING M L  GLDASGRKGKG
Subjt:  RFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKG

Query:  KGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNTSALVQ
        KGVYQFVDKYGANVDGYSPIY+T DWSPSGDVY GG+TGLAIWAVTL G+LAGGALLVYNTSAL Q
Subjt:  KGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNTSALVQ

OIW11634.1 hypothetical protein TanjilG_24840 [Lupinus angustifolius]3.7e-13650.71Show/hide
Query:  MSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPD
        M++   KVAVVGSG        ++A + + ++LF+S              +R P              GG M+   E  EDGKEL FDHGAP+F+VS  +
Subjt:  MSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPD

Query:  VLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHE----------------------------------PFEGIVA
        +L LV+EW+S+   AEWKE FG FD  + +F +I QEG+  RYVG PGMNSICKALC+E                                   F+G+VA
Subjt:  VLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHE----------------------------------PFEGIVA

Query:  SDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA
        SDKNIVSPR   VTGR PPL             ++   PCFA+MLAF +PLSSIPVK  S KNSK+LSWAYCDSSKPGRS++SERW+LHST EYA  +IA
Subjt:  SDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA

Query:  EYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFLRLS
        + GL+KP++ TL KVAE+L+QE Q  GL+I +PF+ +AHRWGSAFPA SIA +EKCLWD SKRLAICGDFCVSPN        + GA    L       +
Subjt:  EYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFLRLS

Query:  RFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKG
          R   +P+ F+++    R           G+                I + SS       SFK+ ASG KKIKTD P GI G MNLR+G+DASGRKG G
Subjt:  RFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKG

Query:  KGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNTSALVQ
        KGVYQ+VDKYGANVDGYSPIY  ++WSP+GDVY GG+TGLAIWAVTLAGLLAGG LLVYNTSAL Q
Subjt:  KGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNTSALVQ

XP_023523201.1 uncharacterized protein LOC111787465 [Cucurbita pepo subsp. pepo]1.5e-13264.99Show/hide
Query:  MSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPD
        MSSVLSKVAVVGSG        S+A + +S++LF+S              +R P              GG M+   EIAEDGKEL FDHGAPYFTVSS D
Subjt:  MSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPD

Query:  VLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEP----------------------------------FEGIVA
        +LSLVREWESK+LCAEWKESFG+FDCFSNQF+SIEQEGVSGRYVGTPGMNSICKALCHEP                                  FEG+VA
Subjt:  VLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEP----------------------------------FEGIVA

Query:  SDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA
        SDKNIVSPRFT+VTGR PPL             +IP IPCFALML FEQPL+SIPVKGFSI NSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA
Subjt:  SDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA

Query:  EYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFL
        EYGLQKP+DATLKKVAEELYQE Q +GLSIPRPFFMKAHRWGSAFP ASIA EEKCLWDGSKR+AICGDFCVSPN +   +  +  A +    +++L
Subjt:  EYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFL

TrEMBL top hitse value%identityAlignment
A0A1J7HZ28 Photosystem II 10 kDa polypeptide, chloroplastic1.8e-13650.71Show/hide
Query:  MSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPD
        M++   KVAVVGSG        ++A + + ++LF+S              +R P              GG M+   E  EDGKEL FDHGAP+F+VS  +
Subjt:  MSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPD

Query:  VLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHE----------------------------------PFEGIVA
        +L LV+EW+S+   AEWKE FG FD  + +F +I QEG+  RYVG PGMNSICKALC+E                                   F+G+VA
Subjt:  VLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHE----------------------------------PFEGIVA

Query:  SDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA
        SDKNIVSPR   VTGR PPL             ++   PCFA+MLAF +PLSSIPVK  S KNSK+LSWAYCDSSKPGRS++SERW+LHST EYA  +IA
Subjt:  SDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA

Query:  EYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFLRLS
        + GL+KP++ TL KVAE+L+QE Q  GL+I +PF+ +AHRWGSAFPA SIA +EKCLWD SKRLAICGDFCVSPN        + GA    L       +
Subjt:  EYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFLRLS

Query:  RFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKG
          R   +P+ F+++    R           G+                I + SS       SFK+ ASG KKIKTD P GI G MNLR+G+DASGRKG G
Subjt:  RFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKG

Query:  KGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNTSALVQ
        KGVYQ+VDKYGANVDGYSPIY  ++WSP+GDVY GG+TGLAIWAVTLAGLLAGG LLVYNTSAL Q
Subjt:  KGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNTSALVQ

A0A4Y1RCD7 Photosystem II 10 kDa polypeptide, chloroplastic1.3e-13951.59Show/hide
Query:  SKVAVVGSG------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPDVLSLVRE
        +KVAVVGSG      ++A + +S++LF+S              +R P              GG M+   E+AEDGKEL FDHGAP+F  ++ +V  LV E
Subjt:  SKVAVVGSG------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPDVLSLVRE

Query:  WESKHLCAEWKESFGIFDCFSNQFTSIEQEG-VSGRYVGTPGMNSICKALCHEP----------------------------------FEGIVASDKNIV
        WE+K L A W+E FG FD  SN+F  +EQEG +S RYVG PGMNS+C+ALCHEP                                  F+G+VA+DKN+V
Subjt:  WESKHLCAEWKESFGIFDCFSNQFTSIEQEG-VSGRYVGTPGMNSICKALCHEP----------------------------------FEGIVASDKNIV

Query:  SPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSI--------PVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERV
        SPRFTSVTGRPPPL             DIPV PCFALM+AF +PLSS+         + GFSIKNS+VLSWA+CDSSKPGRS+SSERW+LHSTMEYA+ +
Subjt:  SPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSI--------PVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERV

Query:  IAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGA---PRVALPIT
        IA+ GLQKP++ATL KVAEEL+QELQ MGL+I +P F KAHRWG+AFPA SIAREEKCLWD +KRLAICGDFCVSP+        + GA   P+    + 
Subjt:  IAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGA---PRVALPIT

Query:  FLRLSRFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASG
                + S+ LQ   +++R R +   S + +  L     +    ++ +  +    S++R    SFKV+ASG KKIKTDTPYG  G MNL++G DASG
Subjt:  FLRLSRFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASG

Query:  RKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNT
        RK  GKGVYQFVDKYGANVDGYSPIY+  +WSPSGDVY GG+TGLAIWAVTL G+LA G L ++ T
Subjt:  RKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNT

A0A6A5M395 Photosystem II 10 kDa polypeptide, chloroplastic2.3e-13951.58Show/hide
Query:  TAMSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSS
        T M++   KVAVVGSG        ++A + + ++LF+S              +R P              GG M+   E  EDGKEL FDHGAP+F+VS 
Subjt:  TAMSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSS

Query:  PDVLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHE----------------------------------PFEGI
         ++L LV+EW+++ L AEWKE FG FD  + +F SI QEG+  RYVG PGMNSICKALC+E                                   F+G+
Subjt:  PDVLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHE----------------------------------PFEGI

Query:  VASDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERV
        VASDKNIVSPR   VTGR PPL             ++   PCFA+MLAF +PLSSIPVKGFS KNSK+LSWAYCDSSKP RS++SERW+LHST EYA  +
Subjt:  VASDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERV

Query:  IAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFLR
        IA+ GL+KP+D TL KVAE+L+QE Q  GL+I +PF+ +AHRWGSAFPA SIA +EKCLWD SKRLAICGDFCVSPN        + GA    L    LR
Subjt:  IAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFLR

Query:  LSRFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKG
        L                                     ++S++SS+       R+S       SFKV ASG KKIKTD P GI G MNLR+G+DASGRKG
Subjt:  LSRFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKG

Query:  KGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNTSALVQ
         GKGVYQ+VDKYGANVDGYSPIY  ++WSP+GDVY GG+TGLAIWAVTLAGLLAGGALLVYNTSAL Q
Subjt:  KGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNTSALVQ

A0A6J1C466 uncharacterized protein LOC1110082131.6e-13265.24Show/hide
Query:  MSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPD
        MSSVL+KVAVVGSG        S+A + +S++LF+S              +R P              GG M+   EIAEDGKEL FDHGAPYFTVSSPD
Subjt:  MSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPD

Query:  VLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEP----------------------------------FEGIVA
        VLS+VREWES++LCAEW+E FGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALC+EP                                  FE IVA
Subjt:  VLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEP----------------------------------FEGIVA

Query:  SDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA
        SDKNIVSPRFTSVTGR PPL             +IPVI CFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKP RS+SSERWILHSTMEYAERVIA
Subjt:  SDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA

Query:  EYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFL
        EYGLQKP+DATLKKVAEELYQELQ MG SIP PFFMKAHRWGSAFP ASIAREEKCLWDGSKRLA+CGDFCVSPN +   +  +  A ++   +++L
Subjt:  EYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFL

A0A6J1FTZ8 uncharacterized protein LOC1114468753.5e-13264.74Show/hide
Query:  MSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPD
        MSSVLSKVAVVGSG        S+A + +S++LF+S              +R P              GG M+   EIAEDGKEL FDHGAPYFTV+S D
Subjt:  MSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFTVSSPD

Query:  VLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEP----------------------------------FEGIVA
        VLSLVREWESK+LCAEWKESFG+FDCFSNQF+SIEQEGVSGRYVGTPGMNSICKALCHEP                                  FEG+VA
Subjt:  VLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEP----------------------------------FEGIVA

Query:  SDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA
        SDKNIVSPRFT+VTGR PPL             +IP IPCFALML FEQPL+SIPVKGFSI NSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA
Subjt:  SDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIA

Query:  EYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFL
        EYGLQKP+DATLKKVAEELYQE Q +GLSIPRPFFMKAHRWGSAFP ASI  EEKCLWDGSKR+AICGDFCVSPN +   +  +  A +    +++L
Subjt:  EYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFL

SwissProt top hitse value%identityAlignment
P06183 Photosystem II 10 kDa polypeptide, chloroplastic2.3e-4373.02Show/hide
Query:  TSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGL
        T +L +  +    S++R    SFKV ASG KK+KTD PYGING+M LR+G+DASGRK KGKGVYQ+VDKYGANVDGYSPIY+T +WSPSGDVYVGG+TGL
Subjt:  TSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGL

Query:  AIWAVTLAGLLAGGALLVYNTSALVQ
        AIWAVTL G+LAGGALLVYNTSAL Q
Subjt:  AIWAVTLAGLLAGGALLVYNTSALVQ

P27202 Photosystem II 10 kDa polypeptide, chloroplastic3.6e-4180Show/hide
Query:  SFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNT
        SFK+ ASG KKIKTD P+GING+M+LR+G+DASGRKGKG GVY++VDKYGANVDGYSPIY+  +WS SGDVY GG TGLAIWAVTLAG+LAGGALLVYNT
Subjt:  SFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNT

Query:  SALVQ
        SAL Q
Subjt:  SALVQ

P49108 Photosystem II 10 kDa polypeptide, chloroplastic2.8e-4175.22Show/hide
Query:  SVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAG
        S++R    SF++ ASG KKIKTD P+G+NG+M+LR+G+DASGRKGKG GVY+FVDKYGANVDGYSPIY+  +WS SGDVY GG TGLAIWAVTLAG+LAG
Subjt:  SVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAG

Query:  GALLVYNTSALVQ
        GALLVYNTSAL Q
Subjt:  GALLVYNTSALVQ

Q40163 Photosystem II 10 kDa polypeptide, chloroplastic5.0e-4373.02Show/hide
Query:  TSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGL
        T +L +  +    S++R    SFKV ASG KK+KTD PYGING+M LR+G+DASGRK KGKGVYQ+VDKYGANVDGYSPIY+T +WSPSGDVYVGG+TGL
Subjt:  TSSLHRREIGQRSSVSREDFRSFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGL

Query:  AIWAVTLAGLLAGGALLVYNTSALVQ
        AIWAVTL G+LAGGALLVYNTSAL Q
Subjt:  AIWAVTLAGLLAGGALLVYNTSALVQ

Q40519 Photosystem II 10 kDa polypeptide, chloroplastic2.0e-4483.81Show/hide
Query:  SFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNT
        SF+V+ASG KK+KTD PYGING+M+LR+G+DASGRK KGKGVYQFVDKYGANVDGYSPIY+T DWSPSGDVYVGG+TGLAIWAVTL G+LAGGALLV+NT
Subjt:  SFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNT

Query:  SALVQ
        SAL Q
Subjt:  SALVQ

Arabidopsis top hitse value%identityAlignment
AT1G55980.1 FAD/NAD(P)-binding oxidoreductase family protein3.7e-8146.41Show/hide
Query:  PMRAVAVQLVVAGWWRRQSTAMSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIA
        PM A  +   +     R+  +  +V  KVAV+GSG        ++A + +S+++FDS               R P              GG M+   EI 
Subjt:  PMRAVAVQLVVAGWWRRQSTAMSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIA

Query:  EDGKELFFDHGAPYFTVSSPDVLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEP-------------------
        EDGKEL FDHGAP+F VS+ D ++LV EWES+   +EWK+ FG FDC SN+F  I+QEG + +YVG PGMNSI KALC+E                    
Subjt:  EDGKELFFDHGAPYFTVSSPDVLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEP-------------------

Query:  ---------------FEGIVASDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGR
                       F+G+VASDKNIVSPRFT VTG PPPL             +IPV+PCF+LMLAF++PLSSIPVKG S KNS++LSWA+C+S+KPGR
Subjt:  ---------------FEGIVASDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGR

Query:  SSSSERWILHSTMEYAERVIAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRW
        S+ SERWILHST +YA  VIA+ GLQK +  TL K++EE+++E Q  GL    PFFMKAHRW
Subjt:  SSSSERWILHSTMEYAERVIAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRW

AT1G56000.1 FAD/NAD(P)-binding oxidoreductase family protein2.3e-9950.53Show/hide
Query:  RQSTAMSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFT
        R+  +  +V  KVAV+GSG        ++A + +S+++FDS               R P              GG M+   EI EDGKEL FDHGAP+F 
Subjt:  RQSTAMSSVLSKVAVVGSG--------SIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFT

Query:  VSSPDVLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEP----------------------------------F
        VS+ D ++LV EWES+   +EWK+ FG FDC SN+F  I+QEG + +YVG PGMNSI KALC+E                                   F
Subjt:  VSSPDVLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEP----------------------------------F

Query:  EGIVASDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYA
        +G+VASDKNIVSPRFT VTG PPPL             +IPV+PCF+LMLAF++PLSSIPVKG S KNS++LSWA+C+S+KPGRS+ SERWILHST +YA
Subjt:  EGIVASDKNIVSPRFTSVTGRPPPL-----------AADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYA

Query:  ERVIAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPN
          VIA+ GLQK +  TL K++EE+++E Q  GL    PFFMKAHRWGSAFPA SIA EE+CLWD ++ LAICGDFCVSPN
Subjt:  ERVIAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPN

AT1G79040.1 photosystem II subunit R2.6e-4280Show/hide
Query:  SFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNT
        SFK+ ASG KKIKTD P+GING+M+LR+G+DASGRKGKG GVY++VDKYGANVDGYSPIY+  +WS SGDVY GG TGLAIWAVTLAG+LAGGALLVYNT
Subjt:  SFKVEASGKKKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNT

Query:  SALVQ
        SAL Q
Subjt:  SALVQ

AT3G04650.1 FAD/NAD(P)-binding oxidoreductase family protein1.1e-1625.25Show/hide
Query:  LFFDHGAPYFTVSSPDVLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHE-------------------------
        L FDH A +FT      + LV  W  K L  EWK + G  +     F+         RY+   GM S+  +L  E                         
Subjt:  LFFDHGAPYFTVSSPDVLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHE-------------------------

Query:  -------PFEGIVASDKNIVSPRFTSVTGRP--PPLAADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSER-WILHSTMEY
                F+ IV +     + R  S +G P        + +   +AL+ AF+ PL ++  +G  +K  + LSW   +S+K G   +    W   ST  Y
Subjt:  -------PFEGIVASDKNIVSPRFTSVTGRP--PPLAADIPVIPCFALMLAFEQPLSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSER-WILHSTMEY

Query:  AERVIAEYGLQKPTDATLKKVAEELYQELQ-GMGL---SIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGA
         ++   +   +     T +KV   + Q ++  +GL   S+P+P + +   WG+A P  + A    C++D   R  ICGD+ +  N   ++  AI GA
Subjt:  AERVIAEYGLQKPTDATLKKVAEELYQELQ-GMGL---SIPRPFFMKAHRWGSAFPAASIAREEKCLWDGSKRLAICGDFCVSPNEDYIQIIAIHGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACCTATGCGGGCAGTGGCCGTGCAGCTGGTGGTGGCAGGGTGGTGGAGAAGACAAAGCACCGCCATGAGCTCCGTTCTGTCGAAAGTTGCCGTCGTGGGAAGTGG
AAGTATAGCGACTCATCATCTTTCTCTCTCTCTTTTTGATTCTTTTCTGGAGCCGCTTGCGCATGGAGCCTCGCCAGAAATGGGGTCTCGGTCACCTTGTTCGAGTCTGC
TAGAGGGCCCGGTGGAAGAATGTCCCAACGGAGGTGCGATGAATCTCCTCCTAGAAATTGCAGAAGATGGGAAAGAACTGTTTTTCGATCACGGTGCCCCATATTTTACT
GTCAGCTCCCCAGACGTATTGAGTCTTGTTCGTGAGTGGGAATCAAAACATCTTTGTGCAGAATGGAAAGAGAGTTTTGGGATTTTTGATTGCTTTTCCAATCAATTCAC
CAGCATAGAACAGGAAGGAGTAAGTGGCAGATATGTGGGTACTCCGGGCATGAATTCTATTTGCAAAGCATTATGCCATGAACCGTTTGAGGGAATTGTTGCATCAGACA
AAAACATAGTTTCTCCAAGGTTTACCAGTGTAACAGGACGACCGCCACCTCTTGCTGCGGACATTCCTGTTATTCCGTGTTTTGCTCTGATGCTTGCATTTGAACAGCCT
CTCTCTTCGATACCCGTCAAAGGTTTCTCTATAAAGAATTCTAAAGTTCTAAGCTGGGCTTACTGTGACAGCAGCAAACCGGGGCGTTCATCTTCTAGTGAACGGTGGAT
TTTGCATTCAACAATGGAGTACGCAGAGAGAGTGATTGCTGAATATGGGCTTCAGAAACCTACAGATGCCACGTTGAAAAAAGTGGCTGAAGAACTCTATCAAGAGTTGC
AAGGCATGGGACTTAGCATACCCCGTCCATTCTTTATGAAAGCTCACAGATGGGGGAGTGCTTTTCCTGCTGCAAGCATAGCCAGAGAGGAAAAGTGCCTCTGGGATGGA
AGCAAGCGGCTGGCCATCTGTGGAGATTTCTGTGTTAGCCCTAACGAAGATTATATCCAGATTATCGCCATTCACGGCGCGCCACGTGTCGCCCTACCAATCACATTTCT
CAGATTATCAAGGTTCCGAGCTTCGTCTGCGCCATTGCAATTTTCGAGCAGAAGCAGAAGAAGAAGAAGAAGAAGAAGAAACTCATCGATAGGAAATGGCGGCCTCGGTC
ATGGCTTCCGTGAGTCTCAAACCAGCTCCCTTCACCGTCGAGAAATCGGCCAGAGGTCTTCCGTCTCTCGCGAGGACTTCCGGTCTTTCAAGGTCGAGGCCAGTGGCAAG
AAGAAGATCAAGACCGACACACCCTACGGAATCAACGGTGCAATGAATTTGAGGAACGGACTTGATGCATCGGGAAGGAAGGGCAAGGGAAAGGGAGTCTATCAATTCGT
CGACAAGTACGGTGCCAATGTCGATGGCTACAGTCCCATCTACGATACCAAAGATTGGTCTCCAAGTGGTGATGTCTACGTCGGTGGTTCTACTGGATTAGCCATTTGGG
CAGTGACCCTGGCTGGGCTTCTTGCTGGAGGGGCTCTTCTTGTGTACAACACCAGTGCTTTGGTGCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTACCTATGCGGGCAGTGGCCGTGCAGCTGGTGGTGGCAGGGTGGTGGAGAAGACAAAGCACCGCCATGAGCTCCGTTCTGTCGAAAGTTGCCGTCGTGGGAAGTGG
AAGTATAGCGACTCATCATCTTTCTCTCTCTCTTTTTGATTCTTTTCTGGAGCCGCTTGCGCATGGAGCCTCGCCAGAAATGGGGTCTCGGTCACCTTGTTCGAGTCTGC
TAGAGGGCCCGGTGGAAGAATGTCCCAACGGAGGTGCGATGAATCTCCTCCTAGAAATTGCAGAAGATGGGAAAGAACTGTTTTTCGATCACGGTGCCCCATATTTTACT
GTCAGCTCCCCAGACGTATTGAGTCTTGTTCGTGAGTGGGAATCAAAACATCTTTGTGCAGAATGGAAAGAGAGTTTTGGGATTTTTGATTGCTTTTCCAATCAATTCAC
CAGCATAGAACAGGAAGGAGTAAGTGGCAGATATGTGGGTACTCCGGGCATGAATTCTATTTGCAAAGCATTATGCCATGAACCGTTTGAGGGAATTGTTGCATCAGACA
AAAACATAGTTTCTCCAAGGTTTACCAGTGTAACAGGACGACCGCCACCTCTTGCTGCGGACATTCCTGTTATTCCGTGTTTTGCTCTGATGCTTGCATTTGAACAGCCT
CTCTCTTCGATACCCGTCAAAGGTTTCTCTATAAAGAATTCTAAAGTTCTAAGCTGGGCTTACTGTGACAGCAGCAAACCGGGGCGTTCATCTTCTAGTGAACGGTGGAT
TTTGCATTCAACAATGGAGTACGCAGAGAGAGTGATTGCTGAATATGGGCTTCAGAAACCTACAGATGCCACGTTGAAAAAAGTGGCTGAAGAACTCTATCAAGAGTTGC
AAGGCATGGGACTTAGCATACCCCGTCCATTCTTTATGAAAGCTCACAGATGGGGGAGTGCTTTTCCTGCTGCAAGCATAGCCAGAGAGGAAAAGTGCCTCTGGGATGGA
AGCAAGCGGCTGGCCATCTGTGGAGATTTCTGTGTTAGCCCTAACGAAGATTATATCCAGATTATCGCCATTCACGGCGCGCCACGTGTCGCCCTACCAATCACATTTCT
CAGATTATCAAGGTTCCGAGCTTCGTCTGCGCCATTGCAATTTTCGAGCAGAAGCAGAAGAAGAAGAAGAAGAAGAAGAAACTCATCGATAGGAAATGGCGGCCTCGGTC
ATGGCTTCCGTGAGTCTCAAACCAGCTCCCTTCACCGTCGAGAAATCGGCCAGAGGTCTTCCGTCTCTCGCGAGGACTTCCGGTCTTTCAAGGTCGAGGCCAGTGGCAAG
AAGAAGATCAAGACCGACACACCCTACGGAATCAACGGTGCAATGAATTTGAGGAACGGACTTGATGCATCGGGAAGGAAGGGCAAGGGAAAGGGAGTCTATCAATTCGT
CGACAAGTACGGTGCCAATGTCGATGGCTACAGTCCCATCTACGATACCAAAGATTGGTCTCCAAGTGGTGATGTCTACGTCGGTGGTTCTACTGGATTAGCCATTTGGG
CAGTGACCCTGGCTGGGCTTCTTGCTGGAGGGGCTCTTCTTGTGTACAACACCAGTGCTTTGGTGCAGTAA
Protein sequenceShow/hide protein sequence
MLPMRAVAVQLVVAGWWRRQSTAMSSVLSKVAVVGSGSIATHHLSLSLFDSFLEPLAHGASPEMGSRSPCSSLLEGPVEECPNGGAMNLLLEIAEDGKELFFDHGAPYFT
VSSPDVLSLVREWESKHLCAEWKESFGIFDCFSNQFTSIEQEGVSGRYVGTPGMNSICKALCHEPFEGIVASDKNIVSPRFTSVTGRPPPLAADIPVIPCFALMLAFEQP
LSSIPVKGFSIKNSKVLSWAYCDSSKPGRSSSSERWILHSTMEYAERVIAEYGLQKPTDATLKKVAEELYQELQGMGLSIPRPFFMKAHRWGSAFPAASIAREEKCLWDG
SKRLAICGDFCVSPNEDYIQIIAIHGAPRVALPITFLRLSRFRASSAPLQFSSRSRRRRRRRRNSSIGNGGLGHGFRESQTSSLHRREIGQRSSVSREDFRSFKVEASGK
KKIKTDTPYGINGAMNLRNGLDASGRKGKGKGVYQFVDKYGANVDGYSPIYDTKDWSPSGDVYVGGSTGLAIWAVTLAGLLAGGALLVYNTSALVQ