; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016374 (gene) of Snake gourd v1 genome

Gene IDTan0016374
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAcetyl-coenzyme A carboxylase carboxyl transferase subunit beta, chloroplastic
Genome locationContig00034:82604..89226
RNA-Seq ExpressionTan0016374
SyntenyTan0016374
Gene Ontology termsGO:0006633 - fatty acid biosynthetic process (biological process)
GO:0015979 - photosynthesis (biological process)
GO:0022900 - electron transport chain (biological process)
GO:0009317 - acetyl-CoA carboxylase complex (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0009522 - photosystem I (cellular component)
GO:0031361 - integral component of thylakoid membrane (cellular component)
GO:0020037 - heme binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0009055 - electron transfer activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0003989 - acetyl-CoA carboxylase activity (molecular function)
InterPro domainsIPR036826 - Cytochrome f large domain superfamily
IPR034733 - Acetyl-CoA carboxylase
IPR029045 - ClpP/crotonase-like domain superfamily
IPR024094 - Cytochrome f large domain
IPR011762 - Acetyl-coenzyme A carboxyltransferase, N-terminal
IPR011054 - Rudiment single hybrid motif
IPR003359 - Photosystem I Ycf4, assembly
IPR002325 - Cytochrome f
IPR000438 - Acetyl-CoA carboxylase carboxyl transferase, beta subunit


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3449781.1 hypothetical protein FNV43_RR05859 [Rhamnella rubrinervis]3.8e-18263.73Show/hide
Query:  MSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEYATNQF
        + SSDRIELSIDPGTWDPMDEDMVS DPI+FHS+EEPYKDRID  QRKTGLTEA+QTGTG+LNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEYATNQF
Subjt:  MSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEYATNQF

Query:  LPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQAAEYLF
        LPLILVCASGGARMQEGSLSLMQMAKISSAL+D+QS KKLFYV+ILTSPTTGGVTASFGMLGDIII EPNAY+AFAGKRVIEQTLNKT+PEGSQAAEYLF
Subjt:  LPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQAAEYLF

Query:  HKGLFDLIVPRNPLK----------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE----------ECSFEPKQTYTEQPENCLCPK-----
        HKGLFD IVPRNPLK                VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE          E   +     T   EN    +     
Subjt:  HKGLFDLIVPRNPLK----------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE----------ECSFEPKQTYTEQPENCLCPK-----

Query:  -----FL-----CGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN-------------------------------
             FL       YENPREATGRIVCANCHLANKPVDIEVPQA+LPDTVFEAVVRIPYD+QLKQVLAN                               
Subjt:  -----FL-----CGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN-------------------------------

Query:  --------------------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA------------------
                                                                GGNRGRGQIYPDGSKSNNNVYNATA                   
Subjt:  --------------------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA------------------

Query:  ---------------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK
                       GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLL FL    LAQIF + + K
Subjt:  ---------------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK

KAF4348566.1 hypothetical protein F8388_000245 [Cannabis sativa]2.2e-18261.2Show/hide
Query:  HLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEYAT
        HL   SSDRIEL IDPGTWDPMDEDMVSLDPIEFHSEEEPYK+R+DSYQRKTGLTEA+QTGTG+LNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEYAT
Subjt:  HLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEYAT

Query:  NQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQAAE
        NQ LPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQ +E
Subjt:  NQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQAAE

Query:  YLFHKGLFDLIVPRNPLK---------------------------------------------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEV
        YLFHKGLFDLIVPRN LK                                                   VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEV
Subjt:  YLFHKGLFDLIVPRNPLK---------------------------------------------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEV

Query:  KE----------ECSFEPKQTYTEQPENCLCPKF--------------LCGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQL
        KE          E   +     T   EN    +               + GYENPREATGRIVCANCHLANKPVDIEVPQA+LPDTVFEAVVRIPYD+QL
Subjt:  KE----------ECSFEPKQTYTEQPENCLCPKF--------------LCGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQL

Query:  KQVLAN---------------------------------------------------------------------------------------GGNRGRG
        KQVLAN                                                                                       GGNRGRG
Subjt:  KQVLAN---------------------------------------------------------------------------------------GGNRGRG

Query:  QIYPDGSKSNNNVYNATAA---------------------------------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFF
        QIYPDGSKSNNNVYNATAA                                 GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLL F
Subjt:  QIYPDGSKSNNNVYNATAA---------------------------------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFF

Query:  LHLYFLAQIFFLKRNK
        L    LAQIF + + K
Subjt:  LHLYFLAQIFFLKRNK

KAF8376859.1 hypothetical protein HHK36_031455 [Tetracentron sinense]1.1e-17363.81Show/hide
Query:  MDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEYATNQFLPLILVCASGGARMQEGS
        MDEDMVS+DPIEFHSEEEPYKDRIDSYQRKTGLTEA+QTG G+LNGIP+AIGVMDFQFMGGSMGSVVGEKITRLIEYATN+ LPLI+VCASGGARMQEGS
Subjt:  MDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEYATNQFLPLILVCASGGARMQEGS

Query:  LSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQAAEYLFHKGLFDLIVPRNPLK---
        LSLMQMAKISSA YDYQSNKKLFYV+ILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVP+GSQAAEYLFHKGLFD IVPRNPLK   
Subjt:  LSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQAAEYLFHKGLFDLIVPRNPLK---

Query:  -------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE----------ECSFEPKQTYTEQPENCLCPK-----------FLC----GYENP
                     VCIFRWGFPG NRRIFLRFLM+DIQSIRIEV+E          E   +     T   EN L P+           FL     GYENP
Subjt:  -------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE----------ECSFEPKQTYTEQPENCLCPK-----------FLC----GYENP

Query:  REATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN-------------------------------------------------
        REATGRIVCANCHLANKPVDIEVPQA+LPDTVFEAVVRIPYDMQLKQVLAN                                                 
Subjt:  REATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN-------------------------------------------------

Query:  --------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA---------------------------------GPE
                                              GGNRGRGQIYPDGSKSNN VYNATAA                                 GPE
Subjt:  --------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA---------------------------------GPE

Query:  LLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK
        LLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFL    LAQIF + + K
Subjt:  LLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK

KAG6735390.1 hypothetical protein POTOM_062021 [Populus tomentosa]3.2e-19771.93Show/hide
Query:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
        MNICE+CGYHLKMSSSDRIELSIDPGTWDPMDE+M SLDPI+FHSEEEPYKDRIDSYQ+KTGLTEAIQTG G+LNGIPVAIGVMDFQFMGGSMGSVVGEK
Subjt:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK

Query:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
        ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKL YV+ILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
Subjt:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT

Query:  VPEGSQAAEYLFHKGLFDLIVPRNPLK----------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE----------ECSFEPKQTYTEQP
        VPEGSQAAE+LFHKGLFD IVPRN LK                VCIFRWGFPGKNRRI LR  MKDIQSIRIEVKE          E   +     T   
Subjt:  VPEGSQAAEYLFHKGLFDLIVPRNPLK----------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE----------ECSFEPKQTYTEQP

Query:  ENCLCPK-----------FLC----GYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN-------------------
        EN L P+           FL     GYENPREATGRIVCANCHLANKPV IEVPQA+LPDTVFEAVVRIPYDMQLKQVLAN                   
Subjt:  ENCLCPK-----------FLC----GYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN-------------------

Query:  ---------------GGNRGRGQIYPDGSKSNNNVYNATAA---------------------------------GPELLVSEGESIKLDQPLTSNPNVGG
                       GGNRGRGQIYPDGSKSNN VYNATAA                                 GPELLVSEGESIKLDQPLTSNPNVGG
Subjt:  ---------------GGNRGRGQIYPDGSKSNNNVYNATAA---------------------------------GPELLVSEGESIKLDQPLTSNPNVGG

Query:  FGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK
        FGQGDAEIVLQDPLRVQGLLFFL    LAQIF + + K
Subjt:  FGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK

TXG46669.1 hypothetical protein EZV62_027824 [Acer yangbiense]9.0e-19264.86Show/hide
Query:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
        MNICE+CGY+LK+SSSDRIELS+DPGTWDPMD+ MVSLDPIEFHSEEEPYKDRIDSYQ+KTGLTEA+QTGTG+LNGIPVAIGVMDFQFMGGSMGSVVGEK
Subjt:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK

Query:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
        ITRLIEYATN+ LPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYV+ILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
Subjt:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT

Query:  VPEGSQAAEYLFHKGLFDLIVPRNPLK----------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKEECSFEPKQTYTE----------QP
        VPEGSQAAEYLFHKGLFD IVPRNPLK                VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE   +  +  Y E          + 
Subjt:  VPEGSQAAEYLFHKGLFDLIVPRNPLK----------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKEECSFEPKQTYTE----------QP

Query:  ENCLCPK-----------FL-----CGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN------------------
        +  L P+           FL      G+ENPREATGRIVCANCHLANKPVDIEVPQA+LPDTVFEAVVRIPYDMQLKQVLAN                  
Subjt:  ENCLCPK-----------FL-----CGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN------------------

Query:  ---------------------------------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA-----
                                                                             GGNRGRGQIYPDG+KSNN VYNATAA     
Subjt:  ---------------------------------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA-----

Query:  ----------------------------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK
                                    GPEL VSEGESIK DQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFL    LAQIF + + K
Subjt:  ----------------------------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK

TrEMBL top hitse value%identityAlignment
A0A4U5MZQ8 Cytochrome f7.3e-16362.55Show/hide
Query:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
        MNICE+CGYHLKMSSSDRIEL IDPGTWDPMDE+M SLDPI+FHSEEEPYKDRIDSY++KTGLTEAIQTG G+LNGIPVAIGVMDFQFMGGSMGSVVGEK
Subjt:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK

Query:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
        ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKL YV+ILTSPTTGGVTASFGMLGDIIIAEPNAYIAFA            
Subjt:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT

Query:  VPEGSQAAEYLFHKGLFDLIVPRNPLKVCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE----------ECSFEPKQTYTEQPENCLCPK---------
                                            GKNRRI LR  MKDIQSIRIEVKE          E   +     T   EN L P+         
Subjt:  VPEGSQAAEYLFHKGLFDLIVPRNPLKVCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE----------ECSFEPKQTYTEQPENCLCPK---------

Query:  --FLC----GYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN-----------------------------------
          FL     GYENPREATGRIVCANCHLANKPV IEVPQA+LPDTVFEAVVRIPYDMQLKQVLAN                                   
Subjt:  --FLC----GYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN-----------------------------------

Query:  ----------------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAAGPELLVSEGESIKLDQPLTSNP
                                                            GGNRGRGQIYPDGSKSNN VYNAT AGPELLVSEGESIKLDQPLTSNP
Subjt:  ----------------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAAGPELLVSEGESIKLDQPLTSNP

Query:  NVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK
        NVGGFGQGDAEIVLQDPLRVQGLLFFL    LAQIF + + K
Subjt:  NVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK

A0A5C7GQ84 Multifunctional fusion protein4.4e-19264.86Show/hide
Query:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
        MNICE+CGY+LK+SSSDRIELS+DPGTWDPMD+ MVSLDPIEFHSEEEPYKDRIDSYQ+KTGLTEA+QTGTG+LNGIPVAIGVMDFQFMGGSMGSVVGEK
Subjt:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK

Query:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
        ITRLIEYATN+ LPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYV+ILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
Subjt:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT

Query:  VPEGSQAAEYLFHKGLFDLIVPRNPLK----------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKEECSFEPKQTYTE----------QP
        VPEGSQAAEYLFHKGLFD IVPRNPLK                VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE   +  +  Y E          + 
Subjt:  VPEGSQAAEYLFHKGLFDLIVPRNPLK----------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKEECSFEPKQTYTE----------QP

Query:  ENCLCPK-----------FL-----CGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN------------------
        +  L P+           FL      G+ENPREATGRIVCANCHLANKPVDIEVPQA+LPDTVFEAVVRIPYDMQLKQVLAN                  
Subjt:  ENCLCPK-----------FL-----CGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN------------------

Query:  ---------------------------------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA-----
                                                                             GGNRGRGQIYPDG+KSNN VYNATAA     
Subjt:  ---------------------------------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA-----

Query:  ----------------------------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK
                                    GPEL VSEGESIK DQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFL    LAQIF + + K
Subjt:  ----------------------------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK

A0A5N5IZR4 Cytochrome f1.4e-16961.91Show/hide
Query:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
        MNICE+CGYHLKMSSSDRIELSIDPGTWDPMDEDM SLDPI+FHSEEEPYKDRIDSYQ+KTGLTEAIQTG G+LNGIPVAIGVMDFQFMGGSMGSVVGEK
Subjt:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK

Query:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
        ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKIS+ALYDYQSNKKL YV+ILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAG           
Subjt:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT

Query:  VPEGSQAAEYLFHKGLFDLIVPRNPLKVCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE----------ECSFEPKQTYTEQPENCLCPK---------
                        FD    R    VCIFRWGFPGKNRRI LRF MKDIQSIRIEVKE          E   +     T   EN L P+         
Subjt:  VPEGSQAAEYLFHKGLFDLIVPRNPLKVCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKE----------ECSFEPKQTYTEQPENCLCPK---------

Query:  --FLC----GYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN-----------------------------------
          FL     GYENPREATGRIVCANCHLANKPV IEVPQA+LPDTVFEAVVRIPYDMQLKQVLAN                                   
Subjt:  --FLC----GYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN-----------------------------------

Query:  ----------------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA----------------------
                                                            GGNRGRGQIYPDGSKSNN VYNATAA                      
Subjt:  ----------------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA----------------------

Query:  -----------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK
                   GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFL    LAQIF + + K
Subjt:  -----------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK

A0A5N6L4X5 Cytochrome f9.8e-16861.11Show/hide
Query:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
        M+ICE CG+HLKMSSSDRIELSIDPGTW PMDEDMVSLDPIEF+ EEEPYKDRIDSYQ KTGLTEA+QTGTG+LNGIPVAIG+MDFQFMGGSMGSVVGEK
Subjt:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK

Query:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
        ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQ NKKLFYV+ILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAG           
Subjt:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT

Query:  VPEGSQAAEYLFHKGLFDLIVPRNPLKVCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKEECSFEPKQTYTE----------QPENCLCPK---------
                        FD    R    VCIFRWGFPGKNRRI L+FLMKDIQSIRIEVKE   +  +  Y E          + +  L P+         
Subjt:  VPEGSQAAEYLFHKGLFDLIVPRNPLKVCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKEECSFEPKQTYTE----------QPENCLCPK---------

Query:  --FL-----CGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN----------------------------------
          FL      GYENPREATGRIVCANCHLANKPVDIEVPQA+LPDTVFEAVVRIPYDMQLKQVLAN                                  
Subjt:  --FL-----CGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN----------------------------------

Query:  -----------------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA---------------------
                                                             GGNRGRGQIYPDGSKSNNNVYNATAA                     
Subjt:  -----------------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA---------------------

Query:  ------------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK
                    GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLR QGLLFFL    LAQIF + + K
Subjt:  ------------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK

A0A7J6DRG2 Cytochrome f1.1e-18261.2Show/hide
Query:  HLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEYAT
        HL   SSDRIEL IDPGTWDPMDEDMVSLDPIEFHSEEEPYK+R+DSYQRKTGLTEA+QTGTG+LNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEYAT
Subjt:  HLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEYAT

Query:  NQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQAAE
        NQ LPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQ +E
Subjt:  NQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQAAE

Query:  YLFHKGLFDLIVPRNPLK---------------------------------------------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEV
        YLFHKGLFDLIVPRN LK                                                   VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEV
Subjt:  YLFHKGLFDLIVPRNPLK---------------------------------------------------VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEV

Query:  KE----------ECSFEPKQTYTEQPENCLCPKF--------------LCGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQL
        KE          E   +     T   EN    +               + GYENPREATGRIVCANCHLANKPVDIEVPQA+LPDTVFEAVVRIPYD+QL
Subjt:  KE----------ECSFEPKQTYTEQPENCLCPKF--------------LCGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQL

Query:  KQVLAN---------------------------------------------------------------------------------------GGNRGRG
        KQVLAN                                                                                       GGNRGRG
Subjt:  KQVLAN---------------------------------------------------------------------------------------GGNRGRG

Query:  QIYPDGSKSNNNVYNATAA---------------------------------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFF
        QIYPDGSKSNNNVYNATAA                                 GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLL F
Subjt:  QIYPDGSKSNNNVYNATAA---------------------------------GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFF

Query:  LHLYFLAQIFFLKRNK
        L    LAQIF + + K
Subjt:  LHLYFLAQIFFLKRNK

SwissProt top hitse value%identityAlignment
B1A944 Acetyl-coenzyme A carboxylase carboxyl transferase subunit beta, chloroplastic5.1e-12193.83Show/hide
Query:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
        MNICE+CGY+LKMSSSDRIEL +DPGTW+PMDE+MVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEA+QTGTG+LNGIP+A+GVMDFQFMGGSMGSVVGEK
Subjt:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK

Query:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
        ITRLIEYATN+FLPLI+VCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYV+ILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
Subjt:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT

Query:  VPEGSQAAEYLFHKGLFDLIVPRNPLK
        VPEGSQAAEYLFHKGLFD IVPRNPLK
Subjt:  VPEGSQAAEYLFHKGLFDLIVPRNPLK

Q06GQ1 Acetyl-coenzyme A carboxylase carboxyl transferase subunit beta, chloroplastic1.1e-12093.83Show/hide
Query:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
        MNICE+CGYHLKMSSSDRIELSIDPGTWDPMDEDMVS+DPIEFHSEEEPYKDR+DSYQRKTGLTEA+QTG G+LNGIPVAIGVMDFQFMGGSMGSVVGEK
Subjt:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK

Query:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
        ITRLIEYATN+ LP+I+VCASGGARMQEGSLSLMQMAKISS  YDYQSN+KLFYV+ILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
Subjt:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT

Query:  VPEGSQAAEYLFHKGLFDLIVPRNPLK
        VPEGSQAAEYLFHKGLFDLIVPRNPLK
Subjt:  VPEGSQAAEYLFHKGLFDLIVPRNPLK

Q09X08 Acetyl-coenzyme A carboxylase carboxyl transferase subunit beta, chloroplastic5.1e-12196.04Show/hide
Query:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
        MNICE CG HLKMSSSDRIEL IDPGTWDPMDEDMVSLDPIEF+SEEEPYKDRIDSYQRKTGLTEA+QTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
Subjt:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK

Query:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
        ITRLIEYATNQ LPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFY+AILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
Subjt:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT

Query:  VPEGSQAAEYLFHKGLFDLIVPRNPLK
        VPEGSQ AEYLFHKGLFD IVPRNPLK
Subjt:  VPEGSQAAEYLFHKGLFDLIVPRNPLK

Q49KY9 Acetyl-coenzyme A carboxylase carboxyl transferase subunit beta, chloroplastic3.5e-12295.15Show/hide
Query:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
        MNIC++CGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLT+A+QTGTG+LNGIP+AIGVMDFQFMGGSMGSVVGEK
Subjt:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK

Query:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
        ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKIS+ALYDYQS+KKLFYV+ILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNK 
Subjt:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT

Query:  VPEGSQAAEYLFHKGLFDLIVPRNPLK
        VPEGSQAAEYLFHKGLFD IVPRNPLK
Subjt:  VPEGSQAAEYLFHKGLFDLIVPRNPLK

Q68RZ7 Acetyl-coenzyme A carboxylase carboxyl transferase subunit beta, chloroplastic2.5e-12094.27Show/hide
Query:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
        MN+CE+CGYHLKMSSSDRIEL IDPGTWDPMDEDMVSLDPIEFHSEEEPYKDR+DSYQRKTGLTEA+QTG G+LN IPVAIGVMDFQFMGGSMGSVVGEK
Subjt:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK

Query:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
        ITRLIEYAT +FLPLI+VCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYV ILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
Subjt:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT

Query:  VPEGSQAAEYLFHKGLFDLIVPRNPLK
        VPEGSQAAEYLF KGLFDLIVPRNPLK
Subjt:  VPEGSQAAEYLFHKGLFDLIVPRNPLK

Arabidopsis top hitse value%identityAlignment
ATCG00500.1 acetyl-CoA carboxylase carboxyl transferase subunit beta5.1e-10885.02Show/hide
Query:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK
        MN+CE+CG++LKMSSS+RIELSIDPGTW+PMDEDMVS DPI+FHS+EEPYK+RIDS Q+ TGLT+A+QTGTG+LNGIPVA+GVMDF+FMGGSMGSVVGEK
Subjt:  MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEK

Query:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT
        ITRLIEYATNQ LPLILVC+SGGARMQEGSLSLMQMAKISS L DYQS+KKLFY++ILTSPTTGGVTASFGMLGDIIIAEP AYIAFAGKRVIEQTL K 
Subjt:  ITRLIEYATNQFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKT

Query:  VPEGSQAAEYLFHKGLFDLIVPRNPLK
        VPEGSQAAE L  KGL D IVPRN LK
Subjt:  VPEGSQAAEYLFHKGLFDLIVPRNPLK

ATCG00520.1 unfolded protein binding2.2e-1086.11Show/hide
Query:  VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKEECS
        V IFRWGFPGK+RRIFLRF MKDIQSIRIEVKE  S
Subjt:  VCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKEECS

ATCG00540.1 photosynthetic electron transfer A9.4e-5449.06Show/hide
Query:  YENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN---------------------------------------------
        YENPREATGRIVCANCHLANKPVDIEVPQ +LPDTVFEAVV+IPYDMQLKQVLAN                                             
Subjt:  YENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPYDMQLKQVLAN---------------------------------------------

Query:  ------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA--------------------------------
                                                  GGNRGRGQIYPDGSKSNN VYNATA                                 
Subjt:  ------------------------------------------GGNRGRGQIYPDGSKSNNNVYNATAA--------------------------------

Query:  -GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK
         G ELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFL    LAQIF + + K
Subjt:  -GPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATTTGTGAAGAATGTGGATACCATTTGAAAATGAGTAGTTCAGATAGAATCGAACTTTCGATTGATCCAGGTACTTGGGACCCTATGGATGAAGACATGGTCTC
TTTGGATCCCATTGAATTTCATTCGGAGGAGGAACCTTATAAAGATCGTATTGATTCTTATCAAAGAAAGACAGGATTAACTGAGGCTATTCAAACAGGCACAGGTAAAT
TAAACGGTATTCCCGTAGCAATTGGGGTTATGGATTTTCAGTTTATGGGGGGTAGTATGGGATCCGTAGTAGGCGAGAAAATCACCCGTTTGATCGAGTATGCTACCAAT
CAATTTTTACCTCTTATTTTAGTGTGTGCTTCTGGAGGAGCACGCATGCAAGAAGGAAGTTTGAGCTTGATGCAAATGGCCAAAATATCTTCCGCTTTATATGATTATCA
ATCAAATAAAAAATTATTCTATGTAGCAATCCTTACATCTCCTACTACTGGTGGGGTGACAGCTAGTTTTGGTATGTTGGGGGATATCATTATTGCCGAACCCAATGCCT
ACATCGCATTTGCGGGTAAAAGAGTAATTGAACAAACATTGAATAAGACAGTCCCTGAGGGTTCACAGGCGGCTGAATATTTATTCCATAAGGGCTTATTCGATCTAATC
GTACCACGTAATCCTTTAAAAGTGTGTATTTTTCGTTGGGGGTTTCCTGGAAAAAATCGTCGAATCTTCCTACGATTCCTTATGAAAGACATTCAGTCCATCAGAATAGA
AGTTAAAGAGGAATGCTCATTTGAGCCAAAGCAGACGTATACGGAACAGCCAGAAAACTGTCTTTGTCCGAAATTTTTATGCGGTTATGAAAATCCACGAGAAGCAACCG
GCCGTATTGTCTGTGCCAACTGCCATTTAGCTAATAAGCCCGTGGATATCGAGGTTCCACAAGCGATACTTCCTGATACTGTATTTGAAGCAGTTGTTCGAATTCCTTAT
GATATGCAACTGAAACAAGTTCTTGCGAATGGCGGGAACAGGGGAAGGGGTCAGATTTATCCCGACGGGAGTAAGAGTAATAATAATGTTTATAATGCTACAGCAGCAGG
ACCAGAACTTCTTGTTTCAGAGGGCGAATCTATCAAACTTGATCAACCATTAACGAGTAATCCTAATGTGGGAGGGTTTGGTCAGGGAGATGCGGAAATAGTCCTTCAAG
ATCCATTACGTGTCCAAGGTCTTTTGTTCTTTTTGCATCTGTATTTTTTGGCACAAATTTTTTTCTTAAAAAGAAACAAAACCGAAGATTGGGTAAAATGTGAATCCGAC
TGGTTGGTTTATAACAATTCAGGAAGCAGATTCTCATAG
mRNA sequenceShow/hide mRNA sequence
GAAAGTTCTAATGATCTAGATGTAACTCAAAAATACAGGCATTTGTGGGTTCAATGCGAAAATTGTTATGGATTAAATTATAAGAAATTTTTGAAATCAAAAATGAATAT
TTGTGAAGAATGTGGATACCATTTGAAAATGAGTAGTTCAGATAGAATCGAACTTTCGATTGATCCAGGTACTTGGGACCCTATGGATGAAGACATGGTCTCTTTGGATC
CCATTGAATTTCATTCGGAGGAGGAACCTTATAAAGATCGTATTGATTCTTATCAAAGAAAGACAGGATTAACTGAGGCTATTCAAACAGGCACAGGTAAATTAAACGGT
ATTCCCGTAGCAATTGGGGTTATGGATTTTCAGTTTATGGGGGGTAGTATGGGATCCGTAGTAGGCGAGAAAATCACCCGTTTGATCGAGTATGCTACCAATCAATTTTT
ACCTCTTATTTTAGTGTGTGCTTCTGGAGGAGCACGCATGCAAGAAGGAAGTTTGAGCTTGATGCAAATGGCCAAAATATCTTCCGCTTTATATGATTATCAATCAAATA
AAAAATTATTCTATGTAGCAATCCTTACATCTCCTACTACTGGTGGGGTGACAGCTAGTTTTGGTATGTTGGGGGATATCATTATTGCCGAACCCAATGCCTACATCGCA
TTTGCGGGTAAAAGAGTAATTGAACAAACATTGAATAAGACAGTCCCTGAGGGTTCACAGGCGGCTGAATATTTATTCCATAAGGGCTTATTCGATCTAATCGTACCACG
TAATCCTTTAAAAGTGTGTATTTTTCGTTGGGGGTTTCCTGGAAAAAATCGTCGAATCTTCCTACGATTCCTTATGAAAGACATTCAGTCCATCAGAATAGAAGTTAAAG
AGGAATGCTCATTTGAGCCAAAGCAGACGTATACGGAACAGCCAGAAAACTGTCTTTGTCCGAAATTTTTATGCGGTTATGAAAATCCACGAGAAGCAACCGGCCGTATT
GTCTGTGCCAACTGCCATTTAGCTAATAAGCCCGTGGATATCGAGGTTCCACAAGCGATACTTCCTGATACTGTATTTGAAGCAGTTGTTCGAATTCCTTATGATATGCA
ACTGAAACAAGTTCTTGCGAATGGCGGGAACAGGGGAAGGGGTCAGATTTATCCCGACGGGAGTAAGAGTAATAATAATGTTTATAATGCTACAGCAGCAGGACCAGAAC
TTCTTGTTTCAGAGGGCGAATCTATCAAACTTGATCAACCATTAACGAGTAATCCTAATGTGGGAGGGTTTGGTCAGGGAGATGCGGAAATAGTCCTTCAAGATCCATTA
CGTGTCCAAGGTCTTTTGTTCTTTTTGCATCTGTATTTTTTGGCACAAATTTTTTTCTTAAAAAGAAACAAAACCGAAGATTGGGTAAAATGTGAATCCGACTGGTTGGT
TTATAACAATTCAGGAAGCAGATTCTCATAG
Protein sequenceShow/hide protein sequence
MNICEECGYHLKMSSSDRIELSIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAIQTGTGKLNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEYATN
QFLPLILVCASGGARMQEGSLSLMQMAKISSALYDYQSNKKLFYVAILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQAAEYLFHKGLFDLI
VPRNPLKVCIFRWGFPGKNRRIFLRFLMKDIQSIRIEVKEECSFEPKQTYTEQPENCLCPKFLCGYENPREATGRIVCANCHLANKPVDIEVPQAILPDTVFEAVVRIPY
DMQLKQVLANGGNRGRGQIYPDGSKSNNNVYNATAAGPELLVSEGESIKLDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLHLYFLAQIFFLKRNKTEDWVKCESD
WLVYNNSGSRFS