; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018592 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018592
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAIR carboxylase
Genome locationChr04:5590330..5597911
RNA-Seq ExpressionHG10018592
SyntenyHG10018592
Gene Ontology termsGO:0006189 - 'de novo' IMP biosynthetic process (biological process)
GO:0004638 - phosphoribosylaminoimidazole carboxylase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0043727 - 5-amino-4-imidazole carboxylate lyase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR003135 - ATP-grasp fold, ATP-dependent carboxylate-amine ligase-type
IPR005875 - Phosphoribosylaminoimidazole carboxylase, ATPase subunit
IPR011054 - Rudiment single hybrid motif
IPR011761 - ATP-grasp fold
IPR013815 - ATP-grasp fold, subdomain 1
IPR016185 - Pre-ATP-grasp domain superfamily
IPR040686 - Phosphoribosylaminoimidazole carboxylase, C-terminal domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578313.1 Phosphoribosylaminoimidazole carboxylase, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.1e-22995.94Show/hide
Query:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE
        MA QPQRKN TLHCS S HSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVG+FDDS TVQEFAKRCE
Subjt:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE

Query:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA
        VLTVEIEHVDVATLEILE+QGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQI+DLEGAKKAGDIFGYPLMIKSKR AYDGRGNAVAKSVEELSSA
Subjt:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA

Query:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE
        IFALGGFER LYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVA+KAVSSLEGAGIFAVELFLTEDGQILLNE
Subjt:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE

Query:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM
        VAPRPHNSGHHTIESCKTSQ+EQHLRAVLGLPLGDPSM+TPAAIMYNILGEDEGEPGFYLAHQLM RALSISGAFVHWYNK EMRKQRKMGHITIVGRSM
Subjt:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM

Query:  SVVENLLASTLDEASEKPA
        SVVENLL+ TLDEASEKPA
Subjt:  SVVENLLASTLDEASEKPA

XP_022938864.1 phosphoribosylaminoimidazole carboxylase, chloroplastic-like isoform X1 [Cucurbita moschata]5.6e-22995.7Show/hide
Query:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE
        MA QPQRKN TLHCS S HSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVG+FDDS TVQEFAKRCE
Subjt:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE

Query:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA
        VLTVEIEHVDVATLEILE+QGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQI+DLEGAKKAGDIFGYPLMIKSKR AYDGRGNAVAKSVEELSSA
Subjt:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA

Query:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE
        IFALGGFER LYVEKWAPFVKELSVVIARGRDN MACYPVVETIHKENICHIVKAPASVSWEIKKLATDVA+KAVSSLEGAGIFAVELFLTEDGQILLNE
Subjt:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE

Query:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM
        VAPRPHNSGHHTIESCKTSQ+EQHLRAVLGLPLGDPSM+TPAAIMYNILGEDEGEPGFYLAHQLM RALSISGAFVHWYNK EMRKQRKMGHITIVGRSM
Subjt:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM

Query:  SVVENLLASTLDEASEKPA
        SVVENLL+ TLDEASEKPA
Subjt:  SVVENLLASTLDEASEKPA

XP_022992775.1 phosphoribosylaminoimidazole carboxylase, chloroplastic-like isoform X1 [Cucurbita maxima]2.4e-22794.99Show/hide
Query:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE
        MA QPQRKN TLHCS S HSTETLPRKDEI+VHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVG+FDDS TVQEFA RCE
Subjt:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE

Query:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA
        VLTVEIEHVDVATLEILE+QGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEF QI+DLEGAKKAGDIFGYPLMIKSKR AYDGRGNAVAKSVEELSSA
Subjt:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA

Query:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE
        IFALGGFER LYVEKWAPFVKELSVVIARGRDNSMACYPVV+TIHKENICHIVKAPASVSWEIKKLATDVA+KAVSSLEGAGIFAVELFLTEDGQILLNE
Subjt:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE

Query:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM
        VAPRPHNSGHHTIESCKTSQ+EQHLRAVLGLPLGDPSM+TPAAIMYNILGEDEGEPGFYLAHQLM RALSISGAFVHWYNK EMRKQRKMGHITIVGRSM
Subjt:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM

Query:  SVVENLLASTLDEASEKPA
        SVVENLL+ TLDEASEKPA
Subjt:  SVVENLLASTLDEASEKPA

XP_023549739.1 phosphoribosylaminoimidazole carboxylase, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo]7.3e-22995.23Show/hide
Query:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE
        MA QPQRKN TLHCS S HSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVG+FDDS TVQEFAKRCE
Subjt:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE

Query:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA
        VLTVEIEHVDVATLEILE+QGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQI+DLEGAKKAGDIFGYPLMIKSKR AYDGRGNAVAKSVEELSSA
Subjt:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA

Query:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE
        IFALGGFER LYVEKWAPFVKELSV+IARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVA+KAVSSLEGAGIFAVELF+TEDGQILLNE
Subjt:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE

Query:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM
        VAPRPHNSGHHTIESCKTSQ+EQHLRAVLGLPLGDPSM+TPAAIMYNILGEDEGEPGFYLAHQLM RALSISGAFVHWYNK EMRKQRKMGHITIVGRSM
Subjt:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM

Query:  SVVENLLASTLDEASEKPA
        SV+ENLL+ TLDEASEKPA
Subjt:  SVVENLLASTLDEASEKPA

XP_038885999.1 phosphoribosylaminoimidazole carboxylase, chloroplastic-like isoform X1 [Benincasa hispida]1.1e-22795.47Show/hide
Query:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE
        MAHQPQRKNQ+LHCS S   T+TLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLS+KIAILDPLVNCPASSLAY+HMVGNFDDS TVQEFAKRC+
Subjt:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE

Query:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA
        VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDK+LQKVHFSQHGIPLPEFMQIDDLE AKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSS+
Subjt:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA

Query:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE
        I ALGGFER LYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE
Subjt:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE

Query:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM
        VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSM+TPAAIMYNILGEDEGEPGFYLAHQLM RALSISGAFVHWYNKPEMRKQRKMGHITIVGRS+
Subjt:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM

Query:  SVVENLLASTLDEASEKPA
        SVV+NLLASTLDEASEKPA
Subjt:  SVVENLLASTLDEASEKPA

TrEMBL top hitse value%identityAlignment
A0A0A0LST5 AIR carboxylase1.5e-22494.26Show/hide
Query:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE
        MAHQP RKN  LHCS S HSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKI+ILDPLVNCPASSLAYYHMVGNFDDS+TVQEFAKRC+
Subjt:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE

Query:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA
        VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEF+QIDDLE AKKAG IFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA
Subjt:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA

Query:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE
        I ALGGFER LYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHK+NICHIVKAPASVSWEIKKLA DVAYKAV+SLEGAGIFAVELFLTEDGQILLNE
Subjt:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE

Query:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM
        VAPRPHNSGHHTIESCKTSQFEQHLRAVLGL LGDPSM+TPAAIM NILGEDEGEPGFY+AHQLM RALSISGAFVHWYNKPEMR+QRKMGHITIVGR M
Subjt:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM

Query:  SVVENLLASTLDEASEKP
        SVVENLLAS LDEASEKP
Subjt:  SVVENLLASTLDEASEKP

A0A1S3B3F9 AIR carboxylase1.5e-22493.79Show/hide
Query:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE
        M HQP RKNQ+LHCS + HST+TLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKI+ILDPLVNCPASSLAYYHMVGNFDDS TVQEFAKRC+
Subjt:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE

Query:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA
        VLTVEIEHVDVATLEILEQQG+DCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQI DLE AKKAG IFGYPLMIKSKRLAYDGRGNAVAKS+EELSSA
Subjt:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA

Query:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE
        I ALGGFER LYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHK+NICHIVKAPASVSWEIKKLA DVAYKAV+SLEGAGIFAVELFLTEDGQILLNE
Subjt:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE

Query:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM
        VAPRPHNSGHHTIESCKTSQFEQHLRAVLGL LGDPSM+TPAAIM NILGEDEGEPGFYLAHQLM RALSISGAFVHWYNKPEMR+QRKMGHITIVGRSM
Subjt:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM

Query:  SVVENLLASTLDEASEKPA
        SVVENLLAS LDEASEKPA
Subjt:  SVVENLLASTLDEASEKPA

A0A5D3C3X4 AIR carboxylase1.5e-22493.79Show/hide
Query:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE
        M HQP RKNQ+LHCS + HST+TLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKI+ILDPLVNCPASSLAYYHMVGNFDDS TVQEFAKRC+
Subjt:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE

Query:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA
        VLTVEIEHVDVATLEILEQQG+DCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQI DLE AKKAG IFGYPLMIKSKRLAYDGRGNAVAKS+EELSSA
Subjt:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA

Query:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE
        I ALGGFER LYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHK+NICHIVKAPASVSWEIKKLA DVAYKAV+SLEGAGIFAVELFLTEDGQILLNE
Subjt:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE

Query:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM
        VAPRPHNSGHHTIESCKTSQFEQHLRAVLGL LGDPSM+TPAAIM NILGEDEGEPGFYLAHQLM RALSISGAFVHWYNKPEMR+QRKMGHITIVGRSM
Subjt:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM

Query:  SVVENLLASTLDEASEKPA
        SVVENLLAS LDEASEKPA
Subjt:  SVVENLLASTLDEASEKPA

A0A6J1FFD1 AIR carboxylase2.7e-22995.7Show/hide
Query:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE
        MA QPQRKN TLHCS S HSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVG+FDDS TVQEFAKRCE
Subjt:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE

Query:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA
        VLTVEIEHVDVATLEILE+QGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQI+DLEGAKKAGDIFGYPLMIKSKR AYDGRGNAVAKSVEELSSA
Subjt:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA

Query:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE
        IFALGGFER LYVEKWAPFVKELSVVIARGRDN MACYPVVETIHKENICHIVKAPASVSWEIKKLATDVA+KAVSSLEGAGIFAVELFLTEDGQILLNE
Subjt:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE

Query:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM
        VAPRPHNSGHHTIESCKTSQ+EQHLRAVLGLPLGDPSM+TPAAIMYNILGEDEGEPGFYLAHQLM RALSISGAFVHWYNK EMRKQRKMGHITIVGRSM
Subjt:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM

Query:  SVVENLLASTLDEASEKPA
        SVVENLL+ TLDEASEKPA
Subjt:  SVVENLLASTLDEASEKPA

A0A6J1JWN2 AIR carboxylase1.1e-22794.99Show/hide
Query:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE
        MA QPQRKN TLHCS S HSTETLPRKDEI+VHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVG+FDDS TVQEFA RCE
Subjt:  MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCE

Query:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA
        VLTVEIEHVDVATLEILE+QGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEF QI+DLEGAKKAGDIFGYPLMIKSKR AYDGRGNAVAKSVEELSSA
Subjt:  VLTVEIEHVDVATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSA

Query:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE
        IFALGGFER LYVEKWAPFVKELSVVIARGRDNSMACYPVV+TIHKENICHIVKAPASVSWEIKKLATDVA+KAVSSLEGAGIFAVELFLTEDGQILLNE
Subjt:  IFALGGFERDLYVEKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNE

Query:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM
        VAPRPHNSGHHTIESCKTSQ+EQHLRAVLGLPLGDPSM+TPAAIMYNILGEDEGEPGFYLAHQLM RALSISGAFVHWYNK EMRKQRKMGHITIVGRSM
Subjt:  VAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSM

Query:  SVVENLLASTLDEASEKPA
        SVVENLL+ TLDEASEKPA
Subjt:  SVVENLLASTLDEASEKPA

SwissProt top hitse value%identityAlignment
P0C017 Phosphoribosylaminoimidazole carboxylase3.6e-8544.94Show/hide
Query:  KIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSL-----AYYHMVGNFDDSKTVQEFAKRCEVLTVEIEHVDVATLEILEQQGI-DCQPRASTI
        K VG+LGGGQLGRML   A+ L I + ILD     PA        ++ H  G F     +++ A  C++LTVEIEHV+   LE +E++G+ + QP   TI
Subjt:  KIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSL-----AYYHMVGNFDDSKTVQEFAKRCEVLTVEIEHVDVATLEILEQQGI-DCQPRASTI

Query:  RIIQDKYLQKVHFSQHGIPLPEFMQI---DDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSV--EELSSAIFALGGFERDLYVEKWAPFVKELSVV
        R+IQ+KY QK + ++ G+ +  F ++      E  K      G PLM+K+K LAYDGRGN+  KS   E++ +++  LG  +R LY E WAPFVKE++V+
Subjt:  RIIQDKYLQKVHFSQHGIPLPEFMQI---DDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSV--EELSSAIFALGGFERDLYVEKWAPFVKELSVV

Query:  IARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNEVAPRPHNSGHHTIESCKTSQFEQHLR
        + R ++  +  Y  VETIH+E+I  +  AP      + + A ++A KAV  LEGAGIF VE+FL  DG++LLNE+APRPHNSGHHTIE+C TSQFE HLR
Subjt:  IARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNEVAPRPHNSGHHTIESCKTSQFEQHLR

Query:  AVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLLASTL
        A+L LPLG  ++  P+A M NILG            ++   AL++ GA VH Y K E RK RKMGHIT+   S + +   L + L
Subjt:  AVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLLASTL

P15567 Phosphoribosylaminoimidazole carboxylase8.7e-9247.67Show/hide
Query:  VSEK-IVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSL--AYYHMVGNFDDSKTVQEFAKRCEVLTVEIEHVDVATLEILEQQGIDCQPRASTI
        +SEK +VG+LGGGQLGRM+ +AA +L+IK  ILD   N PA  +     H+  +F D   + E +K+C +LT EIEH++   L  +  + +  +P  +T+
Subjt:  VSEK-IVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSL--AYYHMVGNFDDSKTVQEFAKRCEVLTVEIEHVDVATLEILEQQGIDCQPRASTI

Query:  RIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSAIFALGGFERDLYVEKWAPFVKELSVVIARGR
        R IQDKYLQK H     I LPEF    D E  +KAG  FGYP ++KSK LAYDGRGN V     E+ +AI ALG  +R LYVEK+ PF  E++V++ R  
Subjt:  RIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSAIFALGGFERDLYVEKWAPFVKELSVVIARGR

Query:  DNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQ-ILLNEVAPRPHNSGHHTIESCKTSQFEQHLRAVLG
        D  +  YP  ETI K+N+CH+V APA + + I++ A  +A  AV + EGAGI+ VE+F+ +DG+ ILLNE+APRPHNSGH+TIE+C TSQFE HLRA+ G
Subjt:  DNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQ-ILLNEVAPRPHNSGHHTIESCKTSQFEQHLRAVLG

Query:  LPLGD----PSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLLASTLD
        LP  +     S  T  A+M NILG D+ +       ++  R+LSI GA +H Y K E RK RKMGH+TI+  S    E      LD
Subjt:  LPLGD----PSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLLASTLD

P50504 Phosphoribosylaminoimidazole carboxylase8.7e-9246.84Show/hide
Query:  VSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSL-AYYHMVGNFDDSKTVQEFAKRCEVLTVEIEHVDVATLE-ILEQQGIDCQPRASTIR
        +  KIVG+LGGGQLGRM+ +AA++L+IK  +LD + N PA  + +  H+ G+F D K++ + A++C++LTVEIEHVDV  L+ + E+ G++  P   TI+
Subjt:  VSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSL-AYYHMVGNFDDSKTVQEFAKRCEVLTVEIEHVDVATLE-ILEQQGIDCQPRASTIR

Query:  IIQDKYLQKVHFSQHGIPLPEFMQI--DDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSAIFALGGFERDLYVEKWAPFVKELSVVIARG
        +IQDKYLQK H  QHGI + E + +  +D +   + G+ F YP M+KS+ LAYDGRGN V K+ E +  A+  L   +R LY EKW PF KEL+V++ R 
Subjt:  IIQDKYLQKVHFSQHGIPLPEFMQI--DDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSAIFALGGFERDLYVEKWAPFVKELSVVIARG

Query:  RDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNEVAPRPHNSGHHTIESCKTSQFEQHLRAVLG
         +  +  YP VETIHK NICH+V APA VS  I   A+ +A  AV S  G GIF VE+FL  + +IL+NE+APRPHNSGH+TI++C TSQFE H+RAV+G
Subjt:  RDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNEVAPRPHNSGHHTIESCKTSQFEQHLRAVLG

Query:  LPL----GDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLLASTLDEASEKPAGI
        LP+       S  T  AIM N+LG++E         ++  RAL    A V+ Y K   R  RKMGHI IV  SM   E+ L   + ++S+ P  +
Subjt:  LPL----GDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLLASTLDEASEKPAGI

P55195 Phosphoribosylaminoimidazole carboxylase, chloroplastic (Fragment)2.5e-15571.77Show/hide
Query:  GVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCEVLTVEIEHVDVATLEILEQQGIDCQPRASTIRI
        G+ E +VGVLGGGQLGRM+CQAAS+++IK+ +LDP  NCPASSL+Y+HMVG+FD+S  V+EFAKRC VLTVEIEHVDV TLE LE+QG+DCQP+AST+RI
Subjt:  GVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCEVLTVEIEHVDVATLEILEQQGIDCQPRASTIRI

Query:  IQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSAIFALGGFERDLYVEKWAPFVKELSVVIARGRDN
        IQDKY QKV      IPLPEFM+IDDL+   K  D      MIKS+RLAYDGRGN VAKS EELSSA+ ALGGF+R LY EKWAPFVKEL+V++ARGRDN
Subjt:  IQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSAIFALGGFERDLYVEKWAPFVKELSVVIARGRDN

Query:  SMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNEVAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPL
        S++CYPVVE +   +ICHIVK+PA+V+W+ ++LA +VA+ AV SLE  G+FAVELFLT++G+ILLNEVAPRPHNSGHHTIESC TSQFEQHL AV+GLPL
Subjt:  SMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNEVAPRPHNSGHHTIESCKTSQFEQHLRAVLGLPL

Query:  GDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLLASTLD
        GDPSM TPAAIMYNILGE+EGE GF LAHQLM RA++I GA VHWY+KPEMRKQRKM HITIVG S+S +E+ LA  L+
Subjt:  GDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLLASTLD

Q01930 Phosphoribosylaminoimidazole carboxylase1.2e-8546.46Show/hide
Query:  VSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYY--HMVGNFDDSKTVQEFAKRCEVLTVEIEHVDV-ATLEILEQQGIDCQPRASTI
        +  + VG+LGGGQLGRM+ +AA +L+IK  IL+     PA  +     H+ G+F+D K + E A +C+VLTVEIEHVD  A +E+ +  GI   P   TI
Subjt:  VSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYY--HMVGNFDDSKTVQEFAKRCEVLTVEIEHVDV-ATLEILEQQGIDCQPRASTI

Query:  RIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGA-KKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSAIFALGGFERDLYVEKWAPFVKELSVVIARG
         +I+DKYLQK H  ++GI + E   ++    + ++ G  +G+P M+KS+ +AYDGRGN V K    +  A+  L   +R LY EKWAPF KEL+V++ R 
Subjt:  RIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGA-KKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSAIFALGGFERDLYVEKWAPFVKELSVVIARG

Query:  RDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNEVAPRPHNSGHHTIESCKTSQFEQHLRAVLG
         D  +  YP VETIH+ NICH V APA V+  ++K A  +A  AV S  GAGIF VE+FL ++G +L+NE+APRPHNSGH+TI++C TSQFE H+RA+ G
Subjt:  RDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNEVAPRPHNSGHHTIESCKTSQFEQHLRAVLG

Query:  LPL--GDPSMLTPA--AIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLL
        LP+      + TP+  AIM N+LG DE    F    ++  RAL    A V+ Y K   R  RKMGHI IV +SM+  E  L
Subjt:  LPL--GDPSMLTPA--AIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLL

Arabidopsis top hitse value%identityAlignment
AT2G37690.1 phosphoribosylaminoimidazole carboxylase, putative / AIR carboxylase, putative4.2e-18276.49Show/hide
Query:  CSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCEVLTVEIEHVDVAT
        CS SH ++     ++   VHGVSEKIVGVLGGGQLGRMLCQAAS+L+IK+ ILDP  NC AS+L+Y HMV +FDDS TV+EFAKRC VLTVEIEHVDV T
Subjt:  CSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCEVLTVEIEHVDVAT

Query:  LEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSAIFALGGFERDLYV
        LE LE+QG+DCQP+ASTIRIIQDKY+QKVHFSQHGIPLPEFM+I D+EGA+KAG++FGYPLMIKSKRLAYDGRGNAVA + +ELSSA+ ALGGF R LY+
Subjt:  LEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSAIFALGGFERDLYV

Query:  EKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNEVAPRPHNSGHHTI
        EKWAPFVKEL+V++ARGRD SM CYPVVETIH++NICHIVKAPA V W+I KLATDVA KAV SLEGAG+FAVELFLTED QILLNEVAPRPHNSGH TI
Subjt:  EKWAPFVKELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNEVAPRPHNSGHHTI

Query:  ESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLLASTLDE
        E C TSQFEQHLRAV+GLPLGDPSM TPA+IMYNILGED+GE GF LAH+L+ RAL I GA VHWY+KPEMRKQRKMGHIT+VG+SM ++E  L   L E
Subjt:  ESCKTSQFEQHLRAVLGLPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLLASTLDE

Query:  ASEK
         S +
Subjt:  ASEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCATCAACCACAGCGCAAGAATCAGACCCTACACTGCTCCCCATCGCATCATTCTACTGAAACACTTCCCCGGAAAGACGAAATTGCTGTTCATGGAGTTTCTGA
AAAAATTGTTGGTGTACTAGGAGGGGGCCAATTGGGTCGTATGCTATGTCAAGCAGCCTCCAAACTGTCCATCAAAATCGCAATTCTCGATCCACTTGTAAACTGCCCTG
CTAGTTCGTTGGCTTACTATCACATGGTTGGTAACTTTGATGACAGTAAAACTGTTCAAGAATTTGCTAAGAGATGTGAAGTATTAACAGTTGAAATTGAACATGTTGAT
GTGGCCACGCTAGAGATTCTCGAGCAGCAAGGAATTGATTGTCAACCTAGAGCATCTACCATCCGTATAATTCAGGACAAATATCTCCAAAAAGTTCATTTTTCTCAGCA
TGGAATTCCATTGCCCGAGTTCATGCAGATCGACGATCTTGAAGGTGCAAAGAAAGCAGGTGATATATTTGGTTATCCTCTCATGATTAAGAGCAAAAGATTAGCTTATG
ATGGACGTGGAAACGCTGTTGCAAAAAGTGTGGAGGAGCTTTCTTCTGCCATTTTTGCTCTAGGTGGATTTGAGCGTGACTTATATGTTGAGAAGTGGGCGCCTTTTGTG
AAGGAACTGTCGGTTGTTATTGCAAGAGGTAGAGACAACTCCATGGCATGCTATCCTGTTGTTGAAACTATTCATAAGGAAAATATTTGTCACATTGTCAAAGCTCCGGC
TAGTGTGTCATGGGAGATCAAGAAACTTGCAACCGACGTTGCATACAAAGCTGTTAGCTCTCTGGAAGGCGCTGGCATTTTTGCTGTGGAATTGTTTCTTACAGAAGATG
GTCAGATTTTACTGAATGAAGTGGCTCCTAGACCTCATAATAGTGGTCATCACACAATTGAGTCTTGCAAGACCTCTCAATTTGAGCAGCATTTGCGGGCTGTCCTTGGT
CTTCCCCTTGGAGATCCATCAATGCTAACCCCAGCTGCTATTATGTACAATATTCTAGGTGAAGATGAGGGGGAACCTGGTTTTTATCTTGCTCATCAACTGATGCTAAG
GGCATTGAGTATTTCTGGAGCTTTTGTTCATTGGTATAATAAACCGGAAATGAGAAAGCAAAGAAAGATGGGTCACATCACGATTGTTGGCCGTTCTATGAGCGTTGTTG
AAAATTTATTAGCATCAACGCTTGATGAAGCTTCTGAAAAACCTGCAGGGATCGGCCATTCTTCTGCGTTCTTTCTTTCAACGAGTGGGAGGTTAATATCACAATCATGT
ATTCTTTGCTTCAAGTGTTTTTTTGAACCTGCTAACTGTTGTATTAATTTAAGTATGGTTTCAAAGGAGCAGTTCCAATGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCATCAACCACAGCGCAAGAATCAGACCCTACACTGCTCCCCATCGCATCATTCTACTGAAACACTTCCCCGGAAAGACGAAATTGCTGTTCATGGAGTTTCTGA
AAAAATTGTTGGTGTACTAGGAGGGGGCCAATTGGGTCGTATGCTATGTCAAGCAGCCTCCAAACTGTCCATCAAAATCGCAATTCTCGATCCACTTGTAAACTGCCCTG
CTAGTTCGTTGGCTTACTATCACATGGTTGGTAACTTTGATGACAGTAAAACTGTTCAAGAATTTGCTAAGAGATGTGAAGTATTAACAGTTGAAATTGAACATGTTGAT
GTGGCCACGCTAGAGATTCTCGAGCAGCAAGGAATTGATTGTCAACCTAGAGCATCTACCATCCGTATAATTCAGGACAAATATCTCCAAAAAGTTCATTTTTCTCAGCA
TGGAATTCCATTGCCCGAGTTCATGCAGATCGACGATCTTGAAGGTGCAAAGAAAGCAGGTGATATATTTGGTTATCCTCTCATGATTAAGAGCAAAAGATTAGCTTATG
ATGGACGTGGAAACGCTGTTGCAAAAAGTGTGGAGGAGCTTTCTTCTGCCATTTTTGCTCTAGGTGGATTTGAGCGTGACTTATATGTTGAGAAGTGGGCGCCTTTTGTG
AAGGAACTGTCGGTTGTTATTGCAAGAGGTAGAGACAACTCCATGGCATGCTATCCTGTTGTTGAAACTATTCATAAGGAAAATATTTGTCACATTGTCAAAGCTCCGGC
TAGTGTGTCATGGGAGATCAAGAAACTTGCAACCGACGTTGCATACAAAGCTGTTAGCTCTCTGGAAGGCGCTGGCATTTTTGCTGTGGAATTGTTTCTTACAGAAGATG
GTCAGATTTTACTGAATGAAGTGGCTCCTAGACCTCATAATAGTGGTCATCACACAATTGAGTCTTGCAAGACCTCTCAATTTGAGCAGCATTTGCGGGCTGTCCTTGGT
CTTCCCCTTGGAGATCCATCAATGCTAACCCCAGCTGCTATTATGTACAATATTCTAGGTGAAGATGAGGGGGAACCTGGTTTTTATCTTGCTCATCAACTGATGCTAAG
GGCATTGAGTATTTCTGGAGCTTTTGTTCATTGGTATAATAAACCGGAAATGAGAAAGCAAAGAAAGATGGGTCACATCACGATTGTTGGCCGTTCTATGAGCGTTGTTG
AAAATTTATTAGCATCAACGCTTGATGAAGCTTCTGAAAAACCTGCAGGGATCGGCCATTCTTCTGCGTTCTTTCTTTCAACGAGTGGGAGGTTAATATCACAATCATGT
ATTCTTTGCTTCAAGTGTTTTTTTGAACCTGCTAACTGTTGTATTAATTTAAGTATGGTTTCAAAGGAGCAGTTCCAATGCTAG
Protein sequenceShow/hide protein sequence
MAHQPQRKNQTLHCSPSHHSTETLPRKDEIAVHGVSEKIVGVLGGGQLGRMLCQAASKLSIKIAILDPLVNCPASSLAYYHMVGNFDDSKTVQEFAKRCEVLTVEIEHVD
VATLEILEQQGIDCQPRASTIRIIQDKYLQKVHFSQHGIPLPEFMQIDDLEGAKKAGDIFGYPLMIKSKRLAYDGRGNAVAKSVEELSSAIFALGGFERDLYVEKWAPFV
KELSVVIARGRDNSMACYPVVETIHKENICHIVKAPASVSWEIKKLATDVAYKAVSSLEGAGIFAVELFLTEDGQILLNEVAPRPHNSGHHTIESCKTSQFEQHLRAVLG
LPLGDPSMLTPAAIMYNILGEDEGEPGFYLAHQLMLRALSISGAFVHWYNKPEMRKQRKMGHITIVGRSMSVVENLLASTLDEASEKPAGIGHSSAFFLSTSGRLISQSC
ILCFKCFFEPANCCINLSMVSKEQFQC