; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G010120 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G010120
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionG-box-binding factor 4-like isoform X1
Genome locationCmo_Chr01:7373490..7378502
RNA-Seq ExpressionCmoCh01G010120
SyntenyCmoCh01G010120
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR043452 - Plant bZIP transcription factors


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607637.1 G-box-binding factor 4, partial [Cucurbita argyrosperma subsp. sororia]7.6e-10398.98Show/hide
Query:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG
        MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQED+R+LNPLNCFPQFEEEMIVGFGNGGEISGRSG
Subjt:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG

Query:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
        KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
Subjt:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC

KAG7037241.1 G-box-binding factor 4, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-10399.49Show/hide
Query:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG
        MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQED+RILNPLNCFPQFEEEMIVGFGNGGEISGRSG
Subjt:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG

Query:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
        KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
Subjt:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC

XP_022926188.1 G-box-binding factor 4-like isoform X1 [Cucurbita moschata]3.4e-103100Show/hide
Query:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG
        MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG
Subjt:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG

Query:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
        KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
Subjt:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC

XP_022981612.1 G-box-binding factor 4-like isoform X1 [Cucurbita maxima]1.5e-9896.43Show/hide
Query:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG
        MKK NQIFRTAAAAKVVANPIHMSAMDMDSKL GPST S GVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEE+IVGFGNGGEISGRSG
Subjt:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG

Query:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
        KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPP++LCPGHSFEC
Subjt:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC

XP_023521221.1 G-box-binding factor 4-like isoform X1 [Cucurbita pepo subsp. pepo]5.4e-10197.5Show/hide
Query:  MKKPNQIFRTAAA----AKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEIS
        MKKPNQIFRTAAA    AKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEIS
Subjt:  MKKPNQIFRTAAA----AKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEIS

Query:  GRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
        GRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPR LCPGHSFEC
Subjt:  GRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC

TrEMBL top hitse value%identityAlignment
A0A1S3CH58 G-box-binding factor 4-like isoform X36.2e-4256.56Show/hide
Query:  MKKPNQIFRT-----AAAAKVVANPIHMSAMD-----MD-----SKLDG------PSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFP
        MKK NQIFRT     AAAAK      H S +D     +D     S LDG      PS+ S  VDD+WR+   E +            ED+ ILNPL+C  
Subjt:  MKKPNQIFRT-----AAAAKVVANPIHMSAMD-----MD-----SKLDG------PSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFP

Query:  QF-----EEEMIVGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMET
         F     E++  VGFGN  +I  R GKRRR  MEPMD+AALQRQRRMIKNRESAARSRERK AHQ+ELE IA+RLEEEN RLLK+KAER KERLKQLM  
Subjt:  QF-----EEEMIVGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMET

Query:  VIPVVEKRRPPRVLCPGHSFE
        VIPV+EK+R P+V+C G SFE
Subjt:  VIPVVEKRRPPRVLCPGHSFE

A0A6J1EEF8 G-box-binding factor 4-like isoform X25.7e-9694.9Show/hide
Query:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG
        MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG
Subjt:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG

Query:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
        KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQK          LMETVIPVVEKRRPPRVLCPGHSFEC
Subjt:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC

A0A6J1EHB8 G-box-binding factor 4-like isoform X11.6e-103100Show/hide
Query:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG
        MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG
Subjt:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG

Query:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
        KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
Subjt:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC

A0A6J1IUG5 G-box-binding factor 4-like isoform X17.2e-9996.43Show/hide
Query:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG
        MKK NQIFRTAAAAKVVANPIHMSAMDMDSKL GPST S GVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEE+IVGFGNGGEISGRSG
Subjt:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG

Query:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
        KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPP++LCPGHSFEC
Subjt:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC

A0A6J1J2K3 G-box-binding factor 4-like isoform X22.5e-9191.33Show/hide
Query:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG
        MKK NQIFRTAAAAKVVANPIHMSAMDMDSKL GPST S GVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEE+IVGFGNGGEISGRSG
Subjt:  MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSG

Query:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC
        KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQK          LMETVIPVVEKRRPP++LCPGHSFEC
Subjt:  KRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC

SwissProt top hitse value%identityAlignment
P42777 G-box-binding factor 42.7e-2644.39Show/hide
Query:  ANPIHMSAMDMDSKLDGPSTPSGG--VDDVW-------------REKAVEEMMRWEDFI-------GVKAQEDVRI----LN---------PLNCFPQFE
        + P  M A+D+D  +   ++ + G  VDDVW             +E+  E++M  EDF+       G   + DV+I    LN         P+     F 
Subjt:  ANPIHMSAMDMDSKLDGPSTPSGG--VDDVW-------------REKAVEEMMRWEDFI-------GVKAQEDVRI----LN---------PLNCFPQFE

Query:  EEMIVGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKR
         +M+ G   GG      GKR R  ME MD+AA QRQ+RMIKNRESAARSRERK A+QVELE +AA+LEEEN +LLK+  E  KER K+LME +IPV EK 
Subjt:  EEMIVGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKR

Query:  RPP-RVLCPGHSFE
        RPP R L   HS E
Subjt:  RPP-RVLCPGHSFE

Q0JHF1 bZIP transcription factor 121.4e-1937.23Show/hide
Query:  AAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISG---RSGKRRRAPM
        AAAA  V  P   +A         P+  +G       E  +E+ +  E   G   +++  + +P       + ++++GF NG E++G       R+R  M
Subjt:  AAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISG---RSGKRRRAPM

Query:  EPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFE
        +PMD AA+QRQ+RMIKNRESAARSRERK A+  ELE +  +LEEEN ++ K++ E+ ++RLK+L E V+PV+ ++   R L   +S E
Subjt:  EPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFE

Q9C5Q2 ABSCISIC ACID-INSENSITIVE 5-like protein 35.2e-0644.58Show/hide
Query:  FPQFEEEMIVGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQK
        +P  E + +V  G   +     G R+R   E +++   +RQ+RMIKNRESAARSR RK A+  ELE+  +RLEEEN +L + K
Subjt:  FPQFEEEMIVGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQK

Q9LES3 ABSCISIC ACID-INSENSITIVE 5-like protein 24.3e-0847.19Show/hide
Query:  GKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVL
        G++R A  E +++   +RQ+RMIKNRESAARSR RK A+  ELE+  +RLEEEN RL KQK           +E ++P V    P R L
Subjt:  GKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVL

Q9SJN0 Protein ABSCISIC ACID-INSENSITIVE 51.4e-0641.84Show/hide
Query:  VGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRP
        +G   G ++ G  G R+R    P+++   +RQRRMIKNRESAARSR RK A+ VELE    +L+EEN +L    AE  ++R +Q  E++    + + P
Subjt:  VGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRP

Arabidopsis top hitse value%identityAlignment
AT1G03970.1 G-box binding factor 41.9e-2744.39Show/hide
Query:  ANPIHMSAMDMDSKLDGPSTPSGG--VDDVW-------------REKAVEEMMRWEDFI-------GVKAQEDVRI----LN---------PLNCFPQFE
        + P  M A+D+D  +   ++ + G  VDDVW             +E+  E++M  EDF+       G   + DV+I    LN         P+     F 
Subjt:  ANPIHMSAMDMDSKLDGPSTPSGG--VDDVW-------------REKAVEEMMRWEDFI-------GVKAQEDVRI----LN---------PLNCFPQFE

Query:  EEMIVGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKR
         +M+ G   GG      GKR R  ME MD+AA QRQ+RMIKNRESAARSRERK A+QVELE +AA+LEEEN +LLK+  E  KER K+LME +IPV EK 
Subjt:  EEMIVGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKR

Query:  RPP-RVLCPGHSFE
        RPP R L   HS E
Subjt:  RPP-RVLCPGHSFE

AT2G36270.1 Basic-leucine zipper (bZIP) transcription factor family protein9.8e-0841.84Show/hide
Query:  VGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRP
        +G   G ++ G  G R+R    P+++   +RQRRMIKNRESAARSR RK A+ VELE    +L+EEN +L    AE  ++R +Q  E++    + + P
Subjt:  VGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRP

AT2G41070.1 Basic-leucine zipper (bZIP) transcription factor family protein3.7e-0744.58Show/hide
Query:  FPQFEEEMIVGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQK
        +P  E + +V  G   +     G R+R   E +++   +RQ+RMIKNRESAARSR RK A+  ELE+  +RLEEEN +L + K
Subjt:  FPQFEEEMIVGFGNGGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQK

AT3G56850.1 ABA-responsive element binding protein 33.0e-0947.19Show/hide
Query:  GKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVL
        G++R A  E +++   +RQ+RMIKNRESAARSR RK A+  ELE+  +RLEEEN RL KQK           +E ++P V    P R L
Subjt:  GKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVL

AT5G44080.1 Basic-leucine zipper (bZIP) transcription factor family protein1.0e-2845.15Show/hide
Query:  SAMDMDSKLDGPSTPSGG--VDDVWR-----------EKAVEEMMRWEDFIGVKAQED--------------VRILN--------PLNCFPQFE--EEMI
        +A D+     G  T  GG  VD++WR           E+  EE+M  EDF+   A ED              + + N        P N F   +  E  I
Subjt:  SAMDMDSKLDGPSTPSGG--VDDVWR-----------EKAVEEMMRWEDFIGVKAQED--------------VRILN--------PLNCFPQFE--EEMI

Query:  VGFGNGGEI--SGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVE--KR
        V FGNG ++   G  GKR R  +EP+D+AA QRQRRMIKNRESAARSRERK A+QVELE +AA+LEEEN  L K+  ++ KER ++LME VIPVVE  K+
Subjt:  VGFGNGGEI--SGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVE--KR

Query:  RPPRVL
        +PPR L
Subjt:  RPPRVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAACCAAATCAGATATTTAGAACAGCAGCAGCAGCAAAAGTAGTAGCTAATCCCATTCATATGTCCGCCATGGATATGGATAGTAAGCTTGATGGGCCATCTAC
TCCCTCTGGAGGTGTGGACGATGTTTGGAGGGAGAAGGCGGTTGAGGAGATGATGAGATGGGAGGATTTTATTGGAGTGAAAGCGCAGGAGGATGTACGGATCTTGAATC
CCTTGAATTGTTTCCCCCAATTTGAAGAGGAGATGATTGTCGGGTTTGGGAATGGCGGTGAAATTAGTGGGAGATCGGGGAAGAGAAGGCGCGCCCCCATGGAGCCCATG
GATGAGGCTGCCTTGCAAAGACAACGGAGGATGATTAAGAACAGGGAGTCCGCTGCCAGATCCAGAGAAAGGAAACATGCACATCAAGTTGAGTTAGAGTTAATAGCTGC
CCGACTTGAGGAAGAGAACCACCGATTATTGAAACAGAAGGCTGAGAGAGGGAAGGAACGACTAAAGCAGCTGATGGAAACAGTGATCCCAGTTGTGGAGAAACGAAGAC
CGCCGCGAGTCCTATGTCCGGGTCACTCCTTCGAATGCTAG
mRNA sequenceShow/hide mRNA sequence
AGAAGATCTGCGGGGAATCAATCGAGCATAATGGCGCCACAGAATTCGAGAGATTCGCAACTGTCGCCTTCTTTTTCTTCTTCCTTCCTTTCTCCTCCTCCTCTTCTGTT
CTTGACGCACACAAATTTCGGATCCATCTCCGATGAAGAAACCAAATCAGATATTTAGAACAGCAGCAGCAGCAAAAGTAGTAGCTAATCCCATTCATATGTCCGCCATG
GATATGGATAGTAAGCTTGATGGGCCATCTACTCCCTCTGGAGGTGTGGACGATGTTTGGAGGGAGAAGGCGGTTGAGGAGATGATGAGATGGGAGGATTTTATTGGAGT
GAAAGCGCAGGAGGATGTACGGATCTTGAATCCCTTGAATTGTTTCCCCCAATTTGAAGAGGAGATGATTGTCGGGTTTGGGAATGGCGGTGAAATTAGTGGGAGATCGG
GGAAGAGAAGGCGCGCCCCCATGGAGCCCATGGATGAGGCTGCCTTGCAAAGACAACGGAGGATGATTAAGAACAGGGAGTCCGCTGCCAGATCCAGAGAAAGGAAACAT
GCACATCAAGTTGAGTTAGAGTTAATAGCTGCCCGACTTGAGGAAGAGAACCACCGATTATTGAAACAGAAGGCTGAGAGAGGGAAGGAACGACTAAAGCAGCTGATGGA
AACAGTGATCCCAGTTGTGGAGAAACGAAGACCGCCGCGAGTCCTATGTCCGGGTCACTCCTTCGAATGCTAGCCTAAATGTCACACCAACACACCAACACACCAACACC
AATATCCTTTTCAAATCAAGCTCCTCTAAATGTTTGAGTTCAGAT
Protein sequenceShow/hide protein sequence
MKKPNQIFRTAAAAKVVANPIHMSAMDMDSKLDGPSTPSGGVDDVWREKAVEEMMRWEDFIGVKAQEDVRILNPLNCFPQFEEEMIVGFGNGGEISGRSGKRRRAPMEPM
DEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEEENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFEC