; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042187 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042187
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionComponent of oligomeric Golgi complex 3
Genome locationchr13:38257148..38265773
RNA-Seq ExpressionLag0042187
SyntenyLag0042187
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006886 - intracellular protein transport (biological process)
GO:0006891 - intra-Golgi vesicle-mediated transport (biological process)
GO:0007030 - Golgi organization (biological process)
GO:0009860 - pollen tube growth (biological process)
GO:0005801 - cis-Golgi network (cellular component)
GO:0016020 - membrane (cellular component)
GO:0017119 - Golgi transport complex (cellular component)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0042803 - protein homodimerization activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008094 - DNA-dependent ATPase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR036397 - Ribonuclease H superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR012337 - Ribonuclease H-like superfamily
IPR007265 - Conserved oligomeric Golgi complex, subunit 3
IPR002156 - Ribonuclease H domain
IPR000679 - Zinc finger, GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596156.1 Conserved oligomeric Golgi complex subunit 3, partial [Cucurbita argyrosperma subsp. sororia]9.8e-8395.81Show/hide
Query:  SASMAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWF
        S SMAAKA+PLGLPKSGAISKGYNFAS WEQNAPLTEQQQAAIATLCHAVAERP P DLAQDRIGGKENALSISVKDT+NEDSDAIEAVLVNTNQFYKWF
Subjt:  SASMAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWF

Query:  SDLESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        SDLESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVD+TLDLFNELQLQHQAVATKTRTLHDACDRL
Subjt:  SDLESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

KAG7027701.1 Conserved oligomeric Golgi complex subunit 3 [Cucurbita argyrosperma subsp. argyrosperma]4.9e-8296.34Show/hide
Query:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL
        MAAKA+PLGLPKSGAISKGYNFAS WEQNAPLTEQQQAAIATLCHAVAERP P DLAQDRIGGKENALSISVKDT+NEDSDAIEAVLVNTNQFYKWFSDL
Subjt:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL

Query:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVD+TLDLFNELQLQHQAVATKTRTLHDACDRL
Subjt:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

XP_022159147.1 conserved oligomeric Golgi complex subunit 3 isoform X1 [Momordica charantia]8.3e-8296.34Show/hide
Query:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL
        MAAKA PLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDR G KENALSISVKDTTNEDSD IEAVLVNTNQFYKWFSDL
Subjt:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL

Query:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        ESAMKSETEEKYHHYLNSLTDRIRTCDDIL QVDDTLDLF+ELQLQHQAVATKTRTLHDACDRL
Subjt:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

XP_023539647.1 conserved oligomeric Golgi complex subunit 3 [Cucurbita pepo subsp. pepo]4.9e-8296.34Show/hide
Query:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL
        MAAKA+PLGLPKSGAISKGYNFAS WEQNAPLTEQQQAAIATLCHAVAERP P DLAQDRIGGKENALSISVKDT+NEDSDAIEAVLVNTNQFYKWFSDL
Subjt:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL

Query:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVD+TLDLFNELQLQHQAVATKTRTLHDACDRL
Subjt:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

XP_038892000.1 conserved oligomeric Golgi complex subunit 3 [Benincasa hispida]2.4e-8195.73Show/hide
Query:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL
        MAAK APLGLPKSGAISKGYNFASTWEQNAPLTEQQQAA+ATLCHA+AERPFPVDLAQDRI GKENALSISVKDT +EDSDAIEAVLVNTNQFYKWFSDL
Subjt:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL

Query:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        ESAMKSETEEKYHHYLNSLTDRIRTCD ILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
Subjt:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

TrEMBL top hitse value%identityAlignment
A0A5A7UE42 Component of oligomeric Golgi complex 33.4e-8196.34Show/hide
Query:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL
        MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATL HAVAERPFPVDLAQDRIGGKENALSISVK+TTN+DSDA+EAVLVNTNQFYKWFSDL
Subjt:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL

Query:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        ESAMKSETEEKYHHYLNSLTDRIRTCD ILRQVDDTL LFNELQLQHQAVATKTRTLHDACDRL
Subjt:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

A0A5D3CUT0 Component of oligomeric Golgi complex 33.4e-8196.34Show/hide
Query:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL
        MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATL HAVAERPFPVDLAQDRIGGKENALSISVK+TTN+DSDA+EAVLVNTNQFYKWFSDL
Subjt:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL

Query:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        ESAMKSETEEKYHHYLNSLTDRIRTCD ILRQVDDTL LFNELQLQHQAVATKTRTLHDACDRL
Subjt:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

A0A6J1DZ07 Component of oligomeric Golgi complex 34.0e-8296.34Show/hide
Query:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL
        MAAKA PLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDR G KENALSISVKDTTNEDSD IEAVLVNTNQFYKWFSDL
Subjt:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL

Query:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        ESAMKSETEEKYHHYLNSLTDRIRTCDDIL QVDDTLDLF+ELQLQHQAVATKTRTLHDACDRL
Subjt:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

A0A6J1FRH7 Component of oligomeric Golgi complex 31.5e-8195.73Show/hide
Query:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL
        MAAKA+PLGLPKSGAISKGYNFAS WEQNAPLTEQQQAAIATLCHAVAERP P DLAQDRIGGKENALSISVKDT+NEDSDAIEAVLVNTNQFYKWFSDL
Subjt:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL

Query:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        ESAMKSETEEKYHHYLNSLTDRI TCDDILRQVD+TLDLFNELQLQHQAVATKTRTLHDACDRL
Subjt:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

A0A6J1I362 Component of oligomeric Golgi complex 33.4e-8195.12Show/hide
Query:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL
        MAA A+PLGLPKSGAISKGYNFAS WEQNAPLTEQQQAAIATLCHAVAERP P DLAQDRIGGKENALSISVKDT+NEDSDAIEAVLVNTNQFYKWFSDL
Subjt:  MAAKAAPLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDL

Query:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        ESAMKSETEEKYHHYLNSLTDRIRTCDDIL QVD+TLDLFNELQLQHQAVATKTRTLHDACDRL
Subjt:  ESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

SwissProt top hitse value%identityAlignment
F4HQ84 Conserved oligomeric Golgi complex subunit 31.6e-6476.36Show/hide
Query:  MAAKAA-PLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSD
        MA KAA    LPKSGAISKGYNFASTWEQ+APLTEQQQAAI +L HAVAERPFP +L  + +   EN LS+SV+DT   DS AIEAVLVNTNQFYKWF+D
Subjt:  MAAKAA-PLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSD

Query:  LESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        LESAMKSETEEKY HY+++LT+RI+TCD+IL QVD+TLDLFNELQLQHQ V TKT+TLHDACDRL
Subjt:  LESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

P0C2F6 Putative ribonuclease H protein At1g657507.0e-0737.5Show/hide
Query:  NQRSWWNTLWKTKVPSKIKLFIWKAYHDCLPTNLGLWRRGMDVSNVCNICKVETERIDHALCGC
        N  S++N LWK +VP ++K F+W   +  + T     RR +  SNVC +CK   E + H L  C
Subjt:  NQRSWWNTLWKTKVPSKIKLFIWKAYHDCLPTNLGLWRRGMDVSNVCNICKVETERIDHALCGC

Q16ZN9 Conserved oligomeric Golgi complex subunit 32.9e-0527.97Show/hide
Query:  WEQN----APLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDLESAMKSETEEKYHHYLNSLTD
        WEQ     APL+  Q   I  L  ++     P   A   +  +E  LS+  K TT  D     +V+ +T  F  W++ ++S +    ++ Y  Y   L  
Subjt:  WEQN----APLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDLESAMKSETEEKYHHYLNSLTD

Query:  RIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        R   CD +L ++D +L+   +L  +++ V+ KT +LH A + L
Subjt:  RIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein1.3e-0523.84Show/hide
Query:  KIKLFIWKAYHDCLPTNLGLWRRGMDVSNVCNICKVETERIDHALCGCERSKKICDFMFTREDSDISVANNFADRLVCLASRLSTKEFETLCITL-----
        KIKLF+WKA    LP    L RR +  +  C  C    E   H L  C+ + ++ + +   +   I +     + L  L   +          TL     
Subjt:  KIKLFIWKAYHDCLPTNLGLWRRGMDVSNVCNICKVETERIDHALCGCERSKKICDFMFTREDSDISVANNFADRLVCLASRLSTKEFETLCITL-----

Query:  WAIWNDKNCFLLQ--------------KPIMEWSSRCEWINQYWEDTRRPNREVDLFTNSEAGTHEPL-YVDIAARKENSVVGVTHVFSDAAVTPNEIGV
        W IW  +N  + Q              +  + W S    + +    T RPN    L       TH+ L YVD A ++++S+ G   VF   + +  EI  
Subjt:  WAIWNDKNCFLLQ--------------KPIMEWSSRCEWINQYWEDTRRPNREVDLFTNSEAGTHEPL-YVDIAARKENSVVGVTHVFSDAAVTPNEIGV

Query:  GMGVVIRDAAGRIQGAMHSFKARLLSPLATE------VQDSLLAIKIINKEQEPIWEVQHWITQIQEMSQYFSKLYFVHIGREYNKRADALAKKALRDSQ
              R     +     + K+ +L  L  E      + DS   +  +N     + E+   + +I+ +   F  + F  I R  N  ADA AK +L  S 
Subjt:  GMGVVIRDAAGRIQGAMHSFKARLLSPLATE------VQDSLLAIKIINKEQEPIWEVQHWITQIQEMSQYFSKLYFVHIGREYNKRADALAKKALRDSQ

Query:  SI
        +I
Subjt:  SI

AT1G73430.1 sec34-like family protein1.2e-6576.36Show/hide
Query:  MAAKAA-PLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSD
        MA KAA    LPKSGAISKGYNFASTWEQ+APLTEQQQAAI +L HAVAERPFP +L  + +   EN LS+SV+DT   DS AIEAVLVNTNQFYKWF+D
Subjt:  MAAKAA-PLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSD

Query:  LESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        LESAMKSETEEKY HY+++LT+RI+TCD+IL QVD+TLDLFNELQLQHQ V TKT+TLHDACDRL
Subjt:  LESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

AT1G73430.2 sec34-like family protein1.2e-6576.36Show/hide
Query:  MAAKAA-PLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSD
        MA KAA    LPKSGAISKGYNFASTWEQ+APLTEQQQAAI +L HAVAERPFP +L  + +   EN LS+SV+DT   DS AIEAVLVNTNQFYKWF+D
Subjt:  MAAKAA-PLGLPKSGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSD

Query:  LESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL
        LESAMKSETEEKY HY+++LT+RI+TCD+IL QVD+TLDLFNELQLQHQ V TKT+TLHDACDRL
Subjt:  LESAMKSETEEKYHHYLNSLTDRIRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRL

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.1e-0629.13Show/hide
Query:  SWWNTLWKTKVPSKIKLFIWKAYHDCLPTNLGLWRRGMDVSNVCNICKVETERIDHALCGCERSKKICDFMFTREDSDISVANNFADRLVCLASRLSTKE
        +W   +W  K+  KIKL IWKA ++ LP    L  R + +   C  C+ + E I H L  C                       FA R V + S +  KE
Subjt:  SWWNTLWKTKVPSKIKLFIWKAYHDCLPTNLGLWRRGMDVSNVCNICKVETERIDHALCGCERSKKICDFMFTREDSDISVANNFADRLVCLASRLSTKE

Query:  FET
        ++T
Subjt:  FET

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-0420.45Show/hide
Query:  WNTLWKTKVPSKIKLFIWKAYHDCLPTNLGLWRRGMDVSNVCNICKVETERIDHALCGCERSKKICDFMFTREDSDISVANNFADRL---------VCLA
        +  +WK++   KI+ F+WK   + LP    L  R +   + C  C    E ++H L  C  ++      +      I +   +AD +         +   
Subjt:  WNTLWKTKVPSKIKLFIWKAYHDCLPTNLGLWRRGMDVSNVCNICKVETERIDHALCGCERSKKICDFMFTREDSDISVANNFADRL---------VCLA

Query:  SRLSTKEFETLCITLWAIWNDKNCFLLQKPIMEWSSRCEWINQYWEDTRRPNREVDLFTNSEA-GTHEPLYVDIAARKENSVVGVTHVFSDAAVTPNEIG
        +    K  + +   LW +W ++N  + +    E++++ E + +  +D      E  + T +E+ GT   +      R            +DA    +   
Subjt:  SRLSTKEFETLCITLWAIWNDKNCFLLQKPIMEWSSRCEWINQYWEDTRRPNREVDLFTNSEA-GTHEPLYVDIAARKENSVVGVTHVFSDAAVTPNEIG

Query:  VGMGVVIRDAAGRIQGAMHSFKARLLSPLATEVQ---------------------DSLLAIKIINKEQEPIW-EVQHWITQIQEMSQYFSKLYFVHIGRE
         G+G V+R+  G ++        +L S L  E++                     DS + I+I+N ++  IW  ++  I  +Q +   F+++ FV I RE
Subjt:  VGMGVVIRDAAGRIQGAMHSFKARLLSPLATEVQ---------------------DSLLAIKIINKEQEPIW-EVQHWITQIQEMSQYFSKLYFVHIGRE

Query:  YNKRADALAKKAL
         N  A+ +A+++L
Subjt:  YNKRADALAKKAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCTTCCGCTGCGTGGGGACTTTTATTCCCTTCACTACCAACCAACGATCTTGGTGGAATACTCTTTGGAAGACTAAGGTTCCCTCCAAGATTAAGCTATTCATTTG
GAAGGCTTACCATGACTGCTTACCAACAAATTTGGGTCTTTGGAGACGTGGTATGGATGTTTCAAATGTGTGCAACATTTGCAAAGTAGAGACGGAAAGAATTGACCATG
CTCTATGTGGGTGTGAACGTTCAAAGAAAATTTGTGATTTTATGTTCACACGTGAGGACTCTGATATTAGTGTTGCTAACAACTTTGCGGACCGATTGGTATGTTTAGCG
AGTCGCCTTAGCACCAAGGAGTTTGAAACATTATGTATTACTCTTTGGGCAATTTGGAATGATAAGAATTGTTTCCTATTACAAAAACCCATAATGGAGTGGTCCTCACG
GTGTGAGTGGATAAACCAGTATTGGGAGGACACAAGGAGGCCGAATAGGGAGGTGGATTTATTTACCAATTCTGAGGCTGGAACCCATGAGCCGTTGTATGTGGATATTG
CTGCGAGGAAGGAGAATTCGGTGGTGGGTGTCACCCATGTTTTCTCGGATGCTGCTGTGACACCTAACGAGATTGGCGTTGGCATGGGTGTGGTTATACGAGATGCTGCC
GGACGAATACAAGGTGCTATGCATAGCTTTAAGGCCCGATTGCTATCTCCCTTAGCGACAGAGGTGCAAGATTCCCTTCTTGCTATTAAGATAATCAATAAGGAGCAGGA
GCCGATTTGGGAGGTTCAACATTGGATTACTCAAATCCAGGAGATGAGTCAATATTTTTCGAAGCTGTATTTTGTCCATATTGGAAGAGAGTACAATAAGAGGGCTGATG
CTTTAGCAAAGAAGGCTCTTAGAGATTCACAATCTATTTTATGGTTGTCCCACGTTCCAACGTGTTCGGCCTCCATGGCTGCCAAGGCCGCCCCTCTTGGTTTACCAAAG
TCCGGTGCAATTTCCAAGGGTTACAATTTTGCTTCTACCTGGGAGCAGAATGCTCCTCTAACGGAGCAACAGCAAGCGGCGATTGCGACGCTCTGTCATGCTGTCGCAGA
GCGACCGTTTCCCGTTGATCTGGCACAAGACCGTATAGGTGGCAAGGAAAATGCCTTGTCTATTTCGGTTAAGGATACCACCAACGAAGATTCTGATGCTATTGAAGCCG
TTTTGGTCAATACCAATCAGTTCTACAAATGGTTTTCTGATCTTGAATCAGCCATGAAATCCGAGACAGAGGAGAAATACCACCACTACTTGAACTCTTTAACAGATCGC
ATACGAACATGTGATGATATACTTCGTCAGGTCGATGATACGTTGGACTTATTTAACGAACTACAATTGCAACATCAAGCTGTGGCAACAAAGACTAGAACACTTCATGA
TGCATGTGATAGACTGAGGGAGTTATGTGCAGGGGTAGCTATGCGAAGGATCATTTCTAAGGGGGTGAGGGGTAGCATTCTTAAGGGCTTTCAGGGGGAGGATAGGGTGG
TTCTATCTCATCTCCAGTTCGCTGATGATATGATGTTTTTCTGTTATGGTAAAGGGGAATCCTTTCGTAATTTGAATCTTGTCCTCTCGTTTTTTGAGGCCATTTCAGGA
GGGAAGCTGAGGATCTGGACCACTTGCTTTGGGGTTGTAAGTTTGCTTCGGCAGATGACGTTCGAGGAGCTTTCTCTTTATCCGCCCTTCCGTGAAGAGGGCGCTTTCTA
TGGCAGGCTGGGGCAGAGGAAGATTTGGACCATATCCTATGGAAGTGCCAGTTTGCCCATTCGGTGTGGAGTTTCTTTTATGATGCGTTCGGGATTCAGGCGAGACGTTC
AGAGACTACAGGGAAATGATCCAGGAGTTCCTCCTCCATCCGCCTTTTCGCGATAA
mRNA sequenceShow/hide mRNA sequence
ATGCGCTTCCGCTGCGTGGGGACTTTTATTCCCTTCACTACCAACCAACGATCTTGGTGGAATACTCTTTGGAAGACTAAGGTTCCCTCCAAGATTAAGCTATTCATTTG
GAAGGCTTACCATGACTGCTTACCAACAAATTTGGGTCTTTGGAGACGTGGTATGGATGTTTCAAATGTGTGCAACATTTGCAAAGTAGAGACGGAAAGAATTGACCATG
CTCTATGTGGGTGTGAACGTTCAAAGAAAATTTGTGATTTTATGTTCACACGTGAGGACTCTGATATTAGTGTTGCTAACAACTTTGCGGACCGATTGGTATGTTTAGCG
AGTCGCCTTAGCACCAAGGAGTTTGAAACATTATGTATTACTCTTTGGGCAATTTGGAATGATAAGAATTGTTTCCTATTACAAAAACCCATAATGGAGTGGTCCTCACG
GTGTGAGTGGATAAACCAGTATTGGGAGGACACAAGGAGGCCGAATAGGGAGGTGGATTTATTTACCAATTCTGAGGCTGGAACCCATGAGCCGTTGTATGTGGATATTG
CTGCGAGGAAGGAGAATTCGGTGGTGGGTGTCACCCATGTTTTCTCGGATGCTGCTGTGACACCTAACGAGATTGGCGTTGGCATGGGTGTGGTTATACGAGATGCTGCC
GGACGAATACAAGGTGCTATGCATAGCTTTAAGGCCCGATTGCTATCTCCCTTAGCGACAGAGGTGCAAGATTCCCTTCTTGCTATTAAGATAATCAATAAGGAGCAGGA
GCCGATTTGGGAGGTTCAACATTGGATTACTCAAATCCAGGAGATGAGTCAATATTTTTCGAAGCTGTATTTTGTCCATATTGGAAGAGAGTACAATAAGAGGGCTGATG
CTTTAGCAAAGAAGGCTCTTAGAGATTCACAATCTATTTTATGGTTGTCCCACGTTCCAACGTGTTCGGCCTCCATGGCTGCCAAGGCCGCCCCTCTTGGTTTACCAAAG
TCCGGTGCAATTTCCAAGGGTTACAATTTTGCTTCTACCTGGGAGCAGAATGCTCCTCTAACGGAGCAACAGCAAGCGGCGATTGCGACGCTCTGTCATGCTGTCGCAGA
GCGACCGTTTCCCGTTGATCTGGCACAAGACCGTATAGGTGGCAAGGAAAATGCCTTGTCTATTTCGGTTAAGGATACCACCAACGAAGATTCTGATGCTATTGAAGCCG
TTTTGGTCAATACCAATCAGTTCTACAAATGGTTTTCTGATCTTGAATCAGCCATGAAATCCGAGACAGAGGAGAAATACCACCACTACTTGAACTCTTTAACAGATCGC
ATACGAACATGTGATGATATACTTCGTCAGGTCGATGATACGTTGGACTTATTTAACGAACTACAATTGCAACATCAAGCTGTGGCAACAAAGACTAGAACACTTCATGA
TGCATGTGATAGACTGAGGGAGTTATGTGCAGGGGTAGCTATGCGAAGGATCATTTCTAAGGGGGTGAGGGGTAGCATTCTTAAGGGCTTTCAGGGGGAGGATAGGGTGG
TTCTATCTCATCTCCAGTTCGCTGATGATATGATGTTTTTCTGTTATGGTAAAGGGGAATCCTTTCGTAATTTGAATCTTGTCCTCTCGTTTTTTGAGGCCATTTCAGGA
GGGAAGCTGAGGATCTGGACCACTTGCTTTGGGGTTGTAAGTTTGCTTCGGCAGATGACGTTCGAGGAGCTTTCTCTTTATCCGCCCTTCCGTGAAGAGGGCGCTTTCTA
TGGCAGGCTGGGGCAGAGGAAGATTTGGACCATATCCTATGGAAGTGCCAGTTTGCCCATTCGGTGTGGAGTTTCTTTTATGATGCGTTCGGGATTCAGGCGAGACGTTC
AGAGACTACAGGGAAATGATCCAGGAGTTCCTCCTCCATCCGCCTTTTCGCGATAA
Protein sequenceShow/hide protein sequence
MRFRCVGTFIPFTTNQRSWWNTLWKTKVPSKIKLFIWKAYHDCLPTNLGLWRRGMDVSNVCNICKVETERIDHALCGCERSKKICDFMFTREDSDISVANNFADRLVCLA
SRLSTKEFETLCITLWAIWNDKNCFLLQKPIMEWSSRCEWINQYWEDTRRPNREVDLFTNSEAGTHEPLYVDIAARKENSVVGVTHVFSDAAVTPNEIGVGMGVVIRDAA
GRIQGAMHSFKARLLSPLATEVQDSLLAIKIINKEQEPIWEVQHWITQIQEMSQYFSKLYFVHIGREYNKRADALAKKALRDSQSILWLSHVPTCSASMAAKAAPLGLPK
SGAISKGYNFASTWEQNAPLTEQQQAAIATLCHAVAERPFPVDLAQDRIGGKENALSISVKDTTNEDSDAIEAVLVNTNQFYKWFSDLESAMKSETEEKYHHYLNSLTDR
IRTCDDILRQVDDTLDLFNELQLQHQAVATKTRTLHDACDRLRELCAGVAMRRIISKGVRGSILKGFQGEDRVVLSHLQFADDMMFFCYGKGESFRNLNLVLSFFEAISG
GKLRIWTTCFGVVSLLRQMTFEELSLYPPFREEGAFYGRLGQRKIWTISYGSASLPIRCGVSFMMRSGFRRDVQRLQGNDPGVPPPSAFSR