; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017566 (gene) of Snake gourd v1 genome

Gene IDTan0017566
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA replication complex GINS protein PSF1-like
Genome locationLG06:2956971..2960901
RNA-Seq ExpressionTan0017566
SyntenyTan0017566
Gene Ontology termsGO:1902983 - DNA strand elongation involved in mitotic DNA replication (biological process)
GO:0000811 - GINS complex (cellular component)
InterPro domainsIPR005339 - GINS complex, subunit Psf1
IPR021151 - GINS subunit, domain A
IPR036224 - GINS, helical bundle-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135842.1 probable DNA replication complex GINS protein PSF1 [Cucumis sativus]5.7e-9892.42Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI
        MYGRKACQLVKELAS EKGQLTHFNSDLFEQVISECQQHHLELQSL+RK+QEEGLDLQTTKNEDHFGAL+HHLALVRNKRCLMAYV+NRAEIIRSL+WK+
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI

Query:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEEL
        +G  +PP IQEKLSNSE+EYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFA  SIHFLKRTDAEQYISRGLMEEL
Subjt:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEEL

XP_008461089.1 PREDICTED: probable DNA replication complex GINS protein PSF1 [Cucumis melo]7.4e-9892.46Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI
        MYGRKACQLV ELAS EKGQLTHFNSDLFEQVISECQQHHLELQSL+RK+QEEGLDLQTTKNEDHFGAL+HHLALVRNKRCLMAYV+NRAE IRSLIWK+
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI

Query:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR
        LG  +PP IQEKLSNSE+EYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFA  SIHFLKRTDAEQYISRGLMEELR
Subjt:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR

XP_022139358.1 DNA replication complex GINS protein PSF1-like [Momordica charantia]4.8e-9790.4Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI
        MYGRKACQLVKELA+ EKGQLTHFNSDLF+QVISEC+QHHLELQSL+RKIQEEGLDLQTTKNEDHFGAL+HHL+LVRNKRCLMAYVYNRAEIIRSLIWK+
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI

Query:  LGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR
           LPP IQEKLSNSE+EYFKKHSARLK YMS+LEL+LTVDMVPPKDPYIQVRVLDDIGEGI+LSDDKTANFAR SIHFLKRTDAEQ+ISRGLMEELR
Subjt:  LGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR

XP_022964026.1 probable DNA replication complex GINS protein PSF1 [Cucurbita moschata]1.3e-9793.47Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI
        MYGRKACQLVKELAS EKGQL  FNSDLFEQVISECQQHHLELQSLLRKIQ+EGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAE IRSLIWKI
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI

Query:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR
        +G  LPP IQEKLSNSE+EYFKKHSARLKEYMSK+ELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFA  SIHFLKRTDAEQYISRGLMEELR
Subjt:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR

XP_038897311.1 probable DNA replication complex GINS protein PSF1 [Benincasa hispida]3.3e-9892.46Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI
        MYGRKACQLVKEL S EKGQLTHFNSDLFEQVISECQQHHLELQSL+RK+QEEGLDLQTTKNEDHFGAL+HHLALVRNKRCLMAYV+NRAEIIRSLIWKI
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI

Query:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR
        +G  +PP IQEKLSNSE+EYFKKHSAR+KEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFA  SIHFLKRTDAEQYISRGLMEELR
Subjt:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR

TrEMBL top hitse value%identityAlignment
A0A0A0K8U8 Sld5 domain-containing protein2.8e-9892.42Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI
        MYGRKACQLVKELAS EKGQLTHFNSDLFEQVISECQQHHLELQSL+RK+QEEGLDLQTTKNEDHFGAL+HHLALVRNKRCLMAYV+NRAEIIRSL+WK+
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI

Query:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEEL
        +G  +PP IQEKLSNSE+EYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFA  SIHFLKRTDAEQYISRGLMEEL
Subjt:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEEL

A0A1S3CDY2 probable DNA replication complex GINS protein PSF13.6e-9892.46Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI
        MYGRKACQLV ELAS EKGQLTHFNSDLFEQVISECQQHHLELQSL+RK+QEEGLDLQTTKNEDHFGAL+HHLALVRNKRCLMAYV+NRAE IRSLIWK+
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI

Query:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR
        LG  +PP IQEKLSNSE+EYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFA  SIHFLKRTDAEQYISRGLMEELR
Subjt:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR

A0A5A7UU94 Putative DNA replication complex GINS protein PSF13.6e-9892.46Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI
        MYGRKACQLV ELAS EKGQLTHFNSDLFEQVISECQQHHLELQSL+RK+QEEGLDLQTTKNEDHFGAL+HHLALVRNKRCLMAYV+NRAE IRSLIWK+
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI

Query:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR
        LG  +PP IQEKLSNSE+EYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFA  SIHFLKRTDAEQYISRGLMEELR
Subjt:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR

A0A6J1HHQ1 probable DNA replication complex GINS protein PSF16.2e-9893.47Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI
        MYGRKACQLVKELAS EKGQL  FNSDLFEQVISECQQHHLELQSLLRKIQ+EGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAE IRSLIWKI
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI

Query:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR
        +G  LPP IQEKLSNSE+EYFKKHSARLKEYMSK+ELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFA  SIHFLKRTDAEQYISRGLMEELR
Subjt:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR

A0A6J1KI79 probable DNA replication complex GINS protein PSF16.2e-9893.47Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI
        MYGRKACQLVKELAS EKGQL  FNSDLFEQVISECQQHHLELQSLLRKIQ+EGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAE IRSLIWKI
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI

Query:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR
        +G  LPP IQEKLSNSE+EYFKKHSARLKEYMSK+ELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFA  SIHFLKRTDAEQYISRGLMEELR
Subjt:  LGP-LPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELR

SwissProt top hitse value%identityAlignment
A4IFH4 DNA replication complex GINS protein PSF11.1e-2235.64Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGL-DLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWK
        M+  KA +LV+EL  A +G+L  FN D   QV+ E +  + + QS + + +  G  DL  T           H +L+RN+RC +AY+Y+R   IR+L W+
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGL-DLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWK

Query:  ILGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKL----ELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLME
            LP  ++  +S  E ++F ++   L  YM  L     LD+T DM PPK  YI+VR L D GE   + D  +    + S HFL R   EQ I +G++E
Subjt:  ILGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKL----ELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLME

Query:  EL
         +
Subjt:  EL

Q14691 DNA replication complex GINS protein PSF11.4e-2235.64Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEG-LDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWK
        M+  KA +L++EL  A +GQL  FN D   QV+ E +  + + QS + + +  G  DL  T           H +L+RN+RC +AY+Y+R   IR+L W+
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEG-LDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWK

Query:  ILGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKL----ELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLME
            LP  ++  ++  E E+F  +   L  YM  L     LD+T DM PPK  YI+VR L D GE   + D  +    + S HFL R   EQ I +G++E
Subjt:  ILGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKL----ELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLME

Query:  EL
         +
Subjt:  EL

Q54HR6 Probable DNA replication complex GINS protein PSF15.9e-2132Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHF--GALVHHLALVRNKRCLMAYVYNRAEIIRSLIW
        M+ + A +L+KEL   +   + H+N    +  I E    + +L   + + +E+       K E  +   A+  H ++ R+KRC++AY+  R   I+   W
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHF--GALVHHLALVRNKRCLMAYVYNRAEIIRSLIW

Query:  KI-LGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEEL
              LP  ++E+LS +E ++F ++   L EY SK+ LDLT+D  PPK+ YI+VRV+ ++GE +VL+   T N    + HFLKR+D    +  G +E +
Subjt:  KI-LGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEEL

Q7ZT47 DNA replication complex GINS protein PSF19.7e-2435.64Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEG-LDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWK
        M+  KA +L++EL  A  GQL  FN D   Q++ E +  + + Q+ + + + EG  DL  T           H  L+RN+RC++AY+Y+R   IR+L W+
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEG-LDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWK

Query:  ILGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKL----ELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLME
            LP  ++  +S  E ++F ++   L  YM  L     LD+T DM PPK  YI+VR L D GE   + D  T    + S HFL R   EQ I +G++E
Subjt:  ILGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKL----ELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLME

Query:  EL
         +
Subjt:  EL

Q9CZ15 DNA replication complex GINS protein PSF18.2e-2336.63Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGL-DLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWK
        M+  KA +LV+EL  A +GQL  FN D   QV+ E +  + + QS + + +  G  DL  T           H AL+RN+RC +AY+Y+R   IR+L W+
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGL-DLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWK

Query:  ILGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKL----ELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLME
            LP  ++  +S  E E+F  +   L  YM  L     LD+T D+ PPK  YI+VR L D GE   + D  +    + S HFL R   EQ I +G++E
Subjt:  ILGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKL----ELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLME

Query:  EL
         +
Subjt:  EL

Arabidopsis top hitse value%identityAlignment
AT1G80190.1 partner of SLD five 13.0e-7367Show/hide
Query:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI
        MYGRK  QL+K+ A+ EKGQL  FNS LF++ I EC Q+H  +QSL+RK+Q+EGLD+Q  +N DH+GAL+HHLAL+RNKRCLMAYVYNRAEI+R L W++
Subjt:  MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKI

Query:  ---LGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEEL
           L  LP  IQEKL+  E+EYFK HS  LK YM K+ ++L VDMVPPKDPYI+VR+LDDI EGIVLS DKT NFAR S+HFLKRTDAE YI+RG MEEL
Subjt:  ---LGPLPPGIQEKLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGGGAGAAAAGCATGCCAATTGGTGAAAGAGCTAGCAAGCGCCGAGAAAGGACAACTAACACACTTCAATAGCGACCTATTTGAACAGGTCATTTCAGAATGCCA
GCAGCATCATCTTGAGCTTCAATCCTTGTTGAGGAAAATACAGGAAGAAGGATTGGATCTACAGACGACTAAGAATGAAGATCATTTTGGCGCGCTCGTTCACCACCTCG
CATTAGTTCGCAATAAACGCTGCCTCATGGCTTATGTGTACAATCGAGCTGAGATTATAAGAAGTTTGATATGGAAGATATTAGGACCGCTTCCACCAGGAATACAAGAG
AAGCTCAGCAACTCAGAGCAAGAGTACTTTAAGAAGCATTCTGCACGTCTAAAAGAATACATGTCGAAACTCGAGTTGGATTTGACTGTGGATATGGTGCCACCTAAGGA
TCCATATATCCAAGTAAGAGTACTCGACGACATCGGTGAAGGAATCGTACTGAGTGATGATAAAACAGCAAACTTTGCACGGTTCTCCATACATTTTCTTAAACGAACAG
ATGCTGAGCAATACATCTCACGGGGTTTAATGGAGGAACTTAGAGACTGA
mRNA sequenceShow/hide mRNA sequence
CAAAATCCCTAATTCAGAAGTTCGTTCCCCCTGCGTCAGTTCGTCTTCTCTTTTCCAGTCGATTCTCAGAAGTTTCCGATAGGTTTCGATCTCTGTATATTTGCGTCCTA
ATAGCGATTTTGAACGGAAATTTTCCATTATGTATGGGAGAAAAGCATGCCAATTGGTGAAAGAGCTAGCAAGCGCCGAGAAAGGACAACTAACACACTTCAATAGCGAC
CTATTTGAACAGGTCATTTCAGAATGCCAGCAGCATCATCTTGAGCTTCAATCCTTGTTGAGGAAAATACAGGAAGAAGGATTGGATCTACAGACGACTAAGAATGAAGA
TCATTTTGGCGCGCTCGTTCACCACCTCGCATTAGTTCGCAATAAACGCTGCCTCATGGCTTATGTGTACAATCGAGCTGAGATTATAAGAAGTTTGATATGGAAGATAT
TAGGACCGCTTCCACCAGGAATACAAGAGAAGCTCAGCAACTCAGAGCAAGAGTACTTTAAGAAGCATTCTGCACGTCTAAAAGAATACATGTCGAAACTCGAGTTGGAT
TTGACTGTGGATATGGTGCCACCTAAGGATCCATATATCCAAGTAAGAGTACTCGACGACATCGGTGAAGGAATCGTACTGAGTGATGATAAAACAGCAAACTTTGCACG
GTTCTCCATACATTTTCTTAAACGAACAGATGCTGAGCAATACATCTCACGGGGTTTAATGGAGGAACTTAGAGACTGACTGTTACACTGAAGATGGAGTTTAGTTGGAG
GTGTTTTTTAGAGCCAAATGAGAAAGAAGTGAAAATACTAGCCATCCCATTTCTGTATAAATTATTATATTTAAAGACAGATGTTTGTTAAATAACTTATTGTTAGTATG
ACTTTTGTCCCAATCACAGCCGAAATAAGTCGATCAGGAAGCTTTCCCAGAAATCTATACCACTTTCCTTTATAAATTTGATTTTAC
Protein sequenceShow/hide protein sequence
MYGRKACQLVKELASAEKGQLTHFNSDLFEQVISECQQHHLELQSLLRKIQEEGLDLQTTKNEDHFGALVHHLALVRNKRCLMAYVYNRAEIIRSLIWKILGPLPPGIQE
KLSNSEQEYFKKHSARLKEYMSKLELDLTVDMVPPKDPYIQVRVLDDIGEGIVLSDDKTANFARFSIHFLKRTDAEQYISRGLMEELRD