; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004491 (gene) of Snake gourd v1 genome

Gene IDTan0004491
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReplication protein A subunit
Genome locationLG06:16914063..16919191
RNA-Seq ExpressionTan0004491
SyntenyTan0004491
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR012340 - Nucleic acid-binding, OB-fold
IPR013955 - Replication factor A, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5512925.1 hypothetical protein RHGRI_038656 [Rhododendron griersonianum]6.3e-1525.98Show/hide
Query:  IGELKPFQRGWSAKVTIIKKFSIQKFKNKK------------EIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNND
        I +L+P+ R W+ KVTI+++  ++ +   K            E    IQ  LFNDVI  F + F    +Y I++G +KP+  +Y ++H + E+ L+ N  
Subjt:  IGELKPFQRGWSAKVTIIKKFSIQKFKNKK------------EIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNND

Query:  VK----ELECNTFYLKKDNFCSFEFGNNNRTNTPLTGAKLQDDNYS--------------KKMEVLLMDN--KFNTLKLALWDDLAENYKPT------TY
        V+    E+  N+F  +   F SFE   +   N  +  AKL +++Y               KKM             ++   ++  A+ +K          
Subjt:  VK----ELECNTFYLKKDNFCSFEFGNNNRTNTPLTGAKLQDDNYS--------------KKMEVLLMDN--KFNTLKLALWDDLAENYKPT------TY

Query:  CQKCENKNSTFSRKYVLRMLVSDGEEESYITLFYATDYIIGCSATEYFNDLKSK
        C KC + N   S +Y++++ V DG ++  +TLF A   ++GC+ +E+   L  +
Subjt:  CQKCENKNSTFSRKYVLRMLVSDGEEESYITLFYATDYIIGCSATEYFNDLKSK

ONK55571.1 uncharacterized protein A4U43_UnF1510 [Asparagus officinalis]3.6e-1029.94Show/hide
Query:  NMLIGELKPFQRGWSAKVTIIKKFSIQKFK----NKKEIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNN
        N++I ELK +Q+ WS  V +++  +I  FK    NK+++ L       IQ  L N +IEKF ++   G+TY I +G +KP  ++Y +I+P+ E++L+   
Subjt:  NMLIGELKPFQRGWSAKVTIIKKFSIQKFK----NKKEIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNN

Query:  DVKELECNTFYLK-KDNFCSFEFGNN--------NRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAE
         ++E +      K K  F SF+  +         +      T   +  +  SK+ E+++M+ +F+ + + LW DLAE
Subjt:  DVKELECNTFYLK-KDNFCSFEFGNN--------NRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAE

ONK60194.1 uncharacterized protein A4U43_C08F15380 [Asparagus officinalis]6.3e-1528.42Show/hide
Query:  NMLIGELKPFQRGWSAKVTIIKKFSIQKFK----NKKEIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNN
        N+ I ELK +Q+ WS  V +++  +I  FK    NK+++ L       IQ  LFN +IEKF  +    +TY I +G +KP  ++Y +I+P+ E++L+   
Subjt:  NMLIGELKPFQRGWSAKVTIIKKFSIQKFK----NKKEIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNN

Query:  DVKELECNTFYLK-KDNFCSFE----FGNN----NRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAE----------NYKPTTYCQKCEN
         ++E +      K K  F SF+    F +     +     +T   +  +  SK+ EV++M+ +F+ + + LW DLAE          + KP         
Subjt:  DVKELECNTFYLK-KDNFCSFE----FGNN----NRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAE----------NYKPTTYCQKCEN

Query:  KN--------STFSRKYVLRMLVSDGEEESYITLFYATDYIIGCSATEY---FNDLKSK-WTKQESDLARKQIVVQEI
        +               Y LR+ V+DG + +Y+TLF   + +IGC+A  Y   F D  S+ + K +S L ++ + + +I
Subjt:  KN--------STFSRKYVLRMLVSDGEEESYITLFYATDYIIGCSATEY---FNDLKSK-WTKQESDLARKQIVVQEI

ONK81712.1 uncharacterized protein A4U43_C01F32110, partial [Asparagus officinalis]5.6e-1132.2Show/hide
Query:  NMLIGELKPFQRGWSAKVTIIKKFSIQKFK----NKKEIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNN
        N+ I ELK +Q+ WS  V +++  +I  FK    NK+++ L       IQ  LFN +IEKF  +   G+TY I +G +KP  ++Y +I+P+ E++L+   
Subjt:  NMLIGELKPFQRGWSAKVTIIKKFSIQKFK----NKKEIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNN

Query:  DVKELECNTFYLK-KDNFCSFE----FGNN----NRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAE
         ++E +      K K  F SF+    F N     +      T   +  +  SK+ EV++M+ +F+ + + LW DLAE
Subjt:  DVKELECNTFYLK-KDNFCSFE----FGNN----NRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAE

XP_022155232.1 replication protein A 70 kDa DNA-binding subunit E-like [Momordica charantia]1.0e-4152.2Show/hide
Query:  MNMLIGELKPFQRGWSAKVTIIKKFSIQKFKNKKEIHLN-------------IQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISL
        MNM+I +L+PFQ+GW+  VTI++KF IQ   N+K   L              IQ+ LFND+IEKFG++ + GKTY INDGNIKPI +RYM++H K EISL
Subjt:  MNMLIGELKPFQRGWSAKVTIIKKFSIQKFKNKKEIHLN-------------IQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISL

Query:  SKNNDVKELECNTFYLKKDNFCSFEFGNNNRT---------NTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAEN
        S+N+ VKE+E  TF+L K +FC+FE   + ++           P+   K +D NYSKK EVLLMDNK NT+KL LWDDLAEN
Subjt:  SKNNDVKELECNTFYLKKDNFCSFEFGNNNRT---------NTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAEN

TrEMBL top hitse value%identityAlignment
A0A1R3L7I6 DUF223 domain-containing protein1.7e-1029.94Show/hide
Query:  NMLIGELKPFQRGWSAKVTIIKKFSIQKFK----NKKEIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNN
        N++I ELK +Q+ WS  V +++  +I  FK    NK+++ L       IQ  L N +IEKF ++   G+TY I +G +KP  ++Y +I+P+ E++L+   
Subjt:  NMLIGELKPFQRGWSAKVTIIKKFSIQKFK----NKKEIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNN

Query:  DVKELECNTFYLK-KDNFCSFEFGNN--------NRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAE
         ++E +      K K  F SF+  +         +      T   +  +  SK+ E+++M+ +F+ + + LW DLAE
Subjt:  DVKELECNTFYLK-KDNFCSFEFGNN--------NRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAE

A0A2R6QH86 Replication protein A subunit1.5e-0925.27Show/hide
Query:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------------EIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE
        L P+Q  W+ KV +  K +++ +KN +            E    IQ T+FN+   KF + FQ+GK Y I+ G +K    ++M++H  +E++L++N+DV+E
Subjt:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------------EIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE

Query:  LECNTFYLKKDNFCSFEFGN-----------------NNRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAEN
              ++ +  F                         N + T     K+ ++   K+ ++ + D    T+ ++LW+DLA N
Subjt:  LECNTFYLKKDNFCSFEFGN-----------------NNRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAEN

A0A5J9WJ31 Replication protein A subunit1.1e-0926.4Show/hide
Query:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------EIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE
        L P+Q  W  KV +  K +++ +KN +       + L       IQ T+FN+  +KF  IF+MGK Y I+ G+++    ++ ++   +E++L++N  V+E
Subjt:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------EIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE

Query:  LECNTF-------YLKKDNFCSFEFGNN--------NRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLA
         E  TF       ++K D   S+  G             +  L+  +  DD    K ++++ D+   T+ ++LW+DL+
Subjt:  LECNTF-------YLKKDNFCSFEFGNN--------NRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLA

A0A5P1FUD9 DUF223 domain-containing protein (Fragment)2.7e-1132.2Show/hide
Query:  NMLIGELKPFQRGWSAKVTIIKKFSIQKFK----NKKEIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNN
        N+ I ELK +Q+ WS  V +++  +I  FK    NK+++ L       IQ  LFN +IEKF  +   G+TY I +G +KP  ++Y +I+P+ E++L+   
Subjt:  NMLIGELKPFQRGWSAKVTIIKKFSIQKFK----NKKEIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNN

Query:  DVKELECNTFYLK-KDNFCSFE----FGNN----NRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAE
         ++E +      K K  F SF+    F N     +      T   +  +  SK+ EV++M+ +F+ + + LW DLAE
Subjt:  DVKELECNTFYLK-KDNFCSFE----FGNN----NRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAE

A0A6J1DNS8 replication protein A 70 kDa DNA-binding subunit E-like5.0e-4252.2Show/hide
Query:  MNMLIGELKPFQRGWSAKVTIIKKFSIQKFKNKKEIHLN-------------IQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISL
        MNM+I +L+PFQ+GW+  VTI++KF IQ   N+K   L              IQ+ LFND+IEKFG++ + GKTY INDGNIKPI +RYM++H K EISL
Subjt:  MNMLIGELKPFQRGWSAKVTIIKKFSIQKFKNKKEIHLN-------------IQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISL

Query:  SKNNDVKELECNTFYLKKDNFCSFEFGNNNRT---------NTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAEN
        S+N+ VKE+E  TF+L K +FC+FE   + ++           P+   K +D NYSKK EVLLMDNK NT+KL LWDDLAEN
Subjt:  SKNNDVKELECNTFYLKKDNFCSFEFGNNNRT---------NTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAEN

SwissProt top hitse value%identityAlignment
Q10Q08 Replication protein A 70 kDa DNA-binding subunit B1.2e-1125.84Show/hide
Query:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------EIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE
        L P+Q  W  KV +  K +++ +KN +       + L       IQ T+FN+  +KF  +F++GK Y I+ G+++    ++ ++H  +E++L++N  V+E
Subjt:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------EIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE

Query:  LECNTF-------YLKKDNFCSFEFGNN-------NRTNTPLTGAKLQDDNYS-KKMEVLLMDNKFNTLKLALWDDLA
         E  TF       ++K D    +  G          ++ +P    + + DN +  K ++++ D+   T+ ++LW+DLA
Subjt:  LECNTF-------YLKKDNFCSFEFGNN-------NRTNTPLTGAKLQDDNYS-KKMEVLLMDNKFNTLKLALWDDLA

Q6YZ49 Replication protein A 70 kDa DNA-binding subunit A1.1e-0628.71Show/hide
Query:  IGELKPFQRGWSAKVTIIKKFSIQKFKNKK------EIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNND
        I  L P+Q  W+ K  +  K  I+++ N K         L       I++T FN ++++F E+ ++GK YV++ GN++P    Y  ++ ++EI L   + 
Subjt:  IGELKPFQRGWSAKVTIIKKFSIQKFKNKK------EIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNND

Query:  V
        V
Subjt:  V

Q9FHJ6 Replication protein A 70 kDa DNA-binding subunit C7.3e-0622.63Show/hide
Query:  IGELKPFQRGWSAKVTIIKKFSIQKFKNKK-EIHL-----------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNND
        I  L P+Q  W+ KV +  K  +++F N + E  L            I++T FND +++F +   +G  Y+I+ GN+KP    +  +   +EI L   + 
Subjt:  IGELKPFQRGWSAKVTIIKKFSIQKFKNKK-EIHL-----------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNND

Query:  VKELECN----TFYLKKDNFCSFEFGNNNRTN---------TPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDL--AENYKPTTYC
        ++  E +     ++    N    E   NN T          +P      ++    +K  + L D    ++++ +W +   AE  K    C
Subjt:  VKELECN----TFYLKKDNFCSFEFGNNNRTN---------TPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDL--AENYKPTTYC

Q9FME0 Replication protein A 70 kDa DNA-binding subunit D2.0e-1125.7Show/hide
Query:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------------EIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE
        L P+Q  W+ KV +  K  ++ +KN +            E    IQ T+FND   KF + FQ+GK Y I+ G++K    ++ ++   +E++L++N++V+E
Subjt:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------------EIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE

Query:  LECNTFYL--KKDNFCSFE--------------FGNNNRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLA
              ++   K NF   E               G     +  ++  +  D+    K ++ L D    T+ ++LW+DLA
Subjt:  LECNTFYL--KKDNFCSFE--------------FGNNNRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLA

Q9SD82 Replication protein A 70 kDa DNA-binding subunit B1.9e-0924.58Show/hide
Query:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------------EIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE
        L P+Q  W+ KV +  K  ++ +KN +            E    IQ T+FN    KF + F+MGK Y I+ G++K    ++ ++   +E++L++N++V+E
Subjt:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------------EIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE

Query:  LECNTFYL--KKDNFCSFE--------------FGNNNRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLA
              +    K NF   +               G     +  ++  +  D+    K ++ L D    T+ ++LW+DLA
Subjt:  LECNTFYL--KKDNFCSFE--------------FGNNNRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLA

Arabidopsis top hitse value%identityAlignment
AT2G06510.1 replication protein A 1A1.2e-0626.47Show/hide
Query:  IGELKPFQRGWSAKVTIIKKFSIQKFKNKK------EIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNND
        I  L P+Q  W+ K  +  K  I+++ N K         L       I++T FN ++++F ++ ++GK Y+I+ G++KP    +  +  ++EI L   + 
Subjt:  IGELKPFQRGWSAKVTIIKKFSIQKFKNKK------EIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNND

Query:  VK
        V+
Subjt:  VK

AT2G06510.2 replication protein A 1A1.2e-0626.47Show/hide
Query:  IGELKPFQRGWSAKVTIIKKFSIQKFKNKK------EIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNND
        I  L P+Q  W+ K  +  K  I+++ N K         L       I++T FN ++++F ++ ++GK Y+I+ G++KP    +  +  ++EI L   + 
Subjt:  IGELKPFQRGWSAKVTIIKKFSIQKFKNKK------EIHL------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNND

Query:  VK
        V+
Subjt:  VK

AT5G08020.1 RPA70-kDa subunit B1.3e-1024.58Show/hide
Query:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------------EIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE
        L P+Q  W+ KV +  K  ++ +KN +            E    IQ T+FN    KF + F+MGK Y I+ G++K    ++ ++   +E++L++N++V+E
Subjt:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------------EIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE

Query:  LECNTFYL--KKDNFCSFE--------------FGNNNRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLA
              +    K NF   +               G     +  ++  +  D+    K ++ L D    T+ ++LW+DLA
Subjt:  LECNTFYL--KKDNFCSFE--------------FGNNNRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLA

AT5G45400.1 Replication factor-A protein 1-related5.2e-0722.63Show/hide
Query:  IGELKPFQRGWSAKVTIIKKFSIQKFKNKK-EIHL-----------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNND
        I  L P+Q  W+ KV +  K  +++F N + E  L            I++T FND +++F +   +G  Y+I+ GN+KP    +  +   +EI L   + 
Subjt:  IGELKPFQRGWSAKVTIIKKFSIQKFKNKK-EIHL-----------NIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNND

Query:  VKELECN----TFYLKKDNFCSFEFGNNNRTN---------TPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDL--AENYKPTTYC
        ++  E +     ++    N    E   NN T          +P      ++    +K  + L D    ++++ +W +   AE  K    C
Subjt:  VKELECN----TFYLKKDNFCSFEFGNNNRTN---------TPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDL--AENYKPTTYC

AT5G61000.1 Replication factor-A protein 1-related1.4e-1225.7Show/hide
Query:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------------EIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE
        L P+Q  W+ KV +  K  ++ +KN +            E    IQ T+FND   KF + FQ+GK Y I+ G++K    ++ ++   +E++L++N++V+E
Subjt:  LKPFQRGWSAKVTIIKKFSIQKFKNKK------------EIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKE

Query:  LECNTFYL--KKDNFCSFE--------------FGNNNRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLA
              ++   K NF   E               G     +  ++  +  D+    K ++ L D    T+ ++LW+DLA
Subjt:  LECNTFYL--KKDNFCSFE--------------FGNNNRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGACATGAATATGCTTATTGGTGAGCTCAAGCCTTTTCAAAGAGGATGGAGTGCTAAAGTCACTATCATAAAGAAATTTTCCATTCAAAAATTCAAGAACAAAAA
GGAGATCCACTTAAATATTCAAATCACTTTGTTCAATGATGTTATTGAAAAATTTGGAGAAATTTTTCAAATGGGAAAAACGTATGTGATCAATGATGGCAACATCAAAC
CTATTTATATGAGATACATGAGCATTCATCCAAAATTTGAGATATCATTATCCAAAAACAATGACGTCAAAGAGCTTGAATGTAACACATTCTATCTTAAGAAGGATAAC
TTTTGCAGCTTTGAATTTGGGAACAATAATCGAACAAATACTCCCCTCACAGGTGCAAAACTTCAGGATGACAATTATTCTAAAAAAATGGAAGTTCTACTTATGGACAA
TAAATTTAATACACTGAAATTGGCATTGTGGGACGATTTGGCAGAAAACTACAAGCCAACAACGTATTGTCAAAAATGTGAGAATAAAAATTCAACATTTTCACGAAAAT
ACGTATTGAGAATGCTGGTATCAGATGGTGAGGAGGAAAGCTACATAACTCTTTTTTATGCTACGGATTATATAATTGGATGCAGTGCAACTGAATATTTCAATGATCTC
AAGTCAAAATGGACAAAGCAGGAGAGTGATCTTGCACGTAAACAAATTGTTGTTCAAGAAATTTGCAGTGTCGTTTCCATGACTAGCAAAAAACACATAAAAGGTGAAGG
GAAGAAAATGATAGAAATGCAAGAAAAGAATAAGCATGCCAAAATAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGACATGAATATGCTTATTGGTGAGCTCAAGCCTTTTCAAAGAGGATGGAGTGCTAAAGTCACTATCATAAAGAAATTTTCCATTCAAAAATTCAAGAACAAAAA
GGAGATCCACTTAAATATTCAAATCACTTTGTTCAATGATGTTATTGAAAAATTTGGAGAAATTTTTCAAATGGGAAAAACGTATGTGATCAATGATGGCAACATCAAAC
CTATTTATATGAGATACATGAGCATTCATCCAAAATTTGAGATATCATTATCCAAAAACAATGACGTCAAAGAGCTTGAATGTAACACATTCTATCTTAAGAAGGATAAC
TTTTGCAGCTTTGAATTTGGGAACAATAATCGAACAAATACTCCCCTCACAGGTGCAAAACTTCAGGATGACAATTATTCTAAAAAAATGGAAGTTCTACTTATGGACAA
TAAATTTAATACACTGAAATTGGCATTGTGGGACGATTTGGCAGAAAACTACAAGCCAACAACGTATTGTCAAAAATGTGAGAATAAAAATTCAACATTTTCACGAAAAT
ACGTATTGAGAATGCTGGTATCAGATGGTGAGGAGGAAAGCTACATAACTCTTTTTTATGCTACGGATTATATAATTGGATGCAGTGCAACTGAATATTTCAATGATCTC
AAGTCAAAATGGACAAAGCAGGAGAGTGATCTTGCACGTAAACAAATTGTTGTTCAAGAAATTTGCAGTGTCGTTTCCATGACTAGCAAAAAACACATAAAAGGTGAAGG
GAAGAAAATGATAGAAATGCAAGAAAAGAATAAGCATGCCAAAATAAAATAA
Protein sequenceShow/hide protein sequence
MADMNMLIGELKPFQRGWSAKVTIIKKFSIQKFKNKKEIHLNIQITLFNDVIEKFGEIFQMGKTYVINDGNIKPIYMRYMSIHPKFEISLSKNNDVKELECNTFYLKKDN
FCSFEFGNNNRTNTPLTGAKLQDDNYSKKMEVLLMDNKFNTLKLALWDDLAENYKPTTYCQKCENKNSTFSRKYVLRMLVSDGEEESYITLFYATDYIIGCSATEYFNDL
KSKWTKQESDLARKQIVVQEICSVVSMTSKKHIKGEGKKMIEMQEKNKHAKIK