; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G08400 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G08400
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationChr4:6140236..6142965
RNA-Seq ExpressionCSPI04G08400
SyntenyCSPI04G08400
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN53624.1 hypothetical protein Csa_015395 [Cucumis sativus]1.8e-9696.72Show/hide
Query:  MTLDSQAPNVRKKVKVKRERPDLGMLKKGKMVGTPELDKLIKIEKGLLPFKGPLPDFLPDPIKAFKWKKFFIVEGDKVPSTVKAISKLYDLPNGSYAYPD
        MTLDSQAPNVRKKVKVKRERPDLGMLKKGKMVGTPELDKLIKIEKGLLPFKGPLPDFLPDPIKAFKWKKFFIVEG+KVPSTVKAISKLYDLPNGSYAYPD
Subjt:  MTLDSQAPNVRKKVKVKRERPDLGMLKKGKMVGTPELDKLIKIEKGLLPFKGPLPDFLPDPIKAFKWKKFFIVEGDKVPSTVKAISKLYDLPNGSYAYPD

Query:  QRIIDNPMRSDMLPTRHDSIVSIEHELVLYYILMKQPFNLRSIINGALLVWRRNPKGAKPFPSTMEKLCLKYLPTLARYHKLP
        QRIIDNPMRSDMLPTRHDSIV IEHELVLYYILMKQPFNLRSIINGAL VWRRNPKGAKPFPSTMEKLCLKYLPTLARY + P
Subjt:  QRIIDNPMRSDMLPTRHDSIVSIEHELVLYYILMKQPFNLRSIINGALLVWRRNPKGAKPFPSTMEKLCLKYLPTLARYHKLP

WP_217833161.1 DDE-type integrase/transposase/recombinase, partial [Synechococcus sp. PCC 7002]7.0e-5665.62Show/hide
Query:  MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREI
        M PFP S  + YIL+A DYVSKWVEA +C KND  TVS+FLKK I + FGTPRAIISDEG HF+N IIT +L K+N+ H++  AYHPQ N QAE++N+EI
Subjt:  MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREI

Query:  KKILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYS
        K ILEK+V+ S KDW + LD ALWAYRT +KTPIGMSPY +VF KACHL LEL+HKA ++
Subjt:  KKILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYS

XP_030479372.1 uncharacterized protein LOC115696618 [Cannabis sativa]1.2e-5565.62Show/hide
Query:  MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREI
        M PFPQS  ++YIL+A DYVSKWVEAI+  KND   V +FL K++ T FGTPRA+ISDEG HFVN ++  +LAKY+++HKI  AYHPQ N QAE+SNREI
Subjt:  MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREI

Query:  KKILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYS
        K ILEK+VN + KDW   LD ALWAYRTAYKTP+GMSPY +V+ KACHLP+EL+HKA ++
Subjt:  KKILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYS

XP_038887969.1 uncharacterized protein LOC120077927 [Benincasa hispida]8.3e-5760.11Show/hide
Query:  FPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKKI
        FP S  H YILL  DYVSKWV+AISC  ND  TVS+FL+KNI T FGT  A ISDEG HF+N I++K+L KYN+ HKI   YHPQ N +AEVSNREIK +
Subjt:  FPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKKI

Query:  LEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYSEEAQKPNL-TARDS----ILEVNQWMRQEEK
        LEK+VN + KDW    D ALWAY T YKTPIGMSPY +VFRKACHLPLEL+HKA ++ +    +L +A D+    +LE+++W  Q  K
Subjt:  LEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYSEEAQKPNL-TARDS----ILEVNQWMRQEEK

XP_038889328.1 uncharacterized protein K02A2.6-like [Benincasa hispida]5.3e-5658.59Show/hide
Query:  MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREI
        M PFP S    YI LA DYVSKWVE ++C +ND  TVS+FL +NI THFGT RA++SDEG HF+N II+K LAKYN+RH I  AYHPQ N QAEVSNREI
Subjt:  MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREI

Query:  KKILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYSEEAQKPNLTA-----RDSILEVNQW---MRQEEKFYTE
        K ILEK+VN S K+    LD  LWAYRTAYKTPI MSPY ++F KACHLPLELKHKA ++ +    NL A     +  + E+ +W     +  K Y E
Subjt:  KKILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYSEEAQKPNLTA-----RDSILEVNQW---MRQEEKFYTE

TrEMBL top hitse value%identityAlignment
A0A0A0KXQ2 Uncharacterized protein8.8e-9796.72Show/hide
Query:  MTLDSQAPNVRKKVKVKRERPDLGMLKKGKMVGTPELDKLIKIEKGLLPFKGPLPDFLPDPIKAFKWKKFFIVEGDKVPSTVKAISKLYDLPNGSYAYPD
        MTLDSQAPNVRKKVKVKRERPDLGMLKKGKMVGTPELDKLIKIEKGLLPFKGPLPDFLPDPIKAFKWKKFFIVEG+KVPSTVKAISKLYDLPNGSYAYPD
Subjt:  MTLDSQAPNVRKKVKVKRERPDLGMLKKGKMVGTPELDKLIKIEKGLLPFKGPLPDFLPDPIKAFKWKKFFIVEGDKVPSTVKAISKLYDLPNGSYAYPD

Query:  QRIIDNPMRSDMLPTRHDSIVSIEHELVLYYILMKQPFNLRSIINGALLVWRRNPKGAKPFPSTMEKLCLKYLPTLARYHKLP
        QRIIDNPMRSDMLPTRHDSIV IEHELVLYYILMKQPFNLRSIINGAL VWRRNPKGAKPFPSTMEKLCLKYLPTLARY + P
Subjt:  QRIIDNPMRSDMLPTRHDSIVSIEHELVLYYILMKQPFNLRSIINGALLVWRRNPKGAKPFPSTMEKLCLKYLPTLARYHKLP

A0A151QL68 Transposon Ty3-G Gag-Pol polyprotein1.6e-5363.12Show/hide
Query:  MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREI
        M PFP S    YILLA DYVSKW+EA+   K+D  TV++F+K NIL  FG PRAIISD+G HF N +   +LAK+ +RHK+   YHPQ N QAEVSNRE+
Subjt:  MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREI

Query:  KKILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYS
        KKILE++V  S KDW   L+ ALWAYRTAYKTPIGMSPY +VF KACHLP+EL+HKA ++
Subjt:  KKILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYS

A0A1U7Y2Z2 uncharacterized protein LOC1042404703.9e-5248.26Show/hide
Query:  MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREI
        M PFP S+ + YILLA DYVSKWVEAI+   ND + V+ F+KKNI + FGTPRA+ISDEG HF N ++  +L KY +RH++  AYHPQ + QA VSNREI
Subjt:  MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREI

Query:  KKILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYSEEAQKPNLTA--RDSILEVNQWMRQEEKFYTELSDLGTGV
        KKILEK V+ + K W   LD ALWAYRTAYK PIG SPY +V+ KACHLP+EL+HKA ++ +    N+ A     ++++N+      K  +  S L    
Subjt:  KKILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYSEEAQKPNLTA--RDSILEVNQWMRQEEKFYTELSDLGTGV

Query:  ETTRLAGNSSCYTHLKHGAGKVLVDCKEIR
        E  R+    +      +G  K LV+   ++
Subjt:  ETTRLAGNSSCYTHLKHGAGKVLVDCKEIR

A0A5A7U2P4 Integrase catalytic domain-containing protein3.0e-5270.13Show/hide
Query:  PFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKK
        PF QS  H+YILL  DYVSKWVEAIS VK DV+TVS+FLKKNI + FGTPRA+I+DEG HF+NHIITK+L  YNI HK+  A  PQ N Q +V NREI K
Subjt:  PFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKK

Query:  ILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHK
        ILEK+VN S KD  DHLDS L AY TAYKTPIGMSPY +VF KACHLP EL+ K
Subjt:  ILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHK

A0A803R2M6 Uncharacterized protein2.3e-5265Show/hide
Query:  MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREI
        M PFP S +++YILLA DYVSKWVEA +   ND  TV RFL+KNI T FGTPRAIISDEG HF N     +L++Y +RH+  + YHPQ N QAE+SNREI
Subjt:  MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREI

Query:  KKILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYS
        K ILEK V  S KDW   LD ALWAYRTA+KTPIGMSPY +VF KACHLP+EL+HKA ++
Subjt:  KKILEKLVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYS

SwissProt top hitse value%identityAlignment
O92815 Gag-Pol polyprotein2.4e-1128.72Show/hide
Query:  KNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIK-KILEK
        K  +Y L+  D  SKW E I C K D  TV   L K+I+  +G P  I SD+G HF   I  ++     +  K+    HP+ +   E +NR +K KI++ 
Subjt:  KNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIK-KILEK

Query:  LVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYSEEAQKPNLTARDSILE-VNQWMRQEEKFYTELSD
                W + L   L   R   K   G+SP+ IV  +    P++  + +  S       L A D+++  +N+  RQ   ++ ++ D
Subjt:  LVNHSCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYSEEAQKPNLTARDSILE-VNQWMRQEEKFYTELSD

P03359 Gag-Pol polyprotein2.2e-1236.3Show/hide
Query:  YILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKKILEKL-VNH
        Y+L+  D  S WVEA        +TV + + + IL  FG P+ + SD G  FV  +   +  +  I  K+  AY PQ + Q E  NR IK+ L KL +  
Subjt:  YILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKKILEKL-VNH

Query:  SCKDWVDHLDSALWAYRTAYKTP--IGMSPYGIVF
          KDWV  L  AL   R    TP   G++PY I++
Subjt:  SCKDWVDHLDSALWAYRTAYKTP--IGMSPYGIVF

P03360 Gag-Pol polyprotein (Fragment)5.4e-1134.33Show/hide
Query:  YILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKKILEKLVNHS
        Y+L+  D  S WVEA    +     V + L  +I+  FG P  I SD G  FV  +  ++    N+  K+  AY PQ + Q E  NR +K+ + KL   +
Subjt:  YILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKKILEKLVNHS

Query:  CKDWVDHLDSALWAYRTAYKTP--IGMSPYGIVF
          DWV  L  AL   R    TP   G+SP+ I++
Subjt:  CKDWVDHLDSALWAYRTAYKTP--IGMSPYGIVF

P21414 Gag-Pol polyprotein6.4e-1234.51Show/hide
Query:  PQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKKIL
        P    + Y+L+  D  S WVEA        + V + + + IL  FG P+ + SD G  FV  +   +  +  I  K+  AY PQ + Q E  NR IK+ L
Subjt:  PQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKKIL

Query:  EKL-VNHSCKDWVDHLDSALWAYRTAYKTP--IGMSPYGIVF
         KL +    KDWV  L  AL   R    TP   G++PY I++
Subjt:  EKL-VNHSCKDWVDHLDSALWAYRTAYKTP--IGMSPYGIVF

Q9TTC1 Gag-Pol polyprotein1.1e-1136.57Show/hide
Query:  YILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKKILEKL-VNH
        Y+L+  D  S WVEA        +TV + + + IL  FG P+ + SD G  FV  +   +  +  I  K+  AY PQ + Q E  NR IK+ L KL +  
Subjt:  YILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKKILEKL-VNH

Query:  SCKDWVDHLDSALWAYRTAYKTP--IGMSPYGIV
          KDWV  L  AL   R    TP   G++PY I+
Subjt:  SCKDWVDHLDSALWAYRTAYKTP--IGMSPYGIV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACCATTTCCTCAATCTAAAAATCATGTTTACATCTTGTTGGCTGAAGACTATGTTTCTAAGTGGGTTGAAGCAATTTCCTGTGTTAAGAATGATGTAGTTACAGT
GAGTAGATTTCTGAAAAAGAACATCCTTACACATTTTGGGACCCCTAGAGCAATTATCAGCGATGAAGGACTCCATTTCGTTAATCATATCATCACTAAGGTGCTTGCAA
AGTACAATATAAGACATAAGATAGACATTGCCTATCATCCGCAAATAAATAGTCAAGCAGAAGTATCCAACAGAGAAATTAAGAAGATCTTAGAAAAATTGGTAAATCAT
TCCTGCAAGGATTGGGTAGATCACCTAGACTCTGCGCTTTGGGCATATCGTACAGCGTACAAGACGCCAATTGGGATGTCCCCTTATGGGATAGTTTTTAGGAAAGCTTG
CCACTTACCGCTAGAATTGAAACACAAGGCATCTTATTCGGAGGAAGCTCAAAAACCTAACCTAACCGCTAGAGACTCCATTCTTGAAGTTAATCAATGGATGAGGCAAG
AAGAGAAATTTTATACTGAATTATCAGACTTAGGCACAGGGGTGGAAACCACCCGTTTAGCAGGCAATTCGTCGTGTTACACGCATTTGAAGCATGGGGCGGGGAAAGTG
CTTGTTGACTGCAAGGAGATAAGAGGAGAGGCAAGTTTGCAAATTCCATCACACCTCTTCGATCAAGTTAAGAAATTCATAGGACATGAGCGAGTAGACCTGATGACCCT
GGACTCCCAAGCTCCCAATGTTCGCAAAAAGGTAAAAGTAAAGCGGGAACGTCCAGACCTGGGGATGTTAAAAAAAGGGAAAATGGTGGGCACCCCTGAACTAGACAAGC
TCATTAAGATCGAAAAGGGATTACTACCCTTTAAGGGCCCACTACCTGACTTCCTTCCTGACCCAATTAAGGCGTTCAAATGGAAAAAGTTCTTTATCGTGGAAGGAGAC
AAAGTTCCCTCCACCGTAAAAGCGATTAGCAAGTTGTATGATTTACCTAATGGTTCTTACGCATATCCCGACCAAAGAATTATTGACAACCCAATGAGGAGTGATATGCT
CCCAACTCGCCATGACAGCATAGTTTCCATTGAACATGAGTTAGTTCTATACTATATCTTGATGAAGCAGCCATTTAATTTGAGGAGTATAATTAATGGAGCTCTCCTTG
TCTGGAGGAGGAACCCTAAGGGCGCAAAGCCTTTTCCGTCTACCATGGAGAAGTTATGTTTGAAGTACTTGCCCACCCTCGCGAGATACCACAAACTCCCATGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGACCATTTCCTCAATCTAAAAATCATGTTTACATCTTGTTGGCTGAAGACTATGTTTCTAAGTGGGTTGAAGCAATTTCCTGTGTTAAGAATGATGTAGTTACAGT
GAGTAGATTTCTGAAAAAGAACATCCTTACACATTTTGGGACCCCTAGAGCAATTATCAGCGATGAAGGACTCCATTTCGTTAATCATATCATCACTAAGGTGCTTGCAA
AGTACAATATAAGACATAAGATAGACATTGCCTATCATCCGCAAATAAATAGTCAAGCAGAAGTATCCAACAGAGAAATTAAGAAGATCTTAGAAAAATTGGTAAATCAT
TCCTGCAAGGATTGGGTAGATCACCTAGACTCTGCGCTTTGGGCATATCGTACAGCGTACAAGACGCCAATTGGGATGTCCCCTTATGGGATAGTTTTTAGGAAAGCTTG
CCACTTACCGCTAGAATTGAAACACAAGGCATCTTATTCGGAGGAAGCTCAAAAACCTAACCTAACCGCTAGAGACTCCATTCTTGAAGTTAATCAATGGATGAGGCAAG
AAGAGAAATTTTATACTGAATTATCAGACTTAGGCACAGGGGTGGAAACCACCCGTTTAGCAGGCAATTCGTCGTGTTACACGCATTTGAAGCATGGGGCGGGGAAAGTG
CTTGTTGACTGCAAGGAGATAAGAGGAGAGGCAAGTTTGCAAATTCCATCACACCTCTTCGATCAAGTTAAGAAATTCATAGGACATGAGCGAGTAGACCTGATGACCCT
GGACTCCCAAGCTCCCAATGTTCGCAAAAAGGTAAAAGTAAAGCGGGAACGTCCAGACCTGGGGATGTTAAAAAAAGGGAAAATGGTGGGCACCCCTGAACTAGACAAGC
TCATTAAGATCGAAAAGGGATTACTACCCTTTAAGGGCCCACTACCTGACTTCCTTCCTGACCCAATTAAGGCGTTCAAATGGAAAAAGTTCTTTATCGTGGAAGGAGAC
AAAGTTCCCTCCACCGTAAAAGCGATTAGCAAGTTGTATGATTTACCTAATGGTTCTTACGCATATCCCGACCAAAGAATTATTGACAACCCAATGAGGAGTGATATGCT
CCCAACTCGCCATGACAGCATAGTTTCCATTGAACATGAGTTAGTTCTATACTATATCTTGATGAAGCAGCCATTTAATTTGAGGAGTATAATTAATGGAGCTCTCCTTG
TCTGGAGGAGGAACCCTAAGGGCGCAAAGCCTTTTCCGTCTACCATGGAGAAGTTATGTTTGAAGTACTTGCCCACCCTCGCGAGATACCACAAACTCCCATGGTGA
Protein sequenceShow/hide protein sequence
MRPFPQSKNHVYILLAEDYVSKWVEAISCVKNDVVTVSRFLKKNILTHFGTPRAIISDEGLHFVNHIITKVLAKYNIRHKIDIAYHPQINSQAEVSNREIKKILEKLVNH
SCKDWVDHLDSALWAYRTAYKTPIGMSPYGIVFRKACHLPLELKHKASYSEEAQKPNLTARDSILEVNQWMRQEEKFYTELSDLGTGVETTRLAGNSSCYTHLKHGAGKV
LVDCKEIRGEASLQIPSHLFDQVKKFIGHERVDLMTLDSQAPNVRKKVKVKRERPDLGMLKKGKMVGTPELDKLIKIEKGLLPFKGPLPDFLPDPIKAFKWKKFFIVEGD
KVPSTVKAISKLYDLPNGSYAYPDQRIIDNPMRSDMLPTRHDSIVSIEHELVLYYILMKQPFNLRSIINGALLVWRRNPKGAKPFPSTMEKLCLKYLPTLARYHKLPW