; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G012000 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G012000
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionUlp1-like peptidase
Genome locationCG_Chr09:12916529..12919519
RNA-Seq ExpressionClCG09G012000
SyntenyClCG09G012000
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN49944.1 hypothetical protein Csa_000148 [Cucumis sativus]3.2e-4556.04Show/hide
Query:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV
        NRFP QA+S SSHIG  N+++KEKLT  QLD+F+RTVFGRFVDMD+VF+SPLVHH L+RE  V++   + M F+LNG    F K+EFLLVTGLW SP   
Subjt:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV

Query:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS
           V    +L TKYF       ++ + FEE YK L F  DLD VKV+ VY  EL + GK++TKS I+KSLLDDV+D    NS
Subjt:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS

XP_008437500.1 PREDICTED: uncharacterized protein LOC103482899 isoform X1 [Cucumis melo]2.5e-4556.59Show/hide
Query:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV
        NRFP QA+S SSHIG  N+++KEKLT  QLD+F+RTVFGRFVDMD+VF+SPLVHH L+RE  V++   + M F+LNG    F K+EFLLVTGLW SP   
Subjt:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV

Query:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS
           V    +L TKYF       ++ I FEE YK L F  DLD VKV+ VY  EL + GK++TKS I+KSLLDDV+D    NS
Subjt:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS

XP_008437501.1 PREDICTED: uncharacterized protein LOC103482899 isoform X2 [Cucumis melo]2.5e-4556.59Show/hide
Query:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV
        NRFP QA+S SSHIG  N+++KEKLT  QLD+F+RTVFGRFVDMD+VF+SPLVHH L+RE  V++   + M F+LNG    F K+EFLLVTGLW SP   
Subjt:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV

Query:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS
           V    +L TKYF       ++ I FEE YK L F  DLD VKV+ VY  EL + GK++TKS I+KSLLDDV+D    NS
Subjt:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS

XP_011654656.1 uncharacterized protein LOC105435430 isoform X1 [Cucumis sativus]3.2e-4556.04Show/hide
Query:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV
        NRFP QA+S SSHIG  N+++KEKLT  QLD+F+RTVFGRFVDMD+VF+SPLVHH L+RE  V++   + M F+LNG    F K+EFLLVTGLW SP   
Subjt:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV

Query:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS
           V    +L TKYF       ++ + FEE YK L F  DLD VKV+ VY  EL + GK++TKS I+KSLLDDV+D    NS
Subjt:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS

XP_031741885.1 uncharacterized protein LOC105435430 isoform X2 [Cucumis sativus]3.2e-4556.04Show/hide
Query:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV
        NRFP QA+S SSHIG  N+++KEKLT  QLD+F+RTVFGRFVDMD+VF+SPLVHH L+RE  V++   + M F+LNG    F K+EFLLVTGLW SP   
Subjt:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV

Query:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS
           V    +L TKYF       ++ + FEE YK L F  DLD VKV+ VY  EL + GK++TKS I+KSLLDDV+D    NS
Subjt:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS

TrEMBL top hitse value%identityAlignment
A0A0A0KM59 DUF1985 domain-containing protein1.6e-4556.04Show/hide
Query:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV
        NRFP QA+S SSHIG  N+++KEKLT  QLD+F+RTVFGRFVDMD+VF+SPLVHH L+RE  V++   + M F+LNG    F K+EFLLVTGLW SP   
Subjt:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV

Query:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS
           V    +L TKYF       ++ + FEE YK L F  DLD VKV+ VY  EL + GK++TKS I+KSLLDDV+D    NS
Subjt:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS

A0A1S3ATU8 uncharacterized protein LOC103482899 isoform X11.2e-4556.59Show/hide
Query:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV
        NRFP QA+S SSHIG  N+++KEKLT  QLD+F+RTVFGRFVDMD+VF+SPLVHH L+RE  V++   + M F+LNG    F K+EFLLVTGLW SP   
Subjt:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV

Query:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS
           V    +L TKYF       ++ I FEE YK L F  DLD VKV+ VY  EL + GK++TKS I+KSLLDDV+D    NS
Subjt:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS

A0A1S3AUB0 uncharacterized protein LOC103482899 isoform X21.2e-4556.59Show/hide
Query:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV
        NRFP QA+S SSHIG  N+++KEKLT  QLD+F+RTVFGRFVDMD+VF+SPLVHH L+RE  V++   + M F+LNG    F K+EFLLVTGLW SP   
Subjt:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV

Query:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS
           V    +L TKYF       ++ I FEE YK L F  DLD VKV+ VY  EL + GK++TKS I+KSLLDDV+D    NS
Subjt:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS

A0A5A7TGU0 Ulp1-like peptidase2.0e-4556.59Show/hide
Query:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV
        NRFP QA+S SSHIG  N+++KEKLT  QLD+F+RTVFGRFVDMD+VF+SPLVHH L+RE  V++   + M F+LNG    F K+EFLLVTGLW SP   
Subjt:  NRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVV

Query:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS
           V    +L TKYF       ++ I FEE YK L F  DLD VKV+ VY  EL + GK++TKS I+KSLLDDV+D    NS
Subjt:  PCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS

A0A6J1DSS5 uncharacterized protein LOC1110239695.2e-3347.87Show/hide
Query:  KGERANRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFR-RTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLW
        K   A+RFPAQ TS  SH+   N++I +KLT  QLDMFR RT+FGRFVD+D++F S LVH+FL+RE  V +   + M FD+ G  V F K EFLL+TGLW
Subjt:  KGERANRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFR-RTVFGRFVDMDIVFNSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLW

Query:  RSPVVVPCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS
        RS   V     +   LR +YF  + +  + L  FE+ YK + F  D D VKVS +Y  E+ + GKNK KSN+DK L   V+D    N+
Subjt:  RSPVVVPCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSGKNKTKSNIDKSLLDDVDDTHVDNS

SwissProt top hitse value%identityAlignment
Q9LJ64 Pollen-specific leucine-rich repeat extensin-like protein 11.3e-0440.14Show/hide
Query:  DTLRTSPIPPPTSRPLPPILIALLSSRSPSIPRPLPPILIPPPRLRPPHLLILSLRPPILSLRPPPILSLRPPPILSLRPPPISSPRPPI-----PISTP
        +T  TSP  PP   P PP          P +  P PP+  PPP +  P   + S  PP+ S  PPP+ S  PPP +   PPP+ SP PP+     P+ +P
Subjt:  DTLRTSPIPPPTSRPLPPILIALLSSRSPSIPRPLPPILIPPPRLRPPHLLILSLRPPILSLRPPPILSLRPPPILSLRPPPISSPRPPI-----PISTP

Query:  PPPILSLIP--LPPPPIPSLIPPPPSPIPRPSPISILIPPQP
        PPP+ S  P    PPP P   PPPP+PI  P P  +  PP P
Subjt:  PPPILSLIP--LPPPPIPSLIPPPPSPIPRPSPISILIPPQP

Q9XIB6 Pollen-specific leucine-rich repeat extensin-like protein 21.3e-0438.27Show/hide
Query:  PVDPPVVSKRRVWSGEHDHTLVDDRVAKVEAKVDTLRTSPIPPPTSRPLPPILIALLSSRSPSIPRPLPPILIPPPRLRPPHLLILSLRPPILS----LR
        PV PP  ++       +D + V +R +    KV+  R  P  PP   P PP   + + S  P +  P PP+   PP   PPH  + S  PP+ S      
Subjt:  PVDPPVVSKRRVWSGEHDHTLVDDRVAKVEAKVDTLRTSPIPPPTSRPLPPILIALLSSRSPSIPRPLPPILIPPPRLRPPHLLILSLRPPILS----LR

Query:  PPPILSLRPPPILSLRPPPISSPRPPIPISTPPPPILSLIPLPPPPIPSLIPPPPSPIPRPS
        PPP+ S  PPP+ S  PPP+ SP PP P+ +PPPP  S    PPPP+ S  PPPP+  P P+
Subjt:  PPPILSLRPPPILSLRPPPISSPRPPIPISTPPPPILSLIPLPPPPIPSLIPPPPSPIPRPS

Arabidopsis top hitse value%identityAlignment
AT1G49490.1 Leucine-rich repeat (LRR) family protein9.1e-0638.27Show/hide
Query:  PVDPPVVSKRRVWSGEHDHTLVDDRVAKVEAKVDTLRTSPIPPPTSRPLPPILIALLSSRSPSIPRPLPPILIPPPRLRPPHLLILSLRPPILS----LR
        PV PP  ++       +D + V +R +    KV+  R  P  PP   P PP   + + S  P +  P PP+   PP   PPH  + S  PP+ S      
Subjt:  PVDPPVVSKRRVWSGEHDHTLVDDRVAKVEAKVDTLRTSPIPPPTSRPLPPILIALLSSRSPSIPRPLPPILIPPPRLRPPHLLILSLRPPILS----LR

Query:  PPPILSLRPPPILSLRPPPISSPRPPIPISTPPPPILSLIPLPPPPIPSLIPPPPSPIPRPS
        PPP+ S  PPP+ S  PPP+ SP PP P+ +PPPP  S    PPPP+ S  PPPP+  P P+
Subjt:  PPPILSLRPPPILSLRPPPISSPRPPIPISTPPPPILSLIPLPPPPIPSLIPPPPSPIPRPS

AT3G19020.1 Leucine-rich repeat (LRR) family protein9.1e-0640.14Show/hide
Query:  DTLRTSPIPPPTSRPLPPILIALLSSRSPSIPRPLPPILIPPPRLRPPHLLILSLRPPILSLRPPPILSLRPPPILSLRPPPISSPRPPI-----PISTP
        +T  TSP  PP   P PP          P +  P PP+  PPP +  P   + S  PP+ S  PPP+ S  PPP +   PPP+ SP PP+     P+ +P
Subjt:  DTLRTSPIPPPTSRPLPPILIALLSSRSPSIPRPLPPILIPPPRLRPPHLLILSLRPPILSLRPPPILSLRPPPILSLRPPPISSPRPPI-----PISTP

Query:  PPPILSLIP--LPPPPIPSLIPPPPSPIPRPSPISILIPPQP
        PPP+ S  P    PPP P   PPPP+PI  P P  +  PP P
Subjt:  PPPILSLIP--LPPPPIPSLIPPPPSPIPRPSPISILIPPQP

AT4G13340.1 Leucine-rich repeat (LRR) family protein5.0e-0440.97Show/hide
Query:  SPIPPPTSRPLPPILIALLSS----RSPSIPRPLPPILIPPPRLRPPHLLILSLRPPILSLRPPPILSLRPPPILSLRPPPISSPRPPIPISTPPPPILS
        +P+PPP S P PP    + S+     SP  P P PP+  PPP   PP   + S  PP     PPP+ S  PPP     PPP+ SP PP P   PPPP+ S
Subjt:  SPIPPPTSRPLPPILIALLSS----RSPSIPRPLPPILIPPPRLRPPHLLILSLRPPILSLRPPPILSLRPPPILSLRPPPISSPRPPIPISTPPPPILS

Query:  LIPLPPPPIPSLI----------PPPPSPIPRPSPISILIPPQP
          P PPPP P  +           PPP P P P+P+    PP P
Subjt:  LIPLPPPPIPSLI----------PPPPSPIPRPSPISILIPPQP

AT4G33970.1 Leucine-rich repeat (LRR) family protein1.7e-0441.1Show/hide
Query:  TLRTSPIPPPTSRPLPPILIALLSSRSPSIPRPLPPILIPPP----RLRPPHLLILSLRPPILSLRPPPILSLRPPPILSLRPPPISSPRPPIPISTPPP
        T R SP P P + P PP+         P +  P PP+  PPP       PP   + S  PP+ S  PPP+ S  PPP +   PPP+ SP PP P+ +PPP
Subjt:  TLRTSPIPPPTSRPLPPILIALLSSRSPSIPRPLPPILIPPP----RLRPPHLLILSLRPPILSLRPPPILSLRPPPILSLRPPPISSPRPPIPISTPPP

Query:  PILSLIPLPPPPIPSLIPPPP--SPIPRPSPISILIPPQPTSLISS
        P+ S    PPPP P   PPPP  SP P  SP  +  PP     I+S
Subjt:  PILSLIPLPPPPIPSLIPPPP--SPIPRPSPISILIPPQPTSLISS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCCGGGGCAGGGCGAGGGGGTTCCCAATGCCTGCCCCGTGGGGGGAATTTACCCCATCCTTGCCCCCGTCCCTGCCATTTGTAGGCAGGGATGGGGGCGAGGATT
CCCTGTCGGGGAAACAGGTCCCCGTAGGACCCATTCTTCATTATTCTCTGCGAGAGCAAGCAAGGGAGAGCGAGCGAATCGTTTTCCTGCCCAAGCTACTAGCTTTTCAT
CACACATTGGAGTTGTGAATCGGCTCATTAAAGAGAAGCTAACAGATGGGCAGCTAGATATGTTTAGGAGGACGGTGTTTGGGAGGTTTGTGGACATGGACATAGTGTTC
AACAGTCCATTAGTCCACCATTTTCTGGTGAGGGAGGTACACGTCCAGAACCATAATGATGAGACCATGTACTTCGACCTCAATGGTCATACTGTAAAATTCTTCAAGGA
CGAATTTCTACTAGTCACTGGACTCTGGAGATCACCAGTGGTAGTTCCTTGCACCGTTGCAACTATCGGTGCACTTAGGACGAAATATTTCCACCGTCTGTGTGAGGCGA
AGATACACCTGATCGCCTTTGAGGAGAATTACAAGGGGCTACAATTTGAGACTGACCTAGATGTGGTGAAGGTTAGTTTTGTATACATGTATGAGCTTGGGTTGAGTGGG
AAGAACAAGACGAAGAGCAACATCGATAAATCGTTGCTCGATGACGTTGATGACACACATGTCGACAATAGTGAGGAGCACCCTCTCGCTCCAAATCATCATGAGGAGTG
GCCTGTTGATCCTCCAGTTGTATCCAAGAGGAGGGTTTGGAGTGGTGAGCATGACCACACTTTAGTCGATGACCGGGTTGCAAAAGTGGAGGCGAAGGTAGATACGTTGA
GGACGTCCCCGATCCCACCACCCACATCGAGACCACTACCACCCATCCTGATAGCACTGCTGAGCTCGAGATCACCATCCATACCAAGACCACTACCACCCATCTTGATA
CCCCCACCGAGACTGAGACCACCACACCTTCTTATCCTGAGCCTGAGACCACCCATATTGAGTCTGAGACCACCACCCATATTGAGTCTGAGACCACCACCCATCCTGAG
CTTGAGACCACCACCCATCTCGAGCCCGAGACCACCCATCCCGATCTCGACACCACCACCACCCATCCTGAGCTTGATACCACTACCACCACCACCCATCCCGAGCCTGA
TACCACCACCACCATCACCTATCCCAAGACCATCACCCATCTCAATCCTGATACCACCACAACCAACATCATTGATCTCATCATTGCAGACATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGCCGGGGCAGGGCGAGGGGGTTCCCAATGCCTGCCCCGTGGGGGGAATTTACCCCATCCTTGCCCCCGTCCCTGCCATTTGTAGGCAGGGATGGGGGCGAGGATT
CCCTGTCGGGGAAACAGGTCCCCGTAGGACCCATTCTTCATTATTCTCTGCGAGAGCAAGCAAGGGAGAGCGAGCGAATCGTTTTCCTGCCCAAGCTACTAGCTTTTCAT
CACACATTGGAGTTGTGAATCGGCTCATTAAAGAGAAGCTAACAGATGGGCAGCTAGATATGTTTAGGAGGACGGTGTTTGGGAGGTTTGTGGACATGGACATAGTGTTC
AACAGTCCATTAGTCCACCATTTTCTGGTGAGGGAGGTACACGTCCAGAACCATAATGATGAGACCATGTACTTCGACCTCAATGGTCATACTGTAAAATTCTTCAAGGA
CGAATTTCTACTAGTCACTGGACTCTGGAGATCACCAGTGGTAGTTCCTTGCACCGTTGCAACTATCGGTGCACTTAGGACGAAATATTTCCACCGTCTGTGTGAGGCGA
AGATACACCTGATCGCCTTTGAGGAGAATTACAAGGGGCTACAATTTGAGACTGACCTAGATGTGGTGAAGGTTAGTTTTGTATACATGTATGAGCTTGGGTTGAGTGGG
AAGAACAAGACGAAGAGCAACATCGATAAATCGTTGCTCGATGACGTTGATGACACACATGTCGACAATAGTGAGGAGCACCCTCTCGCTCCAAATCATCATGAGGAGTG
GCCTGTTGATCCTCCAGTTGTATCCAAGAGGAGGGTTTGGAGTGGTGAGCATGACCACACTTTAGTCGATGACCGGGTTGCAAAAGTGGAGGCGAAGGTAGATACGTTGA
GGACGTCCCCGATCCCACCACCCACATCGAGACCACTACCACCCATCCTGATAGCACTGCTGAGCTCGAGATCACCATCCATACCAAGACCACTACCACCCATCTTGATA
CCCCCACCGAGACTGAGACCACCACACCTTCTTATCCTGAGCCTGAGACCACCCATATTGAGTCTGAGACCACCACCCATATTGAGTCTGAGACCACCACCCATCCTGAG
CTTGAGACCACCACCCATCTCGAGCCCGAGACCACCCATCCCGATCTCGACACCACCACCACCCATCCTGAGCTTGATACCACTACCACCACCACCCATCCCGAGCCTGA
TACCACCACCACCATCACCTATCCCAAGACCATCACCCATCTCAATCCTGATACCACCACAACCAACATCATTGATCTCATCATTGCAGACATAGATCCGACCACCACCG
ATCCCAATCTGGCCACCACCTCTATCGGGCCGATCTCAGTGGCCATTCTTGATCCTTCTGTAAGACATTGAAACATGCAAGTGGAAGCAATTGAATCTTAAACACTCTAA
TTTGCTTTAATTTGAAAGAAAAACATGCTGAATTTCAAAGGTAAAATAAAGAGCTAAGAAATATCACATTTGTAGCACTAAAAACTTGCAATTTATCCTTCAATTTGATC
ACA
Protein sequenceShow/hide protein sequence
MGPGQGEGVPNACPVGGIYPILAPVPAICRQGWGRGFPVGETGPRRTHSSLFSARASKGERANRFPAQATSFSSHIGVVNRLIKEKLTDGQLDMFRRTVFGRFVDMDIVF
NSPLVHHFLVREVHVQNHNDETMYFDLNGHTVKFFKDEFLLVTGLWRSPVVVPCTVATIGALRTKYFHRLCEAKIHLIAFEENYKGLQFETDLDVVKVSFVYMYELGLSG
KNKTKSNIDKSLLDDVDDTHVDNSEEHPLAPNHHEEWPVDPPVVSKRRVWSGEHDHTLVDDRVAKVEAKVDTLRTSPIPPPTSRPLPPILIALLSSRSPSIPRPLPPILI
PPPRLRPPHLLILSLRPPILSLRPPPILSLRPPPILSLRPPPISSPRPPIPISTPPPPILSLIPLPPPPIPSLIPPPPSPIPRPSPISILIPPQPTSLISSLQT