; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G21890 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G21890
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionSulfate adenylyltransferase subunit 2
Genome locationClcChr01:33277388..33286703
RNA-Seq ExpressionClc01G21890
SyntenyClc01G21890
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134798.1 uncharacterized protein LOC101207146 [Cucumis sativus]4.5e-9387.26Show/hide
Query:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGG--
        MLQ LNLRPP PILAL  S+SDDPTCS LPL RPRN  HNWALLQS LKCNGRFSCLFSDNR+EEQARKALESALGGKKNEFEKWNNEIKKREE+GGG  
Subjt:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGG--

Query:  -GGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVS
         GGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYL+VAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRK+S SN AE E ISNK     VS
Subjt:  -GGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVS

Query:  AKERVARKWGSD
        AK+RVARKWGSD
Subjt:  AKERVARKWGSD

XP_008440068.1 PREDICTED: uncharacterized protein LOC103484656 isoform X1 [Cucumis melo]1.6e-9086.73Show/hide
Query:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRRE-EQARKALESALGGKKNEFEKWNNEIKKREEMGGG-
        MLQ LNLR   PILAL  S+SDDPTCSALPLLRPRN THNWALL SNLKCNGRFSCLFS+NRRE EQARKALESALGGKKNEFEKWNNEIKKREE+GGG 
Subjt:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRRE-EQARKALESALGGKKNEFEKWNNEIKKREEMGGG-

Query:  GGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSA
        GGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAV GIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRK+S SN AE E ISNK     V+A
Subjt:  GGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSA

Query:  KERVARKWGSD
        K+RVARKWGSD
Subjt:  KERVARKWGSD

XP_008440069.1 PREDICTED: uncharacterized protein LOC103484656 isoform X2 [Cucumis melo]6.5e-9287.14Show/hide
Query:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGG-G
        MLQ LNLR   PILAL  S+SDDPTCSALPLLRPRN THNWALL SNLKCNGRFSCLFS+NRREEQARKALESALGGKKNEFEKWNNEIKKREE+GGG G
Subjt:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGG-G

Query:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK
        GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAV GIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRK+S SN AE E ISNK     V+AK
Subjt:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK

Query:  ERVARKWGSD
        +RVARKWGSD
Subjt:  ERVARKWGSD

XP_023545023.1 uncharacterized protein LOC111804448 [Cucurbita pepo subsp. pepo]4.2e-9186.67Show/hide
Query:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGG-GG
        MLQVLNL PPT  LAL  SISDDPT S LPLLRPRNATH WALLQS LKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEM G GG
Subjt:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGG-GG

Query:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK
        GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKG +LLAV+ NPLLYALRGTRNGLT VTSKILRK   SN AEF  ISN+ VSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK

Query:  ERVARKWGSD
        +RVARKW +D
Subjt:  ERVARKWGSD

XP_038881277.1 uncharacterized protein LOC120072833 [Benincasa hispida]4.1e-9490.95Show/hide
Query:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGGG-
        MLQVLNLRPPTPILAL  SIS D TCSAL LLRPRNATHNWALLQSNLKCNGRFSCLF DNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGGG 
Subjt:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGGG-

Query:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK
        GGRGGWFGSG WFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILR TS SN AE E ISNK+    VSAK
Subjt:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK

Query:  ERVARKWGSD
        ERVA+KWGSD
Subjt:  ERVARKWGSD

TrEMBL top hitse value%identityAlignment
A0A1S3AZU3 uncharacterized protein LOC103484656 isoform X17.8e-9186.73Show/hide
Query:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRRE-EQARKALESALGGKKNEFEKWNNEIKKREEMGGG-
        MLQ LNLR   PILAL  S+SDDPTCSALPLLRPRN THNWALL SNLKCNGRFSCLFS+NRRE EQARKALESALGGKKNEFEKWNNEIKKREE+GGG 
Subjt:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRRE-EQARKALESALGGKKNEFEKWNNEIKKREEMGGG-

Query:  GGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSA
        GGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAV GIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRK+S SN AE E ISNK     V+A
Subjt:  GGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSA

Query:  KERVARKWGSD
        K+RVARKWGSD
Subjt:  KERVARKWGSD

A0A1S3B0A1 uncharacterized protein LOC103484656 isoform X23.2e-9287.14Show/hide
Query:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGG-G
        MLQ LNLR   PILAL  S+SDDPTCSALPLLRPRN THNWALL SNLKCNGRFSCLFS+NRREEQARKALESALGGKKNEFEKWNNEIKKREE+GGG G
Subjt:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGG-G

Query:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK
        GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAV GIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRK+S SN AE E ISNK     V+AK
Subjt:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK

Query:  ERVARKWGSD
        +RVARKWGSD
Subjt:  ERVARKWGSD

A0A6J1BSB9 uncharacterized protein LOC1110053074.0e-8781.43Show/hide
Query:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGGG-
        ML++LNL PP      P SI D+ T S +P +RPRN+ HNWA LQ+ LKCN RFSCLFSDNRREEQARKALESALG KKNEFEKWNNEIKKREEMGGGG 
Subjt:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGGG-

Query:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK
        GG+GGWFGSGGWFGWSDD FWPEAQQTSLAVLGIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTF+TSKILRK+S  N AEF+ ISN++VSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK

Query:  ERVARKWGSD
        E+VARKWGSD
Subjt:  ERVARKWGSD

A0A6J1GG46 uncharacterized protein LOC1114536318.6e-9084.76Show/hide
Query:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGG-GG
        MLQVLNL P +  LAL  S+SDDPT S LPLLRPRNATH WALLQS LKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEM G GG
Subjt:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGG-GG

Query:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK
        GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKG +L+AV+ NPLLYALRGTRNGLT VTSKILRK   SN AEF+ ISN+ VSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK

Query:  ERVARKWGSD
        +RVARKW +D
Subjt:  ERVARKWGSD

A0A6J1IV71 uncharacterized protein LOC1114788504.7e-8884.29Show/hide
Query:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGG-GG
        MLQVLNL PPT  LAL  SISDD T S LPL RPRNATH WALLQS LKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEM G GG
Subjt:  MLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGG-GG

Query:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK
        GGRGGWFGSGGWFGWSDDQFW EAQQTSLAVLGIIVMYLIVAKG +LLAV+ NPLLYALRGTRNGLT VTSK LRK   +N AEF+ ISN+ VSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSVSNSAEFEAISNKQVSGHVSAK

Query:  ERVARKWGSD
        +RVARKW +D
Subjt:  ERVARKWGSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G22370.1 unknown protein4.8e-0839.13Show/hide
Query:  IIRGLQDSLVFSVQPTTKDGSMVGIKWRVGWHKPLIGSEKGVNIHSHHIYTGKLLIGNFEMLLDPLLQL
        +I  L   +   V+PT KDG  VG++W++   K  I   KG + H  H+Y GKLLI N EM ++P+  +
Subjt:  IIRGLQDSLVFSVQPTTKDGSMVGIKWRVGWHKPLIGSEKGVNIHSHHIYTGKLLIGNFEMLLDPLLQL

AT5G20130.1 unknown protein4.1e-4459.43Show/hide
Query:  WALLQSNLKCNGRFSCLFS-DNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGG----GGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIV
        ++L  S  K  GRFSCLFS  N+REEQARK+LESALGGKKNEFEKW+ EIKKREE GGG    GGG GGWFG GGWF  S D FW EAQQ +  +L I+ 
Subjt:  WALLQSNLKCNGRFSCLFS-DNRREEQARKALESALGGKKNEFEKWNNEIKKREEMGGG----GGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIV

Query:  MYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIL-RKTSVSNSAEFEAISNKQVSGHVSAKERVARKWGSD
        +Y++VAKGE++ A V NPLLYALRGTR GL+ ++SK++ R+ S  +    E +  K+ S   +AKE V RKWGSD
Subjt:  MYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIL-RKTSVSNSAEFEAISNKQVSGHVSAKERVARKWGSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACCATCAAATCCCAAAATTTCACTCTTACAATACAATGGCTGTTTATTCATTAGCAAATTCCATCCATTTCCATATCCCCAACACATTCTCAAACTCAACCTC
CAACAACCTCAATGCCACCACCTCTCTAAGCAGTTGTGTTCCTCCGGCGCTCTCGGTCAACCGTCGAGGCTCTTTGTGCGTTAAATGCCGTGGTAGTGCCAAGCCGGAAA
ATAAAAACCACGACGAAAATGACCCTCTTGAAACAATCGACAAACTTTACAAAACCATCAAGAAGAAAGACATCGTCGAATTGTCTAATAAAATGTGGAATATGGTGTCC
AATATCATAAGGGGATTACAAGATAGCCTAGTATTTTCAGTGCAGCCAACAACAAAAGATGGCTCGATGGTGGGCATTAAATGGAGAGTAGGGTGGCATAAACCGCTCAT
AGGGTCCGAAAAAGGAGTCAATATCCATTCTCATCATATCTATACCGGAAAATTGCTTATTGGAAATTTTGAAATGTTATTGGATCCTCTTCTTCAACTCGGGCCAACAA
AGATGATGAAATGGAATGAAGAGTCAAAGTCAAAAGAGAAGAGAATTGCATCAATGTGTTTGGGTGTTTTCCTCCTCCTTGTATCACTCTTTTGTCTCCAGTTTTCTGTT
CTTATCCCTTTGAATGCAAAACAAACTCAAAACCCTGAGGCCAAGGCTAGCGCTCGTTTTTCTTCTCCGACTCTGCCCATCGCATTTGAAAGCGCAGAGGCCAGCGTTTA
TCGGCCGGGATATGCGAAAACAATGCTTCAGGTTCTCAATCTAAGACCGCCAACTCCGATTCTGGCCCTACCGATCTCGATTTCCGATGACCCAACTTGCTCCGCCCTCC
CATTACTTCGCCCTCGTAATGCAACGCACAATTGGGCGCTTTTACAGTCCAACCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCAGACAATAGAAGAGAGGAACAG
GCAAGGAAGGCATTAGAAAGTGCACTTGGGGGAAAGAAAAATGAATTTGAGAAATGGAATAATGAAATAAAGAAAAGAGAGGAGATGGGTGGTGGTGGAGGTGGACGAGG
AGGTTGGTTTGGATCGGGAGGATGGTTTGGTTGGTCTGATGACCAATTCTGGCCAGAAGCACAACAGACTAGTCTTGCTGTTTTAGGTATAATTGTCATGTATCTCATAG
TTGCGAAAGGTGAACTGTTGCTTGCTGTTGTTTTCAACCCACTGCTTTATGCTTTGCGAGGAACGAGAAATGGATTGACTTTTGTTACTTCAAAAATTTTGAGAAAGACC
TCTGTTAGTAATTCTGCTGAGTTTGAGGCGATTTCAAACAAACAAGTCTCTGGCCATGTCTCTGCCAAAGAGAGAGTTGCAAGGAAATGGGGGAGCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACCATCAAATCCCAAAATTTCACTCTTACAATACAATGGCTGTTTATTCATTAGCAAATTCCATCCATTTCCATATCCCCAACACATTCTCAAACTCAACCTC
CAACAACCTCAATGCCACCACCTCTCTAAGCAGTTGTGTTCCTCCGGCGCTCTCGGTCAACCGTCGAGGCTCTTTGTGCGTTAAATGCCGTGGTAGTGCCAAGCCGGAAA
ATAAAAACCACGACGAAAATGACCCTCTTGAAACAATCGACAAACTTTACAAAACCATCAAGAAGAAAGACATCGTCGAATTGTCTAATAAAATGTGGAATATGGTGTCC
AATATCATAAGGGGATTACAAGATAGCCTAGTATTTTCAGTGCAGCCAACAACAAAAGATGGCTCGATGGTGGGCATTAAATGGAGAGTAGGGTGGCATAAACCGCTCAT
AGGGTCCGAAAAAGGAGTCAATATCCATTCTCATCATATCTATACCGGAAAATTGCTTATTGGAAATTTTGAAATGTTATTGGATCCTCTTCTTCAACTCGGGCCAACAA
AGATGATGAAATGGAATGAAGAGTCAAAGTCAAAAGAGAAGAGAATTGCATCAATGTGTTTGGGTGTTTTCCTCCTCCTTGTATCACTCTTTTGTCTCCAGTTTTCTGTT
CTTATCCCTTTGAATGCAAAACAAACTCAAAACCCTGAGGCCAAGGCTAGCGCTCGTTTTTCTTCTCCGACTCTGCCCATCGCATTTGAAAGCGCAGAGGCCAGCGTTTA
TCGGCCGGGATATGCGAAAACAATGCTTCAGGTTCTCAATCTAAGACCGCCAACTCCGATTCTGGCCCTACCGATCTCGATTTCCGATGACCCAACTTGCTCCGCCCTCC
CATTACTTCGCCCTCGTAATGCAACGCACAATTGGGCGCTTTTACAGTCCAACCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCAGACAATAGAAGAGAGGAACAG
GCAAGGAAGGCATTAGAAAGTGCACTTGGGGGAAAGAAAAATGAATTTGAGAAATGGAATAATGAAATAAAGAAAAGAGAGGAGATGGGTGGTGGTGGAGGTGGACGAGG
AGGTTGGTTTGGATCGGGAGGATGGTTTGGTTGGTCTGATGACCAATTCTGGCCAGAAGCACAACAGACTAGTCTTGCTGTTTTAGGTATAATTGTCATGTATCTCATAG
TTGCGAAAGGTGAACTGTTGCTTGCTGTTGTTTTCAACCCACTGCTTTATGCTTTGCGAGGAACGAGAAATGGATTGACTTTTGTTACTTCAAAAATTTTGAGAAAGACC
TCTGTTAGTAATTCTGCTGAGTTTGAGGCGATTTCAAACAAACAAGTCTCTGGCCATGTCTCTGCCAAAGAGAGAGTTGCAAGGAAATGGGGGAGCGATTGATTTTTCTT
CCCATTTACTTTTGGCCTTAATGTTTCAACTATTCAAATAGGTTGCTATGCGAATTTTTGGAAAATGTTTGGCTCCTAGAATTAGTTTTTTCTTTTCTTCTTTTTTTTAT
GTTGGTTGCTTTCTTTCATTTTAAAGAATAGCTTCCTTAAACGAGATGCTTGAGTTTGTTCCCCTTTTCAACAAAAACCACCCCAAAAGAGACAAAAAAAAAATTGAATT
ATATATATATATAGGGTAGGGAAATCTGAACCCTCGCGATCACTAACACATCTATATGCTAGTTAAGCTATTTTGCTTTCGCTGAATTATGTAATCTTTAAAACTTGTTA
TATTTGAAATTTATTTTCATAAACATACATGTCTACTTTAGTTGTTTACAGATAAAAAAACATAATTATTGTACAATTCAATGAATATTTCCATGCTGAACACTTTCCCT
TGACTTGTTTTTTTTAGTACAAGAATTAAATTATTATAAGGCAAATTGCAAAAACCACCCTAAAGTATATGGTAGTAGTTATAATTACACACTTCAAACTTTCAATTTTA
AAAATTAAGTTCTTAAACTTATATAAAAGTTTGAATTGAGTCAAATTGAACATTTAAGGTTATATAATTGTATAAGTTGTAACATTTTATAACTTATATTGATGGTCTAA
TTTAAATACTTGTAAAAGTTTAGGGGTACCTTTTTTACAATTGAAAGTTTGAAAAGTATAATTGCAACTACCACCATACTTTGAGGGTGATTTTTGTAATTTGTCCAAAA
TAATAATTATTCTTTCTTTAGAAATGCCAACATTGTGTTGCAAATCATCTCTAAGCTGTATCAAACATCATCTGTCCCTCTAAAAACCAAAAGAGTAAAAGGGAATAAAA
ACTTGACTTTTTTGGATCAATGATTGTGGGGTCGGACTATTGACTTTTATCTCATGTGATATTTGATTTATTAAACTGTGTTATCAAATTTCCAATAAAAACTTCAATCA
TTTAAGCTATCCACTTTTGAAATGGCAGGTGATATTTTATCCATTAAACTATGTTATCAAATTCACAATAAAAACTTCAATCATTCAACACAATTCGCACGATCCGAGAT
TTAGTTGAAAGTATATAACTTTGGTCATAAGAAATCACTCAGTAAGAAAAGCTGTTGTTTTCTATCTCAAATATCAATTAACATAAATAAATAAAATAAAGATTGCTTAT
GAACAAACCCTTTAGAAGATTACATACAAATTGTCCCGTGGAATTTCTTAAAATAATTTTAAAATAAAAAGGTAATAGAGAAGTTCAATTTTTATGTTTACCTTTATTTA
GGTCCTTTTATCTTTCAATTTTATCCATTTAAACATTGAATTTGAACAAGTGTTGGTACTAATACCTCAAACTAAATTTACAAATTGCTTATGAGAAGCAAGTCACAAAA
TCAAGTGAAAAAAAAGTTCAAACTTTAACAGCAAAATAAAAGTCCGAGACTTCAAAGATATCAAAACTTAAACCAACTGTGGATATTAATTTCACAATTAATTAAAGGCA
TGGTTTAAGTTGCACCACTTATTGAACACAAAAATTTCCCAAAAAACAGAAAAAACAAAACCCAAATCAAAGATCCAGCTAAGCATGGAATTGAGCAATCACTTTCAAGC
ATTTAAAGAGTTGAAATCATAAATTTCAAAAATAAGTTGGAGAAGACAACCACAGAAGTTATAAGAACATGAAGATTGTAGGCGGCAGCAACAGATTTCCATTAAAACCA
ATGATTTGCAAGAATCTTCTCCGCAAGAACAGCTGAATTTGGTGACAGGAAAACCCTGGATTCTTGGCCTTCGTTGTACCAAACAATGCCCTATGACCTCGAAATGGGTC
TTGACAACAATGAAGATGGAAATGCATCTCCTGAGACCAAGAAAATAGGTGGAGTTGAAGAAGAAGAAGGAGATGAAAGTGTTGGGTTGAGTAGGAAAATGAGTGAAACT
TCTATATGTGCAGCAGAGGATGATGAAGATGAAGAGGGAAGGAAGATTGAGTTAGGTCCTCAGTGTACATTGAAAGAACAACTTGAGAAAGATAAGGATGATGAAAGCTT
GAGGAGGTGGAAGGAGCAGCTTCTAGGAAGTGTGGATATTGCAGCTGTTGGAGAAACTTTGGAACCAGAGGTAAAGATTCTGAGCCTAGCGATTAGAACTCCAGGAAGGC
CGGACATTGTTCTACAAGTTCCTGAGAATGGAAATCCAAAAGGGCTATGGTTTACATTGAAAGAAGGTAGCCGTTACAGCTTGATCTTCACCTTCCAAGTTAGCAATAAC
ATTGTTGCAGGTCTCAAATACGCCAACACAGTCTGGAAAACTGGTGTCAAAGTGGATAGTTCAAAAGAAATGCTGGGAACTTTTAGTCCTCAGGAAGAACCTTACACCCA
TGAAATGCCTGAAGATACAACCCCATCTGGGATCTTTGCTCGAGGATCATATTCAGCAAGAACCAAGTTTGTTGATGATGATGACAAGTGCTACCTGGAAATCAACTACA
CATTTGATATAAGGAAAGATTGGCAATCATCTTAAAAAACTTCAAGATTCTCTGTTAATGATTGGTTTTGCATTTGATCAAAATATCTTCTCTTTTGAAGCTCATTCATA
AATTTGGTTTGGATGTCCAAATATGTCATGTACTCTCTCTTTCTTTTAAGTCTTGATTCTTTCATGTACGTGCATGAGACTGATGGCCTCAACTCAAATTATGCAAATGC
ATATTATTGATAAAATTCATTCGGAGCTGTTCATGAAAAAGGGTCAATTATTCAATTTCCTTCCAACATTTGGGC
Protein sequenceShow/hide protein sequence
MENHQIPKFHSYNTMAVYSLANSIHFHIPNTFSNSTSNNLNATTSLSSCVPPALSVNRRGSLCVKCRGSAKPENKNHDENDPLETIDKLYKTIKKKDIVELSNKMWNMVS
NIIRGLQDSLVFSVQPTTKDGSMVGIKWRVGWHKPLIGSEKGVNIHSHHIYTGKLLIGNFEMLLDPLLQLGPTKMMKWNEESKSKEKRIASMCLGVFLLLVSLFCLQFSV
LIPLNAKQTQNPEAKASARFSSPTLPIAFESAEASVYRPGYAKTMLQVLNLRPPTPILALPISISDDPTCSALPLLRPRNATHNWALLQSNLKCNGRFSCLFSDNRREEQ
ARKALESALGGKKNEFEKWNNEIKKREEMGGGGGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKT
SVSNSAEFEAISNKQVSGHVSAKERVARKWGSD