; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G22810 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G22810
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF1635)
Genome locationClcChr02:34863627..34865410
RNA-Seq ExpressionClc02G22810
SyntenyClc02G22810
Gene Ontology termsNA
InterPro domainsIPR012862 - Protein of unknown function DUF1635


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646892.1 hypothetical protein Csa_020913 [Cucumis sativus]1.1e-7385.79Show/hide
Query:  MENQRNALLGWAYFFQGK--------NIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQR
        MENQR+ALLGWAYFFQGK        NIEELRHSLF ATLELEQTRVAVQEELRKRDEQ I LKDLFS+AIRERDEANEKFQKLLIEKLLL Q+QQQQQR
Subjt:  MENQRNALLGWAYFFQGK--------NIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQR

Query:  TDPHSGISSIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        TDPHSGISSIEDE KKGIDSINGFS SDCEESIVSSPAIDPIPPPQFPP +   PP+  +ELV EKPLPEKGKLLEAVMKAGPLLQTLLV
Subjt:  TDPHSGISSIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

XP_008465402.1 PREDICTED: uncharacterized protein LOC103503032 [Cucumis melo]5.9e-7589.01Show/hide
Query:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS
        MENQR+ALL WAYFFQGKNIEELRHSLF ATLELEQTRVAVQEELRKRDEQ I LKDLFS+AIRERDEANEKFQKLLIEKLLL   QQQQQRTDPHSGIS
Subjt:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS

Query:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        SIEDE KKGIDSINGFS SDCEESIVSSPAIDPIPPPQFPPA+   PP+  +ELV EKPLPEKGKLLEAVMKAGPLLQTLLV
Subjt:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

XP_011657021.1 uncharacterized protein LOC101221076 [Cucumis sativus]6.9e-7689.56Show/hide
Query:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS
        MENQR+ALLGWAYFFQGKNIEELRHSLF ATLELEQTRVAVQEELRKRDEQ I LKDLFS+AIRERDEANEKFQKLLIEKLLL Q+QQQQQRTDPHSGIS
Subjt:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS

Query:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        SIEDE KKGIDSINGFS SDCEESIVSSPAIDPIPPPQFPP +   PP+  +ELV EKPLPEKGKLLEAVMKAGPLLQTLLV
Subjt:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

XP_022140453.1 uncharacterized protein LOC111011124 [Momordica charantia]1.2e-7587.91Show/hide
Query:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS
        MENQR+ALL WAYFFQGKN+EELRHSLF ATLELEQTRVAVQEELRKRDEQ I LKDLF RAIRERDEANEKFQKL+IEKLLL     QQQRTDPHSGIS
Subjt:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS

Query:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        SIEDE +KGIDS NGFSSSDCEESIVSSPAIDPIPPPQFPPA  QAPP+ A+ELV EKPLPEKGKLLEAVMKAGPLLQTLLV
Subjt:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

XP_038900961.1 uncharacterized protein LOC120088005 [Benincasa hispida]1.8e-7689.01Show/hide
Query:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS
        MENQR+ALL WAYFFQGKNIEELRHSLF ATLELEQTRVAVQEELRKRDEQF+ LKDLFSRAIRERD+ANEKFQKLLIEKLLL   QQQQQRTDPHSGIS
Subjt:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS

Query:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        SIEDE KKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPA  Q  P+  +E V EKPLPEKGKLLEAVMKAGPLLQTLLV
Subjt:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

TrEMBL top hitse value%identityAlignment
A0A0A0KAW2 Uncharacterized protein3.4e-7689.56Show/hide
Query:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS
        MENQR+ALLGWAYFFQGKNIEELRHSLF ATLELEQTRVAVQEELRKRDEQ I LKDLFS+AIRERDEANEKFQKLLIEKLLL Q+QQQQQRTDPHSGIS
Subjt:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS

Query:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        SIEDE KKGIDSINGFS SDCEESIVSSPAIDPIPPPQFPP +   PP+  +ELV EKPLPEKGKLLEAVMKAGPLLQTLLV
Subjt:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

A0A1S3CNT5 uncharacterized protein LOC1035030322.8e-7589.01Show/hide
Query:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS
        MENQR+ALL WAYFFQGKNIEELRHSLF ATLELEQTRVAVQEELRKRDEQ I LKDLFS+AIRERDEANEKFQKLLIEKLLL   QQQQQRTDPHSGIS
Subjt:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS

Query:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        SIEDE KKGIDSINGFS SDCEESIVSSPAIDPIPPPQFPPA+   PP+  +ELV EKPLPEKGKLLEAVMKAGPLLQTLLV
Subjt:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

A0A5A7T6U2 Protein enabled-like protein4.5e-7385.26Show/hide
Query:  MENQRNALLGWAYFFQGK--------NIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQR
        MENQR+ALL WAYFFQGK        NIEELRHSLF ATLELEQTRVAVQEELRKRDEQ I LKDLFS+AIRERDEANEKFQKLLIEKLLL   QQQQQR
Subjt:  MENQRNALLGWAYFFQGK--------NIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQR

Query:  TDPHSGISSIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        TDPHSGISSIEDE KKGIDSINGFS SDCEESIVSSPAIDPIPPPQFPPA+   PP+  +ELV EKPLPEKGKLLEAVMKAGPLLQTLLV
Subjt:  TDPHSGISSIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

A0A6J1CI14 uncharacterized protein LOC1110111245.7e-7687.91Show/hide
Query:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS
        MENQR+ALL WAYFFQGKN+EELRHSLF ATLELEQTRVAVQEELRKRDEQ I LKDLF RAIRERDEANEKFQKL+IEKLLL     QQQRTDPHSGIS
Subjt:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS

Query:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        SIEDE +KGIDS NGFSSSDCEESIVSSPAIDPIPPPQFPPA  QAPP+ A+ELV EKPLPEKGKLLEAVMKAGPLLQTLLV
Subjt:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

A0A6J1FXG5 uncharacterized protein LOC1114484727.0e-7487.36Show/hide
Query:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS
        MENQR+ALLGWAYFFQGKNIEELRHSLF ATLELEQTRVAVQEELRKRDEQ I LKDLFSRAIRERDEANEKFQKL IEK LL      QQRTDPHSG+S
Subjt:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS

Query:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        SIEDE KKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPAL      TA++LV EKPLPEKGKLLEAVMKAGPLLQTLLV
Subjt:  SIEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G28140.1 Protein of unknown function (DUF1635)8.3e-2746.15Show/hide
Query:  QRNALLGWAYFFQG--KNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGISS
        Q+N LL W++  Q   K IEELRHSL + T+ELEQTR+   EEL  +D+Q +QLKDL ++AI+E+DEA E+++++L+++  L Q Q  +++ DP      
Subjt:  QRNALLGWAYFFQG--KNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGISS

Query:  IEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMEL-VQEKPLPEKGKLLEAVMKAGPLLQTLLV
                  +INGF SSD EESIVSS          F P ++       +EL   E  LPEKGKLL+AV+KAGPLLQTLL+
Subjt:  IEDEQKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMEL-VQEKPLPEKGKLLEAVMKAGPLLQTLLV

AT2G28690.1 Protein of unknown function (DUF1635)2.4e-1033.33Show/hide
Query:  LGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGISSIEDEQKK
        LG  +F+Q ++++ELR  L Y++ ELE  +    EE +   E+   L  L   A +ERDEA ++ QKLL  K               +S I+        
Subjt:  LGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGISSIEDEQKK

Query:  GIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPA-------LQQAPPNTAM--ELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
         +DS   F      E    +   +PI   +F          L++  P  A+  E+++ K LPEKGKLL+ VM++GPLLQTLLV
Subjt:  GIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPA-------LQQAPPNTAM--ELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

AT3G44940.1 Protein of unknown function (DUF1635)1.6e-3852.36Show/hide
Query:  NQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQR------TDPH
        +Q N+LL W YF  GK  EELR +L Y T+ELEQT++   EELRKRDEQ I L+D+ ++ ++ERDEA EK+  LL+  LLLQQ+QQQ Q       T P 
Subjt:  NQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQR------TDPH

Query:  SGISS-IEDE----QKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        SG SS IEDE    Q+  ++S   FSSSD EESI+S   IDP+   Q    ++ +       L+ +KPLPEKGKLL+AV+KAGPLLQTLL+
Subjt:  SGISS-IEDE----QKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

AT5G22930.1 Protein of unknown function (DUF1635)2.1e-3854.3Show/hide
Query:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS
        M+NQ NALLGW Y+   K  EE+R SL Y TLEL+QT++   EE+RKRDEQ I LKD+ ++ I+ERDEA EK Q+L+ +   LQQQ+     T P SG S
Subjt:  MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGIS

Query:  SIEDE--QKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAM--ELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        SIEDE  Q + + S   FSSSDCEESI+ SP    + PP  P  L++      +   L+Q+KPLPEKGKLL+AV+KAGPLLQTLL+
Subjt:  SIEDE--QKKGIDSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAM--ELVQEKPLPEKGKLLEAVMKAGPLLQTLLV

AT5G59760.1 Protein of unknown function (DUF1635)2.6e-0430.95Show/hide
Query:  KNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGISSIEDEQKKGIDSINGFS
        ++I+E+R +L     ELE  ++   E+ R   E+  QL +L     +ERDEA ++  + +       QQ    +     +  S  +D            S
Subjt:  KNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGISSIEDEQKKGIDSINGFS

Query:  SSDCEESIVSSPAIDPIPPPQFPPA---LQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV
        SS  E S   +    P+   + P A    QQ  P     LV  K  PE GKLL+AV++AGPLL+TLL+
Subjt:  SSDCEESIVSSPAIDPIPPPQFPPA---LQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACCAACGTAATGCTCTTCTGGGTTGGGCTTACTTTTTCCAAGGAAAGAACATTGAAGAATTGAGGCATTCGCTTTTCTACGCGACCCTAGAGTTGGAACAAAC
AAGAGTTGCGGTTCAAGAGGAGCTAAGAAAGAGAGATGAACAGTTCATTCAACTCAAGGATCTTTTCAGCAGAGCCATTAGAGAGAGAGATGAAGCAAATGAAAAGTTTC
AGAAGTTACTCATCGAGAAGCTTTTACTACAACAGCAACAGCAGCAGCAGCAGCGAACAGATCCTCATTCAGGAATTTCGAGCATTGAAGATGAACAAAAGAAAGGGATC
GATTCCATCAATGGCTTTTCGTCGTCAGATTGTGAAGAAAGCATTGTCTCATCTCCTGCCATTGACCCAATTCCACCACCACAGTTTCCTCCAGCTCTGCAGCAAGCACC
ACCGAACACGGCGATGGAATTGGTCCAAGAGAAGCCATTGCCTGAAAAAGGCAAGCTCTTGGAAGCAGTGATGAAAGCAGGTCCTTTGCTTCAGACTCTGCTTGTGGGCC
GGACCTCTGCCGGAGTGGAGACATCCACCACCACCACTCGAATCCTTCGAAATCCCACCAGTGTCAATGCCTTCGACTCCTCCGCCACAGCTGCTCCGAGATTCTCCAGT
CACCTTCAATGGCAGTAG
mRNA sequenceShow/hide mRNA sequence
CTTAAATTGAGATTAGAATAGTAAAACATGGGAATGACCAGTTATAAATGGAATTTGTTAAGAGATTCTTTCTGTGAGTTCAAGTGCAAGACATGGAGGATGGATCATAG
GATGTTAGGAATTGGCTTCTGCTAGGTTCTTGGCGGTTTCTGCTTTTTTCTTCTTCACAAAAAGTAACAAATCTGTCAAAAGAAAACAGTGTCAATTAAAGATTTCTTAT
AAGAACTTCAACAACTGCCAGTCTGAAACCATATCCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTATATCTACTTTTTTTGTTCTCTGTGAAACTTTTGGA
GTTCAGTTATGGAAAACCAACGTAATGCTCTTCTGGGTTGGGCTTACTTTTTCCAAGGAAAGAACATTGAAGAATTGAGGCATTCGCTTTTCTACGCGACCCTAGAGTTG
GAACAAACAAGAGTTGCGGTTCAAGAGGAGCTAAGAAAGAGAGATGAACAGTTCATTCAACTCAAGGATCTTTTCAGCAGAGCCATTAGAGAGAGAGATGAAGCAAATGA
AAAGTTTCAGAAGTTACTCATCGAGAAGCTTTTACTACAACAGCAACAGCAGCAGCAGCAGCGAACAGATCCTCATTCAGGAATTTCGAGCATTGAAGATGAACAAAAGA
AAGGGATCGATTCCATCAATGGCTTTTCGTCGTCAGATTGTGAAGAAAGCATTGTCTCATCTCCTGCCATTGACCCAATTCCACCACCACAGTTTCCTCCAGCTCTGCAG
CAAGCACCACCGAACACGGCGATGGAATTGGTCCAAGAGAAGCCATTGCCTGAAAAAGGCAAGCTCTTGGAAGCAGTGATGAAAGCAGGTCCTTTGCTTCAGACTCTGCT
TGTGGGCCGGACCTCTGCCGGAGTGGAGACATCCACCACCACCACTCGAATCCTTCGAAATCCCACCAGTGTCAATGCCTTCGACTCCTCCGCCACAGCTGCTCCGAGAT
TCTCCAGTCACCTTCAATGGCAGTAGCCACATTACCAACTGTGGAAACAGAAAAAGGGCTTTCTTGGAAGGCTCTGATTCTCCAACAGAGACAAAGTGTCGAAGGCTTGC
TCCCTGCTGACCATTACAAAAACAAAGGGCAGCAAAAACATAATCATAGTCTAATATTAGTAAAGAGCTAATGAAGATGAAAAGAGAATCGGGTTAGTGGCTGATTTTAG
TACTTGTACAGAGGTTTGTGTTATAGGTAAGAAGAAACTCTCTCCTTCTACGTTTCAGGAAGCTACAACAAAAGAAGTCTCCCTCTTTTCTCCCCTTTCTGTCATCACTC
TGACTCAGTTGCCTAGAGTGAAGTTTTTATGGCCACCCTTTTTACTTCACACTCAACTTTTTCTTTTTCGTACTCCGTCCGAGAGAAACAGACGGGGTCAAATTTTTCCA
GAATTGTTTTAGCCATATAAGATGCAATTCCAATTCTGAGATTTCTCCTGTCTGATTGCTTGTCTGAGATTAAGCAAAAATACATAATATAACAAACTCATTGAAGACAA
TTAGTCGGCGAACATATTCCAAGTTCAACTATAATAACCATTATCAACAATTTGCTGGTGTATGTGACTCAAGTTCAACTGCGATATCACCTGTGCCTAACATGTAATAT
ATGTAAATACAAAACAGAACATAAGAAGTCAATAAACA
Protein sequenceShow/hide protein sequence
MENQRNALLGWAYFFQGKNIEELRHSLFYATLELEQTRVAVQEELRKRDEQFIQLKDLFSRAIRERDEANEKFQKLLIEKLLLQQQQQQQQRTDPHSGISSIEDEQKKGI
DSINGFSSSDCEESIVSSPAIDPIPPPQFPPALQQAPPNTAMELVQEKPLPEKGKLLEAVMKAGPLLQTLLVGRTSAGVETSTTTTRILRNPTSVNAFDSSATAAPRFSS
HLQWQ