; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018713 (gene) of Snake gourd v1 genome

Gene IDTan0018713
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUlp1-like peptidase
Genome locationLG04:77845566..77847217
RNA-Seq ExpressionTan0018713
SyntenyTan0018713
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]1.6e-5342.43Show/hide
Query:  LKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSV
        L I  +D FPA +T  +H+ K+ T IK  L+ +Q+ +FRQTCFG +LD  +VFNG LIH+ LLREV E R D+ISF +  K VSF + EF+LITGL + +
Subjt:  LKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSV

Query:  REVPEDVSSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------LKNAI
          V   +   RLR +Y  +   VKCSEL   F    F +D++ VK+ + Y IEL M  +E KQ +D  LL  +D WE F                LKNA+
Subjt:  REVPEDVSSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------LKNAI

Query:  CGKASSYKNKKVKKP---ETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANVILELVPTEMESTYMVRM
          K S Y+ K    P   ETY+LY F YA QVWAYE +S+L++          IPR+LRWSC  S  +R++  EVF +T + V   L+ T+ +  +MVR+
Subjt:  CGKASSYKNKKVKKP---ETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANVILELVPTEMESTYMVRM

Query:  LEQP
        +  P
Subjt:  LEQP

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]1.9e-5944.44Show/hide
Query:  MAFVLKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGL
        M   LKI  DD FPAA++  +H+ K+ + +K  L+ SQ+ +F QTCFG +L  ++VFNG L+H+ LLREV E + D+ISF +    VSF + EF+LITGL
Subjt:  MAFVLKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGL

Query:  RYSVREVPEDVSSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------L
        R+++  V EDV + RLR+ Y  +   VKCSEL   F    FEND++AVK+A+ Y IEL M  +E K  +D +LL  +D WE F                L
Subjt:  RYSVREVPEDVSSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------L

Query:  KNAICGKASSYKNKKV---KKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANVILELVPTEME
        KNA+  K   YK K        ETY+LY F YA QVWAYE +S+L+ RVA  +N   IPR+LRWSC  S ++ ++ REVF +  + V++ L  T++E
Subjt:  KNAICGKASSYKNKKV---KKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANVILELVPTEME

XP_031736793.1 uncharacterized protein LOC116402085 [Cucumis sativus]9.5e-4339.19Show/hide
Query:  KIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSVR
        KI P  HF + ++  +HL  S   IK  L   Q+ LFR T FGH LD +I+FNG LIHY LLREV + R D ISF I + V SF R EFN++TGL  S  
Subjt:  KIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSVR

Query:  EVPEDVSSNRLRVKYLNNSRIVKCSELYATF-PLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------LKNAI
        E  + V ++RL  K+    + +  S+L  TF      ++DD+ VK+AL Y IE+ +  ++ +  VD  L    D+W +FN               LK A+
Subjt:  EVPEDVSSNRLRVKYLNNSRIVKCSELYATF-PLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------LKNAI

Query:  CGKASSYKNKKVKKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFAST--TANVILELVPTEMESTYM
          + +  KNK  K    YT+  F  ALQVWAYE + ++T      V+   IPR+LRW C  SP   ++ R+VF S     NV++E++P E E   M
Subjt:  CGKASSYKNKKVKKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFAST--TANVILELVPTEMESTYM

XP_031739159.1 uncharacterized protein LOC116402876 isoform X1 [Cucumis sativus]9.5e-4339.19Show/hide
Query:  KIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSVR
        KI P  HF + ++  +HL  S   IK  L   Q+ LFR T FGH LD +I+FNG LIHY LLREV + R D ISF I + V SF R EFN++TGL  S  
Subjt:  KIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSVR

Query:  EVPEDVSSNRLRVKYLNNSRIVKCSELYATF-PLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------LKNAI
        E  + V ++RL  K+    + +  S+L  TF      ++DD+ VK+AL Y IE+ +  ++ +  VD  L    D+W +FN               LK A+
Subjt:  EVPEDVSSNRLRVKYLNNSRIVKCSELYATF-PLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------LKNAI

Query:  CGKASSYKNKKVKKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFAST--TANVILELVPTEMESTYM
          + +  KNK  K    YT+  F  ALQVWAYE + ++T      V+   IPR+LRW C  SP   ++ R+VF S     NV++E++P E E   M
Subjt:  CGKASSYKNKKVKKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFAST--TANVILELVPTEMESTYM

XP_031741736.1 uncharacterized protein LOC116403931, partial [Cucumis sativus]9.5e-4339.19Show/hide
Query:  KIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSVR
        KI P  HF + ++  +HL  S   IK  L   Q+ LFR T FGH LD +I+FNG LIHY LLREV + R D ISF I + V SF R EFN++TGL  S  
Subjt:  KIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSVR

Query:  EVPEDVSSNRLRVKYLNNSRIVKCSELYATF-PLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------LKNAI
        E  + V ++RL  K+    + +  S+L  TF      ++DD+ VK+AL Y IE+ +  ++ +  VD  L    D+W +FN               LK A+
Subjt:  EVPEDVSSNRLRVKYLNNSRIVKCSELYATF-PLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------LKNAI

Query:  CGKASSYKNKKVKKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFAST--TANVILELVPTEMESTYM
          + +  KNK  K    YT+  F  ALQVWAYE + ++T      V+   IPR+LRW C  SP   ++ R+VF S     NV++E++P E E   M
Subjt:  CGKASSYKNKKVKKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFAST--TANVILELVPTEMESTYM

TrEMBL top hitse value%identityAlignment
A0A5A7U2U8 MuDRA-like transposase1.9e-4136.11Show/hide
Query:  FVLKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVE-VRPDIISFYIIDKVVSFSRVEFNLITGLR
        F + +  D +FPA ++C  H  K  ++IK  L++ Q+ +F +T FG LL+ ++VFNG+LIH+FLLR++ E    D I F ++ K V F++ EFN+ITGL 
Subjt:  FVLKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVE-VRPDIISFYIIDKVVSFSRVEFNLITGLR

Query:  YSVREVPEDVSSNRLRVKYL--NNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESF--------------NR
         +   + +D  S RL+       N +++ C E+   F    F NDD+AVK+ L   IE VM  ++ K   DM++L  +D+ E+F              N 
Subjt:  YSVREVPEDVSSNRLRVKYL--NNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESF--------------NR

Query:  LKNAICGKASSYKNKKVKKPET---YTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANV
        LK ++ GK  +Y+ KK +  +    Y +  ++ A QVWAYE++S+    +AT  +   IPRILRW+C  +PSY+M+   +F +   NV
Subjt:  LKNAICGKASSYKNKKVKKPET---YTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANV

A0A6J1DJX9 uncharacterized protein LOC1110207577.5e-5442.43Show/hide
Query:  LKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSV
        L I  +D FPA +T  +H+ K+ T IK  L+ +Q+ +FRQTCFG +LD  +VFNG LIH+ LLREV E R D+ISF +  K VSF + EF+LITGL + +
Subjt:  LKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSV

Query:  REVPEDVSSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------LKNAI
          V   +   RLR +Y  +   VKCSEL   F    F +D++ VK+ + Y IEL M  +E KQ +D  LL  +D WE F                LKNA+
Subjt:  REVPEDVSSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------LKNAI

Query:  CGKASSYKNKKVKKP---ETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANVILELVPTEMESTYMVRM
          K S Y+ K    P   ETY+LY F YA QVWAYE +S+L++          IPR+LRWSC  S  +R++  EVF +T + V   L+ T+ +  +MVR+
Subjt:  CGKASSYKNKKVKKP---ETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANVILELVPTEMESTYMVRM

Query:  LEQP
        +  P
Subjt:  LEQP

A0A6J1DRZ7 uncharacterized protein LOC1110238479.2e-6044.44Show/hide
Query:  MAFVLKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGL
        M   LKI  DD FPAA++  +H+ K+ + +K  L+ SQ+ +F QTCFG +L  ++VFNG L+H+ LLREV E + D+ISF +    VSF + EF+LITGL
Subjt:  MAFVLKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGL

Query:  RYSVREVPEDVSSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------L
        R+++  V EDV + RLR+ Y  +   VKCSEL   F    FEND++AVK+A+ Y IEL M  +E K  +D +LL  +D WE F                L
Subjt:  RYSVREVPEDVSSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------L

Query:  KNAICGKASSYKNKKV---KKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANVILELVPTEME
        KNA+  K   YK K        ETY+LY F YA QVWAYE +S+L+ RVA  +N   IPR+LRWSC  S ++ ++ REVF +  + V++ L  T++E
Subjt:  KNAICGKASSYKNKKV---KKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANVILELVPTEME

A0A6J1DSS5 uncharacterized protein LOC1110239691.3e-4234.55Show/hide
Query:  MAFVLKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFR-QTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITG
        M   LK+   D FPA +T  SHL+ +  II + L+ +Q+ +FR +T FG  +D  ++F   L+HYFLLREVV+ RPD++ F I+  +V+FS+ EF L+TG
Subjt:  MAFVLKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFR-QTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITG

Query:  L-RYSVREVPEDVSSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREM-KQLVDMNLLSTIDNWESFNR------------
        L R S R + + VS NRLR +Y  +   ++  E    +  + F NDD+AVK++L Y  E+VM  +   K  VD +L   +++ + FN             
Subjt:  L-RYSVREVPEDVSSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREM-KQLVDMNLLSTIDNWESFNR------------

Query:  --LKNAICGKASSYKNK---KVKKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANVILELVPTEME
          L++A+  K  +YKNK     K    Y+L  F  A QVWAYEI+ SL       ++ T +PRI R+SC  S + +++ R+VF S+   +   LV +E E
Subjt:  --LKNAICGKASSYKNK---KVKKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANVILELVPTEME

Query:  STYMVRMLEQPNSNAPSHPSNEATQMFDRECLPKETDDEHVNVTPEANVDQVIRSP
          Y     +   +   +      +   DR+  P+  D  H     +A  DQ I +P
Subjt:  STYMVRMLEQPNSNAPSHPSNEATQMFDRECLPKETDDEHVNVTPEANVDQVIRSP

A0A6N0C7Z4 Ulp1-like peptidase4.6e-4339.19Show/hide
Query:  KIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSVR
        KI P  HF + ++  +HL  S   IK  L   Q+ LFR T FGH LD +I+FNG LIHY LLREV + R D ISF I + V SF R EFN++TGL  S  
Subjt:  KIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSVR

Query:  EVPEDVSSNRLRVKYLNNSRIVKCSELYATF-PLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------LKNAI
        E  + V ++RL  K+    + +  S+L  TF      ++DD+ VK+AL Y IE+ +  ++ +  VD  L    D+W +FN               LK A+
Subjt:  EVPEDVSSNRLRVKYLNNSRIVKCSELYATF-PLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNR--------------LKNAI

Query:  CGKASSYKNKKVKKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFAST--TANVILELVPTEMESTYM
          + +  KNK  K    YT+  F  ALQVWAYE + ++T      V+   IPR+LRW C  SP   ++ R+VF S     NV++E++P E E   M
Subjt:  CGKASSYKNKKVKKPETYTLYAFLYALQVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFAST--TANVILELVPTEMESTYM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G35050.1 Domain of unknown function (DUF1985)2.2e-0527.63Show/hide
Query:  ITIIKKILSQSQIALFRQTCFGHLLDA-------------SIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSVR--EVPEDV
        + IIK+ILS S  +    T  G ++ A              + F+G+L+ + ++R++V  R D I F I +K + FS  EF+L+TGL   ++  EVP D 
Subjt:  ITIIKKILSQSQIALFRQTCFGHLLDA-------------SIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSVR--EVPEDV

Query:  SSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSR
          +      L +   +K  +  A   ++   +D+E+       I+ L +F R
Subjt:  SSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTTGTGCTGAAAATCCAACCGGATGACCATTTCCCTGCTGCAATTACTTGTTGTTCACACCTAACGAAATCAATCACCATTATAAAAAAGATACTAAGTCAATC
TCAAATCGCCCTGTTTCGCCAAACTTGTTTTGGACACTTATTGGACGCTTCCATAGTTTTCAATGGAAAACTTATCCATTACTTTCTCCTACGAGAGGTCGTAGAGGTCA
GACCGGATATAATAAGCTTCTATATAATAGATAAGGTAGTATCATTTAGTAGGGTAGAGTTCAACTTGATCACTGGTCTGCGTTATAGCGTTCGGGAGGTGCCTGAGGAT
GTGTCTAGTAACCGACTACGGGTCAAGTACCTGAATAACTCTAGAATTGTGAAATGCTCAGAGTTGTATGCAACATTCCCGTTGATGAACTTCGAAAATGACGATGAGGC
GGTGAAGATGGCCCTGTTCTACATTATCGAGCTTGTGATGTTCAGTCGCGAGATGAAACAATTGGTAGACATGAACCTTCTATCTACCATTGATAATTGGGAGTCATTCA
ACAGATTGAAAAATGCTATTTGTGGCAAGGCCTCATCGTACAAAAACAAGAAGGTGAAGAAACCGGAGACATACACATTGTATGCATTCCTTTATGCGCTCCAGGTGTGG
GCTTACGAGATTGTATCGTCCTTGACTAATCGGGTTGCGACTCATGTTAATACCACCGGTATCCCACGTATACTAAGGTGGTCATGTGGTACTTCACCTTCATACAGGAT
GATTATGCGGGAAGTGTTTGCATCCACAACGGCAAACGTCATATTAGAACTGGTGCCAACTGAAATGGAATCAACCTACATGGTACGAATGCTAGAACAACCGAATTCGA
ATGCTCCATCCCATCCATCAAATGAGGCAACTCAAATGTTCGATAGAGAGTGTCTTCCGAAAGAGACTGATGATGAACATGTGAACGTCACGCCTGAAGCGAATGTTGAT
CAAGTAATACGATCGCCATCAGCTGGTGATGCATCTACAAGTCCAAGCAAGGGAGTACGTGCGACATGTGTATGCCAAACACTGCTTCCCCCCATATGTCAACAGCTATT
AGAGTTGCAGAATAATATGACGACGATGCATCGGGCTTTTGATACACTTAAAACAAGTGTTCAAGAAGTGCGCAATCTGGTGATCAGAGATCGAGGTCGATTAGAACTCG
ATTTGAACATCGAAGTTCCTGACCATCCTGTGTCGGACGACAATAATCTAAAATCAGGGCGTGCAGACGATGCAGATGAGGACGAAACAGATCCTCGACCTTCTGTAGTA
GGTCATTGTAGCATCGATCTGACAACCGTGTCGATGGGGGGTGCGGTACCCCCCATCGAGACAACGCCGGTGGAAGTGGGTTCCAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATTTGTGCTGAAAATCCAACCGGATGACCATTTCCCTGCTGCAATTACTTGTTGTTCACACCTAACGAAATCAATCACCATTATAAAAAAGATACTAAGTCAATC
TCAAATCGCCCTGTTTCGCCAAACTTGTTTTGGACACTTATTGGACGCTTCCATAGTTTTCAATGGAAAACTTATCCATTACTTTCTCCTACGAGAGGTCGTAGAGGTCA
GACCGGATATAATAAGCTTCTATATAATAGATAAGGTAGTATCATTTAGTAGGGTAGAGTTCAACTTGATCACTGGTCTGCGTTATAGCGTTCGGGAGGTGCCTGAGGAT
GTGTCTAGTAACCGACTACGGGTCAAGTACCTGAATAACTCTAGAATTGTGAAATGCTCAGAGTTGTATGCAACATTCCCGTTGATGAACTTCGAAAATGACGATGAGGC
GGTGAAGATGGCCCTGTTCTACATTATCGAGCTTGTGATGTTCAGTCGCGAGATGAAACAATTGGTAGACATGAACCTTCTATCTACCATTGATAATTGGGAGTCATTCA
ACAGATTGAAAAATGCTATTTGTGGCAAGGCCTCATCGTACAAAAACAAGAAGGTGAAGAAACCGGAGACATACACATTGTATGCATTCCTTTATGCGCTCCAGGTGTGG
GCTTACGAGATTGTATCGTCCTTGACTAATCGGGTTGCGACTCATGTTAATACCACCGGTATCCCACGTATACTAAGGTGGTCATGTGGTACTTCACCTTCATACAGGAT
GATTATGCGGGAAGTGTTTGCATCCACAACGGCAAACGTCATATTAGAACTGGTGCCAACTGAAATGGAATCAACCTACATGGTACGAATGCTAGAACAACCGAATTCGA
ATGCTCCATCCCATCCATCAAATGAGGCAACTCAAATGTTCGATAGAGAGTGTCTTCCGAAAGAGACTGATGATGAACATGTGAACGTCACGCCTGAAGCGAATGTTGAT
CAAGTAATACGATCGCCATCAGCTGGTGATGCATCTACAAGTCCAAGCAAGGGAGTACGTGCGACATGTGTATGCCAAACACTGCTTCCCCCCATATGTCAACAGCTATT
AGAGTTGCAGAATAATATGACGACGATGCATCGGGCTTTTGATACACTTAAAACAAGTGTTCAAGAAGTGCGCAATCTGGTGATCAGAGATCGAGGTCGATTAGAACTCG
ATTTGAACATCGAAGTTCCTGACCATCCTGTGTCGGACGACAATAATCTAAAATCAGGGCGTGCAGACGATGCAGATGAGGACGAAACAGATCCTCGACCTTCTGTAGTA
GGTCATTGTAGCATCGATCTGACAACCGTGTCGATGGGGGGTGCGGTACCCCCCATCGAGACAACGCCGGTGGAAGTGGGTTCCAGATAA
Protein sequenceShow/hide protein sequence
MAFVLKIQPDDHFPAAITCCSHLTKSITIIKKILSQSQIALFRQTCFGHLLDASIVFNGKLIHYFLLREVVEVRPDIISFYIIDKVVSFSRVEFNLITGLRYSVREVPED
VSSNRLRVKYLNNSRIVKCSELYATFPLMNFENDDEAVKMALFYIIELVMFSREMKQLVDMNLLSTIDNWESFNRLKNAICGKASSYKNKKVKKPETYTLYAFLYALQVW
AYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYRMIMREVFASTTANVILELVPTEMESTYMVRMLEQPNSNAPSHPSNEATQMFDRECLPKETDDEHVNVTPEANVD
QVIRSPSAGDASTSPSKGVRATCVCQTLLPPICQQLLELQNNMTTMHRAFDTLKTSVQEVRNLVIRDRGRLELDLNIEVPDHPVSDDNNLKSGRADDADEDETDPRPSVV
GHCSIDLTTVSMGGAVPPIETTPVEVGSR