; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022054 (gene) of Snake gourd v1 genome

Gene IDTan0022054
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUlp1-like peptidase
Genome locationLG06:66465479..66467102
RNA-Seq ExpressionTan0022054
SyntenyTan0022054
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]5.1e-5544.04Show/hide
Query:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMN
        MF QTCFG  LD  +VFNG LIH+ LLREV E R D+ISF +  K VSFG+ EFDLITGL + +  V   +   RL  +Y  +   VKCSEL   F    
Subjt:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMN

Query:  FDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKP---ETYTLYGFPYALQVWAYE
        F +D++ VK+ + Y +EL M G+E KQ +D  LL  +D WE F + +W  ++F+RT+  L+N +  K S Y+ K    P   ETY+LYGFPYA QVWAYE
Subjt:  FDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKP---ETYTLYGFPYALQVWAYE

Query:  IVSSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTVNVIAELVPTEMESTYMVRLLDQPNPNATSNPP
         +S+L++          IPR+LRWSC  S  + +L  EVF +T   V   L+ T+ +  +MVR++  P      +PP
Subjt:  IVSSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTVNVIAELVPTEMESTYMVRLLDQPNPNATSNPP

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]1.2e-5947.47Show/hide
Query:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMN
        MF QTCFG  L  ++VFNG L+H+ LLREV E + D+ISF +    VSFG+ EFDLITGLR+++  V +DV + RL + Y  +  SVKCSEL   F    
Subjt:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMN

Query:  FDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNK---KLKKPETYTLYGFPYALQVWAYE
        F+ND++AVK+A+ Y +EL M G+E K  +D +LL  +D WE F + +W  ++FERT+  L+N +  K   YK K        ETY+LY FPYA QVWAYE
Subjt:  FDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNK---KLKKPETYTLYGFPYALQVWAYE

Query:  IVSSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTVNVIAELVPTEME
         +S+L+ RVA+ +N   IPR+LRWSC  S ++ +L REVF +    V+  L  T++E
Subjt:  IVSSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTVNVIAELVPTEME

XP_031736793.1 uncharacterized protein LOC116402085 [Cucumis sativus]2.8e-4542.53Show/hide
Query:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATF-PLM
        +F  T FGHFLD +I+FNG LIHY LLREV + R D ISF I + V SFGR EF+++TGL  S  +    V ++RL  K+    + +  S+L  TF    
Subjt:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATF-PLM

Query:  NFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIV
          D+DD+ VK+AL Y +E+ + G++ +  VD  L    D+W +FN+ +WG LVF RT+  L+  +  + +  KNK  K    YT+ GFP ALQVWAYE +
Subjt:  NFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIV

Query:  SSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TVNVIAELVPTEMESTYM
         ++T      V+   IPR+LRW C  SP   +L R+VF S    +NV+ E++P E E   M
Subjt:  SSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TVNVIAELVPTEMESTYM

XP_031737642.1 uncharacterized protein LOC116402483 [Cucumis sativus]2.8e-4540.47Show/hide
Query:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATF-PLM
        +F  T FGHFLD +IVFNG LIHY LLREV + R D ISF I + V SFGR EF+++TGL  S  +  + V ++RL  K+    + +  S+L  TF    
Subjt:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATF-PLM

Query:  NFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIV
          D+DD+ VK+AL Y +EL + G++ +  VD  L    D+W +FN+ +WG LVF RT+  L+  +  + +  KNK  K    YT+ GFP ALQVWAYE +
Subjt:  NFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIV

Query:  SSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TVNVIAELVPTEMESTYM--VRLLDQPNPNATSNPPNEATQMVDRECPVEADVDEH
         ++T      V+   IPR+LRW C  SP   +L  +VF S    +NV+ E++P E E   M    L+++ +P+ T +  N      D + P EA  D++
Subjt:  SSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TVNVIAELVPTEMESTYM--VRLLDQPNPNATSNPPNEATQMVDRECPVEADVDEH

XP_031739159.1 uncharacterized protein LOC116402876 isoform X1 [Cucumis sativus]3.6e-4540.13Show/hide
Query:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATF-PLM
        +F  T FGHFLD +I+FNG LIHY LLREV + R D ISF I + V SFGR EF+++TGL  S  +    V ++RL  K+    + +  S+L  TF    
Subjt:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATF-PLM

Query:  NFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIV
          D+DD+ VK+AL Y +E+ + G++ +  VD  L    D+W +FN+ +WG LVF RT+  L+  +  + +  KNK  K    YT+ GFP ALQVWAYE +
Subjt:  NFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIV

Query:  SSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TVNVIAELVPTEMESTYMV--RLLDQPNPNATSNPPNEATQMVDRECPVEADVDEH
         ++T      V+   IPR+LRW C  SP   +L R+VF S    +NV+ E++P E E   M    L+++ +P  T +  N      D + P EA  D++
Subjt:  SSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TVNVIAELVPTEMESTYMV--RLLDQPNPNATSNPPNEATQMVDRECPVEADVDEH

TrEMBL top hitse value%identityAlignment
A0A5A7UGY3 Ulp1-like peptidase9.0e-4241.34Show/hide
Query:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMN
        +F +T FGHFLD +IVFNG LIHY LLREV +   D ISF + D V +FGR EF++ITGL     D  Q V ++RL  K+  +   V  S+L   F L  
Subjt:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMN

Query:  FDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIVS
          +DD+ VK+AL Y +E+ + G++ +  VD+      D+W SFN+ +WG++VF RT+  L+   L K  +   KK  + + YT+ GFP+ALQVWAYE + 
Subjt:  FDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIVS

Query:  SLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TVNVIAELVPTE
        ++       VN   IPR+LRW C  SP  +  + +VF S    +  + E+ P E
Subjt:  SLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TVNVIAELVPTE

A0A6J1DJX9 uncharacterized protein LOC1110207572.5e-5544.04Show/hide
Query:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMN
        MF QTCFG  LD  +VFNG LIH+ LLREV E R D+ISF +  K VSFG+ EFDLITGL + +  V   +   RL  +Y  +   VKCSEL   F    
Subjt:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMN

Query:  FDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKP---ETYTLYGFPYALQVWAYE
        F +D++ VK+ + Y +EL M G+E KQ +D  LL  +D WE F + +W  ++F+RT+  L+N +  K S Y+ K    P   ETY+LYGFPYA QVWAYE
Subjt:  FDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKP---ETYTLYGFPYALQVWAYE

Query:  IVSSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTVNVIAELVPTEMESTYMVRLLDQPNPNATSNPP
         +S+L++          IPR+LRWSC  S  + +L  EVF +T   V   L+ T+ +  +MVR++  P      +PP
Subjt:  IVSSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTVNVIAELVPTEMESTYMVRLLDQPNPNATSNPP

A0A6J1DRZ7 uncharacterized protein LOC1110238475.6e-6047.47Show/hide
Query:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMN
        MF QTCFG  L  ++VFNG L+H+ LLREV E + D+ISF +    VSFG+ EFDLITGLR+++  V +DV + RL + Y  +  SVKCSEL   F    
Subjt:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMN

Query:  FDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNK---KLKKPETYTLYGFPYALQVWAYE
        F+ND++AVK+A+ Y +EL M G+E K  +D +LL  +D WE F + +W  ++FERT+  L+N +  K   YK K        ETY+LY FPYA QVWAYE
Subjt:  FDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNK---KLKKPETYTLYGFPYALQVWAYE

Query:  IVSSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTVNVIAELVPTEME
         +S+L+ RVA+ +N   IPR+LRWSC  S ++ +L REVF +    V+  L  T++E
Subjt:  IVSSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTVNVIAELVPTEME

A0A6J1DSS5 uncharacterized protein LOC1110239691.6e-4340.15Show/hide
Query:  QTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGL-RYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMNFD
        +T FG F+D  ++F   L+HYFLLREVV+ RPD++ F I+  +V+F + EF L+TGL R S R +++ VS NRL  +Y  +   ++  E    +  + F 
Subjt:  QTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGL-RYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMNFD

Query:  NDDEAVKMALFYMMELVMFGREM-KQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNK---KLKKPETYTLYGFPYALQVWAYEI
        NDD+AVK++L Y  E+VM G+   K  VD +L   +++ + FN+ +WG  +++RT+KGL++ +  K  +YKNK     K    Y+L GFP A QVWAYEI
Subjt:  NDDEAVKMALFYMMELVMFGREM-KQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNK---KLKKPETYTLYGFPYALQVWAYEI

Query:  VSSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTVNVIAELVPTEMESTY
        + SL       ++ T +PRI R+SC  S +  +L R+VF S+ + +   LV +E E  Y
Subjt:  VSSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTVNVIAELVPTEMESTY

A0A6N0C7Z4 Ulp1-like peptidase1.8e-4540.13Show/hide
Query:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATF-PLM
        +F  T FGHFLD +I+FNG LIHY LLREV + R D ISF I + V SFGR EF+++TGL  S  +    V ++RL  K+    + +  S+L  TF    
Subjt:  MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATF-PLM

Query:  NFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIV
          D+DD+ VK+AL Y +E+ + G++ +  VD  L    D+W +FN+ +WG LVF RT+  L+  +  + +  KNK  K    YT+ GFP ALQVWAYE +
Subjt:  NFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIV

Query:  SSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TVNVIAELVPTEMESTYMV--RLLDQPNPNATSNPPNEATQMVDRECPVEADVDEH
         ++T      V+   IPR+LRW C  SP   +L R+VF S    +NV+ E++P E E   M    L+++ +P  T +  N      D + P EA  D++
Subjt:  SSLTNRVAMHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TVNVIAELVPTEMESTYMV--RLLDQPNPNATSNPPNEATQMVDRECPVEADVDEH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)7.9e-0622.05Show/hide
Query:  NGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNR------LWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMA
        +G+LIH  L R+VV  +   + F      + F   EF ++TGLR        +V  ++      +W +     R V   ++          +  + + +A
Subjt:  NGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNR------LWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMA

Query:  LFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIVSSLTNRV
        L  +++ V+   + +  V ++ +  +++ + F    WG+  F  T++             K KK  K +T   YGFP ALQ+  +E +  +  R+
Subjt:  LFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIVSSLTNRV

AT5G35050.1 Domain of unknown function (DUF1985)7.4e-0428.04Show/hide
Query:  FNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMM
        F+GQL+ + ++R++V  R D I F I +K + F  TEF L+TGL   +++    V     W   L +   +K  +  A   ++  D+++   ++A+  ++
Subjt:  FNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMM

Query:  ELVMFGR
         L +F R
Subjt:  ELVMFGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCACCAAACTTGCTTTGGACACTTTTTGGACGCTTCCATAGTTTTCAACGGTCAACTTATCCATTACTTTCTCCTACGAGAGGTCGTGGAGGTCAGACCGGATAT
AATAAGCTTCTATATAATAGACAAGGTTGTATCATTTGGTAGGACAGAGTTCGACTTGATCACTGGTCTGCGTTATAGCGTTCGGGATGTGCGACAGGATGTGTCTAGTA
ATAGGCTATGGGTCAAGTACCTGAATAACTCTAGAAGTGTGAAATGCTCGGAGTTGTATGCAACATTCCCACTGATGAACTTCGATAATGACGATGAGGCGGTGAAGATG
GCATTGTTCTACATGATGGAGCTTGTGATGTTCGGTCGCGAGATGAAACAATTGGTAGACATGAACCTTCTATCTACCATTGACAATTGGGAGTCGTTCAACAGTAAAAA
CTGGGGAAAATTGGTTTTTGAAAGGACTATGAAAGGATTGAGAAATGTTATTCTTGGCAAGGCCTCATCATACAAAAACAAGAAGTTGAAGAAACCGGAGACATACACAT
TGTATGGATTCCCTTATGCGCTGCAGGTGTGGGCTTACGAGATTGTGTCGTCCTTGACTAATCGGGTTGCGATGCACGTTAATACCACCGGTATCCCGCGTATACTAAGG
TGGTCATGTGGTACTTCACCTTCATACACGATGCTTATGCGGGAAGTGTTTGCATCCACAACGGTAAACGTCATAGCAGAACTGGTGCCAACTGAGATGGAATCAACCTA
CATGGTGCGGTTGCTAGACCAACCGAATCCGAATGCTACATCCAATCCTCCAAATGAGGCAACTCAAATGGTCGATAGAGAGTGTCCTGTGGAAGCGGATGTTGATGAAC
ATGTGGACGCCACTACTGAGGTGAATGTTGATCGATCGCCATCAGCTGGTGATGCATCAACAATTCCAAGCGGGGGAGTACGTGGGACATGTGTATGTCAAACCCTGCTC
CCCCCCATATGTCAGCAGCTATCAGAGTTGCAGAATAATATGACGACTATGCATCAGACTTTTGATACAATTAGAACAAGTGTTCAAGAGGTGCGTGATATGGTACTACA
TCTCATAAATTCACAGTCACCGAAATCAGAGTTACCGACTGAATTGAATGTAGATCGAGGTCAGTTAGAACCTGATTTGAACATCGACGTTCCTGACCAACCCCGGTCGG
AAGACAATAAACCAAAATCAGAGCGTGCATACGATACATATGATGACGAAGTAGATCATCGACCGTCTATGTTAGGCCATTGTAGCACCGATTTGACAACCGTGTCAATG
GGTGTTGCGGTACCCCCCATCGAGACAATGCCTGACGAAGTACGAGTATCAGAGCAGTCCACCATTCCCACCGTCGAACAGAATAAGAGTAATGAATGTATTTTCACACC
GAATGTTACCTGTCTCGTGATACCGAAAGAAGAGGTATGTAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTCACCAAACTTGCTTTGGACACTTTTTGGACGCTTCCATAGTTTTCAACGGTCAACTTATCCATTACTTTCTCCTACGAGAGGTCGTGGAGGTCAGACCGGATAT
AATAAGCTTCTATATAATAGACAAGGTTGTATCATTTGGTAGGACAGAGTTCGACTTGATCACTGGTCTGCGTTATAGCGTTCGGGATGTGCGACAGGATGTGTCTAGTA
ATAGGCTATGGGTCAAGTACCTGAATAACTCTAGAAGTGTGAAATGCTCGGAGTTGTATGCAACATTCCCACTGATGAACTTCGATAATGACGATGAGGCGGTGAAGATG
GCATTGTTCTACATGATGGAGCTTGTGATGTTCGGTCGCGAGATGAAACAATTGGTAGACATGAACCTTCTATCTACCATTGACAATTGGGAGTCGTTCAACAGTAAAAA
CTGGGGAAAATTGGTTTTTGAAAGGACTATGAAAGGATTGAGAAATGTTATTCTTGGCAAGGCCTCATCATACAAAAACAAGAAGTTGAAGAAACCGGAGACATACACAT
TGTATGGATTCCCTTATGCGCTGCAGGTGTGGGCTTACGAGATTGTGTCGTCCTTGACTAATCGGGTTGCGATGCACGTTAATACCACCGGTATCCCGCGTATACTAAGG
TGGTCATGTGGTACTTCACCTTCATACACGATGCTTATGCGGGAAGTGTTTGCATCCACAACGGTAAACGTCATAGCAGAACTGGTGCCAACTGAGATGGAATCAACCTA
CATGGTGCGGTTGCTAGACCAACCGAATCCGAATGCTACATCCAATCCTCCAAATGAGGCAACTCAAATGGTCGATAGAGAGTGTCCTGTGGAAGCGGATGTTGATGAAC
ATGTGGACGCCACTACTGAGGTGAATGTTGATCGATCGCCATCAGCTGGTGATGCATCAACAATTCCAAGCGGGGGAGTACGTGGGACATGTGTATGTCAAACCCTGCTC
CCCCCCATATGTCAGCAGCTATCAGAGTTGCAGAATAATATGACGACTATGCATCAGACTTTTGATACAATTAGAACAAGTGTTCAAGAGGTGCGTGATATGGTACTACA
TCTCATAAATTCACAGTCACCGAAATCAGAGTTACCGACTGAATTGAATGTAGATCGAGGTCAGTTAGAACCTGATTTGAACATCGACGTTCCTGACCAACCCCGGTCGG
AAGACAATAAACCAAAATCAGAGCGTGCATACGATACATATGATGACGAAGTAGATCATCGACCGTCTATGTTAGGCCATTGTAGCACCGATTTGACAACCGTGTCAATG
GGTGTTGCGGTACCCCCCATCGAGACAATGCCTGACGAAGTACGAGTATCAGAGCAGTCCACCATTCCCACCGTCGAACAGAATAAGAGTAATGAATGTATTTTCACACC
GAATGTTACCTGTCTCGTGATACCGAAAGAAGAGGTATGTAGGTAG
Protein sequenceShow/hide protein sequence
MFHQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGRTEFDLITGLRYSVRDVRQDVSSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKM
ALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNSKNWGKLVFERTMKGLRNVILGKASSYKNKKLKKPETYTLYGFPYALQVWAYEIVSSLTNRVAMHVNTTGIPRILR
WSCGTSPSYTMLMREVFASTTVNVIAELVPTEMESTYMVRLLDQPNPNATSNPPNEATQMVDRECPVEADVDEHVDATTEVNVDRSPSAGDASTIPSGGVRGTCVCQTLL
PPICQQLSELQNNMTTMHQTFDTIRTSVQEVRDMVLHLINSQSPKSELPTELNVDRGQLEPDLNIDVPDQPRSEDNKPKSERAYDTYDDEVDHRPSMLGHCSTDLTTVSM
GVAVPPIETMPDEVRVSEQSTIPTVEQNKSNECIFTPNVTCLVIPKEEVCR