; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012777 (gene) of Snake gourd v1 genome

Gene IDTan0012777
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUlp1-like peptidase
Genome locationLG03:54888902..54890643
RNA-Seq ExpressionTan0012777
SyntenyTan0012777
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8653184.1 hypothetical protein Csa_020146, partial [Cucumis sativus]8.5e-4838.46Show/hide
Query:  KIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVW
        KI P  HF + ++  +HL  S   IK KL    L LFR T FGHFLD +I+FNG LIHY LLREV + R D ISF I + V SFGR EF+++TGL  S  
Subjt:  KIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVW

Query:  DVRQDVCSNRLWVKYLNNSRSVKCSELYATF-PLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAI
        +    V ++RL  K+    + +  S+L  TF      D+DD+ VK+AL Y +E+ + G++ +  VD  L    D+W +FNN DWG LVF RT+  L+ A+
Subjt:  DVRQDVCSNRLWVKYLNNSRSVKCSELYATF-PLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAI

Query:  -----RGKAYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TANAIAELVPTEMEST
             +GK   TKT+        + +    +   VWAYE + ++T      V+   IPR+LRW C  SP   +L R+VF S     N + E++P E E  
Subjt:  -----RGKAYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TANAIAELVPTEMEST

Query:  YMV--RLLDQPNP-------NATSNPPNEATHMVDREC
         M    L+++ +P       N  S  P EA++  D +C
Subjt:  YMV--RLLDQPNP-------NATSNPPNEATHMVDREC

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]9.7e-5233.19Show/hide
Query:  MTFVLKIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGL
        M   L I  +D FPA +T  +H+ K+ T IK +L+ + L +FRQTCFG  LD  +VFNG LIH+ LLREV E R D+ISF +  K VSFG+ EFDLITGL
Subjt:  MTFVLKIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGL

Query:  RYSVWDVRQDVCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGL
         + +  V   +   RL  +Y  +   VKCSEL   F    F +D++ VK+ + Y +EL M G+E KQ +D  LL  +D WE F N DW  ++F+RT+  L
Subjt:  RYSVWDVRQDVCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGL

Query:  RNAIRGK--AYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTANAIAELVPTEMESTY
        +NA++ K   Y  K  +  +   T+ +        VWAYE +S+L++          IPR+LRWSC  S  + +L  EVF +T +     L+ T+ +  +
Subjt:  RNAIRGK--AYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTANAIAELVPTEMESTY

Query:  MVRLLDQPNPNATSNPPNEATHMVDRECPVE----------ADVD----EHVDITAEANVDRSPSAGDASTSLSGGSTWDMCMSNSAPPICQELSELQNN
        MVR++  P      +PP      V  + P            ADV+    E   + A A  +  PSA D       G    +  +     I + L  L N 
Subjt:  MVRLLDQPNPNATSNPPNEATHMVDRECPVE----------ADVD----EHVDITAEANVDRSPSAGDASTSLSGGSTWDMCMSNSAPPICQELSELQNN

Query:  MTTMHQAFDTIRTSVQEVRDMVLHLINSQSPKSELPTELNVDRGQLEPDLNIGVPDQPRSEDNKPKSERADDTYDDETD
        +  +         +++ ++  +  L      K + P       G   PD + G  DQ   E  KP   R     D  +D
Subjt:  MTTMHQAFDTIRTSVQEVRDMVLHLINSQSPKSELPTELNVDRGQLEPDLNIGVPDQPRSEDNKPKSERADDTYDDETD

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]6.7e-6143.43Show/hide
Query:  MTFVLKIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGL
        M   LKI  DD FPAA++  +H+ K+ + +K +L+ S L +F QTCFG  L  ++VFNG L+H+ LLREV E + D+ISF +    VSFG+ EFDLITGL
Subjt:  MTFVLKIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGL

Query:  RYSVWDVRQDVCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGL
        R+++  V +DV + RL + Y  +  SVKCSEL   F    F+ND++AVK+A+ Y +EL M G+E K  +D +LL  +D WE F N DW  ++FERT+  L
Subjt:  RYSVWDVRQDVCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGL

Query:  RNAIRGKAYHTKTRSRRNRRH--THCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTANAIAELVPTEME
        +NA++ K    K +   +  H  T+ +        VWAYE +S+L+ RVA  +N   IPR+LRWSC  S ++ +L REVF +  +  +  L  T++E
Subjt:  RNAIRGKAYHTKTRSRRNRRH--THCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTANAIAELVPTEME

XP_031739159.1 uncharacterized protein LOC116402876 isoform X1 [Cucumis sativus]8.5e-4838.46Show/hide
Query:  KIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVW
        KI P  HF + ++  +HL  S   IK KL    L LFR T FGHFLD +I+FNG LIHY LLREV + R D ISF I + V SFGR EF+++TGL  S  
Subjt:  KIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVW

Query:  DVRQDVCSNRLWVKYLNNSRSVKCSELYATF-PLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAI
        +    V ++RL  K+    + +  S+L  TF      D+DD+ VK+AL Y +E+ + G++ +  VD  L    D+W +FNN DWG LVF RT+  L+ A+
Subjt:  DVRQDVCSNRLWVKYLNNSRSVKCSELYATF-PLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAI

Query:  -----RGKAYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TANAIAELVPTEMEST
             +GK   TKT+        + +    +   VWAYE + ++T      V+   IPR+LRW C  SP   +L R+VF S     N + E++P E E  
Subjt:  -----RGKAYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TANAIAELVPTEMEST

Query:  YMV--RLLDQPNP-------NATSNPPNEATHMVDREC
         M    L+++ +P       N  S  P EA++  D +C
Subjt:  YMV--RLLDQPNP-------NATSNPPNEATHMVDREC

XP_031744032.1 uncharacterized protein LOC116404765 isoform X1 [Cucumis sativus]8.5e-4838.46Show/hide
Query:  KIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVW
        KI P  HF + ++  +HL  S   IK KL    L LFR T FGHFLD +I+FNG LIHY LLREV + R D ISF I + V SFGR EF+++TGL  S  
Subjt:  KIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVW

Query:  DVRQDVCSNRLWVKYLNNSRSVKCSELYATF-PLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAI
        +    V ++RL  K+    + +  S+L  TF      D+DD+ VK+AL Y +E+ + G++ +  VD  L    D+W +FNN DWG LVF RT+  L+ A+
Subjt:  DVRQDVCSNRLWVKYLNNSRSVKCSELYATF-PLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAI

Query:  -----RGKAYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TANAIAELVPTEMEST
             +GK   TKT+        + +    +   VWAYE + ++T      V+   IPR+LRW C  SP   +L R+VF S     N + E++P E E  
Subjt:  -----RGKAYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TANAIAELVPTEMEST

Query:  YMV--RLLDQPNP-------NATSNPPNEATHMVDREC
         M    L+++ +P       N  S  P EA++  D +C
Subjt:  YMV--RLLDQPNP-------NATSNPPNEATHMVDREC

TrEMBL top hitse value%identityAlignment
A0A5A7UGY3 Ulp1-like peptidase7.7e-4740.69Show/hide
Query:  KIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVW
        KI P  HF + ++C SHL K+   IK KL    LALFR+T FGHFLD +IVFNG LIHY LLREV +   D ISF + D V +FGR EF++ITGL     
Subjt:  KIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVW

Query:  DVRQDVCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAIR
        D  Q V ++RL  K+  +   V  S+L   F L    +DD+ VK+AL Y +E+ + G++ +  VD+      D+W SFNN DWG++VF RT+  L+ A+ 
Subjt:  DVRQDVCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAIR

Query:  GKAYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TANAIAELVPTE
         +    K +S + +++T  ++       VWAYE + ++       VN   IPR+LRW C  SP  +  + +VF S      A+ E+ P E
Subjt:  GKAYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TANAIAELVPTE

A0A5A7VHC7 Ulp1-like peptidase2.3e-4640.34Show/hide
Query:  KIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVW
        KI P  HF + ++C SHL K+   IK KL    LALFR+T FGHFLD +IVFNG LIHY LLREV +   D ISF +   V +FGR EF++ITGL     
Subjt:  KIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVW

Query:  DVRQDVCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAIR
        D  Q V ++RL  K+  +   V  S+L   F L    +DD+ VK+AL Y +E+ + G++ +  VD+      D+W SFNN DWG++VF RT+  L+ A+ 
Subjt:  DVRQDVCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAIR

Query:  GKAYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TANAIAELVPTE
         +    K +S + +++T  ++       VWAYE + ++       VN   IPR+LRW C  SP  +  + +VF S      A+ E+ P E
Subjt:  GKAYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TANAIAELVPTE

A0A6J1DJX9 uncharacterized protein LOC1110207574.7e-5233.19Show/hide
Query:  MTFVLKIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGL
        M   L I  +D FPA +T  +H+ K+ T IK +L+ + L +FRQTCFG  LD  +VFNG LIH+ LLREV E R D+ISF +  K VSFG+ EFDLITGL
Subjt:  MTFVLKIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGL

Query:  RYSVWDVRQDVCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGL
         + +  V   +   RL  +Y  +   VKCSEL   F    F +D++ VK+ + Y +EL M G+E KQ +D  LL  +D WE F N DW  ++F+RT+  L
Subjt:  RYSVWDVRQDVCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGL

Query:  RNAIRGK--AYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTANAIAELVPTEMESTY
        +NA++ K   Y  K  +  +   T+ +        VWAYE +S+L++          IPR+LRWSC  S  + +L  EVF +T +     L+ T+ +  +
Subjt:  RNAIRGK--AYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTANAIAELVPTEMESTY

Query:  MVRLLDQPNPNATSNPPNEATHMVDRECPVE----------ADVD----EHVDITAEANVDRSPSAGDASTSLSGGSTWDMCMSNSAPPICQELSELQNN
        MVR++  P      +PP      V  + P            ADV+    E   + A A  +  PSA D       G    +  +     I + L  L N 
Subjt:  MVRLLDQPNPNATSNPPNEATHMVDRECPVE----------ADVD----EHVDITAEANVDRSPSAGDASTSLSGGSTWDMCMSNSAPPICQELSELQNN

Query:  MTTMHQAFDTIRTSVQEVRDMVLHLINSQSPKSELPTELNVDRGQLEPDLNIGVPDQPRSEDNKPKSERADDTYDDETD
        +  +         +++ ++  +  L      K + P       G   PD + G  DQ   E  KP   R     D  +D
Subjt:  MTTMHQAFDTIRTSVQEVRDMVLHLINSQSPKSELPTELNVDRGQLEPDLNIGVPDQPRSEDNKPKSERADDTYDDETD

A0A6J1DRZ7 uncharacterized protein LOC1110238473.2e-6143.43Show/hide
Query:  MTFVLKIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGL
        M   LKI  DD FPAA++  +H+ K+ + +K +L+ S L +F QTCFG  L  ++VFNG L+H+ LLREV E + D+ISF +    VSFG+ EFDLITGL
Subjt:  MTFVLKIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGL

Query:  RYSVWDVRQDVCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGL
        R+++  V +DV + RL + Y  +  SVKCSEL   F    F+ND++AVK+A+ Y +EL M G+E K  +D +LL  +D WE F N DW  ++FERT+  L
Subjt:  RYSVWDVRQDVCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGL

Query:  RNAIRGKAYHTKTRSRRNRRH--THCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTANAIAELVPTEME
        +NA++ K    K +   +  H  T+ +        VWAYE +S+L+ RVA  +N   IPR+LRWSC  S ++ +L REVF +  +  +  L  T++E
Subjt:  RNAIRGKAYHTKTRSRRNRRH--THCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTANAIAELVPTEME

A0A6N0C7Z4 Ulp1-like peptidase4.1e-4838.46Show/hide
Query:  KIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVW
        KI P  HF + ++  +HL  S   IK KL    L LFR T FGHFLD +I+FNG LIHY LLREV + R D ISF I + V SFGR EF+++TGL  S  
Subjt:  KIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVW

Query:  DVRQDVCSNRLWVKYLNNSRSVKCSELYATF-PLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAI
        +    V ++RL  K+    + +  S+L  TF      D+DD+ VK+AL Y +E+ + G++ +  VD  L    D+W +FNN DWG LVF RT+  L+ A+
Subjt:  DVRQDVCSNRLWVKYLNNSRSVKCSELYATF-PLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAI

Query:  -----RGKAYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TANAIAELVPTEMEST
             +GK   TKT+        + +    +   VWAYE + ++T      V+   IPR+LRW C  SP   +L R+VF S     N + E++P E E  
Subjt:  -----RGKAYHTKTRSRRNRRHTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFAST--TANAIAELVPTEMEST

Query:  YMV--RLLDQPNP-------NATSNPPNEATHMVDREC
         M    L+++ +P       N  S  P EA++  D +C
Subjt:  YMV--RLLDQPNP-------NATSNPPNEATHMVDREC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G35050.1 Domain of unknown function (DUF1985)6.1e-0433.33Show/hide
Query:  ITIIKNKLSQSLLALFRQTCFGHFLDA-------------SIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGL
        + IIK  LS SL +    T  G  + A              + F+GQL+ + ++R++V  R D I F I +K + F   EF L+TGL
Subjt:  ITIIKNKLSQSLLALFRQTCFGHFLDA-------------SIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATTTGTGCTCAAAATCCAACCGGATGACCATTTCCCTGCTGCAATTACTTGTTGTTCTCACCTAACGAAATCAATTACCATTATAAAAAATAAGCTAAGTCAATC
TCTACTCGCCCTCTTTCGGCAAACTTGTTTTGGACACTTTCTAGACGCTTCCATAGTTTTCAACGGTCAACTTATCCATTACTTTCTCCTACGAGAGGTCGTGGAGGTCA
GACCGGATATAATAAGCTTCTATATAATAGACAAGGTTGTATCATTTGGTAGGGAAGAGTTCGACTTGATCACAGGTTTGCGTTATAGCGTTTGGGATGTGCGACAGGAT
GTGTGTAGTAATAGGCTATGGGTTAAGTACCTGAATAACTCTAGAAGTGTGAAATGCTCGGAGTTGTATGCAACATTCCCACTGATGAATTTCGATAATGACGATGAGGC
GGTGAAGATGGCACTGTTCTACATGATGGAGCTTGTGATGTTCGGTCGCGAGATGAAACAATTGGTAGACATGAACCTTCTATCTACCATTGACAATTGGGAGTCGTTCA
ACAATAAGGACTGGGGAAAATTGGTTTTTGAAAGGACTATGAAAGGATTGAGAAATGCTATTCGTGGCAAGGCCTATCATACAAAAACAAGAAGTCGAAGAAACCGGAGA
CATACACATTGTATGGATTCCCTTATGCGCTGCAAGTATGTGTGGGCTTACGAGATTGTGTCATCCTTGACTAATCGGGTTGCGACGCACGTTAATACCACCGGTATCCC
GCGTATACTAAGGTGGTCATGTGGTACTTCACCTTCATACACGATGCTTATGCGGGAAGTGTTTGCATCCACAACGGCAAACGCCATAGCAGAATTGGTGCCAACGGAGA
TGGAATCAACCTACATGGTGCGGTTGCTAGACCAACCGAATCCTAATGCTACATCAAATCCTCCAAATGAGGCAACTCACATGGTCGATAGAGAGTGTCCTGTGGAAGCG
GATGTTGATGAACATGTGGACATCACTGCTGAGGCGAATGTTGATCGATCGCCATCAGCTGGTGATGCATCAACAAGTCTAAGCGGGGGGAGTACGTGGGACATGTGTAT
GTCAAACTCTGCTCCCCCCATATGTCAGGAGCTATCAGAGTTGCAGAATAATATGACGACTATGCATCAGGCTTTTGATACAATTAGAACAAGTGTTCAAGAGGTGCGCG
ATATGGTACTGCATCTCATAAATTCACAGTCACCTAAATCAGAGTTACCGACTGAATTGAATGTAGATCGAGGTCAGTTAGAACCTGATTTGAACATCGGCGTTCCTGAC
CAACCCCGGTCGGAAGACAATAAACCAAAATCAGAGCGGGCAGACGATACATATGATGACGAAACAGATCATCGACCGTCTATGTTAGGCCATTGTAGCACCGAGTTGAC
GACCGTGTCGATGGGTGTTTCGAAACCCACCATCGAGACAACGGCCGACGAAGTACGAGTGTCACAGCAGTCCACCGTTCCCACCGTCGAACAGAATAAGAGCAATGAAT
GTATTTTCGCACCGAATGTTACCTGTCTCGTGATACCGAAAGAAGAGGTATGTAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGACATTTGTGCTCAAAATCCAACCGGATGACCATTTCCCTGCTGCAATTACTTGTTGTTCTCACCTAACGAAATCAATTACCATTATAAAAAATAAGCTAAGTCAATC
TCTACTCGCCCTCTTTCGGCAAACTTGTTTTGGACACTTTCTAGACGCTTCCATAGTTTTCAACGGTCAACTTATCCATTACTTTCTCCTACGAGAGGTCGTGGAGGTCA
GACCGGATATAATAAGCTTCTATATAATAGACAAGGTTGTATCATTTGGTAGGGAAGAGTTCGACTTGATCACAGGTTTGCGTTATAGCGTTTGGGATGTGCGACAGGAT
GTGTGTAGTAATAGGCTATGGGTTAAGTACCTGAATAACTCTAGAAGTGTGAAATGCTCGGAGTTGTATGCAACATTCCCACTGATGAATTTCGATAATGACGATGAGGC
GGTGAAGATGGCACTGTTCTACATGATGGAGCTTGTGATGTTCGGTCGCGAGATGAAACAATTGGTAGACATGAACCTTCTATCTACCATTGACAATTGGGAGTCGTTCA
ACAATAAGGACTGGGGAAAATTGGTTTTTGAAAGGACTATGAAAGGATTGAGAAATGCTATTCGTGGCAAGGCCTATCATACAAAAACAAGAAGTCGAAGAAACCGGAGA
CATACACATTGTATGGATTCCCTTATGCGCTGCAAGTATGTGTGGGCTTACGAGATTGTGTCATCCTTGACTAATCGGGTTGCGACGCACGTTAATACCACCGGTATCCC
GCGTATACTAAGGTGGTCATGTGGTACTTCACCTTCATACACGATGCTTATGCGGGAAGTGTTTGCATCCACAACGGCAAACGCCATAGCAGAATTGGTGCCAACGGAGA
TGGAATCAACCTACATGGTGCGGTTGCTAGACCAACCGAATCCTAATGCTACATCAAATCCTCCAAATGAGGCAACTCACATGGTCGATAGAGAGTGTCCTGTGGAAGCG
GATGTTGATGAACATGTGGACATCACTGCTGAGGCGAATGTTGATCGATCGCCATCAGCTGGTGATGCATCAACAAGTCTAAGCGGGGGGAGTACGTGGGACATGTGTAT
GTCAAACTCTGCTCCCCCCATATGTCAGGAGCTATCAGAGTTGCAGAATAATATGACGACTATGCATCAGGCTTTTGATACAATTAGAACAAGTGTTCAAGAGGTGCGCG
ATATGGTACTGCATCTCATAAATTCACAGTCACCTAAATCAGAGTTACCGACTGAATTGAATGTAGATCGAGGTCAGTTAGAACCTGATTTGAACATCGGCGTTCCTGAC
CAACCCCGGTCGGAAGACAATAAACCAAAATCAGAGCGGGCAGACGATACATATGATGACGAAACAGATCATCGACCGTCTATGTTAGGCCATTGTAGCACCGAGTTGAC
GACCGTGTCGATGGGTGTTTCGAAACCCACCATCGAGACAACGGCCGACGAAGTACGAGTGTCACAGCAGTCCACCGTTCCCACCGTCGAACAGAATAAGAGCAATGAAT
GTATTTTCGCACCGAATGTTACCTGTCTCGTGATACCGAAAGAAGAGGTATGTAGGTAG
Protein sequenceShow/hide protein sequence
MTFVLKIQPDDHFPAAITCCSHLTKSITIIKNKLSQSLLALFRQTCFGHFLDASIVFNGQLIHYFLLREVVEVRPDIISFYIIDKVVSFGREEFDLITGLRYSVWDVRQD
VCSNRLWVKYLNNSRSVKCSELYATFPLMNFDNDDEAVKMALFYMMELVMFGREMKQLVDMNLLSTIDNWESFNNKDWGKLVFERTMKGLRNAIRGKAYHTKTRSRRNRR
HTHCMDSLMRCKYVWAYEIVSSLTNRVATHVNTTGIPRILRWSCGTSPSYTMLMREVFASTTANAIAELVPTEMESTYMVRLLDQPNPNATSNPPNEATHMVDRECPVEA
DVDEHVDITAEANVDRSPSAGDASTSLSGGSTWDMCMSNSAPPICQELSELQNNMTTMHQAFDTIRTSVQEVRDMVLHLINSQSPKSELPTELNVDRGQLEPDLNIGVPD
QPRSEDNKPKSERADDTYDDETDHRPSMLGHCSTELTTVSMGVSKPTIETTADEVRVSQQSTVPTVEQNKSNECIFAPNVTCLVIPKEEVCR