; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016840 (gene) of Snake gourd v1 genome

Gene IDTan0016840
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTranslation initiation factor 3 subunit I
Genome locationLG09:32233105..32235864
RNA-Seq ExpressionTan0016840
SyntenyTan0016840
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004176 - ATP-dependent peptidase activity (molecular function)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR037219 - Peptidase M41-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023968.1 hypothetical protein SDJN02_14996, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-7857.01Show/hide
Query:  MFFTAADCDFTRWVEFHRKIPAIGAVVSS----------PKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMD
        MFFTAADCDFT  +EFHR+IP  G V+SS           KRRRALKLVDRALSKRQYKSALS VKQLQGKP GLRAFG+AKQI+KR SAMD        
Subjt:  MFFTAADCDFTRWVEFHRKIPAIGAVVSS----------PKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMD

Query:  FVSLQPLVDSILDSVQRCLQIS----------------------------------------------------------------------LLEEIDSF
         +SLQPLVDSILDS+Q CLQIS                                                                       L EIDS 
Subjt:  FVSLQPLVDSILDSVQRCLQIS----------------------------------------------------------------------LLEEIDSF

Query:  KILGETAGIKKFHNRANKGR------ISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRS
        KIL E A I   H R NKGR      ISS  LKQFSCVILGGLVAELLVAGNSDGHLADILK+ESVL WLGLPKS+AD L KWAAMNTA +MSRH ETRS
Subjt:  KILGETAGIKKFHNRANKGR------ISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRS

Query:  KLAEAMALGKPIGLCIDIIENCLQGKEV
         LA+ MALGK IG CID IENCLQG E+
Subjt:  KLAEAMALGKPIGLCIDIIENCLQGKEV

XP_004135797.2 uncharacterized protein LOC101213254 isoform X2 [Cucumis sativus]6.7e-8559.18Show/hide
Query:  MFFTAADCDFTRWVEFHRKIPAIGAVVSSPKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLVDS
        MF T A  DFT  +EFH ++P  G VVSS KRRRALKLVDRALSKRQYKSA+S VKQLQGKP GLR FG+AKQI K+   +DE E++RMD +SLQPLVDS
Subjt:  MFFTAADCDFTRWVEFHRKIPAIGAVVSSPKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLVDS

Query:  ILDSVQRCLQISLLE-----------------------------------------------------------------------------EIDSFKIL
        ILDSVQ+CLQISLLE                                                                             EIDS KIL
Subjt:  ILDSVQRCLQISLLE-----------------------------------------------------------------------------EIDSFKIL

Query:  GETAGIKKFHNRANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALG
        GE A I+ F+NRANKG ISSKTL QFSCV LGGLVAELLVAGNSDGHLADILK+ SVL WLGLPKSEADL L+WAA NTA +MSRHCETRS+LAEAMAL 
Subjt:  GETAGIKKFHNRANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALG

Query:  KPIGLCIDIIENCLQG
        KPIGLCID IENCL+G
Subjt:  KPIGLCIDIIENCLQG

XP_008450723.1 PREDICTED: uncharacterized protein LOC103492218 [Cucumis melo]1.1e-8257.91Show/hide
Query:  MFFTAADCDFTRWVEFHRKIPAIGAVVSSPKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLVDS
        MF TAA  DFT  +EFHR++P  G VVSS +RRRALKLVDRALSKRQYKSA+S VKQLQGKP GLR FG+AKQI KR   +DE E++ MD +SLQPLVDS
Subjt:  MFFTAADCDFTRWVEFHRKIPAIGAVVSSPKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLVDS

Query:  ILDSVQRCLQISLLE-----------------------------------------------------------------------------EIDSFKIL
        ILDSVQ+CLQIS LE                                                                             EIDS KIL
Subjt:  ILDSVQRCLQISLLE-----------------------------------------------------------------------------EIDSFKIL

Query:  GETAGIKKFHNRANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALG
        G+ A IKKF+ RANKG ISSKTL QFSCV LGGLVAELLVAGNSDGHLADILK+ SVL W GLPKSEADL L+WAA NTA +MSRHCETR +LAEAM L 
Subjt:  GETAGIKKFHNRANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALG

Query:  KPIGLCIDIIENCLQG
        KPIGLCI+ IENCL+G
Subjt:  KPIGLCIDIIENCLQG

XP_022968755.1 uncharacterized protein LOC111467900 isoform X1 [Cucurbita maxima]5.9e-8157.32Show/hide
Query:  MFFTAADCDFTRWVEFHRKIPAIGAVVSS----------PKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMD
        MFFTAADCDFT  +EFHR+IP  G V+SS           KRRRALKLVDRALSKRQYKSALS VKQLQGKP GLRAFG+AKQI+KR SAMDE EL+  D
Subjt:  MFFTAADCDFTRWVEFHRKIPAIGAVVSS----------PKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMD

Query:  FVSLQPLVDSILDSVQRCLQIS----------------------------------------------------------------------LLEEIDSF
         +SLQPLVDSILDS+Q CLQIS                                                                       L +IDS 
Subjt:  FVSLQPLVDSILDSVQRCLQIS----------------------------------------------------------------------LLEEIDSF

Query:  KILGETAGIKKFHNRANKGR------ISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRS
        KIL E A IK  H R NKGR      IS   L QFSCVILGGLVAELLVAGNSDGHLADILK+ESVL WLGLPKS+AD  LKWAAMNTA +MSRH ETR 
Subjt:  KILGETAGIKKFHNRANKGR------ISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRS

Query:  KLAEAMALGKPIGLCIDIIENCLQGKEV
         LA+ MALGK IG CID IENCLQG E+
Subjt:  KLAEAMALGKPIGLCIDIIENCLQGKEV

XP_038879283.1 uncharacterized protein LOC120071224 isoform X1 [Benincasa hispida]1.4e-8759.45Show/hide
Query:  MFFTAADCDFTRWVEFHRKIPAIGAVVSS----------PKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMD
        MFFTAA  DFT  +EFHR+IP  G V+SS           KRRRALKLVDRALSKRQYKSALS VKQLQGKP GLRAFG+AKQI KR S MDEPEL+R D
Subjt:  MFFTAADCDFTRWVEFHRKIPAIGAVVSS----------PKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMD

Query:  FVSLQPLVDSILDSVQRCLQISLLE---------------------------------------------------------------------------
         ++LQPLV SILDS+Q+CLQISLLE                                                                           
Subjt:  FVSLQPLVDSILDSVQRCLQISLLE---------------------------------------------------------------------------

Query:  -EIDSFKILGETAGIKKFHNRANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRS
         EIDS KILGE A I+ FHNRAN+GRISSKTL QFSCV LGGLVAELLVAGNSDGHLADILK+ SVL WLG  KSEAD+ LKWAA NTA +MSRHCETRS
Subjt:  -EIDSFKILGETAGIKKFHNRANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRS

Query:  KLAEAMALGKPIGLCIDIIENCLQGKEV
        +LAEAMALGKPIGLCID IENCLQG E+
Subjt:  KLAEAMALGKPIGLCIDIIENCLQGKEV

TrEMBL top hitse value%identityAlignment
A0A1S3BP83 uncharacterized protein LOC1034922185.2e-8357.91Show/hide
Query:  MFFTAADCDFTRWVEFHRKIPAIGAVVSSPKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLVDS
        MF TAA  DFT  +EFHR++P  G VVSS +RRRALKLVDRALSKRQYKSA+S VKQLQGKP GLR FG+AKQI KR   +DE E++ MD +SLQPLVDS
Subjt:  MFFTAADCDFTRWVEFHRKIPAIGAVVSSPKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLVDS

Query:  ILDSVQRCLQISLLE-----------------------------------------------------------------------------EIDSFKIL
        ILDSVQ+CLQIS LE                                                                             EIDS KIL
Subjt:  ILDSVQRCLQISLLE-----------------------------------------------------------------------------EIDSFKIL

Query:  GETAGIKKFHNRANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALG
        G+ A IKKF+ RANKG ISSKTL QFSCV LGGLVAELLVAGNSDGHLADILK+ SVL W GLPKSEADL L+WAA NTA +MSRHCETR +LAEAM L 
Subjt:  GETAGIKKFHNRANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALG

Query:  KPIGLCIDIIENCLQG
        KPIGLCI+ IENCL+G
Subjt:  KPIGLCIDIIENCLQG

A0A6J1DM53 uncharacterized protein LOC1110218383.9e-7853.45Show/hide
Query:  MFFTAADCDFTRWVEFHRKIPA-IGA-VVSSPKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLV
        MFFTAAD +FT  +EFHR+IPA +G     + KRRRALKLVDRALSKRQYK+ALS VKQLQGKP GLRAFG+AKQI+K LS++ E EL+  + +SLQPLV
Subjt:  MFFTAADCDFTRWVEFHRKIPA-IGA-VVSSPKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLV

Query:  DSILDSVQRCLQISLLE---------------------------------------------------------------------------EIDSFKIL
        DSILDS+Q+C QISLL+                                                                           EIDSFKIL
Subjt:  DSILDSVQRCLQISLLE---------------------------------------------------------------------------EIDSFKIL

Query:  GETAGIKKFHNR-ANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILK----------------------------MESVLRWLGLPKSEADLL
         E A ++KF NR AN GRIS KTLKQFSCV LGGLVAELLVAGNSDGHLADILK                            +ESVLRWLGL K+ ADL 
Subjt:  GETAGIKKFHNR-ANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILK----------------------------MESVLRWLGLPKSEADLL

Query:  LKWAAMNTACVMSRHCETRSKLAEAMALGKPIGLCIDIIENCLQGKEV
        LKWAA NT  V+SRHCETRS+LAEAMALGKPIG+CID IENCLQG+E+
Subjt:  LKWAAMNTACVMSRHCETRSKLAEAMALGKPIGLCIDIIENCLQGKEV

A0A6J1HDU1 uncharacterized protein LOC1114619601.3e-7857.01Show/hide
Query:  MFFTAADCDFTRWVEFHRKIPAIGAVVSS----------PKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMD
        MFFTAADCDFT  +EFHR+IP  G V+SS           KRRRALKLVDRALSKRQYKSALS VKQLQGKP GLRAFG+AKQI+KR SAMD        
Subjt:  MFFTAADCDFTRWVEFHRKIPAIGAVVSS----------PKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMD

Query:  FVSLQPLVDSILDSVQRCLQIS----------------------------------------------------------------------LLEEIDSF
         +SLQPLVDSILDS+Q CLQIS                                                                       L EIDS 
Subjt:  FVSLQPLVDSILDSVQRCLQIS----------------------------------------------------------------------LLEEIDSF

Query:  KILGETAGIKKFHNRANKGR------ISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRS
        KIL E A I   H R NKGR      ISS  LKQFSCVILGGLVAELLVAGNSDGHLADILK+ESVL WLGLPKS+AD L KWAAMNTA +MSRH ETRS
Subjt:  KILGETAGIKKFHNRANKGR------ISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRS

Query:  KLAEAMALGKPIGLCIDIIENCLQGKEV
         LA+ MALGK IG CID IENCLQG E+
Subjt:  KLAEAMALGKPIGLCIDIIENCLQGKEV

A0A6J1HUE1 uncharacterized protein LOC111467900 isoform X21.9e-7758.58Show/hide
Query:  MFFTAADCDFTRWVEFHRKIPAIGAVVSS----------PKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMD
        MFFTAADCDFT  +EFHR+IP  G V+SS           KRRRALKLVDRALSKRQYKSALS VKQLQGKP GLRAFG+AKQI+KR SAMDE EL+  D
Subjt:  MFFTAADCDFTRWVEFHRKIPAIGAVVSS----------PKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMD

Query:  FVSLQPLVDSILDSVQRCLQISL--LE-----------------------------------------EIDSFKILG--------------ETAGIKKFH
         +SLQPLVDSILDS+Q CLQIS   LE                                         E+ S + L               E  G +   
Subjt:  FVSLQPLVDSILDSVQRCLQISL--LE-----------------------------------------EIDSFKILG--------------ETAGIKKFH

Query:  NRANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALGKPIGLCIDII
         + NKG IS   L QFSCVILGGLVAELLVAGNSDGHLADILK+ESVL WLGLPKS+AD  LKWAAMNTA +MSRH ETR  LA+ MALGK IG CID I
Subjt:  NRANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALGKPIGLCIDII

Query:  ENCLQGKEV
        ENCLQG E+
Subjt:  ENCLQGKEV

A0A6J1HY40 uncharacterized protein LOC111467900 isoform X12.9e-8157.32Show/hide
Query:  MFFTAADCDFTRWVEFHRKIPAIGAVVSS----------PKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMD
        MFFTAADCDFT  +EFHR+IP  G V+SS           KRRRALKLVDRALSKRQYKSALS VKQLQGKP GLRAFG+AKQI+KR SAMDE EL+  D
Subjt:  MFFTAADCDFTRWVEFHRKIPAIGAVVSS----------PKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMD

Query:  FVSLQPLVDSILDSVQRCLQIS----------------------------------------------------------------------LLEEIDSF
         +SLQPLVDSILDS+Q CLQIS                                                                       L +IDS 
Subjt:  FVSLQPLVDSILDSVQRCLQIS----------------------------------------------------------------------LLEEIDSF

Query:  KILGETAGIKKFHNRANKGR------ISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRS
        KIL E A IK  H R NKGR      IS   L QFSCVILGGLVAELLVAGNSDGHLADILK+ESVL WLGLPKS+AD  LKWAAMNTA +MSRH ETR 
Subjt:  KILGETAGIKKFHNRANKGR------ISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRS

Query:  KLAEAMALGKPIGLCIDIIENCLQGKEV
         LA+ MALGK IG CID IENCLQG E+
Subjt:  KLAEAMALGKPIGLCIDIIENCLQGKEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54680.1 unknown protein3.2e-2449.06Show/hide
Query:  NKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALGKPIGLCIDIIENC
        N+G ISSKTL  FSCVILGG+V E ++ G S+G  +DI+K+  VLRWLG  +SE +  +KWA  NT  ++  H E R  LAE MA  KPI  CI+ IE+ 
Subjt:  NKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALGKPIGLCIDIIENC

Query:  LQGKEV
        +   ++
Subjt:  LQGKEV

AT1G54680.2 unknown protein3.2e-2449.06Show/hide
Query:  NKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALGKPIGLCIDIIENC
        N+G ISSKTL  FSCVILGG+V E ++ G S+G  +DI+K+  VLRWLG  +SE +  +KWA  NT  ++  H E R  LAE MA  KPI  CI+ IE+ 
Subjt:  NKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALGKPIGLCIDIIENC

Query:  LQGKEV
        +   ++
Subjt:  LQGKEV

AT1G54680.3 unknown protein3.2e-2449.06Show/hide
Query:  NKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALGKPIGLCIDIIENC
        N+G ISSKTL  FSCVILGG+V E ++ G S+G  +DI+K+  VLRWLG  +SE +  +KWA  NT  ++  H E R  LAE MA  KPI  CI+ IE+ 
Subjt:  NKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALGKPIGLCIDIIENC

Query:  LQGKEV
        +   ++
Subjt:  LQGKEV

AT5G27290.1 unknown protein7.3e-2928.33Show/hide
Query:  RRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLVDSILDSVQRCLQISLLE---------EIDSFKI
        RR+AL+ VD  LS    ++ALS VK LQGKPDGLR FG+A+Q+ +RL  ++E +L+ ++  SL    D+ L S++R LQI+ +          ++ S ++
Subjt:  RRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLVDSILDSVQRCLQISLLE---------EIDSFKI

Query:  LGETAGI----------------------------KKFHNR----------------------------------------------------ANKGRIS
           T G                             +++HNR                                                     N G++S
Subjt:  LGETAGI----------------------------KKFHNR----------------------------------------------------ANKGRIS

Query:  SKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALGKPIGLCIDIIENCLQGKEV
        +  L +FSC+ L G+  E L+ G ++G L DI K++ +++ LG  + +AD  ++W+ +NT  ++ RH   RSKLA+AM+ G+ +G CI IIE+ +   ++
Subjt:  SKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAMALGKPIGLCIDIIENCLQGKEV

AT5G27290.2 unknown protein1.6e-1248.15Show/hide
Query:  RRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLVDSILDSVQRCLQIS
        RR+AL+ VD  LS    ++ALS VK LQGKPDGLR FG+A+Q+ +RL  ++E +L+ ++  SL    D+ L S++R LQI+
Subjt:  RRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLVDSILDSVQRCLQIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTCACCGCGGCTGATTGCGATTTTACCCGCTGGGTTGAGTTTCATCGGAAGATTCCGGCGATCGGCGCCGTCGTATCGTCGCCGAAACGACGTCGTGCTTTGAA
GCTTGTGGATCGAGCACTCTCAAAGCGTCAATACAAATCCGCTCTCTCGTTTGTTAAGCAGTTGCAGGGGAAACCTGATGGCCTTCGTGCTTTCGGTTCCGCCAAACAGA
TAAGCAAGAGGCTTTCAGCAATGGACGAACCAGAGCTCGATAGAATGGACTTTGTATCCCTCCAACCATTGGTGGATTCGATTCTGGATTCAGTTCAACGATGTCTTCAG
ATTTCTTTACTTGAGGAGATTGATTCATTTAAGATTTTGGGTGAAACTGCTGGTATCAAAAAGTTTCATAACAGGGCAAATAAAGGCAGAATTTCCTCAAAGACATTGAA
GCAGTTTTCATGTGTAATATTAGGAGGTTTAGTGGCTGAACTTCTGGTTGCTGGAAATTCTGATGGACATTTAGCAGATATACTCAAAATGGAGAGTGTTCTTAGATGGC
TTGGCCTTCCAAAGTCTGAAGCTGATCTTCTTTTAAAATGGGCTGCAATGAATACAGCATGCGTAATGTCCCGCCATTGCGAAACAAGATCAAAACTTGCAGAGGCCATG
GCGTTGGGGAAACCGATCGGGCTCTGTATCGACATAATCGAAAACTGTTTGCAGGGAAAGGAGGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTCACCGCGGCTGATTGCGATTTTACCCGCTGGGTTGAGTTTCATCGGAAGATTCCGGCGATCGGCGCCGTCGTATCGTCGCCGAAACGACGTCGTGCTTTGAA
GCTTGTGGATCGAGCACTCTCAAAGCGTCAATACAAATCCGCTCTCTCGTTTGTTAAGCAGTTGCAGGGGAAACCTGATGGCCTTCGTGCTTTCGGTTCCGCCAAACAGA
TAAGCAAGAGGCTTTCAGCAATGGACGAACCAGAGCTCGATAGAATGGACTTTGTATCCCTCCAACCATTGGTGGATTCGATTCTGGATTCAGTTCAACGATGTCTTCAG
ATTTCTTTACTTGAGGAGATTGATTCATTTAAGATTTTGGGTGAAACTGCTGGTATCAAAAAGTTTCATAACAGGGCAAATAAAGGCAGAATTTCCTCAAAGACATTGAA
GCAGTTTTCATGTGTAATATTAGGAGGTTTAGTGGCTGAACTTCTGGTTGCTGGAAATTCTGATGGACATTTAGCAGATATACTCAAAATGGAGAGTGTTCTTAGATGGC
TTGGCCTTCCAAAGTCTGAAGCTGATCTTCTTTTAAAATGGGCTGCAATGAATACAGCATGCGTAATGTCCCGCCATTGCGAAACAAGATCAAAACTTGCAGAGGCCATG
GCGTTGGGGAAACCGATCGGGCTCTGTATCGACATAATCGAAAACTGTTTGCAGGGAAAGGAGGTATAG
Protein sequenceShow/hide protein sequence
MFFTAADCDFTRWVEFHRKIPAIGAVVSSPKRRRALKLVDRALSKRQYKSALSFVKQLQGKPDGLRAFGSAKQISKRLSAMDEPELDRMDFVSLQPLVDSILDSVQRCLQ
ISLLEEIDSFKILGETAGIKKFHNRANKGRISSKTLKQFSCVILGGLVAELLVAGNSDGHLADILKMESVLRWLGLPKSEADLLLKWAAMNTACVMSRHCETRSKLAEAM
ALGKPIGLCIDIIENCLQGKEV