; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022779 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022779
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptiontranscription initiation factor TFIID subunit 7-like
Genome locationtig00000589:1703214..1704277
RNA-Seq ExpressionSgr022779
SyntenySgr022779
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137815.3 uncharacterized protein LOC101215662 [Cucumis sativus]1.7e-7266.8Show/hide
Query:  MAMNT-NTLCLVSAMDRLWYHQIILC-SDPL-SSHVPNL---DTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYK
        MAMNT NTLCLVSAMDRLWYHQIILC SDPL +SH PNL    ++FPFT F       P  P SPL+D+TIL SS SS   SSD+ISL SQE+ SN+E K
Subjt:  MAMNT-NTLCLVSAMDRLWYHQIILC-SDPL-SSHVPNL---DTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYK

Query:  EKEVGRRE-STQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEE---------ENDDED-
         K+  +RE S  ++ N LK SVGRKLNKS SC+SLGELELEEVKGFMDLGFEFKRE+LSPQMV L+PGLQRL T  NKQ LE++         ENDD+D 
Subjt:  EKEVGRRE-STQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEE---------ENDDED-

Query:  ----KRAIARPYLSEAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ
            KR IARPYLSEAW I+RPNSPL++LR+PKVSST DMKKHL+ WAKTVA E+Q
Subjt:  ----KRAIARPYLSEAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ

XP_008442668.1 PREDICTED: putative uncharacterized protein YGR160W [Cucumis melo]1.3e-7567.06Show/hide
Query:  MAMNT-NTLCLVSAMDRLWYHQIILCSDPLSSHVPNL---DTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEK
        MAMNT NTLCLVSAMDRLWYHQIILCSDP +SH PN     ++FPFT F       P  P SPL+D+TI L   SS S SSD+ISL SQE  +N+E K+K
Subjt:  MAMNT-NTLCLVSAMDRLWYHQIILCSDPLSSHVPNL---DTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEK

Query:  EVGRRESTQ-KNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEE---------EENDDED---
        +  +RES++ ++ NNLK SVGRKLNKS SC+SLGELELEEVKGFMDLGFEFKRE+LSPQMV L+PGLQRL T TNKQ LEE         +ENDD+D   
Subjt:  EVGRRESTQ-KNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEE---------EENDDED---

Query:  ---KRAIARPYLSEAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ
           KR IARPYLSEAW I+RPNSPL+NLR+PKVSST DMKKHL+ WAKTVA E+Q
Subjt:  ---KRAIARPYLSEAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ

XP_022145659.1 uncharacterized protein LOC111015056 [Momordica charantia]9.9e-8977.55Show/hide
Query:  MAMNTNTLCLVSAMDRLWYHQIILCSDPL-SSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSS-DDISLDSQESCSNDEYKEKEV
        MAMNT TLCLVSAMDRLWYHQIIL SDPL SSH+PN D T PFTKFPSCPSPS     SPL +ETI+ SS S  SVSS +DISLDS E CSND+ KEKEV
Subjt:  MAMNTNTLCLVSAMDRLWYHQIILCSDPL-SSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSS-DDISLDSQESCSNDEYKEKEV

Query:  GRRESTQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEEE----NDDED-KRAIARPYLS
         +REST+K PNNLK SVG KLNKS SCRSLGELELEEVKGF+DLGFEFKRENL+PQMVTLLPGLQRLG   NK+K  EEE    NDDED KR  +RPYLS
Subjt:  GRRESTQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEEE----NDDED-KRAIARPYLS

Query:  EAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQEES
        EAWTIKRPNSPL+ LR+ KVSST DMKKHLKFWAKTVASE+Q+ES
Subjt:  EAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQEES

XP_022983601.1 uncharacterized protein LOC111482158 isoform X2 [Cucurbita maxima]1.8e-5858.37Show/hide
Query:  NTNTLCLVSAMDRLWYHQIILCS-DPLSSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEKEVGRRE
        NTNTLCLVSAMDRLW+HQIIL S  P  SH   L  TFPF+ FP                     SSLSS  +  DD SL SQE  SND  K K+ G+ E
Subjt:  NTNTLCLVSAMDRLWYHQIILCS-DPLSSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEKEVGRRE

Query:  STQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEEENDDEDKRAIARPYLSEAWTIKRPN
        + +++  + + ++ +KLNK++SC+SLGELE+EEVKGFMDLGF+F+ ENLSPQMV L+PGLQR  T  +KQ LE++++DD+ KR IARPYLSEAWTI RPN
Subjt:  STQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEEENDDEDKRAIARPYLSEAWTIKRPN

Query:  SPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ
        SPL+ LR+PKVSST DMKK LK WA+TVA E+Q
Subjt:  SPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ

XP_038903410.1 uncharacterized protein LOC120090009 isoform X1 [Benincasa hispida]2.3e-8574.18Show/hide
Query:  MAMNTNTLCLVSAMDRLWYHQIILCSDPLSSHVPNL--DTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEKEV
        MAMNTNTLCLVS MDRLWYHQIIL SDPLSSH+PN    ++F FT FPS PSPSP LPFSPL+D++IL SS  SPSVSSD+ISL SQ+  SNDE K K+ 
Subjt:  MAMNTNTLCLVSAMDRLWYHQIILCSDPLSSHVPNL--DTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEKEV

Query:  GRRESTQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGT-YTNKQKLEEE------ENDDEDKRAIARPY
        G++E ++++ NNLKLSVG KLNKS SC+SLGELELEEVKGFMDLGFEFK+ENLSP+MV LLPGLQRL T   NKQ LEEE      ENDD+ KR IARPY
Subjt:  GRRESTQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGT-YTNKQKLEEE------ENDDEDKRAIARPY

Query:  LSEAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ
        LSEAWTIKR NSPL+NLR+PKVSST DMKKHLK WAKTVA E+Q
Subjt:  LSEAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ

TrEMBL top hitse value%identityAlignment
A0A1S3B6W6 Uncharacterized protein6.1e-7667.06Show/hide
Query:  MAMNT-NTLCLVSAMDRLWYHQIILCSDPLSSHVPNL---DTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEK
        MAMNT NTLCLVSAMDRLWYHQIILCSDP +SH PN     ++FPFT F       P  P SPL+D+TI L   SS S SSD+ISL SQE  +N+E K+K
Subjt:  MAMNT-NTLCLVSAMDRLWYHQIILCSDPLSSHVPNL---DTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEK

Query:  EVGRRESTQ-KNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEE---------EENDDED---
        +  +RES++ ++ NNLK SVGRKLNKS SC+SLGELELEEVKGFMDLGFEFKRE+LSPQMV L+PGLQRL T TNKQ LEE         +ENDD+D   
Subjt:  EVGRRESTQ-KNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEE---------EENDDED---

Query:  ---KRAIARPYLSEAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ
           KR IARPYLSEAW I+RPNSPL+NLR+PKVSST DMKKHL+ WAKTVA E+Q
Subjt:  ---KRAIARPYLSEAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ

A0A6J1CVW5 uncharacterized protein LOC1110150562.8e-8977.96Show/hide
Query:  MAMNTNTLCLVSAMDRLWYHQIILCSDPL-SSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSS-DDISLDSQESCSNDEYKEKEV
        MAMNT TLCLVSAMDRLWYHQIIL SDPL SSH+PN D T PFTKFPSCPSPS     SPL +ETI+ SS S  SVSS DDISLDS E CSND+ KEKEV
Subjt:  MAMNTNTLCLVSAMDRLWYHQIILCSDPL-SSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSS-DDISLDSQESCSNDEYKEKEV

Query:  GRRESTQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEEE----NDDED-KRAIARPYLS
         +REST+K PNNLK SVG KLNKS SCRSLGELELEEVKGF+DLGFEFKRENL+PQMVTLLPGLQRLG   NK+K  EEE    NDDED KR  +RPYLS
Subjt:  GRRESTQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEEE----NDDED-KRAIARPYLS

Query:  EAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQEES
        EAWTIKRPNSPL+ LR+ KVSST DMKKHLKFWAKTVASE+Q+ES
Subjt:  EAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQEES

A0A6J1F521 uncharacterized protein LOC1114421892.2e-5757.26Show/hide
Query:  NTNTLCLVSAMDRLWYHQIILCSD-PLSSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEKEVGRRE
        NT T CLVSAMDRLW+HQIIL S  P +SH   L  TFPF+ FP                     SSLSS  +S DD SL S E    +  K K+ G+ E
Subjt:  NTNTLCLVSAMDRLWYHQIILCSD-PLSSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEKEVGRRE

Query:  STQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEE-EENDDEDKRAIARPYLSEAWTIKRP
        S Q++ ++ + ++  KLNKS+SC+SLGELELEEVKGFMDLGF+F+ ENLSPQM+ L+PGLQR     +KQ L++ +ENDD+ KR IARPYLSEAWTI RP
Subjt:  STQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEE-EENDDEDKRAIARPYLSEAWTIKRP

Query:  NSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ
        NSPL+NLR+PK+SST DMKKHL+ WA TVA E+Q
Subjt:  NSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ

A0A6J1J2S9 uncharacterized protein LOC111482158 isoform X28.8e-5958.37Show/hide
Query:  NTNTLCLVSAMDRLWYHQIILCS-DPLSSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEKEVGRRE
        NTNTLCLVSAMDRLW+HQIIL S  P  SH   L  TFPF+ FP                     SSLSS  +  DD SL SQE  SND  K K+ G+ E
Subjt:  NTNTLCLVSAMDRLWYHQIILCS-DPLSSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEKEVGRRE

Query:  STQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEEENDDEDKRAIARPYLSEAWTIKRPN
        + +++  + + ++ +KLNK++SC+SLGELE+EEVKGFMDLGF+F+ ENLSPQMV L+PGLQR  T  +KQ LE++++DD+ KR IARPYLSEAWTI RPN
Subjt:  STQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEEENDDEDKRAIARPYLSEAWTIKRPN

Query:  SPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ
        SPL+ LR+PKVSST DMKK LK WA+TVA E+Q
Subjt:  SPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ

A0A6J1J6B4 uncharacterized protein LOC111482158 isoform X11.5e-5856.61Show/hide
Query:  NTNTLCLVSAMDRLWYHQIILCS-DPLSSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEKEVGRRE
        NTNTLCLVSAMDRLW+HQIIL S  P  SH   L  TFPF+ FP                     SSLSS  +  DD SL SQE  SND  K K+ G+ E
Subjt:  NTNTLCLVSAMDRLWYHQIILCS-DPLSSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEKEVGRRE

Query:  STQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEEENDDED---------KRAIARPYLS
        + +++  + + ++ +KLNK++SC+SLGELE+EEVKGFMDLGF+F+ ENLSPQMV L+PGLQR  T  +KQ LE++++DD+D         KR IARPYLS
Subjt:  STQKNPNNLKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEEENDDED---------KRAIARPYLS

Query:  EAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ
        EAWTI RPNSPL+ LR+PKVSST DMKK LK WA+TVA E+Q
Subjt:  EAWTIKRPNSPLINLRVPKVSSTLDMKKHLKFWAKTVASELQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G31560.1 Protein of unknown function (DUF1685)2.1e-0430.17Show/hide
Query:  RSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKL----------EEEENDDEDKRAIARPYLSEAWTIKRPNSPLINLRVPKVSS
        +SL + +LEE+KG +DLGF F  + + P++   LP L+    Y+  QK            +EE+D         P  +  W I  P              
Subjt:  RSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKL----------EEEENDDEDKRAIARPYLSEAWTIKRPNSPLINLRVPKVSS

Query:  TLDMKKHLKFWAKTVA
          D+K  LK+WA+TVA
Subjt:  TLDMKKHLKFWAKTVA

AT2G31560.2 Protein of unknown function (DUF1685)2.1e-0430.17Show/hide
Query:  RSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKL----------EEEENDDEDKRAIARPYLSEAWTIKRPNSPLINLRVPKVSS
        +SL + +LEE+KG +DLGF F  + + P++   LP L+    Y+  QK            +EE+D         P  +  W I  P              
Subjt:  RSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKL----------EEEENDDEDKRAIARPYLSEAWTIKRPNSPLINLRVPKVSS

Query:  TLDMKKHLKFWAKTVA
          D+K  LK+WA+TVA
Subjt:  TLDMKKHLKFWAKTVA

AT2G42760.1 unknown protein2.7e-1534.65Show/hide
Query:  DETILLSSLSSP--SVSSDDISLDS-------------QESCSNDEYKEKEVGRRESTQKNPNNLKLSVGRKLN-KSISCRSLGELELEEVKGFMDLGFE
        DET++ +S  +   S SSDD+ L               Q   S  E     +  RE         +    +K N ++   +S+ +LE EE+KGFMDLGF 
Subjt:  DETILLSSLSSP--SVSSDDISLDS-------------QESCSNDEYKEKEVGRRESTQKNPNNLKLSVGRKLN-KSISCRSLGELELEEVKGFMDLGFE

Query:  FKR-ENLSPQMVTLLPGLQRLGTYTNKQKLEEEENDDEDK---RAIARPYLSEAWTI------KRPNSPLINLRV--PKVSSTLDMKKHLKFWAKTVASE
        F   ++    +V++LPGLQRL    +    EEEE ++EDK      ARPYLSEAW        K+  +P I  RV  P  +S +D+K +L+ WA  VAS 
Subjt:  FKR-ENLSPQMVTLLPGLQRLGTYTNKQKLEEEENDDEDK---RAIARPYLSEAWTI------KRPNSPLINLRV--PKVSSTLDMKKHLKFWAKTVASE

Query:  LQ
        ++
Subjt:  LQ

AT2G43340.1 Protein of unknown function (DUF1685)1.9e-0527.37Show/hide
Query:  SSLSSPSVSSDDISLDSQESCSNDEYKEKEVG----------RRESTQKNPNNLKLS---VGRKLNKSIS-CRSLGELELEEVKGFMDLGFEFKRENLSP
        SS  +   S  +IS  S  SCS  E +E+E+           + +  +K  +N+ L    V   +N  +   +SL + +LEE+KG +DLGF F  E + P
Subjt:  SSLSSPSVSSDDISLDSQESCSNDEYKEKEVG----------RRESTQKNPNNLKLS---VGRKLNKSIS-CRSLGELELEEVKGFMDLGFEFKRENLSP

Query:  QMVTLLPGLQRLGTYTNKQKLEEEENDDEDKRAIARPYLSEAWTIKRPNSPLINLRVPKVSSTL-DMKKHLKFWAKTVA
        ++   LP L+    Y+  QK  ++++      +  +    ++  +  P SP+ + ++        D+K  LKFWA+ VA
Subjt:  QMVTLLPGLQRLGTYTNKQKLEEEENDDEDKRAIARPYLSEAWTIKRPNSPLINLRVPKVSSTL-DMKKHLKFWAKTVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATGAACACTAATACTTTATGTCTAGTTTCAGCTATGGATCGCCTTTGGTACCACCAAATCATTCTTTGCTCAGATCCATTGAGTTCTCATGTTCCCAATCTTGA
CACAACTTTTCCTTTCACGAAGTTTCCTTCTTGCCCATCTCCCTCTCCCCAACTCCCCTTTTCACCCCTAATAGATGAAACTATTCTTCTGTCCTCTTTGTCTTCTCCAT
CGGTTTCCTCTGATGACATCTCCCTCGACTCACAGGAAAGTTGTAGTAATGATGAATACAAGGAGAAAGAAGTTGGGAGAAGAGAGTCAACTCAAAAAAATCCCAACAAT
CTCAAACTTTCAGTGGGGAGAAAATTGAACAAATCTATAAGTTGTAGAAGCTTGGGAGAGTTGGAACTTGAAGAAGTTAAGGGGTTTATGGATTTAGGGTTTGAATTCAA
GAGAGAAAATTTGAGCCCTCAAATGGTGACGTTGTTGCCTGGTTTGCAAAGGCTTGGAACATACACAAACAAACAGAAACTTGAAGAAGAAGAAAATGATGATGAAGATA
AGAGAGCTATAGCGAGGCCATATCTTTCAGAGGCATGGACAATAAAAAGACCGAATTCTCCTCTTATAAATCTAAGGGTGCCAAAGGTTTCTTCGACCTTAGACATGAAG
AAACATCTCAAGTTCTGGGCTAAGACTGTTGCATCTGAACTTCAAGAAGAATCTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTATGAACACTAATACTTTATGTCTAGTTTCAGCTATGGATCGCCTTTGGTACCACCAAATCATTCTTTGCTCAGATCCATTGAGTTCTCATGTTCCCAATCTTGA
CACAACTTTTCCTTTCACGAAGTTTCCTTCTTGCCCATCTCCCTCTCCCCAACTCCCCTTTTCACCCCTAATAGATGAAACTATTCTTCTGTCCTCTTTGTCTTCTCCAT
CGGTTTCCTCTGATGACATCTCCCTCGACTCACAGGAAAGTTGTAGTAATGATGAATACAAGGAGAAAGAAGTTGGGAGAAGAGAGTCAACTCAAAAAAATCCCAACAAT
CTCAAACTTTCAGTGGGGAGAAAATTGAACAAATCTATAAGTTGTAGAAGCTTGGGAGAGTTGGAACTTGAAGAAGTTAAGGGGTTTATGGATTTAGGGTTTGAATTCAA
GAGAGAAAATTTGAGCCCTCAAATGGTGACGTTGTTGCCTGGTTTGCAAAGGCTTGGAACATACACAAACAAACAGAAACTTGAAGAAGAAGAAAATGATGATGAAGATA
AGAGAGCTATAGCGAGGCCATATCTTTCAGAGGCATGGACAATAAAAAGACCGAATTCTCCTCTTATAAATCTAAGGGTGCCAAAGGTTTCTTCGACCTTAGACATGAAG
AAACATCTCAAGTTCTGGGCTAAGACTGTTGCATCTGAACTTCAAGAAGAATCTTCTTAA
Protein sequenceShow/hide protein sequence
MAMNTNTLCLVSAMDRLWYHQIILCSDPLSSHVPNLDTTFPFTKFPSCPSPSPQLPFSPLIDETILLSSLSSPSVSSDDISLDSQESCSNDEYKEKEVGRRESTQKNPNN
LKLSVGRKLNKSISCRSLGELELEEVKGFMDLGFEFKRENLSPQMVTLLPGLQRLGTYTNKQKLEEEENDDEDKRAIARPYLSEAWTIKRPNSPLINLRVPKVSSTLDMK
KHLKFWAKTVASELQEESS