; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020039 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020039
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description2S albumin
Genome locationtig00153446:1264671..1267024
RNA-Seq ExpressionSgr020039
SyntenySgr020039
Gene Ontology termsGO:0043086 - negative regulation of catalytic activity (biological process)
GO:0000322 - storage vacuole (cellular component)
GO:0033095 - aleurone grain (cellular component)
GO:0015066 - alpha-amylase inhibitor activity (molecular function)
GO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR016140 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain
IPR036312 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain superfamily
IPR044723 - AAI/SS protein, conserved domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600761.1 hypothetical protein SDJN03_05994, partial [Cucurbita argyrosperma subsp. sororia]7.3e-4474.81Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE
        MARLT IIAL AVALLVADAYAYRTTITTVEV+E+ +G ++RC QM AREEL SC+ YL Q SR VLQMRGI+N W REGG  DECCREL+NVDEECRC+
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE

Query:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI
        +LEEI   EQR+ R QEGRQMLQ+ARNLPS+
Subjt:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI

Q39649.1 RecName: Full=2S albumin; Contains: RecName: Full=2S albumin small chain; Contains: RecName: Full=2S albumin large chain; Flags: Precursor [Cucurbita maxima]4.6e-4677.86Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE
        MARLT IIAL AVALLVADAYAYRTTITTVEV+E+ QGR++RC QM AREEL SC+ YL Q SR VLQMRGI+N W REGG  DECCRELKNVDEECRC+
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE

Query:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI
        +LEEIA  EQR+ R QEGRQMLQ+ARNLPS+
Subjt:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI

XP_022942593.1 2S albumin [Cucurbita moschata]1.2e-4374.81Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREG-GLDECCRELKNVDEECRCE
        MARLT IIAL AVALLVADAYAYRTTITTVEV+E+ +G ++RC QM AREEL SC+ YL Q SR VLQMRGI+N W REG   DECCREL+NVDEECRC+
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREG-GLDECCRELKNVDEECRCE

Query:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI
        +LEEIA  EQR+ R QEGRQMLQ+ARNLPS+
Subjt:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI

XP_022993226.1 2S albumin [Cucurbita maxima]1.0e-4577.1Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE
        MARLT IIAL AVALLVADAYAYRTTITTVEV+E+ QGR++RC QM AREEL SC+ YL Q SR VLQMRGI+N W REGG  DECCREL+NVDEECRC+
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE

Query:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI
        +LEEIA  EQR+ R QEGRQMLQ+ARNLPS+
Subjt:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI

XP_023545481.1 2S albumin [Cucurbita pepo subsp. pepo]2.5e-4475.57Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE
        MARLT IIAL AVALLVADAYAYRTTITTVEV+E+ +G ++RC QM AREEL SC+ YL Q SR VLQMRGI+N W REGG  DECCREL+NVDEECRC+
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE

Query:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI
        +LEEIA  EQR+ R QEGRQMLQ+ARNLPS+
Subjt:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI

TrEMBL top hitse value%identityAlignment
A0A0U1Z284 Alpha-amylase inhibitor7.9e-4475.57Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE
        MARLTGIIALLAVALLV +AYAYRTTITTVEVD+DNQGRQQRC QM ARE+L SC+ YL Q SR VL MRGIDN   +EGG  DECC EL+NVDE+CRC+
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE

Query:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI
        LLEEIA  EQR+ R QEGRQ+L +ARNLPS+
Subjt:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI

A0A6J1EW73 2S albumin-like1.5e-3972.52Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGGL-DECCRELKNVDEECRCE
        MARLTGIIAL+ +A+LV+DAYAYR TITTVEV++DN+ R +RC QMRA EE+ SC  YLTQ SR VLQMRGI+NQ  REG + DECCREL+NVDE+CRCE
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGGL-DECCRELKNVDEECRCE

Query:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI
        LLE+IA  E RKGR QE RQM QRARNLPS+
Subjt:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI

A0A6J1FV75 2S albumin6.0e-4474.81Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREG-GLDECCRELKNVDEECRCE
        MARLT IIAL AVALLVADAYAYRTTITTVEV+E+ +G ++RC QM AREEL SC+ YL Q SR VLQMRGI+N W REG   DECCREL+NVDEECRC+
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREG-GLDECCRELKNVDEECRCE

Query:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI
        +LEEIA  EQR+ R QEGRQMLQ+ARNLPS+
Subjt:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI

A0A6J1JVR2 2S albumin4.9e-4677.1Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE
        MARLT IIAL AVALLVADAYAYRTTITTVEV+E+ QGR++RC QM AREEL SC+ YL Q SR VLQMRGI+N W REGG  DECCREL+NVDEECRC+
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE

Query:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI
        +LEEIA  EQR+ R QEGRQMLQ+ARNLPS+
Subjt:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI

Q8L694 2S albumin-like1.4e-3764.12Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGI-DNQWGREGGLDECCRELKNVDEECRCE
        MARL+ ++ LLAVALL+ D YAYRTTITTVEVDEDNQGR +RCH +R RE+L SC+S+L Q SRG L+M+G+ +NQW R+ GL+ECCR+L+NV+E+CRC+
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGI-DNQWGREGGLDECCRELKNVDEECRCE

Query:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI
         L+EIA   QR+ R QEG QMLQ+AR LP++
Subjt:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI

SwissProt top hitse value%identityAlignment
P01089 2S albumin6.7e-1637.89Show/hide
Query:  MARLTGIIALLAVAL-LVADA-YAYRTTITTVEVDEDNQGRQ----QRCHQMRAREELPSCQSYLTQSS------RGVLQMRGIDNQWGREGGLDECCRE
        MA+L   IAL++V L ++A+A +AYRTTITT+E+DE    R+    Q+C Q   R++L SC+ YL QSS        VL+M G +NQ      L +CC +
Subjt:  MARLTGIIALLAVAL-LVADA-YAYRTTITTVEVDEDNQGRQ----QRCHQMRAREELPSCQSYLTQSS------RGVLQMRGIDNQWGREGGLDECCRE

Query:  LKNVDEECRCELLEEIAYGEQRKGR--AQEGRQMLQRARNLPSIVCVIECTTRITNESSLQ
        +K V +EC+CE ++ IA  + ++G+   +E  ++ QRA  + S  C + C  +     S Q
Subjt:  LKNVDEECRCELLEEIAYGEQRKGR--AQEGRQMLQRARNLPSIVCVIECTTRITNESSLQ

P04403 2S sulfur-rich seed storage protein 11.4e-0833.09Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQW----GREGGLDECCRELKNVDEEC
        MA+++   A L V + +  A A+R T+TT  V+E+NQ  ++   QM+ ++ L  C+ Y+ Q      QM     Q     G E  + ECC +L+ +DE C
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQW----GREGGLDECCRELKNVDEEC

Query:  RCELLEEI---AYGEQRKGRAQEGRQMLQRARNLPS
        RCE L  +      E+ + R ++ R+M++ A N+PS
Subjt:  RCELLEEI---AYGEQRKGRAQEGRQMLQRARNLPS

P0C8Y8 2S sulfur-rich seed storage protein 21.6e-0935.77Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTV---EVDEDNQGR-QQRC-HQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGGLDECCRELKNVDEE
        MA+++ + A L   L++  A A+RTT+TT    E +E+ +GR +Q+C  QM  +++L  C+ YL Q             + G E  LDECC +L+ +DE 
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTV---EVDEDNQGR-QQRC-HQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGGLDECCRELKNVDEE

Query:  CRCELLEEIAYGEQRKGRAQEGRQM---LQRARNLPS
        CRCE L  +    QR+    +G QM   +++A NL S
Subjt:  CRCELLEEIAYGEQRKGRAQEGRQM---LQRARNLPS

P93198 2S albumin seed storage protein (Fragment)4.5e-1239.37Show/hide
Query:  ALLAVALLVADAYAYRTTITTVEVDEDNQGRQQR---C-HQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGGLDECCRELKNVDEECRCELLEEI
        ALL   L VA+A A+RTTITT+E+DED    ++R   C  Q++ ++ L  CQ YL Q SR      G D    R+    +CC++L  +DE+C+CE L ++
Subjt:  ALLAVALLVADAYAYRTTITTVEVDEDNQGRQQR---C-HQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGGLDECCRELKNVDEECRCELLEEI

Query:  AYGEQRKG--RAQEGRQMLQRARNLPS
           +Q++   R +E  +M+Q AR+LP+
Subjt:  AYGEQRKG--RAQEGRQMLQRARNLPS

Q39649 2S albumin6.0e-4977.86Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE
        MARLT IIAL AVALLVADAYAYRTTITTVEV+E+ QGR++RC QM AREEL SC+ YL Q SR VLQMRGI+N W REGG  DECCRELKNVDEECRC+
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGG-LDECCRELKNVDEECRCE

Query:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI
        +LEEIA  EQR+ R QEGRQMLQ+ARNLPS+
Subjt:  LLEEIAYGEQRKGRAQEGRQMLQRARNLPSI

Arabidopsis top hitse value%identityAlignment
AT5G54740.1 seed storage albumin 51.4e-0525.5Show/hide
Query:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRC-HQMRAREELPSCQSYLTQSSRG--------------VLQMRGIDNQWG--REGGLD
        MA+L  + A LA+ +L+A+A  YRT +   E D+ +  +Q +C  +    ++L  C+ ++ + ++                + +   +N  G  ++  L 
Subjt:  MARLTGIIALLAVALLVADAYAYRTTITTVEVDEDNQGRQQRC-HQMRAREELPSCQSYLTQSSRG--------------VLQMRGIDNQWG--REGGLD

Query:  ECCRELKNVDEECRCELLEEIAYGEQRKGR--AQEGRQMLQRARNLPSI
         CC EL+ VD+ C C  L++ A   + +G    Q+ + + Q A+NLP++
Subjt:  ECCRELKNVDEECRCELLEEIAYGEQRKGR--AQEGRQMLQRARNLPSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGCTTCTCATCGGTGCATGGCTACCCTCACGTGGAAGCTTAAAAACGTCGTCTGAGCCCACCATGGCTACACCTCCACATCTCCGGGACCCGCAAGACAGCCACTT
CTTCCACGTTCCCTACATGCACCACTCCCCACTTCCTATAAATAAACACACACCCCACCATTTTCATTTCACTCACTCAACACTCCTTTTCCCACGTCCCTATTTCCAGC
AAAAGCCTTTAACAATGGCCAGACTCACAGGTATCATTGCTCTCTTGGCGGTGGCACTGCTGGTTGCAGATGCCTACGCCTACCGCACCACCATCACCACCGTGGAGGTG
GACGAAGACAACCAAGGGCGGCAGCAGCGGTGCCACCAGATGAGGGCCCGGGAGGAGCTCCCCAGCTGCCAGAGCTACCTGACGCAGAGCAGCAGAGGCGTTTTGCAGAT
GCGGGGAATCGACAACCAGTGGGGGAGAGAGGGGGGCTTGGATGAGTGTTGCCGAGAGCTCAAGAATGTGGATGAGGAATGCAGGTGCGAGCTCTTGGAGGAGATTGCTT
ATGGGGAACAGAGGAAGGGACGAGCTCAAGAAGGAAGGCAGATGCTACAGAGAGCGAGGAACTTGCCATCCATAGTTTGTGTTATTGAGTGCACAACTCGCATTACAAAT
GAGAGTTCTCTTCAGAATAACAAACCGAAAAGAAGCATCAGATGGGAAAAGAGATCATTTATTTGTAATTGTGTTCAACAACAGAACCGTAACAAAGACTCATTGGGATA
CAAGAAGACGGGCTATACAACTATTAACATGAAGGAAAACGTTACACGGCTAAACGATAAAAACCATAAGCAGACACGACCCACACAGATTATAGGAATAGAGATTATTA
TACGGAAGGGCAGTTATAATAATACTGCAGTTCAGAGATCAGTTTTGCAAACCGCCGCGAAGGCGAAGGACAAGAGTCCTGCCGTCCTCCAGTTGCTTTCCAGCGAAAAT
CAGCCTCTGCTGGTCTGGTGGGATGCCTTCTTTATCTTGAATCTTAGCCTTAACGTTATCAATGTCGGCAAGCGTCCGTCCATCTTCGAGCTGCTTGCCTGCGAAAATCA
ACCTTTGCTGGTCTGGGGGTATACCTTCCTTGTCTTGAATCTTAGCTTTGACATTGTCGATTGTGTCAGAGCTTTCAACCTCCAATACGAAGCACCAAATGCAATGTTGA
TTCCTTCTGAATATTGTAGTCCGCCAGAGTGCGACCGTCTTCGAGTTGTTTGCCAGCGAAAATCAGCCTCTGCTGGTCCGGGTTTTCACGAAGATTTGCATTCCACCACG
CAAACGGAGCACCAAGTGGAGGGTCGACTCTTTTTGTATGTTGTAGTCGGCGAGGGTTCGGCCATCCTCGAGCTGCTTTCCAGCGAAGATAAGCCTCTGCTGGTCCGGTG
GGATTCCTTCTTTATCCTGAATTTTGGCCTTTACATTGTCAATAGTATCAGAACTTTCGACCTCAAGGGTAATTGTTTTGCCGAGAATAACGGTGAAAAGCAACTACTCA
GAAACCAAACCATAATCGTCATCTTCCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGGCTTCTCATCGGTGCATGGCTACCCTCACGTGGAAGCTTAAAAACGTCGTCTGAGCCCACCATGGCTACACCTCCACATCTCCGGGACCCGCAAGACAGCCACTT
CTTCCACGTTCCCTACATGCACCACTCCCCACTTCCTATAAATAAACACACACCCCACCATTTTCATTTCACTCACTCAACACTCCTTTTCCCACGTCCCTATTTCCAGC
AAAAGCCTTTAACAATGGCCAGACTCACAGGTATCATTGCTCTCTTGGCGGTGGCACTGCTGGTTGCAGATGCCTACGCCTACCGCACCACCATCACCACCGTGGAGGTG
GACGAAGACAACCAAGGGCGGCAGCAGCGGTGCCACCAGATGAGGGCCCGGGAGGAGCTCCCCAGCTGCCAGAGCTACCTGACGCAGAGCAGCAGAGGCGTTTTGCAGAT
GCGGGGAATCGACAACCAGTGGGGGAGAGAGGGGGGCTTGGATGAGTGTTGCCGAGAGCTCAAGAATGTGGATGAGGAATGCAGGTGCGAGCTCTTGGAGGAGATTGCTT
ATGGGGAACAGAGGAAGGGACGAGCTCAAGAAGGAAGGCAGATGCTACAGAGAGCGAGGAACTTGCCATCCATAGTTTGTGTTATTGAGTGCACAACTCGCATTACAAAT
GAGAGTTCTCTTCAGAATAACAAACCGAAAAGAAGCATCAGATGGGAAAAGAGATCATTTATTTGTAATTGTGTTCAACAACAGAACCGTAACAAAGACTCATTGGGATA
CAAGAAGACGGGCTATACAACTATTAACATGAAGGAAAACGTTACACGGCTAAACGATAAAAACCATAAGCAGACACGACCCACACAGATTATAGGAATAGAGATTATTA
TACGGAAGGGCAGTTATAATAATACTGCAGTTCAGAGATCAGTTTTGCAAACCGCCGCGAAGGCGAAGGACAAGAGTCCTGCCGTCCTCCAGTTGCTTTCCAGCGAAAAT
CAGCCTCTGCTGGTCTGGTGGGATGCCTTCTTTATCTTGAATCTTAGCCTTAACGTTATCAATGTCGGCAAGCGTCCGTCCATCTTCGAGCTGCTTGCCTGCGAAAATCA
ACCTTTGCTGGTCTGGGGGTATACCTTCCTTGTCTTGAATCTTAGCTTTGACATTGTCGATTGTGTCAGAGCTTTCAACCTCCAATACGAAGCACCAAATGCAATGTTGA
TTCCTTCTGAATATTGTAGTCCGCCAGAGTGCGACCGTCTTCGAGTTGTTTGCCAGCGAAAATCAGCCTCTGCTGGTCCGGGTTTTCACGAAGATTTGCATTCCACCACG
CAAACGGAGCACCAAGTGGAGGGTCGACTCTTTTTGTATGTTGTAGTCGGCGAGGGTTCGGCCATCCTCGAGCTGCTTTCCAGCGAAGATAAGCCTCTGCTGGTCCGGTG
GGATTCCTTCTTTATCCTGAATTTTGGCCTTTACATTGTCAATAGTATCAGAACTTTCGACCTCAAGGGTAATTGTTTTGCCGAGAATAACGGTGAAAAGCAACTACTCA
GAAACCAAACCATAATCGTCATCTTCCTCTAA
Protein sequenceShow/hide protein sequence
MRLLIGAWLPSRGSLKTSSEPTMATPPHLRDPQDSHFFHVPYMHHSPLPINKHTPHHFHFTHSTLLFPRPYFQQKPLTMARLTGIIALLAVALLVADAYAYRTTITTVEV
DEDNQGRQQRCHQMRAREELPSCQSYLTQSSRGVLQMRGIDNQWGREGGLDECCRELKNVDEECRCELLEEIAYGEQRKGRAQEGRQMLQRARNLPSIVCVIECTTRITN
ESSLQNNKPKRSIRWEKRSFICNCVQQQNRNKDSLGYKKTGYTTINMKENVTRLNDKNHKQTRPTQIIGIEIIIRKGSYNNTAVQRSVLQTAAKAKDKSPAVLQLLSSEN
QPLLVWWDAFFILNLSLNVINVGKRPSIFELLACENQPLLVWGYTFLVLNLSFDIVDCVRAFNLQYEAPNAMLIPSEYCSPPECDRLRVVCQRKSASAGPGFHEDLHSTT
QTEHQVEGRLFLYVVVGEGSAILELLSSEDKPLLVRWDSFFILNFGLYIVNSIRTFDLKGNCFAENNGEKQLLRNQTIIVIFL