; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019055 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019055
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:38117734..38122018
RNA-Seq ExpressionLag0019055
SyntenyLag0019055
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]2.5e-5046.39Show/hide
Query:  FSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIEINLLYE
        FS+PPLNQ+LN++  +KLDR N+LLWK LALPIL+ YKLEGHL G  PCPS ++      L+ SS+   V    T   A A+  A SS +P I +N L+E
Subjt:  FSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIEINLLYE

Query:  SWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEEDY
         W+  D LLLGWLYNSMTP+VA             + L                                    GF N +DLW+A Q  FGVQ RAEED+
Subjt:  SWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEEDY

Query:  LRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGR
        LRQ+ Q + KG+ KM +YL ++K + DNLGQ GSPV  R+LISQVLLGLDE +N V+ +IQG+
Subjt:  LRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGR

TYJ96311.1 uncharacterized protein E5676_scaffold1970G00140 [Cucumis melo var. makuwa]8.6e-4345.08Show/hide
Query:  FSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIEINLLYE
        FS+PPLNQ+LN++  +KLDR N+LLWK LALPIL+ YKLEGHL G  PCPS ++      L+ SS+   V    T   A A+  A SS +P I +N L+E
Subjt:  FSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIEINLLYE

Query:  SWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEEDY
         W+  D LLLGWLYNSMTP+VA             + L                                    GF N +DLW+A Q  FGVQ RAEED+
Subjt:  SWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEEDY

Query:  LRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQ
        LRQ+ Q + KG+ KM +YL ++K + DNLGQ GSPV  R+LISQ
Subjt:  LRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQ

XP_022148963.1 uncharacterized protein LOC111017501 [Momordica charantia]1.8e-4055.43Show/hide
Query:  AADSSSSPTIE--INLLYESWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKD
        +A SSSS   E  IN LYESW+  DQLLLGWLYNSMTPEVA                        ++                          G+ENA D
Subjt:  AADSSSSPTIE--INLLYESWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKD

Query:  LWEAIQKMFGVQLRAEEDYLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSG
        LW AIQ++FGVQ +AEEDYLRQVFQQ+ KGSLKM+D+LR++K+HADNLGQAGSPV TRSLISQVLLGLDEE+NPVVA IQG+ G
Subjt:  LWEAIQKMFGVQLRAEEDYLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSG

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]2.3e-5146.1Show/hide
Query:  VAVGPNFSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIE
        V  G  F+SPPLNQLLN++T IK+DRGNFLLW+NLALPILRSYKL  +L G KPCP  +L        P+     +  G+TS          S SSPT  
Subjt:  VAVGPNFSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIE

Query:  INLLYESWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQL
        +N  YE+W+ VD+LLLGWLYNSM  +VA                                                    GF  +++LW A+Q++FGVQ 
Subjt:  INLLYESWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQL

Query:  RAEEDYLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGR
        RAE DYL+QVFQQ+ KGSL+M +YL+++K+HADNL  AGS VS R L+SQVL GLDEE+NP+V  +QG+
Subjt:  RAEEDYLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGR

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]5.6e-5047.66Show/hide
Query:  TIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAA--ANGLNPSSAQAEVG---AGTTSFVASASDAADSSSSPTIEINLLYESWLAVDQL
        T IKLD+ N+LLW+NLALPILRSY+LEGHL G  PCP ++  A   +    P   +A +G   +G  S          S+SSP +++N  YES   VDQL
Subjt:  TIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAA--ANGLNPSSAQAEVG---AGTTSFVASASDAADSSSSPTIEINLLYESWLAVDQL

Query:  LLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEEDYLRQVFQQS
        LLGWLYN MT EVA                                                    G+EN K LW AIQ++FG+Q RA EDYLRQVFQQ+
Subjt:  LLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEEDYLRQVFQQS

Query:  SKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRS
         KG++KM +YLR++K H+DNLG  GSPV TR+L+SQVLLGLDEEFNP VA IQGRS
Subjt:  SKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRS

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein1.2e-5046.39Show/hide
Query:  FSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIEINLLYE
        FS+PPLNQ+LN++  +KLDR N+LLWK LALPIL+ YKLEGHL G  PCPS ++      L+ SS+   V    T   A A+  A SS +P I +N L+E
Subjt:  FSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIEINLLYE

Query:  SWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEEDY
         W+  D LLLGWLYNSMTP+VA             + L                                    GF N +DLW+A Q  FGVQ RAEED+
Subjt:  SWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEEDY

Query:  LRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGR
        LRQ+ Q + KG+ KM +YL ++K + DNLGQ GSPV  R+LISQVLLGLDE +N V+ +IQG+
Subjt:  LRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGR

A0A5A7VPY0 Uncharacterized protein4.0e-3841.8Show/hide
Query:  NFSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIEINLLY
        +F++P LNQ+LN++T IKLDRGN+LLWK LALPIL+SYKL  HL G  PC  K +        P+ +  E  AG        S    SSS+  + +N  Y
Subjt:  NFSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIEINLLY

Query:  ESWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEED
        E W+  D LLLGWLYNSMTPEV              + L                                    GF NAKDLWEA Q +FG+Q RA+ED
Subjt:  ESWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEED

Query:  YLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLIS
        +L Q FQ + KG+L M +YLR +KN+ +NLGQA S V + +++S
Subjt:  YLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLIS

A0A5D3BCH9 Uncharacterized protein4.2e-4345.08Show/hide
Query:  FSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIEINLLYE
        FS+PPLNQ+LN++  +KLDR N+LLWK LALPIL+ YKLEGHL G  PCPS ++      L+ SS+   V    T   A A+  A SS +P I +N L+E
Subjt:  FSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIEINLLYE

Query:  SWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEEDY
         W+  D LLLGWLYNSMTP+VA             + L                                    GF N +DLW+A Q  FGVQ RAEED+
Subjt:  SWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEEDY

Query:  LRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQ
        LRQ+ Q + KG+ KM +YL ++K + DNLGQ GSPV  R+LISQ
Subjt:  LRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQ

A0A6J1D5J0 uncharacterized protein LOC1110175018.7e-4155.43Show/hide
Query:  AADSSSSPTIE--INLLYESWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKD
        +A SSSS   E  IN LYESW+  DQLLLGWLYNSMTPEVA                        ++                          G+ENA D
Subjt:  AADSSSSPTIE--INLLYESWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKD

Query:  LWEAIQKMFGVQLRAEEDYLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSG
        LW AIQ++FGVQ +AEEDYLRQVFQQ+ KGSLKM+D+LR++K+HADNLGQAGSPV TRSLISQVLLGLDEE+NPVVA IQG+ G
Subjt:  LWEAIQKMFGVQLRAEEDYLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSG

A0A6J1DCW4 uncharacterized protein LOC1110195981.1e-5146.1Show/hide
Query:  VAVGPNFSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIE
        V  G  F+SPPLNQLLN++T IK+DRGNFLLW+NLALPILRSYKL  +L G KPCP  +L        P+     +  G+TS          S SSPT  
Subjt:  VAVGPNFSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTIE

Query:  INLLYESWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQL
        +N  YE+W+ VD+LLLGWLYNSM  +VA                                                    GF  +++LW A+Q++FGVQ 
Subjt:  INLLYESWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQL

Query:  RAEEDYLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGR
        RAE DYL+QVFQQ+ KGSL+M +YL+++K+HADNL  AGS VS R L+SQVL GLDEE+NP+V  +QG+
Subjt:  RAEEDYLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.6e-0529.41Show/hide
Query:  AKDLWEAIQKMFGVQLRAEEDYLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGR
        ++D+W  I+  F     A    L    +    G ++++DY R +K  AD+L     PV+ R+L+  VL GL+ +F+ ++ +I+ R
Subjt:  AKDLWEAIQKMFGVQLRAEEDYLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.9e-0425.58Show/hide
Query:  AKDLWEAIQKMFGVQLRAEEDYLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRS
        A+DLW +++ +F     A         + ++   L + +Y + +K+ +D L    SP+S R L+  +L GL E+++ ++ +I+ +S
Subjt:  AKDLWEAIQKMFGVQLRAEEDYLRQVFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGGCATCAACTCCACCGTAATTGTCATCGTAGCTGTCGGACCAAATTTCAGCAGCCCTCCACTTAATCAGCTCTTGAATAAGGTTACGATTATAAAGTTAGATCG
TGGAAATTTTCTTCTGTGGAAGAATCTTGCCCTACCAATCCTTCGGAGTTACAAACTCGAAGGTCATCTCCCGGGAACGAAGCCTTGTCCTTCGAAATACCTTCAAGCTG
CTGCAAATGGATTAAATCCAAGTTCCGCACAAGCTGAAGTTGGAGCTGGAACCACCAGTTTTGTAGCCAGTGCAAGTGATGCAGCTGACTCCTCCTCCTCCCCTACCATT
GAAATCAATCTGTTGTATGAATCTTGGTTAGCAGTAGATCAACTGTTATTGGGTTGGTTGTACAACTCAATGACCCCTGAGGTTGCTGAAGGGAATAAAAGTCCCCACGC
AGCGGAAGCGCATCGATTGGACCTTACGCCTCGGGAAGGAGTGAATTCCATCTTGTACGATTATGTTCTCAGCTCCCCATTTGGTCTTGTCTCCAAAGTGGGCATAAAAC
CCAACAGTTGCTACACAGGTGATGGGTTCGAAAATGCCAAGGACCTGTGGGAGGCAATACAAAAGATGTTTGGAGTACAATTGAGGGCAGAAGAAGACTATCTTCGCCAG
GTATTTCAACAGTCTAGCAAAGGTTCTTTAAAAATGTCTGATTATTTGAGAATTATAAAAAACCACGCTGACAATTTGGGCCAAGCTGGGAGCCCTGTTAGCACAAGGTC
ATTAATTTCTCAGGTTCTTCTGGGATTGGATGAAGAATTCAATCCGGTTGTGGCTATGATTCAAGGGCGATCAGGTGGCTTCGGTTGGATTATTCGCAAGGAGGACGACA
TGCCGGTGAGGATTGAACTTGATGCCTTACAAATCGTCAATCTTCTCAGTGGTCAGGACCAAGATGATACGAAGGTAGAGAAATTCATTTTTGAAGCTAAAAACCTAATC
TCCAACTATTATGTTGATTTCATCGTTCATATTCACAGAAGACACAATGTGATGGTCCACACATTGGCCCAAACCACGTGTAATTCAAATTCCTCAGTCCTCCGATTCTC
TCTCTTGTGTGTGGATGTGTCTCACTCGAGGGCTGGAACGCTTGATCTCGAGTCTGAAGCTTTAATGGATTCTTGGATCTTGAAGTCTTCTGATCTTGAAGGAGTCTTCA
ATCTTCAAGAAGTGTTTAGCACTTTGAAACTTATGGATTTCAAGGAGTCTTCAGTCTTCAGTCTTCAATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCGGCATCAACTCCACCGTAATTGTCATCGTAGCTGTCGGACCAAATTTCAGCAGCCCTCCACTTAATCAGCTCTTGAATAAGGTTACGATTATAAAGTTAGATCG
TGGAAATTTTCTTCTGTGGAAGAATCTTGCCCTACCAATCCTTCGGAGTTACAAACTCGAAGGTCATCTCCCGGGAACGAAGCCTTGTCCTTCGAAATACCTTCAAGCTG
CTGCAAATGGATTAAATCCAAGTTCCGCACAAGCTGAAGTTGGAGCTGGAACCACCAGTTTTGTAGCCAGTGCAAGTGATGCAGCTGACTCCTCCTCCTCCCCTACCATT
GAAATCAATCTGTTGTATGAATCTTGGTTAGCAGTAGATCAACTGTTATTGGGTTGGTTGTACAACTCAATGACCCCTGAGGTTGCTGAAGGGAATAAAAGTCCCCACGC
AGCGGAAGCGCATCGATTGGACCTTACGCCTCGGGAAGGAGTGAATTCCATCTTGTACGATTATGTTCTCAGCTCCCCATTTGGTCTTGTCTCCAAAGTGGGCATAAAAC
CCAACAGTTGCTACACAGGTGATGGGTTCGAAAATGCCAAGGACCTGTGGGAGGCAATACAAAAGATGTTTGGAGTACAATTGAGGGCAGAAGAAGACTATCTTCGCCAG
GTATTTCAACAGTCTAGCAAAGGTTCTTTAAAAATGTCTGATTATTTGAGAATTATAAAAAACCACGCTGACAATTTGGGCCAAGCTGGGAGCCCTGTTAGCACAAGGTC
ATTAATTTCTCAGGTTCTTCTGGGATTGGATGAAGAATTCAATCCGGTTGTGGCTATGATTCAAGGGCGATCAGGTGGCTTCGGTTGGATTATTCGCAAGGAGGACGACA
TGCCGGTGAGGATTGAACTTGATGCCTTACAAATCGTCAATCTTCTCAGTGGTCAGGACCAAGATGATACGAAGGTAGAGAAATTCATTTTTGAAGCTAAAAACCTAATC
TCCAACTATTATGTTGATTTCATCGTTCATATTCACAGAAGACACAATGTGATGGTCCACACATTGGCCCAAACCACGTGTAATTCAAATTCCTCAGTCCTCCGATTCTC
TCTCTTGTGTGTGGATGTGTCTCACTCGAGGGCTGGAACGCTTGATCTCGAGTCTGAAGCTTTAATGGATTCTTGGATCTTGAAGTCTTCTGATCTTGAAGGAGTCTTCA
ATCTTCAAGAAGTGTTTAGCACTTTGAAACTTATGGATTTCAAGGAGTCTTCAGTCTTCAGTCTTCAATCTTGA
Protein sequenceShow/hide protein sequence
MTGINSTVIVIVAVGPNFSSPPLNQLLNKVTIIKLDRGNFLLWKNLALPILRSYKLEGHLPGTKPCPSKYLQAAANGLNPSSAQAEVGAGTTSFVASASDAADSSSSPTI
EINLLYESWLAVDQLLLGWLYNSMTPEVAEGNKSPHAAEAHRLDLTPREGVNSILYDYVLSSPFGLVSKVGIKPNSCYTGDGFENAKDLWEAIQKMFGVQLRAEEDYLRQ
VFQQSSKGSLKMSDYLRIIKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGGFGWIIRKEDDMPVRIELDALQIVNLLSGQDQDDTKVEKFIFEAKNLI
SNYYVDFIVHIHRRHNVMVHTLAQTTCNSNSSVLRFSLLCVDVSHSRAGTLDLESEALMDSWILKSSDLEGVFNLQEVFSTLKLMDFKESSVFSLQS