; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021455 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021455
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationchr7:7903336..7905506
RNA-Seq ExpressionLag0021455
SyntenyLag0021455
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134498.1 protein MODIFYING WALL LIGNIN-1 [Cucumis sativus]1.8e-12687.73Show/hide
Query:  MGRR-KSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVV
        MGRR K+MAVTH+DL PS +S ELGSKMGTFL+ILT++CGLCCFILCLIAE+TRSQVIWMG D+NNK  EKR+CSYSGSGKTPLLCTASAFLGMAVMMVV
Subjt:  MGRR-KSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVV

Query:  QHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLYVLIAVSKS PPALIAWDPSFATSK LTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
Subjt:  QHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRMFEEQENVRREVLESYHIHGSPPR---SPPVQPMPPIAREDPVIRHG--HQQETPFLFLLPSTAAFCKLS
        TAVRAQRMFE+QENVRREVLESYHIH SPPR   SPP+QPMPPIAREDPVIRH   HQ+  PF  LL STA FCKLS
Subjt:  TAVRAQRMFEEQENVRREVLESYHIHGSPPR---SPPVQPMPPIAREDPVIRHG--HQQETPFLFLLPSTAAFCKLS

XP_022934645.1 uncharacterized protein LOC111441776 [Cucurbita moschata]1.5e-12888.52Show/hide
Query:  MGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDD-NNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVV
        MGR K+MAVTHEDL PSRRS ELGSKMGTFL+ILT++CGLCCFILCL+AESTRSQVIW GRD+ NNK GEKR C YSGSGKTPL+CTASAFLGMAVMMVV
Subjt:  MGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDD-NNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVV

Query:  QHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLYVLIAVSKSPPPALIAWDPSFATSK LTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYM
Subjt:  QHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRMFEEQENVRREVLESYHIHGSPPRSPPVQPMPPIAREDPVIRHGHQQETPFLFLLPSTAAFCK
        TAVRAQR+FE+Q NVRREVLESYHIH SPPRSPPVQPMPPIAREDPVIRH H  E+PFLFLLPS+AAFCK
Subjt:  TAVRAQRMFEEQENVRREVLESYHIHGSPPRSPPVQPMPPIAREDPVIRHGHQQETPFLFLLPSTAAFCK

XP_022972325.1 uncharacterized protein LOC111470898 [Cucurbita maxima]1.3e-12988.89Show/hide
Query:  MGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDD-NNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVV
        MGRRK+MAVTHEDL PSRRS ELGSKMGTFL+ILT++CGLCCFILCL+AESTRSQVIWMGRD+ NNK GEKR C YSGSGKTPL+CTASAFLGMAVMMVV
Subjt:  MGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDD-NNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVV

Query:  QHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLYVLIAVSKSPPPALIAWDPSFATSK LTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYM
Subjt:  QHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRMFEEQENVRREVLESYHIHGSPPRSPPVQPMPPIAREDPVIRHGHQQETPFLFLLPSTAAFCK
        TAVRAQR+FE+Q NVRREVLESYHIH SPPRSPP+QPMPPIAREDPVIRH H  E+PFLFLLPS+AAFCK
Subjt:  TAVRAQRMFEEQENVRREVLESYHIHGSPPRSPPVQPMPPIAREDPVIRHGHQQETPFLFLLPSTAAFCK

XP_023540160.1 uncharacterized protein LOC111800614 [Cucurbita pepo subsp. pepo]3.3e-12888.15Show/hide
Query:  MGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDD-NNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVV
        MGR K+MAVTHEDL PSRRS ELGSKMGTFL+ILT++CGLCCFILCL+AESTRSQVIW GRD+ NNK GEKR C YSGSG+TPL+CTASAFLGMAVMMVV
Subjt:  MGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDD-NNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVV

Query:  QHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLYVLIAVSKSPPPALIAWDPSFATSK LTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYM
Subjt:  QHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRMFEEQENVRREVLESYHIHGSPPRSPPVQPMPPIAREDPVIRHGHQQETPFLFLLPSTAAFCK
        TAVRAQR+FE+Q NVRREVLESYHIH SPPRSPPVQPMPPIAREDPVIRH H  E+PFLFLLPS+AAFCK
Subjt:  TAVRAQRMFEEQENVRREVLESYHIHGSPPRSPPVQPMPPIAREDPVIRHGHQQETPFLFLLPSTAAFCK

XP_038903084.1 uncharacterized protein LOC120089763 [Benincasa hispida]8.6e-12990.44Show/hide
Query:  RRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVVQHL
        RRK+MAVTH+DL PS RS ELGSKMGTFLMILT+ICGLCCFILCLIAESTRSQVIWMG D+NNK G KR CSYSGSGKTPLLCTASAFLGMAVMMVVQHL
Subjt:  RRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVVQHL

Query:  YVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
        YVLIAVSKSPPPALIAWDPSFATSK LTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS+PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV
Subjt:  YVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAV

Query:  RAQRMFEEQENVRREVLESYHIHGSPPR--SPPVQPMPPIAREDPVIRHG-HQQETPFLFLLPSTAAFCKLS
        RAQR+FEEQENVRREVLESYHIH SPPR  SPP+QPMPPIAREDPVIRH  H QE PF  LL STA FCKLS
Subjt:  RAQRMFEEQENVRREVLESYHIHGSPPR--SPPVQPMPPIAREDPVIRHG-HQQETPFLFLLPSTAAFCKLS

TrEMBL top hitse value%identityAlignment
A0A0A0L805 Uncharacterized protein2.8e-12588.15Show/hide
Query:  MAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLI
        MAVTH+DL PS +S ELGSKMGTFL+ILT++CGLCCFILCLIAE+TRSQVIWMG D+NNK  EKR+CSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLI
Subjt:  MAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLI

Query:  AVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQR
        AVSKS PPALIAWDPSFATSK LTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQR
Subjt:  AVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQR

Query:  MFEEQENVRREVLESYHIHGSPPR---SPPVQPMPPIAREDPVIRHG--HQQETPFLFLLPSTAAFCKLS
        MFE+QENVRREVLESYHIH SPPR   SPP+QPMPPIAREDPVIRH   HQ+  PF  LL STA FCKLS
Subjt:  MFEEQENVRREVLESYHIHGSPPR---SPPVQPMPPIAREDPVIRHG--HQQETPFLFLLPSTAAFCKLS

A0A1S3AY82 uncharacterized protein LOC1034838795.3e-12485.36Show/hide
Query:  MGRRKS--MAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMV
        MGRR+    AVTH+DL PS +S ELGSK+GTFL+ILT++CGLCCFILCLIAESTRSQ IWMG D+NNK  EKR+CSYSGSGKTPLLCTASAFLGMAVMMV
Subjt:  MGRRKS--MAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMV

Query:  VQHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLY
        VQHLYVLIAVSKS PPALIAWDPSFATSK LTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLY
Subjt:  VQHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLY

Query:  MTAVRAQRMFEEQENVRREVLESYHIHGSPPR-----SPPVQPMPPIAREDPVIRHG--HQQETPFLFLLPSTAAFCKLS
        MTAVRAQRMFE+QENVRREVLESYHIH SPPR     SPP+QPMPPIAREDPVIRH   HQ   PF  LL STA FCKLS
Subjt:  MTAVRAQRMFEEQENVRREVLESYHIHGSPPR-----SPPVQPMPPIAREDPVIRHG--HQQETPFLFLLPSTAAFCKLS

A0A5A7U7S1 Uncharacterized protein2.0e-12386.72Show/hide
Query:  AVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIA
        AVTH+DL PS +S ELGSK+GTFL+ILT++CGLCCFILCLIAESTRSQ IWMG D+NNK  EKR+CSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIA
Subjt:  AVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIA

Query:  VSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRM
        VSKS PPALIAWDPSFATSK LTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWS PKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRM
Subjt:  VSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRM

Query:  FEEQENVRREVLESYHIHGSPPR-----SPPVQPMPPIAREDPVIRHGHQQE--TPFLFLLPSTAAFCKLS
        FE+QENVRREVLESYHIH SPPR     SPP+QPMPPIAREDPVIRH H Q+   PF  LL STA FCKLS
Subjt:  FEEQENVRREVLESYHIHGSPPR-----SPPVQPMPPIAREDPVIRHGHQQE--TPFLFLLPSTAAFCKLS

A0A6J1F8A9 uncharacterized protein LOC1114417767.1e-12988.52Show/hide
Query:  MGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDD-NNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVV
        MGR K+MAVTHEDL PSRRS ELGSKMGTFL+ILT++CGLCCFILCL+AESTRSQVIW GRD+ NNK GEKR C YSGSGKTPL+CTASAFLGMAVMMVV
Subjt:  MGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDD-NNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVV

Query:  QHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLYVLIAVSKSPPPALIAWDPSFATSK LTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYM
Subjt:  QHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRMFEEQENVRREVLESYHIHGSPPRSPPVQPMPPIAREDPVIRHGHQQETPFLFLLPSTAAFCK
        TAVRAQR+FE+Q NVRREVLESYHIH SPPRSPPVQPMPPIAREDPVIRH H  E+PFLFLLPS+AAFCK
Subjt:  TAVRAQRMFEEQENVRREVLESYHIHGSPPRSPPVQPMPPIAREDPVIRHGHQQETPFLFLLPSTAAFCK

A0A6J1I898 uncharacterized protein LOC1114708986.4e-13088.89Show/hide
Query:  MGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDD-NNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVV
        MGRRK+MAVTHEDL PSRRS ELGSKMGTFL+ILT++CGLCCFILCL+AESTRSQVIWMGRD+ NNK GEKR C YSGSGKTPL+CTASAFLGMAVMMVV
Subjt:  MGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDD-NNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVV

Query:  QHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        QHLYVLIAVSKSPPPALIAWDPSFATSK LTFQAAFFFVSTWISF+VGEILLLIGLSVESGHL +W +PKESCLVIKEGLFSAAGVF+LATVFLAAGLYM
Subjt:  QHLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

Query:  TAVRAQRMFEEQENVRREVLESYHIHGSPPRSPPVQPMPPIAREDPVIRHGHQQETPFLFLLPSTAAFCK
        TAVRAQR+FE+Q NVRREVLESYHIH SPPRSPP+QPMPPIAREDPVIRH H  E+PFLFLLPS+AAFCK
Subjt:  TAVRAQRMFEEQENVRREVLESYHIHGSPPRSPPVQPMPPIAREDPVIRHGHQQETPFLFLLPSTAAFCK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)2.4e-0432.81Show/hide
Query:  FFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM
        +F+S+W++F V E  ++ G +  + H    S+   SC  +++G+F A  VF +AT+ L    YM
Subjt:  FFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYM

AT5G49320.1 Protein of unknown function (DUF1218)3.9e-7156.79Show/hide
Query:  MGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVVQ
        M R +  AVTH+DL P+ ++ +L SK G F+ +LT+I GL CF+LCL AE+TRSQ  W            + C Y+GSGKTPLLC A AF+G+AV MV  
Subjt:  MGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVVQ

Query:  HLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT
        H+Y+LIAV+ SP   L+ WDP    +K LTFQAAFFFVSTW+ F VGE+LLL+ LSVESGHL NWS PK SCLVI++GLFSAAGVF L TVFLA GLY+T
Subjt:  HLYVLIAVSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMT

Query:  AVRAQRMFEEQENVRREVLESYHIHGSPPRSPPVQPMPPIARE
        A++A R+ ++ EN  RE++E+  ++ SPPRS P   M  +ARE
Subjt:  AVRAQRMFEEQENVRREVLESYHIHGSPPRSPPVQPMPPIARE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTGATGGGAAGGAGAAAGAGTATGGCGGTGACACATGAGGATCTTCAGCCAAGTCGAAGGAGCTATGAATTGGGCAGTAAAATGGGGACTTTTCTCATGATTTT
GACCCTCATTTGTGGGCTTTGTTGCTTCATTCTTTGCCTCATTGCCGAGTCGACTCGTTCTCAGGTGATATGGATGGGTAGAGACGACAATAATAAAGGCGGAGAAAAGA
GACAATGCTCGTACAGCGGTAGCGGGAAGACGCCGCTGCTATGCACGGCGAGCGCGTTTCTCGGGATGGCGGTGATGATGGTGGTGCAACATCTGTATGTGTTGATTGCA
GTGAGTAAGTCGCCGCCTCCTGCTCTCATTGCTTGGGACCCTTCTTTTGCCACTTCCAAATATCTAACCTTTCAAGCTGCTTTCTTCTTCGTTTCAACATGGATAAGTTT
TGCAGTGGGAGAAATTTTGTTGTTGATTGGACTGAGCGTAGAATCAGGCCATCTCAACAACTGGTCCACTCCAAAAGAGAGCTGCTTGGTGATCAAAGAAGGCTTATTTT
CAGCCGCCGGAGTTTTTCAATTGGCTACAGTGTTCCTCGCCGCCGGCCTCTACATGACGGCGGTCCGAGCACAGAGAATGTTTGAAGAGCAAGAAAACGTGAGAAGAGAG
GTGCTGGAAAGTTACCATATCCACGGTTCGCCGCCCCGGTCGCCGCCGGTGCAGCCGATGCCGCCCATTGCAAGAGAGGACCCTGTAATCAGACATGGTCACCAACAAGA
GACTCCCTTTTTGTTCTTGCTGCCATCTACCGCCGCTTTCTGCAAACTGTCTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTGATGGGAAGGAGAAAGAGTATGGCGGTGACACATGAGGATCTTCAGCCAAGTCGAAGGAGCTATGAATTGGGCAGTAAAATGGGGACTTTTCTCATGATTTT
GACCCTCATTTGTGGGCTTTGTTGCTTCATTCTTTGCCTCATTGCCGAGTCGACTCGTTCTCAGGTGATATGGATGGGTAGAGACGACAATAATAAAGGCGGAGAAAAGA
GACAATGCTCGTACAGCGGTAGCGGGAAGACGCCGCTGCTATGCACGGCGAGCGCGTTTCTCGGGATGGCGGTGATGATGGTGGTGCAACATCTGTATGTGTTGATTGCA
GTGAGTAAGTCGCCGCCTCCTGCTCTCATTGCTTGGGACCCTTCTTTTGCCACTTCCAAATATCTAACCTTTCAAGCTGCTTTCTTCTTCGTTTCAACATGGATAAGTTT
TGCAGTGGGAGAAATTTTGTTGTTGATTGGACTGAGCGTAGAATCAGGCCATCTCAACAACTGGTCCACTCCAAAAGAGAGCTGCTTGGTGATCAAAGAAGGCTTATTTT
CAGCCGCCGGAGTTTTTCAATTGGCTACAGTGTTCCTCGCCGCCGGCCTCTACATGACGGCGGTCCGAGCACAGAGAATGTTTGAAGAGCAAGAAAACGTGAGAAGAGAG
GTGCTGGAAAGTTACCATATCCACGGTTCGCCGCCCCGGTCGCCGCCGGTGCAGCCGATGCCGCCCATTGCAAGAGAGGACCCTGTAATCAGACATGGTCACCAACAAGA
GACTCCCTTTTTGTTCTTGCTGCCATCTACCGCCGCTTTCTGCAAACTGTCTCATTAA
Protein sequenceShow/hide protein sequence
MKVMGRRKSMAVTHEDLQPSRRSYELGSKMGTFLMILTLICGLCCFILCLIAESTRSQVIWMGRDDNNKGGEKRQCSYSGSGKTPLLCTASAFLGMAVMMVVQHLYVLIA
VSKSPPPALIAWDPSFATSKYLTFQAAFFFVSTWISFAVGEILLLIGLSVESGHLNNWSTPKESCLVIKEGLFSAAGVFQLATVFLAAGLYMTAVRAQRMFEEQENVRRE
VLESYHIHGSPPRSPPVQPMPPIAREDPVIRHGHQQETPFLFLLPSTAAFCKLSH