; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0003989 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0003989
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr6:221309..226773
RNA-Seq ExpressionLag0003989
SyntenyLag0003989
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039966.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.9e-4431.19Show/hide
Query:  FSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD-----------------------------GFVSNRQILDASLIASELIDDWKTT
        F++EE++ A+     NKS GPDGFT EF+K +W+ +K+++  I  D                              FV  RQI+DA L+A+E ID W+  
Subjt:  FSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD-----------------------------GFVSNRQILDASLIASELIDDWKTT

Query:  NKKGVVGVVIKLDLERAFDKVDWNFLDAVLQAKRF----------------------GR-------------GDPLLPFLFILVFDCLSRLLTYSAQLGK
          K + G VIKLD+E+AFDK++W F+D +L  K +                      GR             GDP+ PF+F+L  D +SRLL    +  K
Subjt:  NKKGVVGVVIKLDLERAFDKVDWNFLDAVLQAKRF----------------------GR-------------GDPLLPFLFILVFDCLSRLLTYSAQLGK

Query:  ITTHPIGNSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGSKDDGGL-
        I    +  ++NLTHL FA+D LLF   D   + NL +II +F+LA GL+IN  KS +  INV     E + S +G      P++YLG+PLGG +      
Subjt:  ITTHPIGNSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGSKDDGGL-

Query:  ----HNINWKTTQLPHIMGGLGIT------IEMDYVG------GQNQSQEDRQNLFGSICSTIDIVAWDMRLRRNLNEEEVLEWASL-HLLSSVILRNSQ
              IN K T   + M   G           D+ G      G+++     ++++ +      ++ WD+  RR + + E   WA L + L+     N +
Subjt:  ----HNINWKTTQLPHIMGGLGIT------IEMDYVG------GQNQSQEDRQNLFGSICSTIDIVAWDMRLRRNLNEEEVLEWASL-HLLSSVILRNSQ

Query:  DSWSWPLDSSHAFIVRSLRE
        DS +W L+S   + V S+++
Subjt:  DSWSWPLDSSHAFIVRSLRE

RVW38475.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.0e-4128.51Show/hide
Query:  LETPFSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD-----------------------------------GFVSNRQILDASLIA
        L++PF++EE+  A+  L  +K+ GPDGFT   F+  W  IK+D++ ++SD                                    FV  RQILDA LIA
Subjt:  LETPFSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD-----------------------------------GFVSNRQILDASLIA

Query:  SELIDDWKTTNKKGVVGVVIKLDLERAFDKVDWNFLDAVLQAKRFG-----------------------------------RGDPLLPFLFILVFDCLSR
        +E++D+     + G  GVV K+D E+A+D V W+FLD VL+ K F                                    +GDPL PFLF LV D LSR
Subjt:  SELIDDWKTTNKKGVVGVVIKLDLERAFDKVDWNFLDAVLQAKRFG-----------------------------------RGDPLLPFLFILVFDCLSR

Query:  LLTYSAQLGKITTHPIG-NSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLP
        +L  + +   +    +G N   ++HL+FA+DT+ FS      +  L  ++ VF    GL +N  KS + GIN+D   +  L     CK   WP+ YLGLP
Subjt:  LLTYSAQLGKITTHPIG-NSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLP

Query:  LGGSKDDGGL-------------HNINWKTTQLPHIMGGLGITIEMDYVGGQNQSQE------------------------DRQNLFGSICSTIDIVAWD
        LGG+    G              H   WK   +  +  G  +     YV G  +                           D+     S+       +W+
Subjt:  LGGSKDDGGL-------------HNINWKTTQLPHIMGGLGITIEMDYVGGQNQSQE------------------------DRQNLFGSICSTIDIVAWD

Query:  MRLRRNLNEEEV--LEWASLHLLSSVILRNSQDSWSWPLDSSHAFIVRSLREDSVHSKGPNRNNLYSMIW
        +  RRNL+  E+  LE     L    +  +  D+  WPL SS  F V+S       S G  +N     +W
Subjt:  MRLRRNLNEEEV--LEWASLHLLSSVILRNSQDSWSWPLDSSHAFIVRSLREDSVHSKGPNRNNLYSMIW

RVW90164.1 putative mitochondrial protein [Vitis vinifera]3.0e-4238.77Show/hide
Query:  LETPFSKEEVYLAVQTLGTNKSLGPDGFT----DEFFKHSWTTIKQDVMAILSDGFVSNRQILDASLIASELIDDWKTTNKKGVVGVVIKLDLERAFDKV
        L++PF++ E++ A+  L  +K+ GPDGFT     +        + Q+ +      FV  RQILDA LIA+E++D+ K   + G  GVV K+D E+A++ V
Subjt:  LETPFSKEEVYLAVQTLGTNKSLGPDGFT----DEFFKHSWTTIKQDVMAILSDGFVSNRQILDASLIASELIDDWKTTNKKGVVGVVIKLDLERAFDKV

Query:  DWNFLDAVLQAKRFG--RGDPLLPFLFILVFDCLSRLLTYSAQLGKITTHPIG-NSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYA
         W+FLD VL+ K  G  +GDPL PFLF +V D LSR+L  + +   +    +G N   +THL+FA+DT+LF++     +  L  ++ VF    GL +N  
Subjt:  DWNFLDAVLQAKRFG--RGDPLLPFLFILVFDCLSRLLTYSAQLGKITTHPIG-NSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYA

Query:  KSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGSKD----------DGGL-HNINWKTTQLPHIMGGLGI
        KS L GIN+D   +  L     CK  DWP+ YLGLPLGG  +          +G   H + W+    P I+GGLGI
Subjt:  KSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGSKD----------DGGL-HNINWKTTQLPHIMGGLGI

RVW96282.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]6.9e-3935.84Show/hide
Query:  LETPFSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD--------------------------------------GFVSNRQILDAS
        L++PF++ E++ A+  L  +K+ GPDGFT   F+  W  IK+D++ + ++                                       FV  RQILDA 
Subjt:  LETPFSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD--------------------------------------GFVSNRQILDAS

Query:  LIASELIDDWKTTNKKGVVGVVIKLDLERAFDKVDWNFLDAVLQ-------AKRFGRGDPLLPFLFILVFDCLSRLLTYSAQLGKITTHPIG-NSLNLTH
        LI +E++D+ K   + G  GVV K+D E+A+D V W+FLD VL+       A+   +GDPL PFLF +V D LSR+L  + +   +    +G N   ++H
Subjt:  LIASELIDDWKTTNKKGVVGVVIKLDLERAFDKVDWNFLDAVLQ-------AKRFGRGDPLLPFLFILVFDCLSRLLTYSAQLGKITTHPIG-NSLNLTH

Query:  LRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGS
        L+FA+DT+LF++     V  L  ++ VF    GL +N  KS L GIN+D   +  L     CK  DWP+ YLGLPLGG+
Subjt:  LRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGS

TYK24536.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.1e-4330.39Show/hide
Query:  FSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD----------------------------------------GFVSNRQILDASLI
        F++EE++ A+     NKS GPDGFT EF+K +W+ +K+++  I  D                                         FV  RQI+DA L+
Subjt:  FSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD----------------------------------------GFVSNRQILDASLI

Query:  ASELIDDWKTTNKKGVVGVVIKLDLERAFDKVDWNFLDAVLQAKRF----------------------GR-------------GDPLLPFLFILVFDCLS
        A+E ID W+    K + G VIKLD+E+AFDK++W F+D +L  K +                      GR             GDP+ PF+F+L  D +S
Subjt:  ASELIDDWKTTNKKGVVGVVIKLDLERAFDKVDWNFLDAVLQAKRF----------------------GR-------------GDPLLPFLFILVFDCLS

Query:  RLLTYSAQLGKITTHPIGNSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLP
        RLL    +  KI    +  ++NLTHL FA+D LLF   D   + NL +II +F+LA GL+IN  KS +  INV     E + S +G      P++YLG+P
Subjt:  RLLTYSAQLGKITTHPIGNSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLP

Query:  LGGSKDDGGL-----HNINWKTTQLPHIMGGLGIT------IEMDYVG------GQNQSQEDRQNLFGSICSTIDIVAWDMRLRRNLNEEEVLEWASL-H
        LGG +            IN K T   + M   G           D+ G      G+++     ++++ +      ++ WD+  RR + + E   WA L +
Subjt:  LGGSKDDGGL-----HNINWKTTQLPHIMGGLGIT------IEMDYVG------GQNQSQEDRQNLFGSICSTIDIVAWDMRLRRNLNEEEVLEWASL-H

Query:  LLSSVILRNSQDSWSWPLDSSHAFIVRSLRE
         L+     N +DS +W L+S   + V S+++
Subjt:  LLSSVILRNSQDSWSWPLDSSHAFIVRSLRE

TrEMBL top hitse value%identityAlignment
A0A438DSJ8 Transposon TX1 uncharacterized 149 kDa protein9.5e-4228.51Show/hide
Query:  LETPFSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD-----------------------------------GFVSNRQILDASLIA
        L++PF++EE+  A+  L  +K+ GPDGFT   F+  W  IK+D++ ++SD                                    FV  RQILDA LIA
Subjt:  LETPFSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD-----------------------------------GFVSNRQILDASLIA

Query:  SELIDDWKTTNKKGVVGVVIKLDLERAFDKVDWNFLDAVLQAKRFG-----------------------------------RGDPLLPFLFILVFDCLSR
        +E++D+     + G  GVV K+D E+A+D V W+FLD VL+ K F                                    +GDPL PFLF LV D LSR
Subjt:  SELIDDWKTTNKKGVVGVVIKLDLERAFDKVDWNFLDAVLQAKRFG-----------------------------------RGDPLLPFLFILVFDCLSR

Query:  LLTYSAQLGKITTHPIG-NSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLP
        +L  + +   +    +G N   ++HL+FA+DT+ FS      +  L  ++ VF    GL +N  KS + GIN+D   +  L     CK   WP+ YLGLP
Subjt:  LLTYSAQLGKITTHPIG-NSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLP

Query:  LGGSKDDGGL-------------HNINWKTTQLPHIMGGLGITIEMDYVGGQNQSQE------------------------DRQNLFGSICSTIDIVAWD
        LGG+    G              H   WK   +  +  G  +     YV G  +                           D+     S+       +W+
Subjt:  LGGSKDDGGL-------------HNINWKTTQLPHIMGGLGITIEMDYVGGQNQSQE------------------------DRQNLFGSICSTIDIVAWD

Query:  MRLRRNLNEEEV--LEWASLHLLSSVILRNSQDSWSWPLDSSHAFIVRSLREDSVHSKGPNRNNLYSMIW
        +  RRNL+  E+  LE     L    +  +  D+  WPL SS  F V+S       S G  +N     +W
Subjt:  MRLRRNLNEEEV--LEWASLHLLSSVILRNSQDSWSWPLDSSHAFIVRSLREDSVHSKGPNRNNLYSMIW

A0A438F983 Putative ribonuclease H protein3.4e-3937.17Show/hide
Query:  LETPFSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSDGFVSNRQILDASLIASELIDDWKTTNKKGVVGVVIKLDLERAFDKVDWNF
        L++PF++EE+  A+  L  +K+ GPDGFT   F+  W  IK+D++ +    FV  RQILDA LIA+E++D+     + G  GVV K+D E+A+D V W+F
Subjt:  LETPFSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSDGFVSNRQILDASLIASELIDDWKTTNKKGVVGVVIKLDLERAFDKVDWNF

Query:  LDAVLQAKRFG-----------------------------------RGDPLLPFLFILVFDCLSRLLTYSAQLGKITTHPIG-NSLNLTHLRFANDTLLF
        LD VL+ K F                                    +GDPL PFLF LV D LSR+L  + +   +    +G N   ++HL+FA+DT+ F
Subjt:  LDAVLQAKRFG-----------------------------------RGDPLLPFLFILVFDCLSRLLTYSAQLGKITTHPIG-NSLNLTHLRFANDTLLF

Query:  STLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGS
        S      +  L  ++ VF    GL +N  KS + GIN+D   +  L     CK   WP+ YLGLPLGG+
Subjt:  STLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGS

A0A438I0B4 Putative mitochondrial protein1.5e-4238.77Show/hide
Query:  LETPFSKEEVYLAVQTLGTNKSLGPDGFT----DEFFKHSWTTIKQDVMAILSDGFVSNRQILDASLIASELIDDWKTTNKKGVVGVVIKLDLERAFDKV
        L++PF++ E++ A+  L  +K+ GPDGFT     +        + Q+ +      FV  RQILDA LIA+E++D+ K   + G  GVV K+D E+A++ V
Subjt:  LETPFSKEEVYLAVQTLGTNKSLGPDGFT----DEFFKHSWTTIKQDVMAILSDGFVSNRQILDASLIASELIDDWKTTNKKGVVGVVIKLDLERAFDKV

Query:  DWNFLDAVLQAKRFG--RGDPLLPFLFILVFDCLSRLLTYSAQLGKITTHPIG-NSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYA
         W+FLD VL+ K  G  +GDPL PFLF +V D LSR+L  + +   +    +G N   +THL+FA+DT+LF++     +  L  ++ VF    GL +N  
Subjt:  DWNFLDAVLQAKRFG--RGDPLLPFLFILVFDCLSRLLTYSAQLGKITTHPIG-NSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYA

Query:  KSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGSKD----------DGGL-HNINWKTTQLPHIMGGLGI
        KS L GIN+D   +  L     CK  DWP+ YLGLPLGG  +          +G   H + W+    P I+GGLGI
Subjt:  KSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGSKD----------DGGL-HNINWKTTQLPHIMGGLGI

A0A5A7TCY9 LINE-1 retrotransposable element ORF2 protein9.1e-4531.19Show/hide
Query:  FSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD-----------------------------GFVSNRQILDASLIASELIDDWKTT
        F++EE++ A+     NKS GPDGFT EF+K +W+ +K+++  I  D                              FV  RQI+DA L+A+E ID W+  
Subjt:  FSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD-----------------------------GFVSNRQILDASLIASELIDDWKTT

Query:  NKKGVVGVVIKLDLERAFDKVDWNFLDAVLQAKRF----------------------GR-------------GDPLLPFLFILVFDCLSRLLTYSAQLGK
          K + G VIKLD+E+AFDK++W F+D +L  K +                      GR             GDP+ PF+F+L  D +SRLL    +  K
Subjt:  NKKGVVGVVIKLDLERAFDKVDWNFLDAVLQAKRF----------------------GR-------------GDPLLPFLFILVFDCLSRLLTYSAQLGK

Query:  ITTHPIGNSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGSKDDGGL-
        I    +  ++NLTHL FA+D LLF   D   + NL +II +F+LA GL+IN  KS +  INV     E + S +G      P++YLG+PLGG +      
Subjt:  ITTHPIGNSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGSKDDGGL-

Query:  ----HNINWKTTQLPHIMGGLGIT------IEMDYVG------GQNQSQEDRQNLFGSICSTIDIVAWDMRLRRNLNEEEVLEWASL-HLLSSVILRNSQ
              IN K T   + M   G           D+ G      G+++     ++++ +      ++ WD+  RR + + E   WA L + L+     N +
Subjt:  ----HNINWKTTQLPHIMGGLGIT------IEMDYVG------GQNQSQEDRQNLFGSICSTIDIVAWDMRLRRNLNEEEVLEWASL-HLLSSVILRNSQ

Query:  DSWSWPLDSSHAFIVRSLRE
        DS +W L+S   + V S+++
Subjt:  DSWSWPLDSSHAFIVRSLRE

A0A5D3DLM2 LINE-1 retrotransposable element ORF2 protein1.0e-4330.39Show/hide
Query:  FSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD----------------------------------------GFVSNRQILDASLI
        F++EE++ A+     NKS GPDGFT EF+K +W+ +K+++  I  D                                         FV  RQI+DA L+
Subjt:  FSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSD----------------------------------------GFVSNRQILDASLI

Query:  ASELIDDWKTTNKKGVVGVVIKLDLERAFDKVDWNFLDAVLQAKRF----------------------GR-------------GDPLLPFLFILVFDCLS
        A+E ID W+    K + G VIKLD+E+AFDK++W F+D +L  K +                      GR             GDP+ PF+F+L  D +S
Subjt:  ASELIDDWKTTNKKGVVGVVIKLDLERAFDKVDWNFLDAVLQAKRF----------------------GR-------------GDPLLPFLFILVFDCLS

Query:  RLLTYSAQLGKITTHPIGNSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLP
        RLL    +  KI    +  ++NLTHL FA+D LLF   D   + NL +II +F+LA GL+IN  KS +  INV     E + S +G      P++YLG+P
Subjt:  RLLTYSAQLGKITTHPIGNSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLNINYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLP

Query:  LGGSKDDGGL-----HNINWKTTQLPHIMGGLGIT------IEMDYVG------GQNQSQEDRQNLFGSICSTIDIVAWDMRLRRNLNEEEVLEWASL-H
        LGG +            IN K T   + M   G           D+ G      G+++     ++++ +      ++ WD+  RR + + E   WA L +
Subjt:  LGGSKDDGGL-----HNINWKTTQLPHIMGGLGIT------IEMDYVG------GQNQSQEDRQNLFGSICSTIDIVAWDMRLRRNLNEEEVLEWASL-H

Query:  LLSSVILRNSQDSWSWPLDSSHAFIVRSLRE
         L+     N +DS +W L+S   + V S+++
Subjt:  LLSSVILRNSQDSWSWPLDSSHAFIVRSLRE

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein1.8e-0534.21Show/hide
Query:  CDEFWQELHDLAGMGRDRWIHGVSLETPFSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSDGF
        C+E W  L  ++   ++R      LETP + +E+  A++ +  NKS G DG T EFF+  W T+  D   +L++ F
Subjt:  CDEFWQELHDLAGMGRDRWIHGVSLETPFSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSDGF

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein7.7e-0440.43Show/hide
Query:  SKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSDGF
        S +E+  AV  +  NK+ GPD FT EFF  SW  +K   +A + + F
Subjt:  SKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSDGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCAACTAGGAATGCGTTGTGATGAATTCTGGCAAGAGTTACATGATTTAGCTGGAATGGGGCGTGATCGATGGATCCATGGAGTTTCTTTGGAAACTCCTTTCTC
GAAAGAAGAGGTCTATCTTGCTGTCCAAACTTTGGGAACAAATAAGTCTCTTGGTCCTGATGGATTTACTGATGAATTCTTTAAGCACTCATGGACCACCATTAAACAAG
ATGTTATGGCAATTCTATCAGATGGCTTTGTGTCTAATCGCCAAATCCTGGATGCCTCTTTGATTGCAAGTGAATTGATTGATGATTGGAAAACCACTAATAAGAAAGGA
GTGGTAGGAGTGGTAATCAAGCTGGACTTGGAAAGGGCTTTTGATAAAGTTGATTGGAATTTTCTGGATGCGGTCCTACAAGCTAAAAGATTTGGACGAGGGGATCCACT
ATTGCCTTTTCTCTTTATATTGGTATTTGATTGTCTAAGCAGATTATTGACTTATAGTGCTCAGTTGGGTAAGATAACAACACATCCTATTGGTAACTCTTTGAACTTGA
CTCATTTACGATTTGCGAATGATACCCTGTTATTTTCTACCCTTGATTCTTTTATTGTGGATAATCTCTTTGATATTATAAAAGTCTTTGAGCTAGCATTTGGTTTGAAC
ATCAATTATGCCAAAAGTGAATTGTTGGGGATAAACGTTGATGACCTGGAAATAGAGGCTTTGACTTCAAATTTTGGTTGTAAACCGGGAGATTGGCCGGTGTCATATCT
TGGCCTTCCTTTAGGAGGTTCGAAGGATGATGGAGGTTTACATAACATTAACTGGAAGACGACTCAACTTCCCCATATTATGGGAGGCCTTGGTATTACAATAGAGATGG
ATTATGTTGGTGGCCAAAACCAATCACAGGAGGATCGTCAAAATCTCTTTGGATCTATTTGTAGTACCATTGATATTGTTGCTTGGGATATGAGGCTTCGTCGAAACCTC
AATGAGGAGGAAGTTCTGGAATGGGCTAGTCTACATCTTCTTTCATCGGTCATTTTAAGGAATTCTCAAGATTCTTGGTCTTGGCCTCTCGATTCCTCTCATGCCTTTAT
AGTTCGATCTTTGAGGGAAGATTCAGTCCATTCAAAAGGTCCAAATAGAAACAATCTTTATTCGATGATTTGGATGGACAAGAGGAGGACTTGGAGTGGATACCCCATCA
AGGCCACTGAGAGTGGAGATTGGAAAGGGAAACAACTAGTTGAACTAGTTGGAAAACTTGAAGAGAGCAGTTCGATTCTTTCTAACATGGAGTTTGACATGGAGTTAGAG
AATGTTAGCAAGAAGATGAGAGAGTTGTGGAATATCTTCAGCGAAGCAACTACGAAGTTGGAACGAAGCGTTGGAAAAATAAAGCAGAATTTCGAAGAATTGACAGAAGA
GTTTTCAGCAATGAAAAGAGAGTTTAGATTGGCCAAAGAAAAGGGGCAGAAATCGAGAAATCGATACAAGGCCAGAAAAAAAAGCAAAGTTGTCCAAGACTACCATGAAT
AA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCAACTAGGAATGCGTTGTGATGAATTCTGGCAAGAGTTACATGATTTAGCTGGAATGGGGCGTGATCGATGGATCCATGGAGTTTCTTTGGAAACTCCTTTCTC
GAAAGAAGAGGTCTATCTTGCTGTCCAAACTTTGGGAACAAATAAGTCTCTTGGTCCTGATGGATTTACTGATGAATTCTTTAAGCACTCATGGACCACCATTAAACAAG
ATGTTATGGCAATTCTATCAGATGGCTTTGTGTCTAATCGCCAAATCCTGGATGCCTCTTTGATTGCAAGTGAATTGATTGATGATTGGAAAACCACTAATAAGAAAGGA
GTGGTAGGAGTGGTAATCAAGCTGGACTTGGAAAGGGCTTTTGATAAAGTTGATTGGAATTTTCTGGATGCGGTCCTACAAGCTAAAAGATTTGGACGAGGGGATCCACT
ATTGCCTTTTCTCTTTATATTGGTATTTGATTGTCTAAGCAGATTATTGACTTATAGTGCTCAGTTGGGTAAGATAACAACACATCCTATTGGTAACTCTTTGAACTTGA
CTCATTTACGATTTGCGAATGATACCCTGTTATTTTCTACCCTTGATTCTTTTATTGTGGATAATCTCTTTGATATTATAAAAGTCTTTGAGCTAGCATTTGGTTTGAAC
ATCAATTATGCCAAAAGTGAATTGTTGGGGATAAACGTTGATGACCTGGAAATAGAGGCTTTGACTTCAAATTTTGGTTGTAAACCGGGAGATTGGCCGGTGTCATATCT
TGGCCTTCCTTTAGGAGGTTCGAAGGATGATGGAGGTTTACATAACATTAACTGGAAGACGACTCAACTTCCCCATATTATGGGAGGCCTTGGTATTACAATAGAGATGG
ATTATGTTGGTGGCCAAAACCAATCACAGGAGGATCGTCAAAATCTCTTTGGATCTATTTGTAGTACCATTGATATTGTTGCTTGGGATATGAGGCTTCGTCGAAACCTC
AATGAGGAGGAAGTTCTGGAATGGGCTAGTCTACATCTTCTTTCATCGGTCATTTTAAGGAATTCTCAAGATTCTTGGTCTTGGCCTCTCGATTCCTCTCATGCCTTTAT
AGTTCGATCTTTGAGGGAAGATTCAGTCCATTCAAAAGGTCCAAATAGAAACAATCTTTATTCGATGATTTGGATGGACAAGAGGAGGACTTGGAGTGGATACCCCATCA
AGGCCACTGAGAGTGGAGATTGGAAAGGGAAACAACTAGTTGAACTAGTTGGAAAACTTGAAGAGAGCAGTTCGATTCTTTCTAACATGGAGTTTGACATGGAGTTAGAG
AATGTTAGCAAGAAGATGAGAGAGTTGTGGAATATCTTCAGCGAAGCAACTACGAAGTTGGAACGAAGCGTTGGAAAAATAAAGCAGAATTTCGAAGAATTGACAGAAGA
GTTTTCAGCAATGAAAAGAGAGTTTAGATTGGCCAAAGAAAAGGGGCAGAAATCGAGAAATCGATACAAGGCCAGAAAAAAAAGCAAAGTTGTCCAAGACTACCATGAAT
AA
Protein sequenceShow/hide protein sequence
MAQLGMRCDEFWQELHDLAGMGRDRWIHGVSLETPFSKEEVYLAVQTLGTNKSLGPDGFTDEFFKHSWTTIKQDVMAILSDGFVSNRQILDASLIASELIDDWKTTNKKG
VVGVVIKLDLERAFDKVDWNFLDAVLQAKRFGRGDPLLPFLFILVFDCLSRLLTYSAQLGKITTHPIGNSLNLTHLRFANDTLLFSTLDSFIVDNLFDIIKVFELAFGLN
INYAKSELLGINVDDLEIEALTSNFGCKPGDWPVSYLGLPLGGSKDDGGLHNINWKTTQLPHIMGGLGITIEMDYVGGQNQSQEDRQNLFGSICSTIDIVAWDMRLRRNL
NEEEVLEWASLHLLSSVILRNSQDSWSWPLDSSHAFIVRSLREDSVHSKGPNRNNLYSMIWMDKRRTWSGYPIKATESGDWKGKQLVELVGKLEESSSILSNMEFDMELE
NVSKKMRELWNIFSEATTKLERSVGKIKQNFEELTEEFSAMKREFRLAKEKGQKSRNRYKARKKSKVVQDYHE