; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008074 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008074
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:11441708..11447046
RNA-Seq ExpressionLag0008074
SyntenyLag0008074
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG69438.1 hypothetical protein EZV62_004373 [Acer yangbiense]2.0e-3531.14Show/hide
Query:  SDEAASAAAPPPLVEGATVDPQALSKALAPVF----DLRLRKLEIPVFSGENPYEWLHQVERYF------------------------------ERRPID
        S +   + +  P V G T+     S+  + VF    D R RKLE+PVF G NP  W+ + E YF                              ++RP+ 
Subjt:  SDEAASAAAPPPLVEGATVDPQALSKALAPVF----DLRLRKLEIPVFSGENPYEWLHQVERYF------------------------------ERRPID

Query:  CWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEK
         W++ + L+LK+F  T+EGSLHE+F AL+Q  TV EYR KF ++  PL+N+ +     +FI+GL  +I+ +LRV   + L   M++AQ++E +++     
Subjt:  CWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEK

Query:  YFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVSINPSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVV--
                  T +++G    +K   Y    ++ T +  P+  +    P     P    +++TD+ELQ KR  GLCYRC+E++ PGH+CKKKEL V++   
Subjt:  YFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVSINPSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVV--

Query:  ---QEEDRDSDFPMEDVAMQTAELAENGASDEVA
           +EE  ++   +++  ++ AE++E   + EV+
Subjt:  ---QEEDRDSDFPMEDVAMQTAELAENGASDEVA

XP_010270441.1 PREDICTED: uncharacterized protein LOC104606771 [Nelumbo nucifera]6.3e-3739.36Show/hide
Query:  RKLEIPVFSGENPYEWLHQVERYFE------------------------------RRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREK
        RKLE+P+F GENP  WL + ERYFE                              RRP   W +F+ LLL+RF  T+EG+L E+  +L Q +TV EYR  
Subjt:  RKLEIPVFSGENPYEWLHQVERYFE------------------------------RRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREK

Query:  FEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEKYFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVSINPS
        FE  S PL +L E+ LE  F++GL  DI+ +LR  + VGL   M  AQK+E++       Y  +S    P T +SG    S+       + TR     P+
Subjt:  FEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEKYFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVSINPS

Query:  KLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQE--EDRDSDFPMEDVAMQTAELA
            S+ P    PP   FKKMTD E+QQKR +GLC+RC+E++ PGHRC +K LQV+ VQ+  E+  SD    D      E+A
Subjt:  KLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQE--EDRDSDFPMEDVAMQTAELA

XP_015388504.1 uncharacterized protein LOC107178162 [Citrus sinensis]5.3e-3637.28Show/hide
Query:  DLRLRKLEIPVFSGENPYEWLHQVERYF------------------------------ERRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAE
        D R+RKL++P+F GE+ Y W+++ ERYF                              +R+P+  W +F+  LL+RF  T+EG LHE+FFAL Q+ TV E
Subjt:  DLRLRKLEIPVFSGENPYEWLHQVERYF------------------------------ERRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAE

Query:  YREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDE-ILHFEEKYFGASAGPKPTTRVSGSVGQS--------KSPSYG
        YREKFE  SG L  L EA LEG FI GLK +I+  LR+ +  GL  +ME+AQ +ED+  +    K        + +T + G   Q+         +PS G
Subjt:  YREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDE-ILHFEEKYFGASAGPKPTTRVSGSVGQS--------KSPSYG

Query:  AGSATRTVSINPSKLNAS-ARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQEEDRDS
               +  +  +   S AR   T      FK++T+ E+Q KR +G+C+RC+E+F PGHRCK K LQV+ V +E+ ++
Subjt:  AGSATRTVSINPSKLNAS-ARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQEEDRDS

XP_022848903.1 uncharacterized protein LOC111371244 [Olea europaea var. sylvestris]2.2e-3733.9Show/hide
Query:  STWKDQP---KRSDEAASAAAPPPLVEGAT--VDPQALSKALAPV--FDLRLRKLEIPVFSGENPYEWLHQVERYFE-----------------------
        S W+ Q    K    A S++   P VE  +   +P   +K L      D R R+LE+PVF GENP  WL + ERYF                        
Subjt:  STWKDQP---KRSDEAASAAAPPPLVEGAT--VDPQALSKALAPV--FDLRLRKLEIPVFSGENPYEWLHQVERYFE-----------------------

Query:  -------RRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQ
               R+    W+D +  LL RF P++EG+  E+F AL+Q+ TV +Y   FE  + PL  + E  LEG FI+GLK  I+ ++R+ K  GLD+ ME+AQ
Subjt:  -------RRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQ

Query:  KLED--EILHFEEKYFGASAGPKPTTRVSGSVGQSKSPSY---GAGSATRTVSINPSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFF
        ++ED  E++   +  +G       +   S   G SK P+    G+ S      +N   + +S R          FKK++D ELQ KRERGLCYRC+E+F 
Subjt:  KLED--EILHFEEKYFGASAGPKPTTRVSGSVGQSKSPSY---GAGSATRTVSINPSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFF

Query:  PGHRCKKKELQVMVVQEEDRDSDF---PMEDVAMQTAELAENGASDEVAQDQPR
        PGH+C+ KEL V+VVQ E+   +     +++   +  E+ E   +  V  + P+
Subjt:  PGHRCKKKELQVMVVQEEDRDSDF---PMEDVAMQTAELAENGASDEVAQDQPR

XP_038904464.1 uncharacterized protein LOC120090832 [Benincasa hispida]3.9e-3935.29Show/hide
Query:  VFDLRLRKLEIPVFS---GENPYEWLHQVERYF------------------------------ERRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQD
        +FD+RLRKLEIP+F    GE+P  W H+VERYF                              ER P++ W  F+  LL RFLP KE     +F  LKQD
Subjt:  VFDLRLRKLEIPVFS---GENPYEWLHQVERYF------------------------------ERRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQD

Query:  STVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEKYFGASAGPKPTTRVSGSVGQS-------KSP
         +V  YR +FE   G L++L +  LE KF+ GLK+DI++++R+ K +GL   M +AQ +ED      +K+ G +  P  +T  S + G +        +P
Subjt:  STVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEKYFGASAGPKPTTRVSGSVGQS-------KSP

Query:  SYGAGSATRTVSINPSKLNASARPETT----KPPDF---PFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQEEDRDSD
        +    S  RT+S+NP+    +     T        F    FK+++D ++Q +R++GLCY+CEE++ PGHRCK+KEL +++   E+  ++
Subjt:  SYGAGSATRTVSINPSKLNASARPETT----KPPDF---PFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQEEDRDSD

TrEMBL top hitse value%identityAlignment
A0A1S8ADV7 Aminoacyl-tRNA ligase7.5e-3634.97Show/hide
Query:  DLRLRKLEIPVFSGENPYEWLHQVERYF------------------------------ERRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAE
        D R+RKL++P+F GE+ Y W+++ ERYF                              +R+P+  W +F+  LL+RF  T+EG LHE+FFAL Q+ TV E
Subjt:  DLRLRKLEIPVFSGENPYEWLHQVERYF------------------------------ERRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAE

Query:  YREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDE----------ILHFEEKYFGASAGPKPTTR------VSGSVGQ
        YREKFE  SG L  L EA LEG F+ GLK +I+  LR+ +  GL  +ME+AQ +ED+          ++ F  +      G K  T        +   G 
Subjt:  YREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDE----------ILHFEEKYFGASAGPKPTTR------VSGSVGQ

Query:  SKSPSYGAGSATRTVSINPSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQEEDRDSD
         K    G    ++   ++ ++L  +            FK++T+ E+Q KR +G+C+RC+E+F PGHRCK K LQV+ V +E+   +
Subjt:  SKSPSYGAGSATRTVSINPSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQEEDRDSD

A0A1U8ARD0 uncharacterized protein LOC1046067713.0e-3739.36Show/hide
Query:  RKLEIPVFSGENPYEWLHQVERYFE------------------------------RRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREK
        RKLE+P+F GENP  WL + ERYFE                              RRP   W +F+ LLL+RF  T+EG+L E+  +L Q +TV EYR  
Subjt:  RKLEIPVFSGENPYEWLHQVERYFE------------------------------RRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREK

Query:  FEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEKYFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVSINPS
        FE  S PL +L E+ LE  F++GL  DI+ +LR  + VGL   M  AQK+E++       Y  +S    P T +SG    S+       + TR     P+
Subjt:  FEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEKYFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVSINPS

Query:  KLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQE--EDRDSDFPMEDVAMQTAELA
            S+ P    PP   FKKMTD E+QQKR +GLC+RC+E++ PGHRC +K LQV+ VQ+  E+  SD    D      E+A
Subjt:  KLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQE--EDRDSDFPMEDVAMQTAELA

A0A2I0X132 Putative mitochondrial protein3.7e-3534.7Show/hide
Query:  LRKLEIPVFSGENPYEWLHQVERYF------------------------------ERRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYRE
        +RKL++P+F GE+ Y W+ + ERYF                              ER+   CW +FR + L RF P KEG+ HE+FFAL Q  TV+ YR+
Subjt:  LRKLEIPVFSGENPYEWLHQVERYF------------------------------ERRPIDCWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYRE

Query:  KFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEK---YFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVS
        +FE  S  L  + +  LEG F+ GLK  I+  +R AK   L  T+E+A+ +ED +   + +   +FG         +V+   G +K    GAG       
Subjt:  KFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEK---YFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVS

Query:  INPSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQEEDRD
                    + + P    F+++T+AEL+ KR +GLCYRC+E+F PGHRCK K LQV++V++ + +
Subjt:  INPSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQEEDRD

A0A5C7HSW3 Chromo domain-containing protein4.9e-3531.66Show/hide
Query:  SDEAASAAAPPPLVEGATVDPQALSKALAPVF----DLRLRKLEIPVFSGENPYEWLHQVERYF------------------------------ERRPID
        S +   + +  P V G  +     S+  + VF    D R RKLE+PVF G NP  W+ + ERYF                              ++RP+ 
Subjt:  SDEAASAAAPPPLVEGATVDPQALSKALAPVF----DLRLRKLEIPVFSGENPYEWLHQVERYF------------------------------ERRPID

Query:  CWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEK
         W++ + L+LK+F  T+EGSLHE+F AL+Q  TV EYR KF ++   L+N+ +     +FI+GL  +I+ +LRV   + LD  M++AQ++E +++     
Subjt:  CWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEK

Query:  YFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVSINPSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVV--
                  T +++G    +K   Y    ++ T +  P+  +    P  +  P    +++TD+ELQ KR  GLCYRC+E++ PGH+CKKKEL V++   
Subjt:  YFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVSINPSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVV--

Query:  ---QEEDRDSDFPMEDVAMQTAELAE----NGASDEVA
           +EE  ++   + +  ++ AE++E    +GAS+ VA
Subjt:  ---QEEDRDSDFPMEDVAMQTAELAE----NGASDEVA

A0A5C7IJS7 Uncharacterized protein9.8e-3631.14Show/hide
Query:  SDEAASAAAPPPLVEGATVDPQALSKALAPVF----DLRLRKLEIPVFSGENPYEWLHQVERYF------------------------------ERRPID
        S +   + +  P V G T+     S+  + VF    D R RKLE+PVF G NP  W+ + E YF                              ++RP+ 
Subjt:  SDEAASAAAPPPLVEGATVDPQALSKALAPVF----DLRLRKLEIPVFSGENPYEWLHQVERYF------------------------------ERRPID

Query:  CWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEK
         W++ + L+LK+F  T+EGSLHE+F AL+Q  TV EYR KF ++  PL+N+ +     +FI+GL  +I+ +LRV   + L   M++AQ++E +++     
Subjt:  CWDDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEK

Query:  YFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVSINPSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVV--
                  T +++G    +K   Y    ++ T +  P+  +    P     P    +++TD+ELQ KR  GLCYRC+E++ PGH+CKKKEL V++   
Subjt:  YFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVSINPSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVV--

Query:  ---QEEDRDSDFPMEDVAMQTAELAENGASDEVA
           +EE  ++   +++  ++ AE++E   + EV+
Subjt:  ---QEEDRDSDFPMEDVAMQTAELAENGASDEVA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding3.0e-0522.63Show/hide
Query:  KDSTWKDQPKRSDEA-ASAAAPPPLVEGATVDPQALSKALAPVFDLRLRKL--EIPVFSGENPY------------------EWLHQVERYFERRPIDCW
        +D+ WK Q K+S  A       PPLV+ +  + +     ++   D  LR+       + GEN                    +W   ++  +++     W
Subjt:  KDSTWKDQPKRSDEA-ASAAAPPPLVEGATVDPQALSKALAPVFDLRLRKL--EIPVFSGENPY------------------EWLHQVERYFERRPIDCW

Query:  DDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLED
         +F+ ++ +    T + +    +  ++Q+ +V EYRE+FE        L    LE  F+ GL+  ++  +R  K  G+   M+ AQ LE+
Subjt:  DDFRRLLLKRFLPTKEGSLHERFFALKQDSTVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGCAACAGCTCTTACAAGCGCTCGGACGCTGGATGATGACGTCCCTAGGACGGAGCAGTCCCAAGAAGTGGGCACGTCGTCTGGTGCCAAAAGAAGCATGAAGAG
GAAGTGTTTGCACCTAGACAGAAAGTCAGAAGGTCTCTGTCAGGGACCAATAGCTCCTAAGATGGGGAGTCAATGCGCCCCTCGCTCAAGCGTAAGTTGTTGTCACGACA
AAGTTACCTTGGTTGGTGAAGTTGTTGATAAGCTCCTGCAGGTTGGAGTCGTGTTTAAGTCGTTGATGGTAGGTTCTAAAGTGCCTAACTCAAGGTTTAAGCTGACATTG
ACAACCTATGTGGTGATTCTTCGCTTGTTGAAAAGTAGCTTAATCGGGGTTCATTGGACATATGAGATCTTGGAGCGCATCGAGCCTGTTGCTTACAGGTTGGCGTTATC
ACCGACCATGTCGACTGTGCATGACTCTGTCAAGGTCCTAACCAATAAAGCAAAGGTTCTGAGAAGCAAGACCATTGTTTTGGTAGAAGTGCTATGGGTGAACCATAGAG
CAGAAGAAGCTACCTGGGAAACCGAGAAAGCCCTCCCTTGCCGGAATCGAGCTTCAGTTAGCCGCCGCCGCCTTCCTCTTTCAGCCGCGCCGCCGCCTCCCCTGCCGAAG
CTTCAGCCGCCGTCGCGCCTGAATCGTGGAAGTCGCCCCTCTATCTCGCGAATCGCTCTCTCTGTCCGTGGGTTGTTCGCCATGGCTCCCTCCCTCTCTTCGCTCGGAGT
CTCTTTTCCTCGCGTTTTCGCCTCTGTCCAGCTTCTATTTCAGGTGTTTTCGAGGTCATTTGACGTCTTTGCGCCGTCTAAGTGTTCGATTGAGTTCGAAACACTTCGAC
TCGAATACCCACTGCCCAAGGAGCGTTCTAGCACACTGTTCGAGGGTGTAGGCACTTTTTGTGTGGTGAGTAACTGGGGCAAGGGGAGAGGACTAGGAAGCTTTCGAATT
CTTCCTAAGGGCGGAAAAGACTCCACCTGGAAGGACCAACCGAAACGCTCAGATGAAGCGGCCAGTGCAGCGGCACCTCCGCCTCTGGTAGAGGGTGCGACGGTGGATCC
GCAGGCCTTGTCCAAAGCGTTAGCTCCGGTGTTTGACTTACGACTTCGGAAGTTGGAGATTCCGGTGTTCTCTGGTGAGAATCCATATGAATGGCTTCACCAGGTCGAAC
GGTACTTTGAGCGTCGACCGATTGATTGCTGGGACGACTTCCGGCGACTTCTGTTAAAACGATTCTTGCCGACTAAGGAAGGGAGTCTTCATGAGCGTTTTTTCGCTTTG
AAACAAGATTCCACGGTGGCCGAATATCGGGAGAAGTTCGAAGATTATTCGGGACCTTTGGAGAATTTGGACGAGGCGACTTTGGAGGGGAAGTTCATCGATGGGTTGAA
GGATGACATCAAGATGAAGCTCCGGGTGGCTAAATCGGTTGGGCTGGACATGACGATGGAGATAGCCCAGAAATTGGAGGACGAGATCCTTCATTTCGAAGAGAAGTATT
TTGGGGCCTCTGCTGGCCCAAAGCCCACGACCCGTGTGTCTGGTTCGGTGGGACAGTCCAAGTCTCCATCCTATGGGGCTGGATCCGCCACCCGGACTGTGTCCATTAAC
CCGAGCAAGTTGAACGCGTCGGCCCGACCCGAGACAACAAAACCACCTGATTTTCCGTTTAAGAAGATGACTGATGCCGAGCTTCAACAGAAACGGGAACGAGGTCTTTG
TTATCGCTGTGAAGAAAGGTTTTTCCCGGGGCACCGTTGTAAGAAGAAGGAACTCCAGGTGATGGTGGTGCAGGAAGAAGACCGGGATTCTGATTTCCCTATGGAGGATG
TGGCCATGCAAACCGCGGAGTTGGCCGAGAACGGTGCGAGCGATGAAGTTGCTCAAGATCAACCGAGGTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCGCAACAGCTCTTACAAGCGCTCGGACGCTGGATGATGACGTCCCTAGGACGGAGCAGTCCCAAGAAGTGGGCACGTCGTCTGGTGCCAAAAGAAGCATGAAGAG
GAAGTGTTTGCACCTAGACAGAAAGTCAGAAGGTCTCTGTCAGGGACCAATAGCTCCTAAGATGGGGAGTCAATGCGCCCCTCGCTCAAGCGTAAGTTGTTGTCACGACA
AAGTTACCTTGGTTGGTGAAGTTGTTGATAAGCTCCTGCAGGTTGGAGTCGTGTTTAAGTCGTTGATGGTAGGTTCTAAAGTGCCTAACTCAAGGTTTAAGCTGACATTG
ACAACCTATGTGGTGATTCTTCGCTTGTTGAAAAGTAGCTTAATCGGGGTTCATTGGACATATGAGATCTTGGAGCGCATCGAGCCTGTTGCTTACAGGTTGGCGTTATC
ACCGACCATGTCGACTGTGCATGACTCTGTCAAGGTCCTAACCAATAAAGCAAAGGTTCTGAGAAGCAAGACCATTGTTTTGGTAGAAGTGCTATGGGTGAACCATAGAG
CAGAAGAAGCTACCTGGGAAACCGAGAAAGCCCTCCCTTGCCGGAATCGAGCTTCAGTTAGCCGCCGCCGCCTTCCTCTTTCAGCCGCGCCGCCGCCTCCCCTGCCGAAG
CTTCAGCCGCCGTCGCGCCTGAATCGTGGAAGTCGCCCCTCTATCTCGCGAATCGCTCTCTCTGTCCGTGGGTTGTTCGCCATGGCTCCCTCCCTCTCTTCGCTCGGAGT
CTCTTTTCCTCGCGTTTTCGCCTCTGTCCAGCTTCTATTTCAGGTGTTTTCGAGGTCATTTGACGTCTTTGCGCCGTCTAAGTGTTCGATTGAGTTCGAAACACTTCGAC
TCGAATACCCACTGCCCAAGGAGCGTTCTAGCACACTGTTCGAGGGTGTAGGCACTTTTTGTGTGGTGAGTAACTGGGGCAAGGGGAGAGGACTAGGAAGCTTTCGAATT
CTTCCTAAGGGCGGAAAAGACTCCACCTGGAAGGACCAACCGAAACGCTCAGATGAAGCGGCCAGTGCAGCGGCACCTCCGCCTCTGGTAGAGGGTGCGACGGTGGATCC
GCAGGCCTTGTCCAAAGCGTTAGCTCCGGTGTTTGACTTACGACTTCGGAAGTTGGAGATTCCGGTGTTCTCTGGTGAGAATCCATATGAATGGCTTCACCAGGTCGAAC
GGTACTTTGAGCGTCGACCGATTGATTGCTGGGACGACTTCCGGCGACTTCTGTTAAAACGATTCTTGCCGACTAAGGAAGGGAGTCTTCATGAGCGTTTTTTCGCTTTG
AAACAAGATTCCACGGTGGCCGAATATCGGGAGAAGTTCGAAGATTATTCGGGACCTTTGGAGAATTTGGACGAGGCGACTTTGGAGGGGAAGTTCATCGATGGGTTGAA
GGATGACATCAAGATGAAGCTCCGGGTGGCTAAATCGGTTGGGCTGGACATGACGATGGAGATAGCCCAGAAATTGGAGGACGAGATCCTTCATTTCGAAGAGAAGTATT
TTGGGGCCTCTGCTGGCCCAAAGCCCACGACCCGTGTGTCTGGTTCGGTGGGACAGTCCAAGTCTCCATCCTATGGGGCTGGATCCGCCACCCGGACTGTGTCCATTAAC
CCGAGCAAGTTGAACGCGTCGGCCCGACCCGAGACAACAAAACCACCTGATTTTCCGTTTAAGAAGATGACTGATGCCGAGCTTCAACAGAAACGGGAACGAGGTCTTTG
TTATCGCTGTGAAGAAAGGTTTTTCCCGGGGCACCGTTGTAAGAAGAAGGAACTCCAGGTGATGGTGGTGCAGGAAGAAGACCGGGATTCTGATTTCCCTATGGAGGATG
TGGCCATGCAAACCGCGGAGTTGGCCGAGAACGGTGCGAGCGATGAAGTTGCTCAAGATCAACCGAGGTGGTAG
Protein sequenceShow/hide protein sequence
MFATALTSARTLDDDVPRTEQSQEVGTSSGAKRSMKRKCLHLDRKSEGLCQGPIAPKMGSQCAPRSSVSCCHDKVTLVGEVVDKLLQVGVVFKSLMVGSKVPNSRFKLTL
TTYVVILRLLKSSLIGVHWTYEILERIEPVAYRLALSPTMSTVHDSVKVLTNKAKVLRSKTIVLVEVLWVNHRAEEATWETEKALPCRNRASVSRRRLPLSAAPPPPLPK
LQPPSRLNRGSRPSISRIALSVRGLFAMAPSLSSLGVSFPRVFASVQLLFQVFSRSFDVFAPSKCSIEFETLRLEYPLPKERSSTLFEGVGTFCVVSNWGKGRGLGSFRI
LPKGGKDSTWKDQPKRSDEAASAAAPPPLVEGATVDPQALSKALAPVFDLRLRKLEIPVFSGENPYEWLHQVERYFERRPIDCWDDFRRLLLKRFLPTKEGSLHERFFAL
KQDSTVAEYREKFEDYSGPLENLDEATLEGKFIDGLKDDIKMKLRVAKSVGLDMTMEIAQKLEDEILHFEEKYFGASAGPKPTTRVSGSVGQSKSPSYGAGSATRTVSIN
PSKLNASARPETTKPPDFPFKKMTDAELQQKRERGLCYRCEERFFPGHRCKKKELQVMVVQEEDRDSDFPMEDVAMQTAELAENGASDEVAQDQPRW