; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018325 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018325
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr5:23051057..23056038
RNA-Seq ExpressionLag0018325
SyntenyLag0018325
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.9e-7144.27Show/hide
Query:  EEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGT---------------------DKAQDWLQSITP
        E P+ IRDYFQP     Q GI+  PIN NNFELK GLIQMAR+ A+RG   EDP+ HL+SFL+ICGT                     D+A+DWL++I P
Subjt:  EEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGT---------------------DKAQDWLQSITP

Query:  GSITTWDALVQAFLKKFFPPAK------------------------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKTVENARTL
         SITTW+ L QAFL K+FPPAK                                           +QLFYNGL  ST++I+DA A G++ SK  + A T+
Subjt:  GSITTWDALVQAFLKKFFPPAK------------------------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKTVENARTL

Query:  LEDMATNSYQWPSKRSAPK-KIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQ----SIESTAALASR-SQEETIEQVQYVSNFNSRGYNNNSTPTH
        LED+AT SY WP +R++P     AG++EVD+V++L+AQM SL NA  K    G AQ    SI S AALAS        E   YV   + R Y +   PTH
Subjt:  LEDMATNSYQWPSKRSAPK-KIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQ----SIESTAALASR-SQEETIEQVQYVSNFNSRGYNNNSTPTH

Query:  YHPNNRNHENFSYAKTKNVLN-PLGF-APQTQENKKLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQLE
        YHPN RNHENFSYA  KNVL  P GF      +   LED++  F+ ES +RTT LE +V AI +TV      ++N+E QL Q++
Subjt:  YHPNNRNHENFSYAKTKNVLN-PLGF-APQTQENKKLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQLE

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]4.5e-6040.16Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGT---------------------DKAQD
        MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   + GSP +DPN HL  FL+IC T                     DKA+ 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGT---------------------DKAQD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAK------------------------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKT
        WLQS+ PGSIT+W  + + FL KFFPPAK                                           VQ+FYNGL   T TIVDA + GTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAK------------------------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKT

Query:  VENARTLLEDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAA--LASRSQEETIEQVQYVSNFNSRGYNNNST
         E A +LLE+MA+N+YQWP++R+  KK VAG+ E++  +AL AQ+ SL++           Q  E  AA  +     E + EQVQY++N N   Y  N  
Subjt:  VENARTLLEDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAA--LASRSQEETIEQVQYVSNFNSRGYNNNST

Query:  PTHYHPNNRNHENFSYAKTKNVLN-PLGFAPQTQENK-KLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQL
        P +YHP  RNHENFSY  TKNVL  P GF  Q  E K  LED + +F+ E+     K +  +  I T  +   AT+KN+E Q+GQL
Subjt:  PTHYHPNNRNHENFSYAKTKNVLN-PLGFAPQTQENK-KLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQL

XP_023881727.1 uncharacterized protein LOC111994101 [Quercus suber]1.3e-5942.82Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGT---------------------DKAQD
        MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   + GSP +DPN HL  FL+IC T                     DKA+ 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGT---------------------DKAQD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKMVQL-----------FYNGLTPSTETIVDAVASGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIVAG
        WLQS+ PGSIT+W  + +  L KFFP AK  QL           F +       TIVDA + GTL+SKT E A +LLE+MA+N+YQWP++R+  KK VAG
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKMVQL-----------FYNGLTPSTETIVDAVASGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIVAG

Query:  VFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAA--LASRSQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYAKTKNVLNPL-GFAP
        + E++  +AL AQ+ SL++           QS E  AA  +     E + EQVQY++N N   Y  N  P +YHP  RNHENFSY  TKNVL PL GF  
Subjt:  VFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAA--LASRSQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYAKTKNVLNPL-GFAP

Query:  QTQENK-KLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQL
        Q  E K  LED + +F+ E+  R  K +  +  I T  +   ATIKN+E Q+GQL
Subjt:  QTQENK-KLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQL

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]4.0e-6140.67Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGT---------------------DKAQD
        MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   + GSP +DPN HL  FL+IC T                     DKA+ 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGT---------------------DKAQD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAK------------------------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKT
        WLQS+ PGSIT+W  + + FL KFFPPAK                                           VQ+FYNGL   T TIVDA + GTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAK------------------------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKT

Query:  VENARTLLEDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAA--LASRSQEETIEQVQYVSNFNSRGYNNNST
         E A +LLE+MA+N+YQWP++R+  KK VAG+ E++  +AL AQ+ SL++           QS E  AA  +     E + EQVQY++N N   Y  N  
Subjt:  VENARTLLEDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAA--LASRSQEETIEQVQYVSNFNSRGYNNNST

Query:  PTHYHPNNRNHENFSYAKTKNVLN-PLGFAPQTQENK-KLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQL
        P +YHP  RNHENFSY  TKNVL  P GF  Q  E K  LED + +F+ E+  R  K +  +  I T  +   AT+KN+E Q+GQL
Subjt:  PTHYHPNNRNHENFSYAKTKNVLN-PLGFAPQTQENK-KLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQL

XP_023929660.1 uncharacterized protein LOC112040975 [Quercus suber]1.7e-5939.64Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICG---------------------TDKAQD
        MA+     +PR ++DY +P+     SGI +  INANNFEL   LI M +   + GSP +DPN HL  FL+IC                       DKA+ 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICG---------------------TDKAQD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAK------------------------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKT
        WLQS+ PGSIT+W  + + FL KFFPPAK                                           VQ+FYNGL   T TIVDA + GTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAK------------------------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKT

Query:  VENARTLLEDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAA--LASRSQEETIEQVQYVSNFNSRGYNNNST
         E A +LLE+MA+N YQWP++R+  KK VAG+ E++  +AL AQ+ SL++           QS+E  AA  +     E + E VQY++N N   Y+ N  
Subjt:  VENARTLLEDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAA--LASRSQEETIEQVQYVSNFNSRGYNNNST

Query:  PTHYHPNNRNHENFSYAKTKNVLN-PLGFAPQTQENK-KLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQL
        P +YHP  RNHENFSY  TKNVL  P GF  Q  E K  LED + +F+ E+  R  K +  +  I T  +   A +KN+E Q+GQL
Subjt:  PTHYHPNNRNHENFSYAKTKNVLN-PLGFAPQTQENK-KLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQL

TrEMBL top hitse value%identityAlignment
A0A2I4E1Q5 uncharacterized protein LOC1089854723.7e-4443.62Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTDKAQ----DWLQ-SITPGSITTWDA-
        MAD     +PR ++DY +PV     SGI    INANNFELK  LI M +   + GSP +DPN HL  FL+IC T K      D ++  + P S+      
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTDKAQ----DWLQ-SITPGSITTWDA-

Query:  LVQAFLKKFFPPAKMVQLFYNGLTPSTETIVDAVASGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLG
        L++   +   P    VQ+FYNGL   T TIVDAV+ GTL+SKT E A  LLE+M +N+YQWP +RS  KK VAG+ E++ ++AL AQ+ +L++  +    
Subjt:  LVQAFLKKFFPPAKMVQLFYNGLTPSTETIVDAVASGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLG

Query:  TGSAQSIE--STAALASRSQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYAKTKNVLNPL---GFAPQTQENK
            QS E  + A++   S E   EQVQY++N N   Y  N  P +YH   RNHEN SY+ TKNVL P    GF  Q  E K
Subjt:  TGSAQSIE--STAALASRSQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYAKTKNVLNPL---GFAPQTQENK

A0A2I4G4Q3 uncharacterized protein LOC1090047123.5e-4237.61Show/hide
Query:  MARDCAYRGSPTEDPNSHLKSFLDICGT---------------------DKAQDWLQSITPGSITTWDALVQAFLKKFFPPAK-----------------
        M +   + GSP +DPN HL  FL+IC T                     D+A+ WLQS+ P SIT+W  + + F  KFFPPAK                 
Subjt:  MARDCAYRGSPTEDPNSHLKSFLDICGT---------------------DKAQDWLQSITPGSITTWDALVQAFLKKFFPPAK-----------------

Query:  -------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMT
                                  VQ+FYNGL   T TIVD  + GTL+ KT+E A  LLE+MA+N+YQWP +R+  KK VA + E++ ++AL AQ+ 
Subjt:  -------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMT

Query:  SLANAFMKFLGTGSAQSIESTAA--LASRSQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYAKTKNVLNPL---GFAPQTQENK-KLEDLI
        +L++           QS E   A  +   S E + EQVQY++N N   Y  N  P +YHP  +NHEN SY  TKNVL P    GF  Q+ E K  LED +
Subjt:  SLANAFMKFLGTGSAQSIESTAA--LASRSQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYAKTKNVLNPL---GFAPQTQENK-KLEDLI

Query:  GAFIAESSNRTTKLEEAVIAINTTVNGHSATI-KNIETQLGQL
         +FI E++ R  K +  +  I T  +   A I KNIE Q+GQL
Subjt:  GAFIAESSNRTTKLEEAVIAINTTVNGHSATI-KNIETQLGQL

A0A6J1DU19 uncharacterized protein LOC1110243617.5e-4535.96Show/hide
Query:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTDKAQDWLQS-----ITPGSITTWDALVQAFLKKFFPPA
        IRDY QP F     GI+  PINANN ELK GLIQM R+  +RG+ TEDPN+HL  FLD+CGT K    +       + P S+   + +VQAFL  FFPPA
Subjt:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTDKAQDWLQS-----ITPGSITTWDALVQAFLKKFFPPA

Query:  K------------------------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKI
        K                                           +Q+FYNGL   T TI+DA A GTLLS+T ENA  LL+DMA NS+QWPS+RS  KK 
Subjt:  K------------------------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKI

Query:  VAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAALASRS-QEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYAKTKNVLNPLGFA
        VAG++E+D++S+L+AQ+ +L NA  K  G G++ S E  AA  + S  E TIEQ Q+ S                HP                       
Subjt:  VAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAALASRS-QEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYAKTKNVLNPLGFA

Query:  PQTQENKKLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQLEEAEEEPESEDYDT-----PTGEAEEDTSYDEDEKPEPEPPIPSP
           ++   LEDL+GAFI E  +R +++E  V  +   + G++ +IKN+E Q+GQ+       +   + +     P    +  T     E  EPE      
Subjt:  PQTQENKKLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQLEEAEEEPESEDYDT-----PTGEAEEDTSYDEDEKPEPEPPIPSP

Query:  PLMALE
        P++  E
Subjt:  PLMALE

A0A6P6XAQ1 Reverse transcriptase5.4e-4334.24Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGT---------------------DKAQDWLQSITPGSI
        R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y G+ TEDPNSHL +FL+IC T                     DKA+ WLQS  P + 
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGT---------------------DKAQDWLQSITPGSI

Query:  TTWDALVQAFLKKFFPPAK------------------------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKTVENARTLLED
        TTWD L +AFL KFFPP K                                          +VQ FYNGLT  T+T VDA A G L+ KT E A+ L+E+
Subjt:  TTWDALVQAFLKKFFPPAK------------------------------------------MVQLFYNGLTPSTETIVDAVASGTLLSKTVENARTLLED

Query:  MATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAALASRSQEE-----TIEQVQYVSNFNSRGYNNNSTPTHYHPN
        MA N+YQW ++R   ++  AG+ EVD ++ L A+M ++     + +G+ S Q +   +        +     + EQVQY++N+N R   NN     Y+P 
Subjt:  MATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAALASRSQEE-----TIEQVQYVSNFNSRGYNNNSTPTHYHPN

Query:  NRNHENFSYAKTKN---VLNPLGFAPQ--TQENKKLEDLIGAFIAESSN-RTTKLEEAVIAINTTVNGHSATI----KNIETQLGQLEEAEEEPESEDYD
         RNH NF +    N    +NP GF  +    E+K   +L    +A +SN +  KL  A       + G    +    +N+E QLGQ+  A       D  
Subjt:  NRNHENFSYAKTKN---VLNPLGFAPQ--TQENKKLEDLIGAFIAESSN-RTTKLEEAVIAINTTVNGHSATI----KNIETQLGQLEEAEEEPESEDYD

Query:  TPT
        + T
Subjt:  TPT

A0A803PT47 Uncharacterized protein3.0e-4635.45Show/hide
Query:  EPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICG---------------------TDKAQDWLQSITPG
        +PR +RDYF PV                       LI M +   +    TEDPN HL  FL++C                       D+ + WLQS+ P 
Subjt:  EPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICG---------------------TDKAQDWLQSITPG

Query:  SITTWDALVQAFLKKFFPPAKMVQL------------------------------------------FYNGLTPSTETIVDAVASGTLLSKTVENARTLL
        SI+TWD + + F+ KFFPP+K  QL                                          FYNGL   T T++DA   G LLSK +  A  LL
Subjt:  SITTWDALVQAFLKKFFPPAKMVQL------------------------------------------FYNGLTPSTETIVDAVASGTLLSKTVENARTLL

Query:  EDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIES-TAALASRSQEETIEQVQYVS-NFNSRGYNNNSTPTHYHPNN
        E+MATNSY WP++R+  KK+ AG+ EVD ++ + AQ+++L+N     +   +  ++E+  AA  S+  E +IEQ QY++    +  Y  N  P +YHP  
Subjt:  EDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIES-TAALASRSQEETIEQVQYVS-NFNSRGYNNNSTPTHYHPNN

Query:  RNHENFSYAKTKNVLN-PLGFAPQTQENKK-LEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQL
        RNHEN SY  TKNVL  P GF  Q QE+KK LED++G F+ ES  R  K E  +  I T ++   A++KNIE Q+ +L
Subjt:  RNHENFSYAKTKNVLN-PLGFAPQTQENKK-LEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGACCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGACTACTTTCAGCCCGTGTTTCAGGGGCAACAATCTGGGATTGTCTATGCCCCGATCAATGCCAACAA
CTTTGAGCTGAAGACCGGTCTCATTCAGATGGCCCGAGACTGTGCATATAGAGGATCACCCACCGAGGATCCAAACTCTCATCTAAAATCATTTTTGGACATTTGTGGGA
CGGATAAAGCACAAGATTGGTTGCAGTCTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTTCAGGCCTTTTTGAAGAAATTTTTCCCTCCTGCAAAGATGGTT
CAATTGTTTTATAATGGTCTAACTCCTAGTACAGAAACGATTGTTGATGCAGTTGCAAGTGGGACTCTGTTGTCCAAGACCGTGGAAAATGCTCGCACACTTCTAGAGGA
TATGGCCACCAACAGCTATCAGTGGCCATCTAAGCGGTCTGCACCTAAAAAGATTGTTGCTGGAGTGTTTGAGGTTGACAAGGTAAGTGCACTCCAGGCCCAGATGACCT
CCCTTGCTAATGCTTTTATGAAATTTTTAGGTACAGGGAGTGCACAGTCAATTGAATCAACTGCTGCTTTAGCATCTAGATCTCAGGAGGAGACCATCGAGCAGGTTCAG
TATGTATCAAATTTTAATTCTAGGGGGTATAATAATAATTCTACACCTACTCATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCAAAGACTAAGAATGT
TCTTAACCCCCTTGGTTTTGCCCCTCAAACTCAAGAAAATAAAAAGCTAGAGGATCTTATTGGAGCTTTCATTGCAGAGTCGAGTAACAGGACAACCAAATTAGAGGAGG
CAGTCATTGCCATCAACACCACGGTGAATGGCCACAGTGCAACCATCAAGAACATTGAGACTCAGCTGGGACAGTTGGAGGAAGCTGAAGAGGAGCCTGAGTCTGAGGAT
TATGATACTCCTACTGGGGAAGCTGAGGAGGACACATCATATGATGAAGATGAAAAGCCGGAACCTGAGCCTCCTATTCCTTCTCCTCCCTTGATGGCGTTAGAGATGTC
ACAATATAACAGGTTCATGAGAATAACCCTAGGAGTGACCCCCTACGGATGGTGTTTGCATGGATCAATATCAAGGTGCCACTTGTCACGTTTGTTTTTCCTTTCAAGGA
ATAAGCTCATGGCAACAGTTGGAAATAGATCTTTTACTGGAAAAGAGCTAGAGAGCCAAAGAGCCCTTGAAGCTTGTACATTTGGATCCTTGTGGTCTACTACGGAAACA
TCAACAAGAGTTGTTGATGAATCAACAAAAGTTGTTGATAAGACTGGTTATTCAACAAGAGTTGTTGATAAGACTGGTTATTCAACAAGAGTTGTTGATAAGGCCAGTGC
TTTTGGTCAGTCACATCCATCTCAAGAGTTGTATCTTTGGACAATGAGGAACTATATGCTCGTGTATAGTGCTAAGGATTTGATCCTTACAGGTTACACTGTCATAGATG
TTTTAATTGCGATCGAGGCTATAAGACTTGGAAAGTTCGAGGCTGGTTTGGAAAGTGTTCCAAATATGACCCTTCCTAATGCAAGACAAGCATGCACATTGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGACCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGACTACTTTCAGCCCGTGTTTCAGGGGCAACAATCTGGGATTGTCTATGCCCCGATCAATGCCAACAA
CTTTGAGCTGAAGACCGGTCTCATTCAGATGGCCCGAGACTGTGCATATAGAGGATCACCCACCGAGGATCCAAACTCTCATCTAAAATCATTTTTGGACATTTGTGGGA
CGGATAAAGCACAAGATTGGTTGCAGTCTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTTCAGGCCTTTTTGAAGAAATTTTTCCCTCCTGCAAAGATGGTT
CAATTGTTTTATAATGGTCTAACTCCTAGTACAGAAACGATTGTTGATGCAGTTGCAAGTGGGACTCTGTTGTCCAAGACCGTGGAAAATGCTCGCACACTTCTAGAGGA
TATGGCCACCAACAGCTATCAGTGGCCATCTAAGCGGTCTGCACCTAAAAAGATTGTTGCTGGAGTGTTTGAGGTTGACAAGGTAAGTGCACTCCAGGCCCAGATGACCT
CCCTTGCTAATGCTTTTATGAAATTTTTAGGTACAGGGAGTGCACAGTCAATTGAATCAACTGCTGCTTTAGCATCTAGATCTCAGGAGGAGACCATCGAGCAGGTTCAG
TATGTATCAAATTTTAATTCTAGGGGGTATAATAATAATTCTACACCTACTCATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCAAAGACTAAGAATGT
TCTTAACCCCCTTGGTTTTGCCCCTCAAACTCAAGAAAATAAAAAGCTAGAGGATCTTATTGGAGCTTTCATTGCAGAGTCGAGTAACAGGACAACCAAATTAGAGGAGG
CAGTCATTGCCATCAACACCACGGTGAATGGCCACAGTGCAACCATCAAGAACATTGAGACTCAGCTGGGACAGTTGGAGGAAGCTGAAGAGGAGCCTGAGTCTGAGGAT
TATGATACTCCTACTGGGGAAGCTGAGGAGGACACATCATATGATGAAGATGAAAAGCCGGAACCTGAGCCTCCTATTCCTTCTCCTCCCTTGATGGCGTTAGAGATGTC
ACAATATAACAGGTTCATGAGAATAACCCTAGGAGTGACCCCCTACGGATGGTGTTTGCATGGATCAATATCAAGGTGCCACTTGTCACGTTTGTTTTTCCTTTCAAGGA
ATAAGCTCATGGCAACAGTTGGAAATAGATCTTTTACTGGAAAAGAGCTAGAGAGCCAAAGAGCCCTTGAAGCTTGTACATTTGGATCCTTGTGGTCTACTACGGAAACA
TCAACAAGAGTTGTTGATGAATCAACAAAAGTTGTTGATAAGACTGGTTATTCAACAAGAGTTGTTGATAAGACTGGTTATTCAACAAGAGTTGTTGATAAGGCCAGTGC
TTTTGGTCAGTCACATCCATCTCAAGAGTTGTATCTTTGGACAATGAGGAACTATATGCTCGTGTATAGTGCTAAGGATTTGATCCTTACAGGTTACACTGTCATAGATG
TTTTAATTGCGATCGAGGCTATAAGACTTGGAAAGTTCGAGGCTGGTTTGGAAAGTGTTCCAAATATGACCCTTCCTAATGCAAGACAAGCATGCACATTGAATTGA
Protein sequenceShow/hide protein sequence
MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTDKAQDWLQSITPGSITTWDALVQAFLKKFFPPAKMV
QLFYNGLTPSTETIVDAVASGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIVAGVFEVDKVSALQAQMTSLANAFMKFLGTGSAQSIESTAALASRSQEETIEQVQ
YVSNFNSRGYNNNSTPTHYHPNNRNHENFSYAKTKNVLNPLGFAPQTQENKKLEDLIGAFIAESSNRTTKLEEAVIAINTTVNGHSATIKNIETQLGQLEEAEEEPESED
YDTPTGEAEEDTSYDEDEKPEPEPPIPSPPLMALEMSQYNRFMRITLGVTPYGWCLHGSISRCHLSRLFFLSRNKLMATVGNRSFTGKELESQRALEACTFGSLWSTTET
STRVVDESTKVVDKTGYSTRVVDKTGYSTRVVDKASAFGQSHPSQELYLWTMRNYMLVYSAKDLILTGYTVIDVLIAIEAIRLGKFEAGLESVPNMTLPNARQACTLN