; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005560 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005560
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:22024975..22033791
RNA-Seq ExpressionLag0005560
SyntenyLag0005560
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]8.5e-3443.81Show/hide
Query:  PINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTV
        PIN NNFELK GLIQMAR+ A+RG   EDP+ HL+SF                         + D+A+DWL++I P SITTW+ L QAFL K+FPPAK+ 
Subjt:  PINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTV

Query:  KLRTEIGHSNN-NMMSSCSSWERFKELLRKCLSMDTP----IGFRFN------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTP
        +LRTEIG            +WER+K+LLR+C     P    I   +N            +AGG++ SK  + A T+LED+AT SY WP ER++P
Subjt:  KLRTEIGHSNN-NMMSSCSSWERFKELLRKCLSMDTP----IGFRFN------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTP

XP_030443636.1 uncharacterized protein LOC115665966 [Syzygium oleosum]4.3e-3830.18Show/hide
Query:  INANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTV
        I ANNFE+K  LIQM ++   + G P +DPN HL +F                         + DKA+ WL S+  GSITTW+ + Q FL K+FPPAK+ 
Subjt:  INANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTV

Query:  KLRTEI-GHSNNNMMSSCSSWERFKELLRKCLSMDTPIGFRFN----------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVLE
        K+R +I      +  S   +WERFKELLR+C     P+  + +                +AGGTL +K+ E A  LLE+MA NSYQWP ER + +K    
Subjt:  KLRTEI-GHSNNNMMSSCSSWERFKELLRKCLSMDTPIGFRFN----------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVLE

Query:  CTGSAQSIESAAALASRPQEETIEQVQGSEEDTSSDEAEKPEPEPPIPSPTLMVPKEK-------------------------------------KKKKK
              +  +A    + P   T      +  + S     +  P PP PS     P+EK                                     +K+K 
Subjt:  CTGSAQSIESAAALASRPQEETIEQVQGSEEDTSSDEAEKPEPEPPIPSPTLMVPKEK-------------------------------------KKKKK

Query:  KKN----NQALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSF----------DIG-----------------EIKS
          N    +   +MP Y +F+KE LA K K +  +TV L   CS  +Q K+P K+ DPGSF++PC+           D+G                 E K+
Subjt:  KKN----NQALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSF----------DIG-----------------EIKS

Query:  TPVKLQLADQSVVRPVGIVENVLI---------------------------RTIPRYWRVIIDIERRELTIRVKNEK---EIFKAVE
        T V LQLAD+S+  P GIVE+VL+                           R      R +ID+++ +L +RV++++   ++FKA++
Subjt:  TPVKLQLADQSVVRPVGIVENVLI---------------------------RTIPRYWRVIIDIERRELTIRVKNEK---EIFKAVE

XP_030443756.1 uncharacterized protein LOC115666104 [Syzygium oleosum]2.0e-3830.39Show/hide
Query:  INANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTV
        I ANNFE+K  LIQM ++   + G P +DPN HL +F                         + DKA+ WL S+  GSITTW+ + Q FL K+FPPAK+ 
Subjt:  INANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTV

Query:  KLRTEI-GHSNNNMMSSCSSWERFKELLRKCLSMDTPIGFRFN----------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVLE
        K+R +I      +  S   +WERFKELLR+C     P+  + +                +AGGTL +K+ E A  LLE+MA NSYQWP ER + +K    
Subjt:  KLRTEI-GHSNNNMMSSCSSWERFKELLRKCLSMDTPIGFRFN----------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVLE

Query:  CTGSAQSIESAAALASRPQEETIEQVQGSEEDTSSDEAEKPEPEPPIPSPTLMVPKEK-------------------------------------KKKKK
              +  +A    + P   T      +  + S     +  P PP PS     P+EK                                     +K+K 
Subjt:  CTGSAQSIESAAALASRPQEETIEQVQGSEEDTSSDEAEKPEPEPPIPSPTLMVPKEK-------------------------------------KKKKK

Query:  KKN----NQALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS-----FD----------------------IGEIKS
          N    +   +MP Y +F+KE LA KRK +  +TV L   CS  +Q K+P K+ DPGSF++P +     FD                      +GE K+
Subjt:  KKN----NQALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS-----FD----------------------IGEIKS

Query:  TPVKLQLADQSVVRPVGIVENVLI---------------------------RTIPRYWRVIIDIERRELTIRVKNEK---EIFKAVE
        T V LQLAD+S+  P GIVE+VL+                           R      R +ID+++ +L +RV++++   ++FKA++
Subjt:  TPVKLQLADQSVVRPVGIVENVLI---------------------------RTIPRYWRVIIDIERRELTIRVKNEK---EIFKAVE

XP_030497486.1 uncharacterized protein LOC115713139 [Cannabis sativa]5.3e-3627.44Show/hide
Query:  EAKGKIRKSTHYRDIKLVNLHENQLYSGEGNIGDCLCP--INANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKS------------------------
        +A G        RD  L NL  ++          C+ P  ++ANNFE+K  ++QM +    + G P+ED N HL +                        
Subjt:  EAKGKIRKSTHYRDIKLVNLHENQLYSGEGNIGDCLCP--INANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKS------------------------

Query:  FWIFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGH-SNNNMMSSCSSWERFKELLRKCLS----------------MDTPIGFRFNS
        F + ++A+ W  S+   SI TW+ L   FL KFFPPAK  KLR +I + S  +  S   +WERFK+LLRKC +                M+        +
Subjt:  FWIFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGH-SNNNMMSSCSSWERFKELLRKCLS----------------MDTPIGFRFNS

Query:  AGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKI--------VLECTGSAQSIESAAALASRPQEETIEQVQGSE-------------EDTSSDEAE
         GG  + K+   A  LLE+MA  + QW +ER   KK+        + + T   + + +  A  ++    +  +V+G E             +  S  + E
Subjt:  AGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKI--------VLECTGSAQSIESAAALASRPQEETIEQVQGSE-------------EDTSSDEAE

Query:  KPEPEPPIPSPT----------------------LMVPKEKKKKKKKKNNQ-------------------ALE-MPQYNRFMKEWLAKKRKEKKVDTVYL
          E E P+P+P                       + +P  ++ +K   + Q                   ALE MP Y +FMKE L+KKRK ++ + V L
Subjt:  KPEPEPPIPSPT----------------------LMVPKEKKKKKKKKNNQ-------------------ALE-MPQYNRFMKEWLAKKRKEKKVDTVYL

Query:  ASTCSTRVQQKVPEKVADPGSFSVPCS---------------------------FDIGEIKSTPVKLQLADQSVVRPVGIVENVLIR-------------
           CS  +Q+K+P K+ DPGSF++PCS                             +GE K T V LQ+AD+S+  P GI+E+VL++             
Subjt:  ASTCSTRVQQKVPEKVADPGSFSVPCS---------------------------FDIGEIKSTPVKLQLADQSVVRPVGIVENVLIR-------------

Query:  ------TIP--------RYWRVIIDIERRELTIRVKNEKEIFK
               IP           R +ID+++ EL +RV+ E+E FK
Subjt:  ------TIP--------RYWRVIIDIERRELTIRVKNEKEIFK

XP_038973113.1 uncharacterized protein LOC120105094 [Phoenix dactylifera]9.7e-3829.02Show/hide
Query:  INANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVK
        +NANNFE+K GLIQM +   + G P EDP++HL +F                         + DKA+ WL S  P S T W+AL QAFL K+FPP KT K
Subjt:  INANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVK

Query:  LRTEI-GHSNNNMMSSCSSWERFKELLRKCLSMDTPIGFRFN---SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKI-----------------
        LR +I   +  +  S   +WERFK+L RKC     P   R     +AGGTL+SK++E A  LLE+MA+N+YQW +ER  PKK+                 
Subjt:  LRTEI-GHSNNNMMSSCSSWERFKELLRKCLSMDTPIGFRFN---SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKI-----------------

Query:  -VLECTGSAQS-----------------------------IESAAALAS-----------------RPQE------------------ETI-------EQ
         +++  GS+                               ++ +  L S                  P+E                  ETI       ++
Subjt:  -VLECTGSAQS-----------------------------IESAAALAS-----------------RPQE------------------ETI-------EQ

Query:  VQGSEEDTSSDEAEKPEPEPPIPSPTLMVPKEKKKKKKKKNNQ--------------------ALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRV
        V     +   D A+   P P +      +P  ++ K+ K + Q                      ++P Y +F+KE ++KKRK +  +T+ L   CS  +
Subjt:  VQGSEEDTSSDEAEKPEPEPPIPSPTLMVPKEKKKKKKKKNNQ--------------------ALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRV

Query:  QQKVPEKVADPGSFSVPCSF----------DIG-----------------EIKSTPVKLQLADQSVVRPVGIVENVLIRT----IPRYWRV---------
        Q K+P K+ DPGSFS+PC+           D+G                 E+K T + LQLAD+SV  P+G++ENVLI+     IP  + V         
Subjt:  QQKVPEKVADPGSFSVPCSF----------DIG-----------------EIKSTPVKLQLADQSVVRPVGIVENVLIRT----IPRYWRV---------

Query:  --------------IIDIERRELTIRVKNEKEIFKAVEDSK
                      IIDI+   LT++V  E+  F   E +K
Subjt:  --------------IIDIERRELTIRVKNEKEIFKAVEDSK

TrEMBL top hitse value%identityAlignment
A0A2I4F4C8 uncharacterized protein LOC1089953734.0e-2938.1Show/hide
Query:  INANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVK
        INANNFELK  LI M +   +  SP +DPN HL  F +                         DKAR WLQS+  GSIT+W  + + FL KFFPPAKT +
Subjt:  INANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIF------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVK

Query:  LRTEIGH-SNNNMMSSCSSWERFKELLRKCLSMDTP----IGFRFN------------SAGGTLLSKTVENART-LLEDMATNSYQWPSERSTPKKI---
        LR+EI     N+  S   +WER+K L+R C     P    +   +N            +AGGTL+SKT+E A T LLE+M +N+YQWP+E++  KK+   
Subjt:  LRTEIGH-SNNNMMSSCSSWERFKELLRKCLSMDTP----IGFRFN------------SAGGTLLSKTVENART-LLEDMATNSYQWPSERSTPKKI---

Query:  -VLECTGSAQSIESAAALASRPQEETIEQVQ
         +   T       +A ++     E + EQVQ
Subjt:  -VLECTGSAQSIESAAALASRPQEETIEQVQ

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.0e-2825.39Show/hide
Query:  INANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTV
        INANNFE+K   IQM +    + G P++DPNSHL +F                         + DKA+ WL S+  GSITTW+ L Q FL KFFPPAKT 
Subjt:  INANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTV

Query:  KLRTEI-GHSNNNMMSSCSSWERFKELLRKCLSMDTPIGFRFN----------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKK----
        K+R +I      +  S   +WERFKELLR+C     P   +                  +AGG L+SK   +A  LLE+MA+N+YQWPSERS  +K    
Subjt:  KLRTEI-GHSNNNMMSSCSSWERFKELLRKCLSMDTPIGFRFN----------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKK----

Query:  --------------------------------IVLECTGSAQSIES------------------------------------------------------
                                        +V E  G + S +                                                       
Subjt:  --------------------------------IVLECTGSAQSIES------------------------------------------------------

Query:  ------------------------------------------------AAALASRPQ------------------------------------EETIE--
                                                        A ++ +RPQ                                    E  IE  
Subjt:  ------------------------------------------------AAALASRPQ------------------------------------EETIE--

Query:  ----------QVQGSEEDTSSDEAEKPEPEPPIPSPTLMVPKEKKKKKKKKNN------------QALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTC
                  ++Q  ++D + ++       PP P P  +  ++ +K+ +K  N            +ALE MP Y +F+K+ L+KKRK  + +TV+L   C
Subjt:  ----------QVQGSEEDTSSDEAEKPEPEPPIPSPTLMVPKEKKKKKKKKNN------------QALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTC

Query:  STRVQQKVPEKVADPGSFSVPCS---------------------------FDIGEIKSTPVKLQLADQSVVRPVGIVENVLIR
        S  +Q K+P K+ DPGSF++PC+                             +GE K T V LQLAD+S V P GI+E+VL++
Subjt:  STRVQQKVPEKVADPGSFSVPCS---------------------------FDIGEIKSTPVKLQLADQSVVRPVGIVENVLIR

A0A6J0ZYV0 uncharacterized protein LOC1104134134.4e-2838.53Show/hide
Query:  INANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTV
        INANNFE+K   IQM +    + G P++DPNSHL +F                         + DKA+ WL S+  GSITTW+ L Q FL KFFPPAKT 
Subjt:  INANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTV

Query:  KLRTEI-GHSNNNMMSSCSSWERFKELLRKCLSMDTPIGFRFN----------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIV--
        K+R +I      +  S   +WERFKELLR+C     P   +                  +AGG L+SK   +A  LLE+MA+N+YQWPSERS  +K V  
Subjt:  KLRTEI-GHSNNNMMSSCSSWERFKELLRKCLSMDTPIGFRFN----------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIV--

Query:  LECTGSAQSIESAAALASRPQEETIEQVQGS
         E           AAL+ +     +  VQ S
Subjt:  LECTGSAQSIESAAALASRPQEETIEQVQGS

A0A6J1DU19 uncharacterized protein LOC1110243614.0e-2930.04Show/hide
Query:  NIGDCLCPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW-----------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEI
        ++G    PINANN ELK GLIQM R+  +RG+ TEDPN+HL  F            I D  R     + P S+   + +VQAFL  FFPPAKT +LRTEI
Subjt:  NIGDCLCPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW-----------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEI

Query:  -GHSNNNMMSSCSSWERFKELLRKCLSMDT----PIGFRFN------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKI--------V
              +       WER+KELLRKC          I   +N            +AGGTLLS+T ENA  LL+DMA NS+QWPSERS  KK+        +
Subjt:  -GHSNNNMMSSCSSWERFKELLRKCLSMDT----PIGFRFN------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKI--------V

Query:  LECTGSAQSIESAAALASRP-----------------QEETIEQVQ---------GSEED-------------------------------TSSDEAE--
               Q++ +A +  S P                  E TIEQ Q          S ED                               TS    E  
Subjt:  LECTGSAQSIESAAALASRP-----------------QEETIEQVQ---------GSEED-------------------------------TSSDEAE--

Query:  --------------------KPEPEPPIPSPTLMVPKEKKKKKKKKNNQAL---------------------------------------------EMPQ
                            + +P     + TL   KE ++ +KKK  + +                                             +MP 
Subjt:  --------------------KPEPEPPIPSPTLMVPKEKKKKKKKKNNQAL---------------------------------------------EMPQ

Query:  YNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFDIGEIKSTPVKLQLADQSV---VRPVGIVENVLIR
        Y RFMK+ +  KRK +  +TV L   CS  +Q+K+P+K+ DPGSF++PC+     I S+     L D      + P+G++E+VL++
Subjt:  YNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFDIGEIKSTPVKLQLADQSV---VRPVGIVENVLIR

A0A6P6XAQ1 Reverse transcriptase5.2e-2925.22Show/hide
Query:  INANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVK
        +NANNFE+K  LIQM +   Y G+ TEDPNSHL +F                         + DKA+ WLQS  P + TTWD L +AFL KFFPP KT K
Subjt:  INANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFW------------------------IFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVK

Query:  LRTEI-GHSNNNMMSSCSSWERFKELLRKCLSMDTP----IGFRFN------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKK-----
        LR +I   S     +   +WER++EL R+C     P    +   +N            +AGG L+ KT E A+ L+E+MA N+YQW +ER   ++     
Subjt:  LRTEI-GHSNNNMMSSCSSWERFKELLRKCLSMDTP----IGFRFN------------SAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKK-----

Query:  -------------------------------IVLECT-----------GSAQSIESAAALASRPQEETIEQV---------------QGSEE--------
                                       +V  CT            S++ ++        PQ                      QG+++        
Subjt:  -------------------------------IVLECT-----------GSAQSIESAAALASRPQEETIEQV---------------QGSEE--------

Query:  ---------------------DTSSDEAEK--------------------------------------PEPEPPIPSPTLMVPKE---------------
                             + S+D+ EK                                         +  +PS T + P+E               
Subjt:  ---------------------DTSSDEAEK--------------------------------------PEPEPPIPSPTLMVPKE---------------

Query:  ---------------------KKKKKKKKNNQALE---------------MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSV
                             K+  K++K  + +E               +P Y +F+KE + KKRK    +T+ L   CS  +Q K+P K+ DPGSF+V
Subjt:  ---------------------KKKKKKKKNNQALE---------------MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSV

Query:  PCSF----------DIG-----------------EIKSTPVKLQLADQSVVRPVGIVENVLIR
        PC+           D+G                 E+K T + LQLAD+S+  P+GI+ENVLI+
Subjt:  PCSF----------DIG-----------------EIKSTPVKLQLADQSVVRPVGIVENVLIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATTGGAAGAGAAACTGCCCAAAGTACCTTGCTGAGAAAAAGAAGGAAAATGAAGTTCCTTTTGACAGTTGAACGATTGGTGTCACAACGTCATCGTGACGCGGA
GCGGCCGACGTCCTTTCCTAAGGGTCTCTTTCGAGGAACCTCGTCGGAATCAACGGGCAAGCAGTGGCAGCAAGGTGCGGCGGTGAGCGACGGAAGCGGCGGCAGTGGTG
TGGCGGAAATGGTTGATTGGAGCTATAATGCAATAGTGGAGTTAATCGGGTGCTCGGGACGCGAAAAGATGCAAAGGAAGGAAAGGAATCAAAAGGGAAAAAAGTCAAAA
TTCGGTCAAAAGGTTTTTGGGGAGCCAATTTTGGGACTTCTTGGAGCCGTAAACAGAGCAAAATCAGAGGAATTCAAGGCTGAAGCAAAGGGGAAAATTCGGAAATCAAC
CCATTATAGGGATATCAAGTTGGTTAATTTGCATGAGAATCAATTATACTCGGGAGAGGGCAACATCGGGGATTGTCTATGCCCGATCAATGCCAACAACTTTGAGCTGA
AGACCGGTCTCATTCAGATGGCTCGAGACTGTGCATATAGAGGATCGCCCACCGAGGATCCAAATTCTCATCTTAAATCTTTTTGGATATTTGATAAAGCACGAGATTGG
TTGCAGTCTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTGAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGG
ACATTCCAACAACAATATGATGAGCAGTTGTTCGAGCTGGGAGCGATTTAAAGAGTTGCTGAGGAAGTGCCTCAGCATGGATACCCCGATTGGCTTCAGGTTCAATTCTG
CAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAATGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACCTAAAAAGATT
GTGCTGGAGTGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTGGCATCTAGACCTCAGGAGGAGACCATTGAACAGGTTCAGGGAAGTGAGGAGGACACATC
ATCAGATGAGGCTGAAAAGCCTGAACCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGCATTAG
AGATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACTGTTTATCTTGCTTCCACATGCAGCACCAGAGTACAACAG
AAAGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGT
GGTTAGACCAGTTGGCATTGTAGAAAATGTTTTAATCAGAACCATTCCTCGCTACTGGCGAGTGATTATAGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAAGAACG
AAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTTCATGGGATATAGGAAAGGTGCAAGAAGAGCACCTCTGTTGGATTCACAGAACAAAAGCCCC
TTGAAGCACGATCAACACGTCGAGCTAATGACGTTAAACAAGCGCTTATGGGAGGCAACCCAAGAAATGGTTGATTGGAGCTATAATGCAATAGTGGAGTTAATCGGGTG
CTCGGGACGCGAAAAGATGCAAAGGAAGGAAAAGAATCAAAAGGGAAAAAAGTCAAAATTCGGTCAAAAGGTGACTAGCATCTCGACGCTAGCCCTTAGGCGTCTCGACG
CTAGCATTCCTTATTCGGATAGGCGCGAAATCGTCGCAGCGTCGAGACGCTGCGACCTAGTGTCGAGACGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACATTGGAAGAGAAACTGCCCAAAGTACCTTGCTGAGAAAAAGAAGGAAAATGAAGTTCCTTTTGACAGTTGAACGATTGGTGTCACAACGTCATCGTGACGCGGA
GCGGCCGACGTCCTTTCCTAAGGGTCTCTTTCGAGGAACCTCGTCGGAATCAACGGGCAAGCAGTGGCAGCAAGGTGCGGCGGTGAGCGACGGAAGCGGCGGCAGTGGTG
TGGCGGAAATGGTTGATTGGAGCTATAATGCAATAGTGGAGTTAATCGGGTGCTCGGGACGCGAAAAGATGCAAAGGAAGGAAAGGAATCAAAAGGGAAAAAAGTCAAAA
TTCGGTCAAAAGGTTTTTGGGGAGCCAATTTTGGGACTTCTTGGAGCCGTAAACAGAGCAAAATCAGAGGAATTCAAGGCTGAAGCAAAGGGGAAAATTCGGAAATCAAC
CCATTATAGGGATATCAAGTTGGTTAATTTGCATGAGAATCAATTATACTCGGGAGAGGGCAACATCGGGGATTGTCTATGCCCGATCAATGCCAACAACTTTGAGCTGA
AGACCGGTCTCATTCAGATGGCTCGAGACTGTGCATATAGAGGATCGCCCACCGAGGATCCAAATTCTCATCTTAAATCTTTTTGGATATTTGATAAAGCACGAGATTGG
TTGCAGTCTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTGAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGG
ACATTCCAACAACAATATGATGAGCAGTTGTTCGAGCTGGGAGCGATTTAAAGAGTTGCTGAGGAAGTGCCTCAGCATGGATACCCCGATTGGCTTCAGGTTCAATTCTG
CAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAATGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACCTAAAAAGATT
GTGCTGGAGTGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTGGCATCTAGACCTCAGGAGGAGACCATTGAACAGGTTCAGGGAAGTGAGGAGGACACATC
ATCAGATGAGGCTGAAAAGCCTGAACCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGCATTAG
AGATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACTGTTTATCTTGCTTCCACATGCAGCACCAGAGTACAACAG
AAAGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGT
GGTTAGACCAGTTGGCATTGTAGAAAATGTTTTAATCAGAACCATTCCTCGCTACTGGCGAGTGATTATAGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAAGAACG
AAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTTCATGGGATATAGGAAAGGTGCAAGAAGAGCACCTCTGTTGGATTCACAGAACAAAAGCCCC
TTGAAGCACGATCAACACGTCGAGCTAATGACGTTAAACAAGCGCTTATGGGAGGCAACCCAAGAAATGGTTGATTGGAGCTATAATGCAATAGTGGAGTTAATCGGGTG
CTCGGGACGCGAAAAGATGCAAAGGAAGGAAAAGAATCAAAAGGGAAAAAAGTCAAAATTCGGTCAAAAGGTGACTAGCATCTCGACGCTAGCCCTTAGGCGTCTCGACG
CTAGCATTCCTTATTCGGATAGGCGCGAAATCGTCGCAGCGTCGAGACGCTGCGACCTAGTGTCGAGACGCTGA
Protein sequenceShow/hide protein sequence
MDIGRETAQSTLLRKRRKMKFLLTVERLVSQRHRDAERPTSFPKGLFRGTSSESTGKQWQQGAAVSDGSGGSGVAEMVDWSYNAIVELIGCSGREKMQRKERNQKGKKSK
FGQKVFGEPILGLLGAVNRAKSEEFKAEAKGKIRKSTHYRDIKLVNLHENQLYSGEGNIGDCLCPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFWIFDKARDW
LQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGHSNNNMMSSCSSWERFKELLRKCLSMDTPIGFRFNSAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKI
VLECTGSAQSIESAAALASRPQEETIEQVQGSEEDTSSDEAEKPEPEPPIPSPTLMVPKEKKKKKKKKNNQALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQ
KVPEKVADPGSFSVPCSFDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRTIPRYWRVIIDIERRELTIRVKNEKEIFKAVEDSKDEVLFMGYRKGARRAPLLDSQNKSP
LKHDQHVELMTLNKRLWEATQEMVDWSYNAIVELIGCSGREKMQRKEKNQKGKKSKFGQKVTSISTLALRRLDASIPYSDRREIVAASRRCDLVSRR