; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000559 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000559
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr4:9912163..9915051
RNA-Seq ExpressionLag0000559
SyntenyLag0000559
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.8e-5045.54Show/hide
Query:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDD-------RVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSNLATIKERSK----
        ERQKK+Y F DSD+ DMLEQL+E QL+ LP+CK+PE++ KVDD       RV+ H VE+CFVLK+LI KLA+E KIE D+DEVAQ+N   +   S     
Subjt:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDD-------RVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSNLATIKERSK----

Query:  ----HQRK-------------KDPKKLQPKRKKSKKSTQPQQLVMLNETFSKTFHKKEKENF-------------ATSYYIDVEEVDNFENGEQRTSVFN
             QRK             +  +K      ++K+     +     E   ++F +   E                 + Y   EE+DN    +QRTSVF+
Subjt:  ----HQRK-------------KDPKKLQPKRKKSKKSTQPQQLVMLNETFSKTFHKKEKENF-------------ATSYYIDVEEVDNFENGEQRTSVFN

Query:  RIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAFQRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSV
         IKP TTR SVFQR+SM  K+E+NQC   T A+ SAF+RLS+S SKK R ST  FD LK+T+DQ +R+M  L+ K F E N+D K HS + SRMKRK SV
Subjt:  RIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAFQRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSV

Query:  LINTKGSLKVLTRF
         INT+GSL V  RF
Subjt:  LINTKGSLKVLTRF

KAA0040811.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.5e-5141.38Show/hide
Query:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDD-------RVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSN-------------
        ERQKK+Y F DSD+ DMLEQL+E QL+ LPKCK+PE+  KVDD       RV+ H VE+CFVLK+LILKL +E KIE D+DEVAQ+N             
Subjt:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDD-------RVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSN-------------

Query:  -----------------------------------------------------------------------LATIKERSKHQRKKDPKKLQPKRKKSKKS
                                                                                  I  + K +R K   K +P + K +  
Subjt:  -----------------------------------------------------------------------LATIKERSKHQRKKDPKKLQPKRKKSKKS

Query:  TQPQQLVMLNETFSKTF---HKKEKENFATSY----------YIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAF
         QP+Q + L E F ++F   H KE       +          Y   EEVDN    +QRTSVF+RIKP TTR SVFQR+S+  KEE+NQC  ST  R SAF
Subjt:  TQPQQLVMLNETFSKTF---HKKEKENFATSY----------YIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAF

Query:  QRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF
        + LS+STSKK R STS FD LK+ +DQ +R+M +L+VK F E N+D K HS + SRMKRK SV INT+GSL V  RF
Subjt:  QRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF

KAA0055462.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.8e-5040.58Show/hide
Query:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDDR-------VVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSN-------------
        ERQKK+Y F DSD+ DMLEQL++ QL+ L +CK+PE+  KVDD        V+ H VE+CFVLK+LILKLA+E KIE D+DEVAQ+N             
Subjt:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDDR-------VVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSN-------------

Query:  -----------------------------------------------------------------------LATIKERSKHQRKKDPKKLQPKRKKSKKS
                                                                                  I  + K +R K   K +P + K +  
Subjt:  -----------------------------------------------------------------------LATIKERSKHQRKKDPKKLQPKRKKSKKS

Query:  TQPQQLVMLNETFSKTFHKKEKENF-------------ATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAF
         QP++ + L E  S++F +   E                 + Y   EEVDN    +QRTSVF+RIKP TTR SVFQR+SM  KEEKNQC  ST AR SAF
Subjt:  TQPQQLVMLNETFSKTFHKKEKENF-------------ATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAF

Query:  QRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF
        +RLS+STSKK R STS FD LK+T+DQ +R+M +L+ K F E N+D K H+ + SRMKRK SV INT+GSL V  RF
Subjt:  QRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.0e-5249.65Show/hide
Query:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDD-------RVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSNLATIKERSKHQRK
        ERQ+K+Y F DSD+ DMLEQL+E QL+ LP+CK+PE+  KVDD       RV+ HPVE+CFVLK+LILKLA+E KIE D+DEVAQ+N A I+  S   + 
Subjt:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDD-------RVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSNLATIKERSKHQRK

Query:  KDPKKLQPKRK--------KSKKSTQPQQLVMLNETFSKTFHKKEKENFATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQC
        KD   LQ +R         +S     P++++ +    + +  + +  N+ +S     +EV+N     QRTSVF+RIKPSTTR SVFQR+S+  KEE+NQC
Subjt:  KDPKKLQPKRK--------KSKKSTQPQQLVMLNETFSKTFHKKEKENFATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQC

Query:  SVSTSARPSAFQRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF
              R S  +RLS+ST KK R STS FD LK+T+DQ +R+M + + K F E N+D K HS + SRMKRK  V INT+GSL V  RF
Subjt:  SVSTSARPSAFQRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF

TYK08944.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.8e-5040.58Show/hide
Query:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDDR-------VVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSN-------------
        ERQKK+Y F DSD+ DMLEQL++ QL+ L +CK+PE+  KVDD        V+ H VE+CFVLK+LILKLA+E KIE D+DEVAQ+N             
Subjt:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDDR-------VVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSN-------------

Query:  -----------------------------------------------------------------------LATIKERSKHQRKKDPKKLQPKRKKSKKS
                                                                                  I  + K +R K   K +P + K +  
Subjt:  -----------------------------------------------------------------------LATIKERSKHQRKKDPKKLQPKRKKSKKS

Query:  TQPQQLVMLNETFSKTFHKKEKENF-------------ATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAF
         QP++ + L E  S++F +   E                 + Y   EEVDN    +QRTSVF+RIKP TTR SVFQR+SM  KEEKNQC  ST AR SAF
Subjt:  TQPQQLVMLNETFSKTFHKKEKENF-------------ATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAF

Query:  QRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF
        +RLS+STSKK R STS FD LK+T+DQ +R+M +L+ K F E N+D K H+ + SRMKRK SV INT+GSL V  RF
Subjt:  QRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein1.4e-5045.54Show/hide
Query:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDD-------RVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSNLATIKERSK----
        ERQKK+Y F DSD+ DMLEQL+E QL+ LP+CK+PE++ KVDD       RV+ H VE+CFVLK+LI KLA+E KIE D+DEVAQ+N   +   S     
Subjt:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDD-------RVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSNLATIKERSK----

Query:  ----HQRK-------------KDPKKLQPKRKKSKKSTQPQQLVMLNETFSKTFHKKEKENF-------------ATSYYIDVEEVDNFENGEQRTSVFN
             QRK             +  +K      ++K+     +     E   ++F +   E                 + Y   EE+DN    +QRTSVF+
Subjt:  ----HQRK-------------KDPKKLQPKRKKSKKSTQPQQLVMLNETFSKTFHKKEKENF-------------ATSYYIDVEEVDNFENGEQRTSVFN

Query:  RIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAFQRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSV
         IKP TTR SVFQR+SM  K+E+NQC   T A+ SAF+RLS+S SKK R ST  FD LK+T+DQ +R+M  L+ K F E N+D K HS + SRMKRK SV
Subjt:  RIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAFQRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSV

Query:  LINTKGSLKVLTRF
         INT+GSL V  RF
Subjt:  LINTKGSLKVLTRF

A0A5A7TGM1 Retrotransposon gag protein7.3e-5241.38Show/hide
Query:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDD-------RVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSN-------------
        ERQKK+Y F DSD+ DMLEQL+E QL+ LPKCK+PE+  KVDD       RV+ H VE+CFVLK+LILKL +E KIE D+DEVAQ+N             
Subjt:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDD-------RVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSN-------------

Query:  -----------------------------------------------------------------------LATIKERSKHQRKKDPKKLQPKRKKSKKS
                                                                                  I  + K +R K   K +P + K +  
Subjt:  -----------------------------------------------------------------------LATIKERSKHQRKKDPKKLQPKRKKSKKS

Query:  TQPQQLVMLNETFSKTF---HKKEKENFATSY----------YIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAF
         QP+Q + L E F ++F   H KE       +          Y   EEVDN    +QRTSVF+RIKP TTR SVFQR+S+  KEE+NQC  ST  R SAF
Subjt:  TQPQQLVMLNETFSKTF---HKKEKENFATSY----------YIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAF

Query:  QRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF
        + LS+STSKK R STS FD LK+ +DQ +R+M +L+VK F E N+D K HS + SRMKRK SV INT+GSL V  RF
Subjt:  QRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF

A0A5A7UI09 Retrotransposon gag protein1.4e-5040.58Show/hide
Query:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDDR-------VVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSN-------------
        ERQKK+Y F DSD+ DMLEQL++ QL+ L +CK+PE+  KVDD        V+ H VE+CFVLK+LILKLA+E KIE D+DEVAQ+N             
Subjt:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDDR-------VVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSN-------------

Query:  -----------------------------------------------------------------------LATIKERSKHQRKKDPKKLQPKRKKSKKS
                                                                                  I  + K +R K   K +P + K +  
Subjt:  -----------------------------------------------------------------------LATIKERSKHQRKKDPKKLQPKRKKSKKS

Query:  TQPQQLVMLNETFSKTFHKKEKENF-------------ATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAF
         QP++ + L E  S++F +   E                 + Y   EEVDN    +QRTSVF+RIKP TTR SVFQR+SM  KEEKNQC  ST AR SAF
Subjt:  TQPQQLVMLNETFSKTFHKKEKENF-------------ATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAF

Query:  QRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF
        +RLS+STSKK R STS FD LK+T+DQ +R+M +L+ K F E N+D K H+ + SRMKRK SV INT+GSL V  RF
Subjt:  QRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF

A0A5A7URH1 Ty3-gypsy retrotransposon protein1.9e-5249.65Show/hide
Query:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDD-------RVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSNLATIKERSKHQRK
        ERQ+K+Y F DSD+ DMLEQL+E QL+ LP+CK+PE+  KVDD       RV+ HPVE+CFVLK+LILKLA+E KIE D+DEVAQ+N A I+  S   + 
Subjt:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDD-------RVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSNLATIKERSKHQRK

Query:  KDPKKLQPKRK--------KSKKSTQPQQLVMLNETFSKTFHKKEKENFATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQC
        KD   LQ +R         +S     P++++ +    + +  + +  N+ +S     +EV+N     QRTSVF+RIKPSTTR SVFQR+S+  KEE+NQC
Subjt:  KDPKKLQPKRK--------KSKKSTQPQQLVMLNETFSKTFHKKEKENFATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQC

Query:  SVSTSARPSAFQRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF
              R S  +RLS+ST KK R STS FD LK+T+DQ +R+M + + K F E N+D K HS + SRMKRK  V INT+GSL V  RF
Subjt:  SVSTSARPSAFQRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF

A0A5D3CCI8 Retrotransposon gag protein1.4e-5040.58Show/hide
Query:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDDR-------VVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSN-------------
        ERQKK+Y F DSD+ DMLEQL++ QL+ L +CK+PE+  KVDD        V+ H VE+CFVLK+LILKLA+E KIE D+DEVAQ+N             
Subjt:  ERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDDR-------VVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSN-------------

Query:  -----------------------------------------------------------------------LATIKERSKHQRKKDPKKLQPKRKKSKKS
                                                                                  I  + K +R K   K +P + K +  
Subjt:  -----------------------------------------------------------------------LATIKERSKHQRKKDPKKLQPKRKKSKKS

Query:  TQPQQLVMLNETFSKTFHKKEKENF-------------ATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAF
         QP++ + L E  S++F +   E                 + Y   EEVDN    +QRTSVF+RIKP TTR SVFQR+SM  KEEKNQC  ST AR SAF
Subjt:  TQPQQLVMLNETFSKTFHKKEKENF-------------ATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAF

Query:  QRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF
        +RLS+STSKK R STS FD LK+T+DQ +R+M +L+ K F E N+D K H+ + SRMKRK SV INT+GSL V  RF
Subjt:  QRLSVSTSKKSRLSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGGAAAGACAGAAGAAAATCTATCATTTCCTTGATTCTGACATCCCTGACATGCTGGAACAACTAATGGAAGCACAACTGATGGGGCTTCCTAAGTGTAAACA
ACCCGAAGAGATGGAGAAAGTCGATGATCGAGTTGTTGGTCATCCAGTGGAAAGATGCTTCGTCCTAAAGGACTTAATTCTAAAGCTAGCTAAGGAAGGAAAAATTGAGT
TCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAGGAGCAAACATCAAAGAAAAAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAAAAGTAAA
AAGTCTACTCAACCTCAACAATTGGTGATGTTGAACGAAACCTTCTCCAAAACATTCCACAAAAAGGAAAAAGAAAACTTTGCAACTTCCTACTACATCGACGTAGAAGA
AGTTGACAATTTTGAGAATGGTGAACAAAGGACTTCTGTCTTCAATCGCATCAAGCCTTCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGTCGCGAAAGAAG
AAAAAAATCAATGTTCAGTGTCCACCTCCGCTCGACCTTCAGCTTTCCAAAGGTTAAGTGTTTCCACATCGAAGAAAAGTCGACTTTCAACATCTGTTTTTGATTGTCTC
AAAGTAACTAGTGATCAACCTAAAAGAAAGATGGACAACTTGGAGGTGAAACTTTTCGATGAAGTAAACAACGACAAGAAGTTTCATAGTAGCATCTCGTCACGTATGAA
AAGGAAGTTCTCTGTTCTCATAAACACAAAAGGTTCCTTGAAGGTTCTCACTCGCTTCGCTGCAGTTCCTTCTCTCCAAGTTCAAAGGTTCTCACGCGCTTTGCTACAGT
TTCCTTCTCTCCAAGTTCGAAGGTTCTCACACGCTTTGCTGCAATTCCTTTTCTCCAAGTTTGAAAGTTCTCATGCGTTTCGCTGCAGTTTCTTCTCTCCAAGTTCGAAA
GTTCTCCGCGTTTCGCTGCAGTTCCTTCTCCTCAAAAGTTTGAAGTTCGAAGGTTTTCACGCCGCTTCGTTGCAATTCCTTTTCCCCAAGTTCGAAGGTTCTCACGCGCT
TAACGCGCTTCGCAGCAATTCTTTCTCCCCAAGTTCAAAGGTTCTCACGCGCTTAAAGCGCTACGCTGCAGTTTCTCTCTCCAAGTTCGAAGGTTCTCATGTTGCTACGC
TGCTAGTCCCTTCTCCCTCCAAGTTTGAAGGTTCTCACGCTACTAGTCTCTTCTCCCTCCAAGTTCGAAGTTCCTTCTCTTCAAGTTCGAAGGTTCACGCGGCGTTGCTT
CATTGTTCCTTCTCCAAGTTCGTAGGTTCTCACACTACGTTTCTTCCTTCTCAAAATTTGAAGGATCTCACGTTGTTTCGTAGTTTTTCCTTTTCCAAGTTCGAAGGCGT
TGTACGCTGTTGTGCTGCTCATTCTCTAAGTTCGAAGGTTCTCACATTGCGCTGCTTCCTTCACCAAGTTCGAAAGTTTTCACACTACTGTGTTGTCCCTTCTCCAAGTT
CAAAGGTTCTTAGTTGTACATTGTTGTGTTGCTCCTTCTCCAAGTTCGAAGGTTTTCACGTTCTACGTTGCTACCCAATTTCTTCTCCAAGTTTGAAGGCTCTCACATGC
TTTCCTATTGTTCCTTCTCCAAGTTCGAAGAAACAGAAAAGTCTGAGTCTATCGCAGGTCGTTTTGAATGAGCTTTTCACTCAACTAATTGGTTGTTTTGACTCCGAAAC
TGACTGGGCTACCTACCAACACCATTTTCGTGACTCCTCAACCACTTCCCACACTATAAAAGGCCTCGATTCGAAGGAAAACATAAGCAAAATTCACTTGGAAGAAAAGG
GTGGTGACAGAACCCTGGCTCGACGCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATGGAAAGACAGAAGAAAATCTATCATTTCCTTGATTCTGACATCCCTGACATGCTGGAACAACTAATGGAAGCACAACTGATGGGGCTTCCTAAGTGTAAACA
ACCCGAAGAGATGGAGAAAGTCGATGATCGAGTTGTTGGTCATCCAGTGGAAAGATGCTTCGTCCTAAAGGACTTAATTCTAAAGCTAGCTAAGGAAGGAAAAATTGAGT
TCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAGGAGCAAACATCAAAGAAAAAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAAAAGTAAA
AAGTCTACTCAACCTCAACAATTGGTGATGTTGAACGAAACCTTCTCCAAAACATTCCACAAAAAGGAAAAAGAAAACTTTGCAACTTCCTACTACATCGACGTAGAAGA
AGTTGACAATTTTGAGAATGGTGAACAAAGGACTTCTGTCTTCAATCGCATCAAGCCTTCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGTCGCGAAAGAAG
AAAAAAATCAATGTTCAGTGTCCACCTCCGCTCGACCTTCAGCTTTCCAAAGGTTAAGTGTTTCCACATCGAAGAAAAGTCGACTTTCAACATCTGTTTTTGATTGTCTC
AAAGTAACTAGTGATCAACCTAAAAGAAAGATGGACAACTTGGAGGTGAAACTTTTCGATGAAGTAAACAACGACAAGAAGTTTCATAGTAGCATCTCGTCACGTATGAA
AAGGAAGTTCTCTGTTCTCATAAACACAAAAGGTTCCTTGAAGGTTCTCACTCGCTTCGCTGCAGTTCCTTCTCTCCAAGTTCAAAGGTTCTCACGCGCTTTGCTACAGT
TTCCTTCTCTCCAAGTTCGAAGGTTCTCACACGCTTTGCTGCAATTCCTTTTCTCCAAGTTTGAAAGTTCTCATGCGTTTCGCTGCAGTTTCTTCTCTCCAAGTTCGAAA
GTTCTCCGCGTTTCGCTGCAGTTCCTTCTCCTCAAAAGTTTGAAGTTCGAAGGTTTTCACGCCGCTTCGTTGCAATTCCTTTTCCCCAAGTTCGAAGGTTCTCACGCGCT
TAACGCGCTTCGCAGCAATTCTTTCTCCCCAAGTTCAAAGGTTCTCACGCGCTTAAAGCGCTACGCTGCAGTTTCTCTCTCCAAGTTCGAAGGTTCTCATGTTGCTACGC
TGCTAGTCCCTTCTCCCTCCAAGTTTGAAGGTTCTCACGCTACTAGTCTCTTCTCCCTCCAAGTTCGAAGTTCCTTCTCTTCAAGTTCGAAGGTTCACGCGGCGTTGCTT
CATTGTTCCTTCTCCAAGTTCGTAGGTTCTCACACTACGTTTCTTCCTTCTCAAAATTTGAAGGATCTCACGTTGTTTCGTAGTTTTTCCTTTTCCAAGTTCGAAGGCGT
TGTACGCTGTTGTGCTGCTCATTCTCTAAGTTCGAAGGTTCTCACATTGCGCTGCTTCCTTCACCAAGTTCGAAAGTTTTCACACTACTGTGTTGTCCCTTCTCCAAGTT
CAAAGGTTCTTAGTTGTACATTGTTGTGTTGCTCCTTCTCCAAGTTCGAAGGTTTTCACGTTCTACGTTGCTACCCAATTTCTTCTCCAAGTTTGAAGGCTCTCACATGC
TTTCCTATTGTTCCTTCTCCAAGTTCGAAGAAACAGAAAAGTCTGAGTCTATCGCAGGTCGTTTTGAATGAGCTTTTCACTCAACTAATTGGTTGTTTTGACTCCGAAAC
TGACTGGGCTACCTACCAACACCATTTTCGTGACTCCTCAACCACTTCCCACACTATAAAAGGCCTCGATTCGAAGGAAAACATAAGCAAAATTCACTTGGAAGAAAAGG
GTGGTGACAGAACCCTGGCTCGACGCCAATGA
Protein sequenceShow/hide protein sequence
MKMERQKKIYHFLDSDIPDMLEQLMEAQLMGLPKCKQPEEMEKVDDRVVGHPVERCFVLKDLILKLAKEGKIEFDLDEVAQSNLATIKERSKHQRKKDPKKLQPKRKKSK
KSTQPQQLVMLNETFSKTFHKKEKENFATSYYIDVEEVDNFENGEQRTSVFNRIKPSTTRPSVFQRMSMVAKEEKNQCSVSTSARPSAFQRLSVSTSKKSRLSTSVFDCL
KVTSDQPKRKMDNLEVKLFDEVNNDKKFHSSISSRMKRKFSVLINTKGSLKVLTRFAAVPSLQVQRFSRALLQFPSLQVRRFSHALLQFLFSKFESSHAFRCSFFSPSSK
VLRVSLQFLLLKSLKFEGFHAASLQFLFPKFEGSHALNALRSNSFSPSSKVLTRLKRYAAVSLSKFEGSHVATLLVPSPSKFEGSHATSLFSLQVRSSFSSSSKVHAALL
HCSFSKFVGSHTTFLPSQNLKDLTLFRSFSFSKFEGVVRCCAAHSLSSKVLTLRCFLHQVRKFSHYCVVPSPSSKVLSCTLLCCSFSKFEGFHVLRCYPISSPSLKALTC
FPIVPSPSSKKQKSLSLSQVVLNELFTQLIGCFDSETDWATYQHHFRDSSTTSHTIKGLDSKENISKIHLEEKGGDRTLARRQ