; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032625 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032625
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr11:35403213..35409220
RNA-Seq ExpressionLag0032625
SyntenyLag0032625
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]5.1e-3942.96Show/hide
Query:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRKN-------
        MLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA+E KIELD+DEVAQ+N   +   S          QRK+       
Subjt:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRKN-------

Query:  DPKKLQPKRKRSKKFSQPRQ--------------PDLRLRSHQA-----------------SNYSSFSIPKN---------------------------E
        +P  ++ ++K     SQ ++              P   L  H                   +NY S+    N                            
Subjt:  DPKKLQPKRKRSKKFSQPRQ--------------PDLRLRSHQA-----------------SNYSSFSIPKN---------------------------E

Query:  YGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGS
            KEENQC   T  + SAF+RLS+S SKK R ST  FDRLK+TNDQ +R+M  L+ K F E N D K+HSR+PSRMKRK SV INTEGS
Subjt:  YGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGS

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.1e-4048.08Show/hide
Query:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKNDPKKLQPKRKRSKKF
        MLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  S   +  D   LQ +R  +   
Subjt:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKNDPKKLQPKRKRSKKF

Query:  SQPR-----QPDLRLRSHQASNYSSFSIPKNEYGRDK-------------------------------------EENQCSMSTSTRPSAFQRLSVSTSKK
          PR      P+  +    A + +S     N YG  K                                     EENQC     TR S  +RLS+ST KK
Subjt:  SQPR-----QPDLRLRSHQASNYSSFSIPKNEYGRDK-------------------------------------EENQCSMSTSTRPSAFQRLSVSTSKK

Query:  SRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGS
         R STS FDRLK+TNDQ +R+M + + K F E N D K+HS +PSRMKRK  V INTEGS
Subjt:  SRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGS

KAA0065608.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.2e-4147.43Show/hide
Query:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKS----------KHQRKNDPKKL
        MLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI H +E+CFVLK+LILKLA + KIELD+DEVAQ+N A +   S          +   ++ P+++
Subjt:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKS----------KHQRKNDPKKL

Query:  -------------------------QPKRKRSKKFSQPRQPDLRLRSHQASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSV
                                      + + F   R   L  RS   S +   S+   E     EE QC  ST TR S F+RLS+STSKK R STS 
Subjt:  -------------------------QPKRKRSKKFSQPRQPDLRLRSHQASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSV

Query:  FDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGS
        FDRLK+TNDQ +++M +L+ K F E N D K+HSR+PSR KRK SV INTEGS
Subjt:  FDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGS

KAA0066166.1 Retrotransposon gag protein [Cucumis melo var. makuwa]3.3e-3845.53Show/hide
Query:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK-----GKSKHQRKNDPKK------
        MLEQLLE QLI+LPKCKRP++  KVDDP YCKYHRVI HPVE+CF+LK +ILKLA+E KIELD+ EVAQ+N   ++       S H   +  ++      
Subjt:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK-----GKSKHQRKNDPKK------

Query:  ------LQPKRKRSKKFSQPRQPDLRLRSH-----QASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKR
              ++     +         +++ R++     + S   S    +      +E+NQC MSTST+ SAF+RLS+STSK+ R  TS FDRLK+TNDQ +R
Subjt:  ------LQPKRKRSKKFSQPRQPDLRLRSH-----QASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKR

Query:  KMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSSLQFLL
        +M  L+ K F E N+D K+++R+PS MKRK  V INTE  S+   +
Subjt:  KMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSSLQFLL

TYK15207.1 Retrotransposon gag protein [Cucumis melo var. makuwa]1.5e-3845.53Show/hide
Query:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK-----GKSKHQRKNDPKK------
        MLEQLLE QLI+LPKCKRP++  KVDDP YCKYHRVI HPVE+CF+LK++ILKLA+E KIELD+ EVAQ+N   ++       S H   +  ++      
Subjt:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK-----GKSKHQRKNDPKK------

Query:  ------LQPKRKRSKKFSQPRQPDLRLRSH-----QASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKR
              ++     +         +++ R++     + S   S    +      +E+NQC MSTST+ SAF+RLS+STSK+ R  TS FDRLK+TNDQ +R
Subjt:  ------LQPKRKRSKKFSQPRQPDLRLRSH-----QASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKR

Query:  KMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSSLQFLL
        +M  L+ K F E N+D K+++R+PS MKRK  V INTE  S+   +
Subjt:  KMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSSLQFLL

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein2.5e-3942.96Show/hide
Query:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRKN-------
        MLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA+E KIELD+DEVAQ+N   +   S          QRK+       
Subjt:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRKN-------

Query:  DPKKLQPKRKRSKKFSQPRQ--------------PDLRLRSHQA-----------------SNYSSFSIPKN---------------------------E
        +P  ++ ++K     SQ ++              P   L  H                   +NY S+    N                            
Subjt:  DPKKLQPKRKRSKKFSQPRQ--------------PDLRLRSHQA-----------------SNYSSFSIPKN---------------------------E

Query:  YGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGS
            KEENQC   T  + SAF+RLS+S SKK R ST  FDRLK+TNDQ +R+M  L+ K F E N D K+HSR+PSRMKRK SV INTEGS
Subjt:  YGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGS

A0A5A7URH1 Ty3-gypsy retrotransposon protein2.9e-4048.08Show/hide
Query:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKNDPKKLQPKRKRSKKF
        MLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  S   +  D   LQ +R  +   
Subjt:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKNDPKKLQPKRKRSKKF

Query:  SQPR-----QPDLRLRSHQASNYSSFSIPKNEYGRDK-------------------------------------EENQCSMSTSTRPSAFQRLSVSTSKK
          PR      P+  +    A + +S     N YG  K                                     EENQC     TR S  +RLS+ST KK
Subjt:  SQPR-----QPDLRLRSHQASNYSSFSIPKNEYGRDK-------------------------------------EENQCSMSTSTRPSAFQRLSVSTSKK

Query:  SRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGS
         R STS FDRLK+TNDQ +R+M + + K F E N D K+HS +PSRMKRK  V INTEGS
Subjt:  SRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGS

A0A5A7VII4 Retrotransposon gag protein1.6e-3845.53Show/hide
Query:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK-----GKSKHQRKNDPKK------
        MLEQLLE QLI+LPKCKRP++  KVDDP YCKYHRVI HPVE+CF+LK +ILKLA+E KIELD+ EVAQ+N   ++       S H   +  ++      
Subjt:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK-----GKSKHQRKNDPKK------

Query:  ------LQPKRKRSKKFSQPRQPDLRLRSH-----QASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKR
              ++     +         +++ R++     + S   S    +      +E+NQC MSTST+ SAF+RLS+STSK+ R  TS FDRLK+TNDQ +R
Subjt:  ------LQPKRKRSKKFSQPRQPDLRLRSH-----QASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKR

Query:  KMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSSLQFLL
        +M  L+ K F E N+D K+++R+PS MKRK  V INTE  S+   +
Subjt:  KMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSSLQFLL

A0A5D3CA53 Retrotransposon gag protein2.0e-4147.43Show/hide
Query:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKS----------KHQRKNDPKKL
        MLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI H +E+CFVLK+LILKLA + KIELD+DEVAQ+N A +   S          +   ++ P+++
Subjt:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKS----------KHQRKNDPKKL

Query:  -------------------------QPKRKRSKKFSQPRQPDLRLRSHQASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSV
                                      + + F   R   L  RS   S +   S+   E     EE QC  ST TR S F+RLS+STSKK R STS 
Subjt:  -------------------------QPKRKRSKKFSQPRQPDLRLRSHQASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSV

Query:  FDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGS
        FDRLK+TNDQ +++M +L+ K F E N D K+HSR+PSR KRK SV INTEGS
Subjt:  FDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGS

A0A5D3CTF5 Retrotransposon gag protein7.2e-3945.53Show/hide
Query:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK-----GKSKHQRKNDPKK------
        MLEQLLE QLI+LPKCKRP++  KVDDP YCKYHRVI HPVE+CF+LK++ILKLA+E KIELD+ EVAQ+N   ++       S H   +  ++      
Subjt:  MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK-----GKSKHQRKNDPKK------

Query:  ------LQPKRKRSKKFSQPRQPDLRLRSH-----QASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKR
              ++     +         +++ R++     + S   S    +      +E+NQC MSTST+ SAF+RLS+STSK+ R  TS FDRLK+TNDQ +R
Subjt:  ------LQPKRKRSKKFSQPRQPDLRLRSH-----QASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKR

Query:  KMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSSLQFLL
        +M  L+ K F E N+D K+++R+PS MKRK  V INTE  S+   +
Subjt:  KMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSSLQFLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGAACAACTATTGGAAGCACAACTGATAGAGCTTCCTAAGTGTAAACGGCCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTAT
TGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTA
CAATCAAAGGAAAGAGCAAGCATCAAAGAAAGAATGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCGACAACCGGACCTCCGTCTT
CGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAAAGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGC
TTTCCAAAGGCTAAGTGTCTCCACATCAAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGG
AGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGAATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCTTCGCTG
CAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAA
GTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTC
TCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTCAAGGTCGAAGGTTCTCACTGCTGCGTTGCAGTTCTT
TCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCA
CGCGCTTCGCTGTAGTTCCTTCCCCCTAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCC
CCCAAGTTCGAAGGTTCTTACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTC
GCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCATATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGTTCCTTCCTCCAAGTT
CGAAGGTTCTCATGCGCTTCGTTGCTCCTTCCTCCAAATTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCG
CTCCTTCTCCAAGTTCGAAGGCGTTTCTCTCCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCT
CCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCC
TTCTCCAAGTTCGAAGGAGAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCT
CTACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTGAAGTTGACGGCGTCCGCTTCGCTTCATCTTCAAAAATTGACTGTTGATAACTTCACTTCA
TATTCAAAAGTTGACGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGAAGGCGAGTCGGGTCTGGT
GACCACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAA
TAAAATGGGGACTGGGTCTAGCAGGAGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTG
CATGAAGGCGAATCTGGTGACTACCCCTGCAGCCAGAGATCAGAGAACTCCGAGTCCAGAGAATTCTGCCAGAGTCCAGAGTCCAGAGTCACCAGAGTCCAGAGTCATCA
GAAGTCAGAGAGTCTAGAGAATTCAGAAGATCCAAGATTCAGAATTCAACCAACTCAAGACTTAGAAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCA
ACAGATCATCAAGCCAACAGGTCGATCCAAGAGATCATCAAGTCAGCAGGCTGATCATCCAAGAGGATCAACAAGCTAATAAGCCGATTCGACAGATCAACAAGCCAATC
GACCGATCAAGAAGATCAACAAACCGATCATCCAAGAGGATCAACAAGCTAACAAGTCGATCCAACAAATCAACAAGTCAACAGGCCGATCATCCAGGAAGATCAACAAG
CCAACAAGTCGATCCAAGAGATCACCACGCCAACAGGCCGATCATCCAAGAAGATCAACAAGCCAATAGGCCGATCCAAGAGATCATCAACCTAACAGACCGATCATCCA
GGAAGATCAACAAGCCAACAAGCCGATCCAAGAGATCATCACGCCAACAGGCCGATCATCCAAGAAGATCAACAAGGTTCAGAAATTCTACACTCTCACAAAGACAAGAG
TTCAGAGTTTCAAAGCTCTCAAGCAGAACCAGAGAATTCAGAGAGATTCCATCAAGTCTGAAGACCGAAGACTCTCTGCAATCCATAAGTCCAAGTGTTGAACACTTCTT
GAAGACCAAACACTCCTCAAGACTTCAACGCCTCTTGAAGACCAAACGCTCTTCAAGACCTCAACACTCCTTGAAGATCAAAAACTCTTCAGGACATCAACATTTCTTGA
AGACCAAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTCCAGGACATCAACACTTCTTGAAGACTGAAGACTCCTTCAAGACTAGAAGACTTCAAGCT
CCAAGAATCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGAACAACTATTGGAAGCACAACTGATAGAGCTTCCTAAGTGTAAACGGCCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTAT
TGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTA
CAATCAAAGGAAAGAGCAAGCATCAAAGAAAGAATGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCGACAACCGGACCTCCGTCTT
CGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAAAGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGC
TTTCCAAAGGCTAAGTGTCTCCACATCAAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGG
AGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGAATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCTTCGCTG
CAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAA
GTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTC
TCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTCAAGGTCGAAGGTTCTCACTGCTGCGTTGCAGTTCTT
TCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCA
CGCGCTTCGCTGTAGTTCCTTCCCCCTAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCC
CCCAAGTTCGAAGGTTCTTACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTC
GCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCATATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGTTCCTTCCTCCAAGTT
CGAAGGTTCTCATGCGCTTCGTTGCTCCTTCCTCCAAATTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCG
CTCCTTCTCCAAGTTCGAAGGCGTTTCTCTCCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCT
CCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCC
TTCTCCAAGTTCGAAGGAGAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCT
CTACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTGAAGTTGACGGCGTCCGCTTCGCTTCATCTTCAAAAATTGACTGTTGATAACTTCACTTCA
TATTCAAAAGTTGACGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGAAGGCGAGTCGGGTCTGGT
GACCACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAA
TAAAATGGGGACTGGGTCTAGCAGGAGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTG
CATGAAGGCGAATCTGGTGACTACCCCTGCAGCCAGAGATCAGAGAACTCCGAGTCCAGAGAATTCTGCCAGAGTCCAGAGTCCAGAGTCACCAGAGTCCAGAGTCATCA
GAAGTCAGAGAGTCTAGAGAATTCAGAAGATCCAAGATTCAGAATTCAACCAACTCAAGACTTAGAAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCA
ACAGATCATCAAGCCAACAGGTCGATCCAAGAGATCATCAAGTCAGCAGGCTGATCATCCAAGAGGATCAACAAGCTAATAAGCCGATTCGACAGATCAACAAGCCAATC
GACCGATCAAGAAGATCAACAAACCGATCATCCAAGAGGATCAACAAGCTAACAAGTCGATCCAACAAATCAACAAGTCAACAGGCCGATCATCCAGGAAGATCAACAAG
CCAACAAGTCGATCCAAGAGATCACCACGCCAACAGGCCGATCATCCAAGAAGATCAACAAGCCAATAGGCCGATCCAAGAGATCATCAACCTAACAGACCGATCATCCA
GGAAGATCAACAAGCCAACAAGCCGATCCAAGAGATCATCACGCCAACAGGCCGATCATCCAAGAAGATCAACAAGGTTCAGAAATTCTACACTCTCACAAAGACAAGAG
TTCAGAGTTTCAAAGCTCTCAAGCAGAACCAGAGAATTCAGAGAGATTCCATCAAGTCTGAAGACCGAAGACTCTCTGCAATCCATAAGTCCAAGTGTTGAACACTTCTT
GAAGACCAAACACTCCTCAAGACTTCAACGCCTCTTGAAGACCAAACGCTCTTCAAGACCTCAACACTCCTTGAAGATCAAAAACTCTTCAGGACATCAACATTTCTTGA
AGACCAAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTCCAGGACATCAACACTTCTTGAAGACTGAAGACTCCTTCAAGACTAGAAGACTTCAAGCT
CCAAGAATCCATTGA
Protein sequenceShow/hide protein sequence
MLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKNDPKKLQPKRKRSKKFSQPRQPDLRL
RSHQASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSSL
QFLLSKFEGPYTVRYCVVPSPSSKVLRCILLRCSFSKFEGSQLYDCYVVPPPSVKDLMWCVVALFPLLSSSMVLTQLCWSFFSPSSKVLTRSVAVPSFKVEGSHCCVAVL
SPQVRRFTHFAAVPSPKFEGSHALRSAIPSPKFEGSHALRCSSFPLSLKVLTRFAAVPSSKFKGSHALRCSSFPQVRRFLRALLQFLPHSSKVLTRFAAVPSPKFEGSHV
ASLQFLPPSLKVLISLRFALRFVAVPSSKFEVPSSKFEGSHALRCSFLQIRRFSHALLQLLPPSSKVPSRASLAPSPSSKAFLSAAPSPSSKALLSTAPSPSSKALLSTA
PSPSSKALLSTAPSPSSKALLSVATSPSSKALLSTAPSPSSKESSKALLSTAPSPSSKALLSTAPSPSSKALLSTAPSPSSKVLLSTPLLKLTASASLHLQKLTVDNFTS
YSKVDGNYSHQSDWSRQVVKSLQLNLMTTVEGESGLVTTPAGYSDHPIKWGLGLAGVHEANLVTTPAGYSDHPIKWGLGLAGECMKANLVTTPAGYSDHPIKWGLGLAGV
HEGESGDYPCSQRSENSESREFCQSPESRVTRVQSHQKSESLENSEDPRFRIQPTQDLEGRSSKRINKLTSRSNRSSSQQVDPRDHQVSRLIIQEDQQANKPIRQINKPI
DRSRRSTNRSSKRINKLTSRSNKSTSQQADHPGRSTSQQVDPRDHHANRPIIQEDQQANRPIQEIINLTDRSSRKINKPTSRSKRSSRQQADHPRRSTRFRNSTLSQRQE
FRVSKLSSRTREFREIPSSLKTEDSLQSISPSVEHFLKTKHSSRLQRLLKTKRSSRPQHSLKIKNSSGHQHFLKTKHSSRLQHSLKIKDSPGHQHFLKTEDSFKTRRLQA
PRIH