; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007788 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007788
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:4704891..4706793
RNA-Seq ExpressionLag0007788
SyntenyLag0007788
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043186.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]8.6e-5950.79Show/hide
Query:  QIIDENTTFSIWRKLQALYIKKDVPSKIYHRETLFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLG
        ++I+E   F+ W KL++LY KKD+P+K++ RE LF+FKMN +K+LDENLDEFKK T     T +KLG A EA ILINS+ + YK+VK ALKY R++IT+ 
Subjt:  QIIDENTTFSIWRKLQALYIKKDVPSKIYHRETLFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLG

Query:  SVVAAIRCKELELKSENKGSGGAESLFSKGKTHNKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR--------------------------RNGD
        SV+ A++ KELELK+ENK S  AESLFSKGK   +KN   K  ++ K K +LKC+ICHKEGHFKRNCP R                          +  D
Subjt:  SVVAAIRCKELELKSENKGSGGAESLFSKGKTHNKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR--------------------------RNGD

Query:  FRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMT
         R+G+EHGR    +G   FEY+EVL  T  KA++     +EDWV++SGCTYHMT
Subjt:  FRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMT

KAA0051442.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.8e-7252.54Show/hide
Query:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET
        EIEKF+G+GDF LW K+I AIL  QK LKALEDPK LPAT T  E++T+EEVAYSTLI+N++D+VLRQ+I+E T F+ W KL++LY KKD+P+K++ +E 
Subjt:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET

Query:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTH
        LF+FK N +K+LDENLDEFKKLT     TG+KLG+ NEA ILINS+ + YK+VK  LKY R++IT+ SV+  ++ KELELK+ENK S  AE         
Subjt:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTH

Query:  NKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQRRNGDFRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMT
                  KNF+ +   + Y  +    F RN   +R  D R+G+EHGR    +G   FEY+EVLA T  KA++     +ED V++SGCTYHMT
Subjt:  NKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQRRNGDFRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMT

KAA0054988.1 hypothetical protein E6C27_scaffold43052G001360 [Cucumis melo var. makuwa]7.8e-6044.24Show/hide
Query:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET
        EIE F+G+ DF  W K+I AIL  QK LKA EDPK LPAT T  E++T+EEVAY+TLI+N++D+VLRQ+I+E   F+                       
Subjt:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET

Query:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTH
                        +EFKKLT  F  TG+KLG+ +EA ILINS+ + YK+VK ALKY R+ IT+  V+ A++ +ELELK+ENK S  AESLF KGK  
Subjt:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTH

Query:  NKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR--------------------------RNGDFRKGKEHGRGDVSIGENTFEYSEVLATTEGKAI
         +KN   K  ++ + K +LKC+ICHK GHFKRNCP R                          +  D R+G+EHGR    +G   FEY+E+L TT  + +
Subjt:  NKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR--------------------------RNGDFRKGKEHGRGDVSIGENTFEYSEVLATTEGKAI

Query:  KQGVGNKEDWVINSGCTYHMT
        +     +EDWV++SGCTYHMT
Subjt:  KQGVGNKEDWVINSGCTYHMT

TYK12279.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]8.6e-5950.79Show/hide
Query:  QIIDENTTFSIWRKLQALYIKKDVPSKIYHRETLFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLG
        ++I+E   F+ W KL++LY KKD+P+K++ RE LF+FKMN +K+LDENLDEFKK T     T +KLG A EA ILINS+ + YK+VK ALKY R++IT+ 
Subjt:  QIIDENTTFSIWRKLQALYIKKDVPSKIYHRETLFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLG

Query:  SVVAAIRCKELELKSENKGSGGAESLFSKGKTHNKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR--------------------------RNGD
        SV+ A++ KELELK+ENK S  AESLFSKGK   +KN   K  ++ K K +LKC+ICHKEGHFKRNCP R                          +  D
Subjt:  SVVAAIRCKELELKSENKGSGGAESLFSKGKTHNKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR--------------------------RNGD

Query:  FRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMT
         R+G+EHGR    +G   FEY+EVL  T  KA++     +EDWV++SGCTYHMT
Subjt:  FRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMT

TYK27723.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.8e-8052.02Show/hide
Query:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET
        EIEKF+G+GDF L  K+I A L  QK LKALEDPK LPAT T  E++T+EEVAYSTLI+N++D+VLRQ+I+E T F+ W  L++LY KKD+ +K++ RE 
Subjt:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET

Query:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTH
        LF+FKMN +K+LDENLDEFKKLT     T +KLG+ +EA ILIN + + YK+VK +LKY R++IT+ SV+ A++ KELELK+ENK S  AESLFSKG   
Subjt:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTH

Query:  NKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR---------------------RNGDFR-----KGKEHGRGDVSIGENTFEYSEVLATTEGKAI
         +KN   K  ++ + K +LKC+ICHKEGHFKRNCP R                     RN +++     +G+EHG     +G   FEY++VLA T  +A+
Subjt:  NKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR---------------------RNGDFR-----KGKEHGRGDVSIGENTFEYSEVLATTEGKAI

Query:  KQGVGNKEDWVINSGCTYHMT
        +  +  +EDWV++SGCTY+MT
Subjt:  KQGVGNKEDWVINSGCTYHMT

TrEMBL top hitse value%identityAlignment
A0A5A7TMB4 Pentatricopeptide repeat-containing protein4.2e-5950.79Show/hide
Query:  QIIDENTTFSIWRKLQALYIKKDVPSKIYHRETLFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLG
        ++I+E   F+ W KL++LY KKD+P+K++ RE LF+FKMN +K+LDENLDEFKK T     T +KLG A EA ILINS+ + YK+VK ALKY R++IT+ 
Subjt:  QIIDENTTFSIWRKLQALYIKKDVPSKIYHRETLFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLG

Query:  SVVAAIRCKELELKSENKGSGGAESLFSKGKTHNKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR--------------------------RNGD
        SV+ A++ KELELK+ENK S  AESLFSKGK   +KN   K  ++ K K +LKC+ICHKEGHFKRNCP R                          +  D
Subjt:  SVVAAIRCKELELKSENKGSGGAESLFSKGKTHNKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR--------------------------RNGD

Query:  FRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMT
         R+G+EHGR    +G   FEY+EVL  T  KA++     +EDWV++SGCTYHMT
Subjt:  FRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMT

A0A5A7U6R2 Retrovirus-related Pol polyprotein from transposon TNT 1-948.6e-7352.54Show/hide
Query:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET
        EIEKF+G+GDF LW K+I AIL  QK LKALEDPK LPAT T  E++T+EEVAYSTLI+N++D+VLRQ+I+E T F+ W KL++LY KKD+P+K++ +E 
Subjt:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET

Query:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTH
        LF+FK N +K+LDENLDEFKKLT     TG+KLG+ NEA ILINS+ + YK+VK  LKY R++IT+ SV+  ++ KELELK+ENK S  AE         
Subjt:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTH

Query:  NKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQRRNGDFRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMT
                  KNF+ +   + Y  +    F RN   +R  D R+G+EHGR    +G   FEY+EVLA T  KA++     +ED V++SGCTYHMT
Subjt:  NKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQRRNGDFRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMT

A0A5A7UJ23 Integrase catalytic domain-containing protein3.8e-6044.24Show/hide
Query:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET
        EIE F+G+ DF  W K+I AIL  QK LKA EDPK LPAT T  E++T+EEVAY+TLI+N++D+VLRQ+I+E   F+                       
Subjt:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET

Query:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTH
                        +EFKKLT  F  TG+KLG+ +EA ILINS+ + YK+VK ALKY R+ IT+  V+ A++ +ELELK+ENK S  AESLF KGK  
Subjt:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTH

Query:  NKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR--------------------------RNGDFRKGKEHGRGDVSIGENTFEYSEVLATTEGKAI
         +KN   K  ++ + K +LKC+ICHK GHFKRNCP R                          +  D R+G+EHGR    +G   FEY+E+L TT  + +
Subjt:  NKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR--------------------------RNGDFRKGKEHGRGDVSIGENTFEYSEVLATTEGKAI

Query:  KQGVGNKEDWVINSGCTYHMT
        +     +EDWV++SGCTYHMT
Subjt:  KQGVGNKEDWVINSGCTYHMT

A0A5D3CPM8 Pentatricopeptide repeat-containing protein4.2e-5950.79Show/hide
Query:  QIIDENTTFSIWRKLQALYIKKDVPSKIYHRETLFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLG
        ++I+E   F+ W KL++LY KKD+P+K++ RE LF+FKMN +K+LDENLDEFKK T     T +KLG A EA ILINS+ + YK+VK ALKY R++IT+ 
Subjt:  QIIDENTTFSIWRKLQALYIKKDVPSKIYHRETLFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLG

Query:  SVVAAIRCKELELKSENKGSGGAESLFSKGKTHNKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR--------------------------RNGD
        SV+ A++ KELELK+ENK S  AESLFSKGK   +KN   K  ++ K K +LKC+ICHKEGHFKRNCP R                          +  D
Subjt:  SVVAAIRCKELELKSENKGSGGAESLFSKGKTHNKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR--------------------------RNGD

Query:  FRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMT
         R+G+EHGR    +G   FEY+EVL  T  KA++     +EDWV++SGCTYHMT
Subjt:  FRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMT

A0A5D3DVM0 Retrovirus-related Pol polyprotein from transposon TNT 1-948.6e-8152.02Show/hide
Query:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET
        EIEKF+G+GDF L  K+I A L  QK LKALEDPK LPAT T  E++T+EEVAYSTLI+N++D+VLRQ+I+E T F+ W  L++LY KKD+ +K++ RE 
Subjt:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET

Query:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTH
        LF+FKMN +K+LDENLDEFKKLT     T +KLG+ +EA ILIN + + YK+VK +LKY R++IT+ SV+ A++ KELELK+ENK S  AESLFSKG   
Subjt:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTH

Query:  NKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR---------------------RNGDFR-----KGKEHGRGDVSIGENTFEYSEVLATTEGKAI
         +KN   K  ++ + K +LKC+ICHKEGHFKRNCP R                     RN +++     +G+EHG     +G   FEY++VLA T  +A+
Subjt:  NKKNRFTKGHKNFKGKTSLKCYICHKEGHFKRNCPQR---------------------RNGDFR-----KGKEHGRGDVSIGENTFEYSEVLATTEGKAI

Query:  KQGVGNKEDWVINSGCTYHMT
        +  +  +EDWV++SGCTY+MT
Subjt:  KQGVGNKEDWVINSGCTYHMT

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.1e-1427.34Show/hide
Query:  IEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRETL
        I+ F+G   + +W  +I+A+L +Q  LK ++    L      +  +  E  A ST+I  +SDS L     + T   I   L A+Y +K + S++  R+ L
Subjt:  IEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRETL

Query:  FTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALK-YERDSITLGSVVAAIRCKELELKSENKGSGGA---ESLFSKG
         + K+++  SL  +   F +L +E  A G K+   ++ + L+ +LP  Y  +  A++    +++TL  V   +  +E+++K+++  +        + +  
Subjt:  FTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALK-YERDSITLGSVVAAIRCKELELKSENKGSGGA---ESLFSKG

Query:  KTHNK---KNRFTKGHKNFKG--KTSLKCYICHKEGHFKRNCPQRRNGDFRKGKEH
         T+     KNR TK  K FKG  K  +KC+ C +EGH K++C   +     K KE+
Subjt:  KTHNK---KNRFTKGHKNFKG--KTSLKCYICHKEGHFKRNCPQRRNGDFRKGKEH

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.9e-3129.13Show/hide
Query:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET
        E+ KFNG   F  W ++++ +L+QQ   K L+     P T   E+   ++E A S + L++SD V+  IIDE+T   IW +L++LY+ K + +K+Y ++ 
Subjt:  EIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRET

Query:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKE-LELKSENKGSGGAESLFSKGK-
        L+   M+   +   +L+ F  L T+ A  G K+   ++A +L+NSLP +Y ++   + + + +I L  V +A+   E +  K EN+G    ++L ++G+ 
Subjt:  LFTFKMNNSKSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKE-LELKSENKGSGGAESLFSKGK-

Query:  ------THNKKNRFTKGHKNFKGKTSLK-CYICHKEGHFKRNCPQRRNGDFRKGKEHGRGDVSIGENTFEYSE-----VLATTEGKAIKQGVGNKEDWVI
              ++N      +G    + K+ ++ CY C++ GHFKR+CP  R G   KG+  G+ +    +NT    +     VL   E +      G + +WV+
Subjt:  ------THNKKNRFTKGHKNFKGKTSLK-CYICHKEGHFKRNCPQRRNGDFRKGKEHGRGDVSIGENTFEYSE-----VLATTEGKAIKQGVGNKEDWVI

Query:  NSGCTYHMT
        ++  ++H T
Subjt:  NSGCTYHMT

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTACCAAGAGTTTGCTCTTATCTAGGACAACTCCTTGTCCAACAGAGCCCAACCACGGATCTCTCCATGACTACTTCTTTGAGGTTCAGTGGTACTTTGTTCGA
GGGGAGGATTGTTGGGAGTGCGACCAAAGTCCCACATTGGCTAGATAAGGGGATGATCATGGAGCCATCAAGATTCAAGAAAGGAAGAAAGAGAAAAATTGATAAATGGC
AACAACAAGGTANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNTGAAATCGAGAAGTTCAATGGTAGTGGAGACTTTGATTTATGGTGTAAAAAGATCAAGGCAATTCTATTGCAACAAAAGGAACTTAAGGCCTTGGAGGATCCCAAAAC
TTTACCGGCTACATTCACACTTGAAGAAAAACAAACAATGGAGGAAGTAGCTTACAGTACTCTTATATTGAATGTTTCTGACAGTGTTCTGAGACAAATTATAGATGAAA
ATACAACATTTAGCATTTGGAGAAAATTGCAAGCCTTATATATCAAGAAAGATGTACCTAGCAAAATCTATCACAGAGAAACATTGTTTACTTTCAAAATGAATAACTCC
AAATCACTTGATGAGAATCTTGATGAATTCAAGAAACTCACCACAGAATTTGCAGCAACAGGAGATAAATTGGGTAGCGCGAATGAAGCTACAATCCTTATAAATTCTTT
ACCTGAAGCCTATAAAGATGTGAAGGCAGCACTGAAGTACGAGAGAGATTCTATAACTTTAGGCTCTGTGGTGGCAGCAATTAGATGTAAGGAGCTTGAACTAAAATCAG
AGAATAAAGGGAGTGGTGGAGCTGAATCTCTTTTCTCAAAGGGAAAGACTCACAATAAGAAAAACAGATTTACAAAGGGACACAAGAATTTTAAAGGTAAAACTAGTTTG
AAGTGTTATATTTGTCACAAAGAAGGACACTTCAAACGTAATTGTCCTCAAAGAAGAAATGGAGATTTCAGAAAAGGGAAGGAACATGGAAGAGGAGATGTCTCTATTGG
AGAAAACACTTTTGAATACTCAGAAGTGCTAGCCACTACTGAGGGAAAGGCCATAAAACAGGGAGTGGGGAATAAGGAAGATTGGGTAATTAACTCAGGGTGTACCTACC
ACATGACTGAAGGGAATAAAAGTCCCCACGCAGCGGAAGCGCATCGATTGGACCTTACGCCGTATATTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTACCAAGAGTTTGCTCTTATCTAGGACAACTCCTTGTCCAACAGAGCCCAACCACGGATCTCTCCATGACTACTTCTTTGAGGTTCAGTGGTACTTTGTTCGA
GGGGAGGATTGTTGGGAGTGCGACCAAAGTCCCACATTGGCTAGATAAGGGGATGATCATGGAGCCATCAAGATTCAAGAAAGGAAGAAAGAGAAAAATTGATAAATGGC
AACAACAAGGTANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNTGAAATCGAGAAGTTCAATGGTAGTGGAGACTTTGATTTATGGTGTAAAAAGATCAAGGCAATTCTATTGCAACAAAAGGAACTTAAGGCCTTGGAGGATCCCAAAAC
TTTACCGGCTACATTCACACTTGAAGAAAAACAAACAATGGAGGAAGTAGCTTACAGTACTCTTATATTGAATGTTTCTGACAGTGTTCTGAGACAAATTATAGATGAAA
ATACAACATTTAGCATTTGGAGAAAATTGCAAGCCTTATATATCAAGAAAGATGTACCTAGCAAAATCTATCACAGAGAAACATTGTTTACTTTCAAAATGAATAACTCC
AAATCACTTGATGAGAATCTTGATGAATTCAAGAAACTCACCACAGAATTTGCAGCAACAGGAGATAAATTGGGTAGCGCGAATGAAGCTACAATCCTTATAAATTCTTT
ACCTGAAGCCTATAAAGATGTGAAGGCAGCACTGAAGTACGAGAGAGATTCTATAACTTTAGGCTCTGTGGTGGCAGCAATTAGATGTAAGGAGCTTGAACTAAAATCAG
AGAATAAAGGGAGTGGTGGAGCTGAATCTCTTTTCTCAAAGGGAAAGACTCACAATAAGAAAAACAGATTTACAAAGGGACACAAGAATTTTAAAGGTAAAACTAGTTTG
AAGTGTTATATTTGTCACAAAGAAGGACACTTCAAACGTAATTGTCCTCAAAGAAGAAATGGAGATTTCAGAAAAGGGAAGGAACATGGAAGAGGAGATGTCTCTATTGG
AGAAAACACTTTTGAATACTCAGAAGTGCTAGCCACTACTGAGGGAAAGGCCATAAAACAGGGAGTGGGGAATAAGGAAGATTGGGTAATTAACTCAGGGTGTACCTACC
ACATGACTGAAGGGAATAAAAGTCCCCACGCAGCGGAAGCGCATCGATTGGACCTTACGCCGTATATTAATTAA
Protein sequenceShow/hide protein sequence
MALPRVCSYLGQLLVQQSPTTDLSMTTSLRFSGTLFEGRIVGSATKVPHWLDKGMIMEPSRFKKGRKRKIDKWQQQGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XEIEKFNGSGDFDLWCKKIKAILLQQKELKALEDPKTLPATFTLEEKQTMEEVAYSTLILNVSDSVLRQIIDENTTFSIWRKLQALYIKKDVPSKIYHRETLFTFKMNNS
KSLDENLDEFKKLTTEFAATGDKLGSANEATILINSLPEAYKDVKAALKYERDSITLGSVVAAIRCKELELKSENKGSGGAESLFSKGKTHNKKNRFTKGHKNFKGKTSL
KCYICHKEGHFKRNCPQRRNGDFRKGKEHGRGDVSIGENTFEYSEVLATTEGKAIKQGVGNKEDWVINSGCTYHMTEGNKSPHAAEAHRLDLTPYIN