; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018263 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018263
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:20790927..20792075
RNA-Seq ExpressionLag0018263
SyntenyLag0018263
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN80650.1 hypothetical protein VITISV_022906 [Vitis vinifera]5.0e-2633.59Show/hide
Query:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALTNNSETMKTVVSDSTENNTLK
        G+ KFDG +F YW+MQ+++YL  +K+H   L  KP+ M  E+W  LD++ +  IR+ LS  VA  V  E T   LM+AL   SE M+ VVS+ST    LK
Subjt:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALTNNSETMKTVVSDSTENNTLK

Query:  FSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSK--EDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFT
        +++ +D +  E+     + ++ + G   N+ T  +    R      K  ED         VQD +L+ V+S         DW+LDS TS H    R +  
Subjt:  FSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSK--EDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFT

Query:  SFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY
        ++  G  G V + +G      G+GDV +        +L  V  +P+++ NLISIG+L D+ +
Subjt:  SFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY

KAB5561215.1 hypothetical protein DKX38_006172 [Salix brachista]4.2e-2531.54Show/hide
Query:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALTNNSETMKTVVSDSTENNTLK
        G+ K DG  FGYWKMQ+++YL  KK+H   L +KP+ M DE+W +LD + +  IR+ L+  VA  V  E T   LM  L+   E M+T +S+S   + L 
Subjt:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALTNNSETMKTVVSDSTENNTLK

Query:  FSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSF
        + + +D +  E+     + +S   G   NV T      + + +    +D+     A    + +L+ V S          W+LDS  S H T  +++  ++
Subjt:  FSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSF

Query:  TGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY
          G +G+V + +G   +  GIGDV +K        L+ V  VP +K NLIS+G+L +  +
Subjt:  TGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY

TKR84597.1 Glycyl-tRNA synthetase family protein [Populus alba]7.2e-2534.07Show/hide
Query:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALT---------NNSETMKTVVS
        G+ KFDG  FGYWKMQ+++YL  KK+H   L  KP+ M  E+W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+         N    MK + +
Subjt:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALT---------NNSETMKTVVS

Query:  DSTENNTLKFSEGKD--KVDEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHI
             NT K  EGK   KV   ++ S+S   G   +  +                  E        + VQD +++ V+S         +WILDS  S H 
Subjt:  DSTENNTLKFSEGKD--KVDEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHI

Query:  TSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY
        T    +  ++ GG HG+V + +G   K  GIGDV +KT       L++V  VP +K  LIS+G+L D  +
Subjt:  TSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY

TKS02451.1 hypothetical protein D5086_0000163350 [Populus alba]1.0e-2332.13Show/hide
Query:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALT---------NNSETMKTVVS
        G+ KFDG  FGYWKMQ+++YL  KK+H   L  KP+ M  E+W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+         N    MK + +
Subjt:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALT---------NNSETMKTVVS

Query:  DSTENNTLKFSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKT-----------------------------KPEANI----
             NT  + + +D +  E+     S +S   G   N+ T  R+     S G SK   ++                             KPEA+     
Subjt:  DSTENNTLKFSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKT-----------------------------KPEANI----

Query:  ---VQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKL
           VQD +++ V+S         +WILDS  S H T    +  ++ GG HG+V + +G   K  GIGDV +KT       L++V  VP +K  LIS+G+L
Subjt:  ---VQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKL

Query:  ADDDY
         D  +
Subjt:  ADDDY

TKS15174.1 hypothetical protein D5086_0000036030 [Populus alba]5.1e-2330.03Show/hide
Query:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALT--------NNS---------
        G+ KFDG  FGYWKMQ+++YL  KK+H   L  KP+ M  E+W+ LD + +  I++ LS  VA  V  E +  KLMEAL+        NN+         
Subjt:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALT--------NNS---------

Query:  --------------------ETMKTVVSDSTENNTLKFSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKT-----------
                            E M+T VS+S   + LK+ + +D +  E+     S +S   G   N+ T  R+     S G SK   ++           
Subjt:  --------------------ETMKTVVSDSTENNTLKFSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKT-----------

Query:  ------------------KPEANI-------VQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLK
                          KP+A+        VQD +++ V+S         +WILDS  S H T    +  ++ GG HG+V + +G      GIGDV +K
Subjt:  ------------------KPEANI-------VQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLK

Query:  TEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY
        T       L++V  VP +K  LIS+G+L D  +
Subjt:  TEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY

TrEMBL top hitse value%identityAlignment
A0A438G9X7 Retrovirus-related Pol polyprotein from transposon TNT 1-947.2e-2330.96Show/hide
Query:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALT------------------NN
        G+ KFDG  F YW+MQ+++YL  +K+H   L  KP+ M  E+W  LD + +  IR+ LS  VA  V  E T   LM+AL+                  N+
Subjt:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALT------------------NN

Query:  SETMKTVVSDSTENNTLKFSEGKDKV--DEDNEPSSSRKSGKIGMRYNVFTVIRKVISR--FSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSD
         E M+  VS+ST     K+++ +D +  +E     +   SG  G   N+ T  R    R   S     ED         +QD +L+ V+S         D
Subjt:  SETMKTVVSDSTENNTLKFSEGKDKV--DEDNEPSSSRKSGKIGMRYNVFTVIRKVISR--FSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSD

Query:  WILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY
        W+LDS  S H T  R +  ++  G  G V + +G      G+GDV +        +L  V  +P+++ NLIS+G+L D+ +
Subjt:  WILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY

A0A4U5NLQ9 Diadenosine tetraphosphate synthetase3.5e-2534.07Show/hide
Query:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALT---------NNSETMKTVVS
        G+ KFDG  FGYWKMQ+++YL  KK+H   L  KP+ M  E+W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+         N    MK + +
Subjt:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALT---------NNSETMKTVVS

Query:  DSTENNTLKFSEGKD--KVDEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHI
             NT K  EGK   KV   ++ S+S   G   +  +                  E        + VQD +++ V+S         +WILDS  S H 
Subjt:  DSTENNTLKFSEGKD--KVDEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHI

Query:  TSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY
        T    +  ++ GG HG+V + +G   K  GIGDV +KT       L++V  VP +K  LIS+G+L D  +
Subjt:  TSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY

A0A4U5PZ61 Uncharacterized protein5.0e-2432.13Show/hide
Query:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALT---------NNSETMKTVVS
        G+ KFDG  FGYWKMQ+++YL  KK+H   L  KP+ M  E+W+ LD + +  IR+ LS  VA  V  E +  KLMEAL+         N    MK + +
Subjt:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALT---------NNSETMKTVVS

Query:  DSTENNTLKFSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKT-----------------------------KPEANI----
             NT  + + +D +  E+     S +S   G   N+ T  R+     S G SK   ++                             KPEA+     
Subjt:  DSTENNTLKFSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKT-----------------------------KPEANI----

Query:  ---VQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKL
           VQD +++ V+S         +WILDS  S H T    +  ++ GG HG+V + +G   K  GIGDV +KT       L++V  VP +K  LIS+G+L
Subjt:  ---VQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKL

Query:  ADDDY
         D  +
Subjt:  ADDDY

A0A5N5N166 Uncharacterized protein2.0e-2531.54Show/hide
Query:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALTNNSETMKTVVSDSTENNTLK
        G+ K DG  FGYWKMQ+++YL  KK+H   L +KP+ M DE+W +LD + +  IR+ L+  VA  V  E T   LM  L+   E M+T +S+S   + L 
Subjt:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALTNNSETMKTVVSDSTENNTLK

Query:  FSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSF
        + + +D +  E+     + +S   G   NV T      + + +    +D+     A    + +L+ V S          W+LDS  S H T  +++  ++
Subjt:  FSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSF

Query:  TGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY
          G +G+V + +G   +  GIGDV +K        L+ V  VP +K NLIS+G+L +  +
Subjt:  TGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY

A5B0E4 Integrase catalytic domain-containing protein2.4e-2633.59Show/hide
Query:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALTNNSETMKTVVSDSTENNTLK
        G+ KFDG +F YW+MQ+++YL  +K+H   L  KP+ M  E+W  LD++ +  IR+ LS  VA  V  E T   LM+AL   SE M+ VVS+ST    LK
Subjt:  GVMKFDGIFFGYWKMQVKNYLTCKKVH-KALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALTNNSETMKTVVSDSTENNTLK

Query:  FSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSK--EDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFT
        +++ +D +  E+     + ++ + G   N+ T  +    R      K  ED         VQD +L+ V+S         DW+LDS TS H    R +  
Subjt:  FSEGKDKV-DEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSK--EDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFT

Query:  SFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY
        ++  G  G V + +G      G+GDV +        +L  V  +P+++ NLISIG+L D+ +
Subjt:  SFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDY

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-1240.17Show/hide
Query:  KEDQKTKPEANIVQDVVLVCVESD--TKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPN
        K D  T        +VVL   E +     S   S+W++D+A S H T  R LF  +  G  G V+MGN    K  GIGD+ +KT     LVL+DV  VP+
Subjt:  KEDQKTKPEANIVQDVVLVCVESD--TKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPN

Query:  IKMNLISIGKLADDDYE
        ++MNLIS   L  D YE
Subjt:  IKMNLISIGKLADDDYE

P25601 Putative transposon Ty5-1 protein YCL075W3.5e-0635.23Show/hide
Query:  VCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISI
        +C+ S T  +  SS+WI D+  + H+  DRS+F+SFT         G G +    G G V++     G + L DV +VP++ +NLIS+
Subjt:  VCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-0539.29Show/hide
Query:  SSDWILDSATSIHITSD---RSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLAD
        S++W+LDS  + HITSD    SL   +TGG    V + +G T      G  SL T+    L L ++++VPNI  NLIS+ +L +
Subjt:  SSDWILDSATSIHITSD---RSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLAD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-0437.65Show/hide
Query:  HSSDWILDSATSIHITSD---RSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLAD
        ++++W+LDS  + HITSD    S    +TGG    V + +G T      G  SL T     L L  V++VPNI  NLIS+ +L +
Subjt:  HSSDWILDSATSIHITSD---RSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLAD

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein2.7e-0630.91Show/hide
Query:  NIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKT-----RGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLIS
        N V D V  C    +KY+ H + W++ S  S H+T     FT+        V+  +G   +T      GIGDV+  T  EG   +++V++VP I+ N +S
Subjt:  NIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKT-----RGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLIS

Query:  IGKLADDDYE
        + +L  + +E
Subjt:  IGKLADDDYE

AT3G21000.1 Gag-Pol-related retrotransposon family protein6.7e-0526.45Show/hide
Query:  RFSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDV
        +F +   KE++    E  IV D  L  V +    +     WI+     I++T     FT+        V   +G      G GDV ++ +   K  +R+V
Subjt:  RFSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGNGRTFKTRGIGDVSLKTEYEGKLVLRDV

Query:  MFVPNIKMNLISIGKLADDDY
        +FVP +  N++S GK+    Y
Subjt:  MFVPNIKMNLISIGKLADDDY

AT3G29785.1 unknown protein1.0e-0529.81Show/hide
Query:  KFDGIFFGYWKMQVKNYLTCKKVHKALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALTN---NSETMKTVVSDSTENNTLKF
        K DG  + + +M++++YL  KK+H+ L +K + M+ +DW  L  + +  IR+ +S ++A  VA E +   LM+ L++      T  TV+S      T+  
Subjt:  KFDGIFFGYWKMQVKNYLTCKKVHKALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALTN---NSETMKTVVSDSTENNTLKF

Query:  SEGK
         +G+
Subjt:  SEGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTTGTAGAGCCAAAAAGTTTTGATGGAGTCATGAAGTTCGATGGAATTTTTTTTGGATATTGGAAGATGCAAGTCAAGAATTATTTAACTTGCAAGAAAGTGCA
TAAGGCATTGAAGGAGAAACCGAAAGGGATGACAGATGAAGATTGGGAAGCTCTGGATGAAGAGACAGTTGCAACCATAAGGATGTGTTTGTCAATGGATGTGGCAAGTC
TAGTAGCCTATGAGACAACTGCAGTAAAATTGATGGAAGCACTTACAAACAATTCGGAAACGATGAAGACAGTAGTGTCTGATTCAACTGAAAATAATACTTTAAAATTT
TCAGAAGGTAAAGATAAGGTTGATGAAGATAATGAACCGAGCAGCAGTAGAAAAAGTGGAAAAATAGGAATGAGGTATAATGTTTTTACTGTCATAAGAAAGGTCATTTC
AAGATTCAGTGTAGGAAATTCAAAAGAGGATCAGAAAACAAAACCAGAGGCGAATATAGTGCAAGATGTCGTCTTAGTTTGTGTTGAGAGTGACACAAAGTATAGTAACC
ACTCTTCAGATTGGATATTAGACAGTGCAACTTCCATTCACATAACTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAAT
GGTAGAACCTTCAAGACTAGAGGGATTGGAGATGTTAGTCTAAAGACAGAATATGAAGGTAAATTGGTACTGCGAGATGTCATGTTCGTGCCTAATATCAAAATGAATCT
TATTTCTATTGGCAAGTTGGCAGATGATGATTATGAAGGAGATGAAAGCCCCGACGTAGCGGAAGCGCGTCGAGAGGATCTCACGTCGTATATTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATTTGTAGAGCCAAAAAGTTTTGATGGAGTCATGAAGTTCGATGGAATTTTTTTTGGATATTGGAAGATGCAAGTCAAGAATTATTTAACTTGCAAGAAAGTGCA
TAAGGCATTGAAGGAGAAACCGAAAGGGATGACAGATGAAGATTGGGAAGCTCTGGATGAAGAGACAGTTGCAACCATAAGGATGTGTTTGTCAATGGATGTGGCAAGTC
TAGTAGCCTATGAGACAACTGCAGTAAAATTGATGGAAGCACTTACAAACAATTCGGAAACGATGAAGACAGTAGTGTCTGATTCAACTGAAAATAATACTTTAAAATTT
TCAGAAGGTAAAGATAAGGTTGATGAAGATAATGAACCGAGCAGCAGTAGAAAAAGTGGAAAAATAGGAATGAGGTATAATGTTTTTACTGTCATAAGAAAGGTCATTTC
AAGATTCAGTGTAGGAAATTCAAAAGAGGATCAGAAAACAAAACCAGAGGCGAATATAGTGCAAGATGTCGTCTTAGTTTGTGTTGAGAGTGACACAAAGTATAGTAACC
ACTCTTCAGATTGGATATTAGACAGTGCAACTTCCATTCACATAACTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAAT
GGTAGAACCTTCAAGACTAGAGGGATTGGAGATGTTAGTCTAAAGACAGAATATGAAGGTAAATTGGTACTGCGAGATGTCATGTTCGTGCCTAATATCAAAATGAATCT
TATTTCTATTGGCAAGTTGGCAGATGATGATTATGAAGGAGATGAAAGCCCCGACGTAGCGGAAGCGCGTCGAGAGGATCTCACGTCGTATATTAATTAA
Protein sequenceShow/hide protein sequence
MGFVEPKSFDGVMKFDGIFFGYWKMQVKNYLTCKKVHKALKEKPKGMTDEDWEALDEETVATIRMCLSMDVASLVAYETTAVKLMEALTNNSETMKTVVSDSTENNTLKF
SEGKDKVDEDNEPSSSRKSGKIGMRYNVFTVIRKVISRFSVGNSKEDQKTKPEANIVQDVVLVCVESDTKYSNHSSDWILDSATSIHITSDRSLFTSFTGGHHGLVRMGN
GRTFKTRGIGDVSLKTEYEGKLVLRDVMFVPNIKMNLISIGKLADDDYEGDESPDVAEARREDLTSYIN