; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018102 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018102
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon opus
Genome locationchr5:16184333..16188114
RNA-Seq ExpressionLag0018102
SyntenyLag0018102
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023521781.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785639, partial [Cucurbita pepo subsp. pepo]1.8e-2829.17Show/hide
Query:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVRGF-SKKVKSVIEVDCVSTIRADIA----------------------------------------
        +VDA A G +L+KT+NE YEILERI++ +CQW++VR    +K + V+EVD +S+I A +A                                        
Subjt:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVRGF-SKKVKSVIEVDCVSTIRADIA----------------------------------------

Query:  ---------------------------------------------IWARKQSSKQQK------------VNQLGFAKSQTLPQQNKQALPQQNSKSSLEA
                                                      W  + S  QQ              NQL +   Q   Q    +  Q    +SLE+
Subjt:  ---------------------------------------------IWARKQSSKQQK------------VNQLGFAKSQTLPQQNKQALPQQNSKSSLEA

Query:  MMKEYMARTDVTIQSNQASLRALELQVGQLANELKAQPQG--NIPSYIEHL--IREGKKQMQ-----------AVTLRSGKPLEERKKPS--IPQEV---
        ++KEYMA+ D  IQS QASLR LE+QVGQLANEL+ +P     +P+Y++ L  +   +++ +           +  L++  PL+E+   S  IP  +   
Subjt:  MMKEYMARTDVTIQSNQASLRALELQVGQLANELKAQPQG--NIPSYIEHL--IREGKKQMQ-----------AVTLRSGKPLEERKKPS--IPQEV---

Query:  -------ENSSDSNVVEKEL--ESGIGEARPTTVTLQLADRSITYPEGKIEDVLVQ-----------------------------------------GEL
               +  S  N++   +  + GIGEARPTTVTLQLADRS TYPEGKIED+L+Q                                         G +
Subjt:  -------ENSSDSNVVEKEL--ESGIGEARPTTVTLQLADRSITYPEGKIEDVLVQ-----------------------------------------GEL

Query:  TMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDL
        T+ + DQ+V+FN+ D+MKYP   ++CS + +L
Subjt:  TMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDL

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]4.3e-2726.79Show/hide
Query:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVRGF-SKKVKSVIEVDCVSTIRADIAIWARKQSSKQQKVNQLGFAKSQTLPQQNKQALPQQNSKSS
        +VDA A G +L+KT+NE YEILERI++ +CQW++VR    +K + V+EVD +S+I A +A       S    +  L   +   +      A     + + 
Subjt:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVRGF-SKKVKSVIEVDCVSTIRADIAIWARKQSSKQQKVNQLGFAKSQTLPQQNKQALPQQNSKSS

Query:  LEAMMKEYMARTDVTIQSNQASLRALELQVGQLANELKAQPQGNIPSYIEHLIREGKKQMQAVTLRSGKPL---------------------EERKKPSI
          +++KEYMA+ D  IQS QASLR LE+QVGQLANEL+ +P   +P+  E   REG +Q QA+ LRSGK +                     ++RK+ ++
Subjt:  LEAMMKEYMARTDVTIQSNQASLRALELQVGQLANELKAQPQGNIPSYIEHLIREGKKQMQAVTLRSGKPL---------------------EERKKPSI

Query:  PQEVENSSDSN-VVEKE-----------------------------------------------------------------------------------
         QE  N +D+   V+KE                                                                                   
Subjt:  PQEVENSSDSN-VVEKE-----------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------LESGIGEARPTTVTLQLADRSITYPEGKIEDVLVQ--
                                                                        + GIGEARPTTVTLQLADRS TYPEGKIED+L+Q  
Subjt:  ---------------------------------------------------------------LESGIGEARPTTVTLQLADRSITYPEGKIEDVLVQ--

Query:  ---------------------------------------GELTMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDL
                                               G +T+ + DQ+V+FN+ D+MKYP   E+CS + +L
Subjt:  ---------------------------------------GELTMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDL

XP_024038239.1 uncharacterized protein LOC112097286 [Citrus clementina]8.1e-2629.74Show/hide
Query:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVR----------------------------------GFSKKVKSVIEVDCVSTIR-----------
        MVDA A G LL+K++ E YEILERI+  + QW + R                                       VK V E+ CV               
Subjt:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVR----------------------------------GFSKKVKSVIEVDCVSTIR-----------

Query:  ADIAI--------------------WAR----KQSSKQQKVNQLGFAKSQTLPQQ---NKQALPQ----QNSKSSLEAMMKEYMARTDVTIQSNQASLRA
        A I                      W +      S++ +    L      T PQQ   ++Q+  Q    Q+  +SLE ++KEY+A+ +  +QS   SLR 
Subjt:  ADIAI--------------------WAR----KQSSKQQKVNQLGFAKSQTLPQQ---NKQALPQ----QNSKSSLEAMMKEYMARTDVTIQSNQASLRA

Query:  LELQVGQLANELKAQPQGNIPSYIEHLIREGKKQMQAVTLRSGK----PLEERK-------------KPSIPQEVENSS--------------DSNVVEK
        LE Q+GQLA  + ++ QG++PS  E+  RE K+  + ++LRSGK    P E  K             K S+ Q+  +                   + EK
Subjt:  LELQVGQLANELKAQPQGNIPSYIEHLIREGKKQMQAVTLRSGK----PLEERK-------------KPSIPQEVENSS--------------DSNVVEK

Query:  ELESGIGEARPTTVTLQLADRSITYPEGKIEDVLV-----------------------------------------QGELTMNVHDQEVKFNMFDAMKYP
        ELE  +GE RPTTVTL+LADRS TYPEG IEDVLV                                         +GELTM V+DQ+V FN+ +AM+ P
Subjt:  ELESGIGEARPTTVTLQLADRSITYPEGKIEDVLV-----------------------------------------QGELTMNVHDQEVKFNMFDAMKYP

Query:  NDIEDCSCIQ--DLEIGGL----------------EYEHKDVAHIETVKTPWYD
        ++IEDC+ +   DL + G                 E E +DVA    ++T W D
Subjt:  NDIEDCSCIQ--DLEIGGL----------------EYEHKDVAHIETVKTPWYD

XP_030497888.1 uncharacterized protein LOC115713544 [Cannabis sativa]5.8e-3229.44Show/hide
Query:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWS-NVRGFSKKVKSVIEVDCVSTIRADIA----------------------------------------
        ++DA A G +L+K++NE +EILERI++ + QWS N    S+KV  V+EVD ++ + A +A                                        
Subjt:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWS-NVRGFSKKVKSVIEVDCVSTIRADIA----------------------------------------

Query:  -----------IWARK--QSSKQQKVNQLGFA---KSQTLPQQNKQALPQQNSKSSLEAMMKEYMARTDVTIQSNQASLRALELQVGQLANELKAQPQGN
                    W  +   SS  Q   +  F      Q  PQQ  Q  PQ +  SSLE++M++Y  + D  IQS  ASL+ LE+Q+GQLAN+LK++PQG 
Subjt:  -----------IWARK--QSSKQQKVNQLGFA---KSQTLPQQNKQALPQQNSKSSLEAMMKEYMARTDVTIQSNQASLRALELQVGQLANELKAQPQGN

Query:  IPSYIEHLIREGKKQMQAVTLRSGKPLEE-------------------RKKPS-----IPQEVENSSDSNVVEKELES----------------------
        +PS  ++  R+GK+  +AV LRSGK LE                    +KKP+     IP  V  S   +  EK L+                       
Subjt:  IPSYIEHLIREGKKQMQAVTLRSGKPLEE-------------------RKKPS-----IPQEVENSSDSNVVEKELES----------------------

Query:  -----------------------------------------------------GIGEARPTTVTLQLADRSITYPEGKIEDVLVQ---------------
                                                             GIGEARPTTVTLQL DRS+ +PEGKIEDV VQ               
Subjt:  -----------------------------------------------------GIGEARPTTVTLQLADRSITYPEGKIEDVLVQ---------------

Query:  --------------------------GELTMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDLEIGGLEYEHKDVAHIETV
                                  GELTM V+DQ+V FN+F+AM++P++IE+CS +  ++    E  HK+V   E V
Subjt:  --------------------------GELTMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDLEIGGLEYEHKDVAHIETV

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]1.3e-2639.33Show/hide
Query:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVRG-FSKKVKSVIEVDCVSTIRADIAIWARKQSSKQQKVNQLG-FAKSQTLPQQNKQALP------
        ++DA A G +L+K++NE +EILE I++ + QWSN R   S+KV  V+EVD ++ + A +A      ++    ++  G  A S T P Q +QA P      
Subjt:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVRG-FSKKVKSVIEVDCVSTIRADIAIWARKQSSKQQKVNQLG-FAKSQTLPQQNKQALP------

Query:  -------QQNSKSSLEAMMKEYMARTDVTIQSNQASLRALELQVGQLANELKAQPQGNIPSYIEHLIREGKKQMQAVTLRSGKPL----EERKKPSIPQE
               Q    SSLE++M++YMA+ D  IQS  ASLR LELQ+G LANELKA+PQG++PS  E+  R+GK+Q +++ LRSGK L    EE K    P  
Subjt:  -------QQNSKSSLEAMMKEYMARTDVTIQSNQASLRALELQVGQLANELKAQPQGNIPSYIEHLIREGKKQMQAVTLRSGKPL----EERKKPSIPQE

Query:  VENSSDSNVVEKELESGIGEARPT-TVTLQLADRSITYP
        ++N      + K+    I + RP  T + Q +D   + P
Subjt:  VENSSDSNVVEKELESGIGEARPT-TVTLQLADRSITYP

TrEMBL top hitse value%identityAlignment
A0A061EW79 Retrotrans_gag domain-containing protein2.7e-1932.68Show/hide
Query:  VDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVRGFSKKVKSVIEVDCVSTIRADIAIWARKQSSKQQKVNQLGF-------AKSQTLPQQNKQALPQQ
        +DA   G L++K+ ++ Y++LE I + + QW + R  ++K+  + E+D ++T+   +  +A+K         Q  F       AKS   P       P  
Subjt:  VDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVRGFSKKVKSVIEVDCVSTIRADIAIWARKQSSKQQKVNQLGF-------AKSQTLPQQNKQALPQQ

Query:  NSKSSLEAMMKEYMARTDVTIQSNQASLRALELQVGQLANELKAQPQGNIPSYIE-HLIREGKKQMQAVTLRSGKP------LEERKKPSIPQEVENSSD
          K S+E +  ++M +T+  IQ+   S+R LE+QVGQLA+ L  +PQG +PS  E +  REGK+   A+TL +GK       L++    SIP  + +   
Subjt:  NSKSSLEAMMKEYMARTDVTIQSNQASLRALELQVGQLANELKAQPQGNIPSYIE-HLIREGKKQMQAVTLRSGKP------LEERKKPSIPQEVENSSD

Query:  S----------NVVEKELESGIG--EARPTTVTLQLADRSITYPEGKIEDVLVQ
        S          +++   +   +G  E +PTTVTLQLADR+I Y    IEDVL++
Subjt:  S----------NVVEKELESGIG--EARPTTVTLQLADRSITYPEGKIEDVLVQ

A0A5B6UIR1 Retrovirus-related Pol polyprotein from transposon opus8.8e-1832.64Show/hide
Query:  QNKQALPQQNSKSSLEAMMKEYMARTDVTIQSNQASLRALELQVGQLANELKAQPQGNIPSYIEHLIREGKKQMQAVTLRSGKPLEERKKP---------
        Q+ Q  PQ  S +SLE ++K YM + DV IQS  A+L+ LE Q+GQLA EL+++ Q  +PS  ++L   G +       +        K P         
Subjt:  QNKQALPQQNSKSSLEAMMKEYMARTDVTIQSNQASLRALELQVGQLANELKAQPQGNIPSYIEHLIREGKKQMQAVTLRSGKPLEERKKP---------

Query:  SIPQEVENS----------SDSNVVEKELES--GIGEARPTTVTLQLADRSITYPEGKIEDVLV------------------------------------
        +IP  +  S          +  N++ K +    GIGE RPTT+T QL ++S+ YPEGKIEDVLV                                    
Subjt:  SIPQEVENS----------SDSNVVEKELES--GIGEARPTTVTLQLADRSITYPEGKIEDVLV------------------------------------

Query:  -----QGELTMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDLE
             + EL M V D +V FN+  AMK+PN +E+CS ++++E
Subjt:  -----QGELTMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDLE

A0A5B6V914 Uncharacterized protein9.4e-2033.69Show/hide
Query:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVR-GFSKKVKSVIEVDCVSTIRADIAIWARKQSSKQQKVNQLGFAKSQTLPQQNKQALPQQNSKSS
        +VDA   G LL K++NE YEILE+I+    Q+   R G   KV S +E+D ++++ A                                    Q +  SS
Subjt:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVR-GFSKKVKSVIEVDCVSTIRADIAIWARKQSSKQQKVNQLGFAKSQTLPQQNKQALPQQNSKSS

Query:  LEAMMKEYMARTDVTIQSNQASLRALELQVGQLANE---LKAQPQGNIPSYIEHLIREGKKQMQAVTLRSGKPLEERKKPSIPQEVENSSD---------
        +EA++KEYMA+ D        S   L+  V     E    K     N+ S  ++   E     Q   L    PL E  +  +P  V+   D         
Subjt:  LEAMMKEYMARTDVTIQSNQASLRALELQVGQLANE---LKAQPQGNIPSYIEHLIREGKKQMQAVTLRSGKPLEERKKPSIPQEVENSSD---------

Query:  --SNVVEKELESGIGEARPTTVTLQLADRSITYPEGKIEDVLV-QGELTMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDLEI
            +   E + GIG+ARP TVTLQL DRS  +PEGKIEDVL  +GELT+ V+DQ + FN+FDA+KY  D ++C  I  +EI
Subjt:  --SNVVEKELESGIGEARPTTVTLQLADRSITYPEGKIEDVLV-QGELTMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDLEI

A0A6A2WDK9 Mitogen-activated protein kinase 141.5e-1727.05Show/hide
Query:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVR-GFSKKVKSVIEVDC-------VSTIRADIAIWARKQSSKQQK---------------------
        ++D  A G LL K+  E ++IL+RIS    Q+ + R G  +K     ++D        +STI   +    R    K+ K                     
Subjt:  MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVR-GFSKKVKSVIEVDC-------VSTIRADIAIWARKQSSKQQK---------------------

Query:  ---VNQLGFAKSQTLPQQNKQALPQQNSKSSLEAMMKEYMA---------------------RTDVTIQSNQASLRALELQVGQLANELKAQPQGNIPSY
           +N +G+  +      NK+A     S SSLEA ++E+++                          +QS+ +SLRALE QVGQ+A  L+ + QG +PS 
Subjt:  ---VNQLGFAKSQTLPQQNKQALPQQNSKSSLEAMMKEYMA---------------------RTDVTIQSNQASLRALELQVGQLANELKAQPQGNIPSY

Query:  IEHLIREGKKQMQAVTLRSGKPL--------------------------------EERKKPSIPQEVENSSDSN---------------VVEKELESGIG
         E     GK+    +TLRSG  +                                E R  P  P+ ++  +D                  VE      +G
Subjt:  IEHLIREGKKQMQAVTLRSGKPL--------------------------------EERKKPSIPQEVENSSDSN---------------VVEKELESGIG

Query:  EARPTTVTLQLADRSITYPEGKIEDVLV--------------QGELTMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDLEIGGLEYEHKDVAHIETVKTPW
        +ARPT+V LQLADRS   PEG++EDV+V              +GELTM V DQ V  N+F  +KY +D E+C  I +L    +E E      I+  +  +
Subjt:  EARPTTVTLQLADRSITYPEGKIEDVLV--------------QGELTMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDLEIGGLEYEHKDVAHIETVKTPW

Query:  YDDFSNQLDFGNLP
        + D+ + ++  N P
Subjt:  YDDFSNQLDFGNLP

A0A6J1DZC3 uncharacterized protein LOC1110244493.3e-1728.95Show/hide
Query:  KSQTLPQQNKQALPQQNSKSSLEAMMKEYMARTDVTIQS---------------NQASLRALELQVGQLANELKAQPQGNIPSYIEHLIREGKKQMQAVT
        K+ + P Q   +  +   K  + +MMKE+  R D  +Q                N A++R LE Q+GQLA+ELK +P+G +PS  E    EG++  + +T
Subjt:  KSQTLPQQNKQALPQQNSKSSLEAMMKEYMARTDVTIQS---------------NQASLRALELQVGQLANELKAQPQGNIPSYIEHLIREGKKQMQAVT

Query:  LRSGKPLEERK-----------------KPSIPQEVE--------------------------NSSDSNVVEKEL-------------------------
         RSG   EE K                 +P  P E+E                              SNV + ++                         
Subjt:  LRSGKPLEERK-----------------KPSIPQEVE--------------------------NSSDSNVVEKEL-------------------------

Query:  ----------------ESGIGEARPTTVTLQLADRSITYPEGKIEDVLV-----------------------------------------QGELTMNVHD
                        +  IG+A PTTVTLQLADRSIT PEGKIEDVLV                                         +GELTM+V D
Subjt:  ----------------ESGIGEARPTTVTLQLADRSITYPEGKIEDVLV-----------------------------------------QGELTMNVHD

Query:  QEVKFNMFDAMKYPNDIEDCSCIQDLEIGGLEYEHKDVAHIE
        Q+V FNM DAMKYP+D+E+C+ I  +  G   YE  D+ + E
Subjt:  QEVKFNMFDAMKYPNDIEDCSCIQDLEIGGLEYEHKDVAHIE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGATGCTTTTGCGGGAGGGGACCTTTTGGCAAAAACTTTTAATGAAGTTTATGAGATTTTAGAGAGAATATCAACCTACAGTTGTCAATGGTCAAATGTTAGAGG
CTTTAGTAAGAAAGTTAAGAGTGTGATAGAAGTTGATTGTGTGTCTACCATTAGGGCTGATATTGCAATATGGGCAAGGAAGCAATCATCAAAACAACAAAAGGTGAACC
AGCTAGGATTTGCTAAATCACAGACATTGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCAAAGAGTTCTCTTGAGGCGATGATGAAAGAATATATGGCTCGT
ACAGATGTCACAATTCAAAGTAATCAAGCTTCATTGAGAGCCCTAGAGTTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCACAACCTCAAGGGAACATTCCTTCATA
TATTGAACACCTTATAAGGGAAGGTAAGAAGCAGATGCAAGCAGTGACTTTAAGGAGTGGTAAGCCACTAGAAGAGAGGAAAAAGCCTAGTATACCCCAGGAAGTAGAAA
ATAGTAGTGATAGTAATGTTGTCGAAAAAGAGTTGGAGTCTGGAATAGGTGAGGCTAGGCCTACCACAGTCACACTTCAATTAGCAGATAGGTCTATCACATATCCTGAG
GGTAAAATTGAGGATGTTCTAGTCCAGGGGGAGCTTACAATGAATGTGCATGACCAAGAGGTGAAGTTTAATATGTTTGATGCAATGAAATATCCTAATGATATTGAGGA
TTGCTCGTGCATTCAGGATTTGGAAATAGGTGGATTGGAGTATGAGCATAAAGATGTAGCTCATATTGAGACAGTGAAAACACCTTGGTATGATGACTTTTCCAATCAAC
TTGATTTTGGAAATTTGCCTCCTGGTTTATCAAAAGAACAGATGAAAGAATTTTTCCATGGGGTGAAGTTTTATTTATCAAATGATGCATCAGTGGTTAGACAATGTGTT
GATGAAGGCTGTGAGTTTAAGAAAGTGCAGCGAATAAAACATTACTGGAGGGAGGAGTTTCATTCGAAATATTCTTCTTTGAAGTTGCTAGCAGATTTAGGGAACGCAGG
GAAAAAAGCACGAGCAAAGAAAGAAAGAGAGACGAAAGAAGAAGTTCAGCCGGATCCCACTGTTGATCAACTACAATGGGAGTTTTACGCCAACATCGATGAAAATGAAG
GATTCTTGGTTATTGTTCGTAGAATTGTTATCGACTGGAGCCATTGGAGGTTGTCTAGTAAAGGGGCAAGAAAGTTCCAGTCAGCCTACATGAAGTGTGAGGCTAACACT
TGGCTCAACTTTGTCAAGCAGAGATTATTGCCTACAACGTACGACTTCAATGTCCCCCATGATCGAGTGCACTTGAAGGGTCAGAGTCCCAAAGAGCTCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGATGCTTTTGCGGGAGGGGACCTTTTGGCAAAAACTTTTAATGAAGTTTATGAGATTTTAGAGAGAATATCAACCTACAGTTGTCAATGGTCAAATGTTAGAGG
CTTTAGTAAGAAAGTTAAGAGTGTGATAGAAGTTGATTGTGTGTCTACCATTAGGGCTGATATTGCAATATGGGCAAGGAAGCAATCATCAAAACAACAAAAGGTGAACC
AGCTAGGATTTGCTAAATCACAGACATTGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCAAAGAGTTCTCTTGAGGCGATGATGAAAGAATATATGGCTCGT
ACAGATGTCACAATTCAAAGTAATCAAGCTTCATTGAGAGCCCTAGAGTTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCACAACCTCAAGGGAACATTCCTTCATA
TATTGAACACCTTATAAGGGAAGGTAAGAAGCAGATGCAAGCAGTGACTTTAAGGAGTGGTAAGCCACTAGAAGAGAGGAAAAAGCCTAGTATACCCCAGGAAGTAGAAA
ATAGTAGTGATAGTAATGTTGTCGAAAAAGAGTTGGAGTCTGGAATAGGTGAGGCTAGGCCTACCACAGTCACACTTCAATTAGCAGATAGGTCTATCACATATCCTGAG
GGTAAAATTGAGGATGTTCTAGTCCAGGGGGAGCTTACAATGAATGTGCATGACCAAGAGGTGAAGTTTAATATGTTTGATGCAATGAAATATCCTAATGATATTGAGGA
TTGCTCGTGCATTCAGGATTTGGAAATAGGTGGATTGGAGTATGAGCATAAAGATGTAGCTCATATTGAGACAGTGAAAACACCTTGGTATGATGACTTTTCCAATCAAC
TTGATTTTGGAAATTTGCCTCCTGGTTTATCAAAAGAACAGATGAAAGAATTTTTCCATGGGGTGAAGTTTTATTTATCAAATGATGCATCAGTGGTTAGACAATGTGTT
GATGAAGGCTGTGAGTTTAAGAAAGTGCAGCGAATAAAACATTACTGGAGGGAGGAGTTTCATTCGAAATATTCTTCTTTGAAGTTGCTAGCAGATTTAGGGAACGCAGG
GAAAAAAGCACGAGCAAAGAAAGAAAGAGAGACGAAAGAAGAAGTTCAGCCGGATCCCACTGTTGATCAACTACAATGGGAGTTTTACGCCAACATCGATGAAAATGAAG
GATTCTTGGTTATTGTTCGTAGAATTGTTATCGACTGGAGCCATTGGAGGTTGTCTAGTAAAGGGGCAAGAAAGTTCCAGTCAGCCTACATGAAGTGTGAGGCTAACACT
TGGCTCAACTTTGTCAAGCAGAGATTATTGCCTACAACGTACGACTTCAATGTCCCCCATGATCGAGTGCACTTGAAGGGTCAGAGTCCCAAAGAGCTCCAATGA
Protein sequenceShow/hide protein sequence
MVDAFAGGDLLAKTFNEVYEILERISTYSCQWSNVRGFSKKVKSVIEVDCVSTIRADIAIWARKQSSKQQKVNQLGFAKSQTLPQQNKQALPQQNSKSSLEAMMKEYMAR
TDVTIQSNQASLRALELQVGQLANELKAQPQGNIPSYIEHLIREGKKQMQAVTLRSGKPLEERKKPSIPQEVENSSDSNVVEKELESGIGEARPTTVTLQLADRSITYPE
GKIEDVLVQGELTMNVHDQEVKFNMFDAMKYPNDIEDCSCIQDLEIGGLEYEHKDVAHIETVKTPWYDDFSNQLDFGNLPPGLSKEQMKEFFHGVKFYLSNDASVVRQCV
DEGCEFKKVQRIKHYWREEFHSKYSSLKLLADLGNAGKKARAKKERETKEEVQPDPTVDQLQWEFYANIDENEGFLVIVRRIVIDWSHWRLSSKGARKFQSAYMKCEANT
WLNFVKQRLLPTTYDFNVPHDRVHLKGQSPKELQ