; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041477 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041477
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr13:18665505..18668484
RNA-Seq ExpressionLag0041477
SyntenyLag0041477
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG9450307.1 hypothetical protein H6P81_010272 [Aristolochia fimbriata]9.9e-3033.13Show/hide
Query:  MASTSYQWPTERASSSKKADIYEIDESNSLKEQMASFTNALNKLTSSEIAIAVNTL--------------QKGKFPSDTEINPREQCKVIALRSGRQLEN
        M+S  YQ+P ER++  + A I  +D   +L+ Q+ S TN L+KL    +   +  +               +G  PS++E NPREQ K I LRSG+ LE 
Subjt:  MASTSYQWPTERASSSKKADIYEIDESNSLKEQMASFTNALNKLTSSEIAIAVNTL--------------QKGKFPSDTEINPREQCKVIALRSGRQLEN

Query:  RLNVEKQKKEEKRSLDEDKGTE---AQKNLHSQVPKSSLVSHQPAQEEGQVSTFDYRELPFSQRFKNVKLDEQFAR-------------------NIGLT
        + +   Q +E+     +DK  E    ++ +  Q  +    S   + +   V T     LP+  R K  KL+++F++                    + L 
Subjt:  RLNVEKQKKEEKRSLDEDKGTE---AQKNLHSQVPKSSLVSHQPAQEEGQVSTFDYRELPFSQRFKNVKLDEQFAR-------------------NIGLT

Query:  GLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGKVEISVHTGKLTLSVDDEKIVFNI-----FGHDESVCS
         LK+T+I LQ ADRS   P  ++ +VLV++ KFI+P DF+VLDM+ D  +S+ILGRPFLAT    I V  GKLTL ++DE+IVF+I        +   C 
Subjt:  GLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGKVEISVHTGKLTLSVDDEKIVFNI-----FGHDESVCS

Query:  IHTCFSIGLDLLLDEDEEEESNFGLELEG
        + +C     D L+ ++ E+     LE EG
Subjt:  IHTCFSIGLDLLLDEDEEEESNFGLELEG

OIT36514.1 hypothetical protein A4A49_13529 [Nicotiana attenuata]3.1e-3138.52Show/hide
Query:  EQMASFTNALNKL--TSSEIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLENRLNVEKQKKEEKRSLDEDKGTEAQKNLHSQVPKSSLVSHQPAQ
        E+M S  +AL  L    S++A  V+   +G  PS+TE NP+E  K I LRSG++L+   NV+++K + ++ +++ K  E      S+  +   V  +  +
Subjt:  EQMASFTNALNKL--TSSEIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLENRLNVEKQKKEEKRSLDEDKGTEAQKNLHSQVPKSSLVSHQPAQ

Query:  EEGQVSTFDYRELPFSQRFKNVKLDEQFARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGKVE
           ++       +PF Q+ K  KLD +FA+ + L  +KD  ++LQ AD+    P  I+ NVLV+V+KF+FPVDFIVL+M+E  +  IILGRPFLATG+  
Subjt:  EEGQVSTFDYRELPFSQRFKNVKLDEQFARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGKVE

Query:  ISVHTGKLTLSVDDEKIVFNI-----FGHDESVCSIHTCFSIGL
        I VH G+L L  D+E+++F++     F  DE   S ++CFSI +
Subjt:  ISVHTGKLTLSVDDEKIVFNI-----FGHDESVCSIHTCFSIGL

XP_022864267.1 uncharacterized protein LOC111384238 [Olea europaea var. sylvestris]1.9e-2831.14Show/hide
Query:  MASTSYQWPTERASSSKKADIYEIDESNSLKEQMASFTNALNKLTS------SEIAIAV------------NTLQ----------------KGKFPSDTE
        +A+ + QWP+ER S  K A ++E+D   +L  Q+ S TN +  LT+      +++ + +            NTLQ                +GKFPSDTE
Subjt:  MASTSYQWPTERASSSKKADIYEIDESNSLKEQMASFTNALNKLTS------SEIAIAV------------NTLQ----------------KGKFPSDTE

Query:  INPREQCKVIALRSGRQL-----ENRLNVEKQKKEEKRSLDEDKGTEAQKNLH------------------------SQVP------------KSSLVSH
        +NPREQCK I LR G+Q+         N    K+EE+  ++++K  +  K  H                         Q+P            K  L  +
Subjt:  INPREQCKVIALRSGRQL-----ENRLNVEKQKKEEKRSLDEDKGTEAQKNLH------------------------SQVP------------KSSLVSH

Query:  QPAQEEGQVSTFDYRELPFSQR----------FKNVKLDEQFA--------------RNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPV
        +  +   + S    ++LP  ++            +   D+                 R +GL  +K   +TLQLADRS+T+P  I+ +VLV V+KFIFP 
Subjt:  QPAQEEGQVSTFDYRELPFSQR----------FKNVKLDEQFA--------------RNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPV

Query:  DFIVLDMQEDREVSIILGRPFLATGKVEISVHTGKLTLSVDDEKIVFNIF
        DF+VLDM+ED+++ +ILGRPFLA G+  I V  G LTL V++E++ FNI+
Subjt:  DFIVLDMQEDREVSIILGRPFLATGKVEISVHTGKLTLSVDDEKIVFNIF

XP_023876781.1 uncharacterized protein LOC111989228 [Quercus suber]3.4e-3033.54Show/hide
Query:  MASTSYQWPTERASSSKKADIYEIDESNSLKEQMASFTNAL----------NKLTS--------SEIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGR
        +AS++YQWPTERA   K A++ E+D   SL  QMA+ +  L          N L S        S+++  +     G  PS+T  N +E  K I+L+SGR
Subjt:  MASTSYQWPTERASSSKKADIYEIDESNSLKEQMASFTNAL----------NKLTS--------SEIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGR

Query:  QLENRLNVEKQKKEEKRSLDEDKGTE----------AQKNLHSQVPKSSLVSHQPAQEEGQVSTFDYRELPFSQRFKNVKLDE-----------------
          E  L V   K++E    +E K  E          A+K +  +    S    +   EE +            Q+  + KL +                 
Subjt:  QLENRLNVEKQKKEEKRSLDEDKGTE----------AQKNLHSQVPKSSLVSHQPAQEEGQVSTFDYRELPFSQRFKNVKLDE-----------------

Query:  --------------QFARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGKVEISVHTGKLTLSV
                       F R +GL  +K T I+LQLADRS+ +P  ++ NVLVKV+KFIF VDFIVLD+ ED E+ +IL RPFLA G+  I V  GKL L V
Subjt:  --------------QFARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGKVEISVHTGKLTLSV

Query:  DDEKIVFNIFGHDESVCSIHTCFSI
         ++++ F++F   E     H+CF I
Subjt:  DDEKIVFNIFGHDESVCSIHTCFSI

XP_024022201.1 uncharacterized protein LOC112091842 [Morus notabilis]2.4e-2831.1Show/hide
Query:  MASTSYQWPTERASSSKKADIYEIDESNSLKEQMASFTNALNK--LTSSEIAIA-------------------------------------------VNT
        MA+ +YQWP+ER+   K A ++EID   +L  Q+AS +  L K  LT++ I I+                                            + 
Subjt:  MASTSYQWPTERASSSKKADIYEIDESNSLKEQMASFTNALNK--LTSSEIAIA-------------------------------------------VNT

Query:  LQKGKFPSDTEINP----REQCKVIALRSGRQLENRLNVEKQKKEEKR----------------------SLDEDKGTEA-----------QKNLHSQVP
         Q+G  PS +E+NP    +E CK + LRSG++LE  + +EK  K+E+R                      +L   K  EA           Q+ LH  +P
Subjt:  LQKGKFPSDTEINP----REQCKVIALRSGRQLENRLNVEKQKKEEKR----------------------SLDEDKGTEA-----------QKNLHSQVP

Query:  KSSLVSHQPAQEE-------GQVSTFDYRELPFSQRFKNVKLDEQFA--------------RNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKF
         +  +   P+  +        +    DY ++  ++ F N   +++                + +GL   + T +TLQLADRS+ HP  ++ +VLVKV+KF
Subjt:  KSSLVSHQPAQEE-------GQVSTFDYRELPFSQRFKNVKLDEQFA--------------RNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKF

Query:  IFPVDFIVLDMQEDREVSIILGRPFLATGKVEISVHTGKLTLSV
        IFP DFIVLDM+ED+E+ IILGRPFLATG+  I+V  G+L L V
Subjt:  IFPVDFIVLDMQEDREVSIILGRPFLATGKVEISVHTGKLTLSV

TrEMBL top hitse value%identityAlignment
A0A314L6J8 Uncharacterized protein1.5e-3138.52Show/hide
Query:  EQMASFTNALNKL--TSSEIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLENRLNVEKQKKEEKRSLDEDKGTEAQKNLHSQVPKSSLVSHQPAQ
        E+M S  +AL  L    S++A  V+   +G  PS+TE NP+E  K I LRSG++L+   NV+++K + ++ +++ K  E      S+  +   V  +  +
Subjt:  EQMASFTNALNKL--TSSEIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLENRLNVEKQKKEEKRSLDEDKGTEAQKNLHSQVPKSSLVSHQPAQ

Query:  EEGQVSTFDYRELPFSQRFKNVKLDEQFARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGKVE
           ++       +PF Q+ K  KLD +FA+ + L  +KD  ++LQ AD+    P  I+ NVLV+V+KF+FPVDFIVL+M+E  +  IILGRPFLATG+  
Subjt:  EEGQVSTFDYRELPFSQRFKNVKLDEQFARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGKVE

Query:  ISVHTGKLTLSVDDEKIVFNI-----FGHDESVCSIHTCFSIGL
        I VH G+L L  D+E+++F++     F  DE   S ++CFSI +
Subjt:  ISVHTGKLTLSVDDEKIVFNI-----FGHDESVCSIHTCFSIGL

A0A438G6F9 Retrovirus-related Pol polyprotein from transposon 17.61.5e-2834.63Show/hide
Query:  EIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLENRLNVEKQKKEEKRSLDEDKGTEAQKNLHSQVPKSSLVSHQPAQEEGQVSTFDYRE------
        ++A  +N  Q G F S+TE+NP+EQCK I LR+GR++E        K+ +   +D + G    K    ++   +L      ++ G +    + +      
Subjt:  EIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLENRLNVEKQKKEEKRSLDEDKGTEAQKNLHSQVPKSSLVSHQPAQEEGQVSTFDYRE------

Query:  --LPFSQR--------FKNVK-LDEQFARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGKVEI
          LP+ Q          K +K +  +  R +GL  +K T I+LQLA +S+ +P  I+ ++LVKV+KFIFP+DF+VLDM+ED+EV +ILGRPFLA G+  I
Subjt:  --LPFSQR--------FKNVK-LDEQFARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGKVEI

Query:  SVHTGKLTLSVDDEKIVFNIFGHDESVCSIHTCFSIGLDLLLDEDEEEESNFGLELE
         V  G+LTL V+ ++++FNI+          TCF I ++ L     E ++   L ++
Subjt:  SVHTGKLTLSVDDEKIVFNIFGHDESVCSIHTCFSIGLDLLLDEDEEEESNFGLELE

A0A6P6SU28 Reverse transcriptase7.6e-2834.92Show/hide
Query:  EIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLENRLNVEKQKKEEKRSLDEDKGTEA------QKNLHSQVPKSS------------------LV
        +IA ++N   +G+ PS TE+NP+EQ + I LRSGRQLE+ L VE +K E ++  ++ +  EA      ++N   + P SS                  ++
Subjt:  EIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLENRLNVEKQKKEEKRSLDEDKGTEA------QKNLHSQVPKSS------------------LV

Query:  SHQPAQEE-------GQVSTFDYRELP-------------------FSQRFKNVKLDEQF-----ARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLV
        + +   E+       G+ S     +LP                   FS+   ++           AR +GL  LK T+ITLQLADRS+ +P+ ++ NVL+
Subjt:  SHQPAQEE-------GQVSTFDYRELP-------------------FSQRFKNVKLDEQF-----ARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLV

Query:  KVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGKVEISVHTGKLTLSVDDEKIVFNIFGHDESVCSIHTCFSIG-LDLLLDEDEEEESNFGLEL
        KV KFI PVDF+VLDM+ED  + IILGRPFLAT    I V  GKL   V +E++ FN+   ++        +SIG +D L    E  + NF L+L
Subjt:  KVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGKVEISVHTGKLTLSVDDEKIVFNIFGHDESVCSIHTCFSIG-LDLLLDEDEEEESNFGLEL

A0A6P8D6H7 uncharacterized protein LOC1162042452.0e-2837.23Show/hide
Query:  LKEQMASFTNALNKLTSSEIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLENRLNVEKQKKEEKRSLDEDKGTEAQKNLHSQVPKSSLVSHQPAQ
        L+ Q A+  N   ++  S+I+  ++    G  P++ E NP      I LRSG++L+    + ++ + E+ S ++DK    Q   +++  K  L   +   
Subjt:  LKEQMASFTNALNKLTSSEIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLENRLNVEKQKKEEKRSLDEDKGTEAQKNLHSQVPKSSLVSHQPAQ

Query:  EEGQVSTFDYRE-LPFSQRFKNVKL-DEQFARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGK
               FD RE +  ++   ++ L      + +GL   K+T ITLQLADRS+ +P  IV NVLVKV+KFIFPVDFIVL+++EDREV +ILGRPFL TGK
Subjt:  EEGQVSTFDYRE-LPFSQRFKNVKL-DEQFARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATGK

Query:  VEISVHTGKLTLSVDDEKIVFNIFGHDESVCSIHTCFSIG-LDLLLDEDEEEESNFGLELEGLPMIDDIFYPDE
          I V  GKLTL V DE+I FN++   +      +C++I  +D L+ E  EE++        L  +DD    DE
Subjt:  VEISVHTGKLTLSVDDEKIVFNIFGHDESVCSIHTCFSIG-LDLLLDEDEEEESNFGLELEGLPMIDDIFYPDE

A0A6P9EWB3 uncharacterized protein LOC1183487293.8e-2737.78Show/hide
Query:  IAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLE-------NRLNVEKQKKEEKRSLDEDKGTEAQKNL--------HSQVPKSSLVSHQPAQEEGQVS
        + +N  Q+G FP++TE+NP+EQ K I LRS R++E       N  +  +   + K  ++ED  T  + +         +  +  + L   Q A E+    
Subjt:  IAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLE-------NRLNVEKQKKEEKRSLDEDKGTEAQKNL--------HSQVPKSSLVSHQPAQEEGQVS

Query:  TFDYRELPFS----QRFKNVKLDEQFA-----RNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATG
            +++ F     ++F+ VKL E+ +     + + L  +K T I+LQLADRS+ +   I+ +VLV V+KFIFP DF+VLDM+ED+EV +ILG+PFLATG
Subjt:  TFDYRELPFS----QRFKNVKLDEQFA-----RNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGRPFLATG

Query:  KVEISVHTGKLTLSVDDEKIVFNIF
        +  I V  G+LTL V+ E+I+FNI+
Subjt:  KVEISVHTGKLTLSVDDEKIVFNIF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAACTAGTTACCAGTGGCCCACGGAGAGGGCATCATCATCAAAGAAGGCTGACATTTATGAAATTGATGAGTCGAACTCACTAAAGGAGCAAATGGCTTCCTT
CACGAATGCATTAAACAAGTTGACTTCTTCTGAGATAGCAATAGCGGTGAACACCCTTCAAAAGGGTAAGTTTCCCAGTGATACTGAGATCAACCCTCGAGAGCAATGTA
AGGTGATCGCACTCCGAAGTGGGAGACAACTGGAGAACCGCTTGAATGTAGAGAAGCAGAAGAAGGAAGAAAAAAGGAGCCTAGATGAAGACAAAGGGACTGAGGCACAA
AAAAACCTCCATAGTCAGGTCCCAAAATCCTCCTTGGTCTCACATCAGCCCGCCCAGGAGGAGGGGCAAGTGTCCACCTTTGATTACAGGGAGTTGCCTTTTTCCCAAAG
ATTTAAAAATGTTAAATTAGATGAGCAGTTTGCTAGGAATATTGGACTCACTGGACTTAAAGACACAGACATCACCCTCCAGCTAGCGGATCGATCAGTCACTCACCCGA
TGGAGATAGTGATGAATGTGTTGGTGAAGGTAAACAAGTTCATCTTTCCGGTAGACTTCATTGTGTTGGATATGCAGGAAGATAGAGAAGTGTCCATCATTCTTGGTAGA
CCATTTCTAGCTACTGGTAAGGTTGAAATTAGTGTCCATACAGGTAAGTTAACTCTTAGCGTAGATGATGAGAAGATAGTCTTCAATATCTTTGGTCATGACGAGTCAGT
TTGTAGCATACATACTTGCTTTTCTATTGGCCTAGATTTACTACTTGATGAGGATGAAGAAGAAGAATCAAACTTTGGCCTAGAGTTAGAAGGTCTTCCCATGATTGATG
ATATTTTTTATCCTGATGAATTCTTTGACGATGCCATGTATGAGAATGAACTGTTGAATAATGTTGAGCAACCTATTGTAAATGTTTATGAGTATAATTTGCCTACTTTA
AATCTAGAAAAGCATGAGTTAGATGATATTTTTGGTTCTGATATTAATGAGATAGAGGATAGCATTGGACAGTTTAACTCTTATTTGGCCTTGCTCCCAAGAACCGTTAG
AAGTCTAAGCGAGTACCGTTATGCACATTTCAATCCATTCCACCTGGAAGTTAATATCACTACTTGGGATGCGTTAGTTCAGGCTTTCCTTGCCAAATATTTTCCACTTG
CAAAGACAACTAAGTTGAGAATTGAGATTGGGACTTTTAACAGCTGGAGGCACTTTACTTTCCAAAACTGTGTAGAGTATACCTTGCTGCAGGACATGGCCACAAACAAC
TATTTGTGGCCAACTAAACGTTCATCTTCCAAATCTGTAGTTGGTGTCTATGAAATAGATAATGTGACTTCCCTTAAGGCTCAAATGTTTTCTTTAGCTAATGTTTTTCT
TAATTTTTCTACAACAAGGAATTCACAGTCCATTGAGTCAGTTGCAACAGCTGAAGATCTCAGTTTTATGAGGAGCAAAACGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAACTAGTTACCAGTGGCCCACGGAGAGGGCATCATCATCAAAGAAGGCTGACATTTATGAAATTGATGAGTCGAACTCACTAAAGGAGCAAATGGCTTCCTT
CACGAATGCATTAAACAAGTTGACTTCTTCTGAGATAGCAATAGCGGTGAACACCCTTCAAAAGGGTAAGTTTCCCAGTGATACTGAGATCAACCCTCGAGAGCAATGTA
AGGTGATCGCACTCCGAAGTGGGAGACAACTGGAGAACCGCTTGAATGTAGAGAAGCAGAAGAAGGAAGAAAAAAGGAGCCTAGATGAAGACAAAGGGACTGAGGCACAA
AAAAACCTCCATAGTCAGGTCCCAAAATCCTCCTTGGTCTCACATCAGCCCGCCCAGGAGGAGGGGCAAGTGTCCACCTTTGATTACAGGGAGTTGCCTTTTTCCCAAAG
ATTTAAAAATGTTAAATTAGATGAGCAGTTTGCTAGGAATATTGGACTCACTGGACTTAAAGACACAGACATCACCCTCCAGCTAGCGGATCGATCAGTCACTCACCCGA
TGGAGATAGTGATGAATGTGTTGGTGAAGGTAAACAAGTTCATCTTTCCGGTAGACTTCATTGTGTTGGATATGCAGGAAGATAGAGAAGTGTCCATCATTCTTGGTAGA
CCATTTCTAGCTACTGGTAAGGTTGAAATTAGTGTCCATACAGGTAAGTTAACTCTTAGCGTAGATGATGAGAAGATAGTCTTCAATATCTTTGGTCATGACGAGTCAGT
TTGTAGCATACATACTTGCTTTTCTATTGGCCTAGATTTACTACTTGATGAGGATGAAGAAGAAGAATCAAACTTTGGCCTAGAGTTAGAAGGTCTTCCCATGATTGATG
ATATTTTTTATCCTGATGAATTCTTTGACGATGCCATGTATGAGAATGAACTGTTGAATAATGTTGAGCAACCTATTGTAAATGTTTATGAGTATAATTTGCCTACTTTA
AATCTAGAAAAGCATGAGTTAGATGATATTTTTGGTTCTGATATTAATGAGATAGAGGATAGCATTGGACAGTTTAACTCTTATTTGGCCTTGCTCCCAAGAACCGTTAG
AAGTCTAAGCGAGTACCGTTATGCACATTTCAATCCATTCCACCTGGAAGTTAATATCACTACTTGGGATGCGTTAGTTCAGGCTTTCCTTGCCAAATATTTTCCACTTG
CAAAGACAACTAAGTTGAGAATTGAGATTGGGACTTTTAACAGCTGGAGGCACTTTACTTTCCAAAACTGTGTAGAGTATACCTTGCTGCAGGACATGGCCACAAACAAC
TATTTGTGGCCAACTAAACGTTCATCTTCCAAATCTGTAGTTGGTGTCTATGAAATAGATAATGTGACTTCCCTTAAGGCTCAAATGTTTTCTTTAGCTAATGTTTTTCT
TAATTTTTCTACAACAAGGAATTCACAGTCCATTGAGTCAGTTGCAACAGCTGAAGATCTCAGTTTTATGAGGAGCAAAACGTAG
Protein sequenceShow/hide protein sequence
MASTSYQWPTERASSSKKADIYEIDESNSLKEQMASFTNALNKLTSSEIAIAVNTLQKGKFPSDTEINPREQCKVIALRSGRQLENRLNVEKQKKEEKRSLDEDKGTEAQ
KNLHSQVPKSSLVSHQPAQEEGQVSTFDYRELPFSQRFKNVKLDEQFARNIGLTGLKDTDITLQLADRSVTHPMEIVMNVLVKVNKFIFPVDFIVLDMQEDREVSIILGR
PFLATGKVEISVHTGKLTLSVDDEKIVFNIFGHDESVCSIHTCFSIGLDLLLDEDEEEESNFGLELEGLPMIDDIFYPDEFFDDAMYENELLNNVEQPIVNVYEYNLPTL
NLEKHELDDIFGSDINEIEDSIGQFNSYLALLPRTVRSLSEYRYAHFNPFHLEVNITTWDALVQAFLAKYFPLAKTTKLRIEIGTFNSWRHFTFQNCVEYTLLQDMATNN
YLWPTKRSSSKSVVGVYEIDNVTSLKAQMFSLANVFLNFSTTRNSQSIESVATAEDLSFMRSKT