; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021224 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021224
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:5686494..5688150
RNA-Seq ExpressionLag0021224
SyntenyLag0021224
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.9e-2426.61Show/hide
Query:  LGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLACVLREMTSSSNS----------EKKKGVKIQNG-----------------------------SPRCK
        L IP       D LIWHYE+NG YSV+SGY LAC+ ++  S   S          +K   +KI N                               P C 
Subjt:  LGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLACVLREMTSSSNS----------EKKKGVKIQNG-----------------------------SPRCK

Query:  IKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEE--FVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQF---
         K E+V HA+W C++ K +W+N  +        + S  +L  W  L +SSS  E+  F   CW +WNRRN  +F G+ E   +      +++   +F   
Subjt:  IKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEE--FVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQF---

Query:  ----QAFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIR-----------------------------DGLSLAKEAGFTNLEVE
                G++S     +     WRPP   I+K+N D   +     +   VV+R                             +GL  A + GFT   +E
Subjt:  ----QAFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIR-----------------------------DGLSLAKEAGFTNLEVE

Query:  SDSARAIALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPK
         D+   I  + S        G L++E+ ++  +      +W  RS N VAH LA+ A        W+EE P+
Subjt:  SDSARAIALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPK

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.4e-2730.94Show/hide
Query:  LGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLA-----CVLREMTSSSNSEK----------------------------------KKGVKIQNGSPRCK
        L IP  R    D LIW+YEK G YSVRSGY +A     CV    +SSS   +                                  K+GV+I N    C 
Subjt:  LGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLA-----CVLREMTSSSNSEK----------------------------------KKGVKIQNGSPRCK

Query:  IKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGG--WEWVSEYLAQFQAF
           E   H  W CK  + LW N  F        +R   + L       S + FEE  V  W +WN+RN   F    +   + G    EW ++Y  +F+  
Subjt:  IKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGG--WEWVSEYLAQFQAF

Query:  QGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIRDGLSLAKEAGFTNLE-VES-DSARAIAL-----LRSE------ASDISEVGAL
        +     G V    E +W+PP++ I+K+NTDA       +    ++I +       A    LE ++S D A AIA      L SE        D+SE G +
Subjt:  QGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIRDGLSLAKEAGFTNLE-VES-DSARAIAL-----LRSE------ASDISEVGAL

Query:  VKEIK-WMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPKEVEDFYLYEVLD
        V + K + +Q LH  SF + KR  N  AH+LAR AL     SIW+E+ P E++     E L+
Subjt:  VKEIK-WMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPKEVEDFYLYEVLD

XP_023915286.1 uncharacterized protein LOC112026812 [Quercus suber]4.5e-2628.3Show/hide
Query:  IPKPRRLCRDELIWHYEKNGFYSVRSGYNLA-CVLREMT---SSSNSEKKKGV-------KIQNG-----------------------------SPRCKI
        IP  RR   D ++W + K+G YSVRSGY+    ++RE +   +SSN      V       +I N                               P CK+
Subjt:  IPKPRRLCRDELIWHYEKNGFYSVRSGYNLA-CVLREMT---SSSNSEKKKGV-------KIQNG-----------------------------SPRCKI

Query:  KEETVFHAIWECKSVKTLWQN-PPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQFQAFQG
          +T+ HA+WECK+ + +W        + G +   S+  L   +W  M    FE F+V CW +W+RRN+ VFGG ++   E G    +         F  
Subjt:  KEETVFHAIWECKSVKTLWQN-PPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQFQAFQG

Query:  KKSGGAV--GIRREEVWRPPNDSIFKLNTD----------ALHQLIR-------------------GNKPAAVVIRDGLSLAKEAGFTNLEVESDSARAI
         ++  A+   +   + W+PP  S +KLN D           +  +IR                   G++   +  R GL  A EAGF +L VE D+   +
Subjt:  KKSGGAV--GIRREEVWRPPNDSIFKLNTD----------ALHQLIR-------------------GNKPAAVVIRDGLSLAKEAGFTNLEVESDSARAI

Query:  ALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIP
          + S  SD S +G LV +++ ++  L L  F+  +RS N VAH LAR A     + +W+E+ P
Subjt:  ALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIP

XP_023928118.1 uncharacterized protein LOC112039474 [Quercus suber]6.5e-2529.35Show/hide
Query:  CKIKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGW--EWVSEYLAQFQ
        C  + E   HA+WEC   K +W       +      + V  L   +   +S + FE FV+  W IWN+RN  VFGG+    +++  W  +W  E+L +F 
Subjt:  CKIKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGW--EWVSEYLAQFQ

Query:  AFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDA----------LHQLIRGNK-------------------PAAVVIRDGLSLAKEAGFTNLEVESDSAR
          QG+     V    + VWRPP DS FKLN DA          +  +IR                         +  R  +  A +AGFT+L VE D+  
Subjt:  AFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDA----------LHQLIRGNK-------------------PAAVVIRDGLSLAKEAGFTNLEVESDSAR

Query:  AIALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPKEVEDFYLYE
         +  L +  +D+S +G ++++IKW+++     SF + +R+ N+VA+ LAR A +   +  W+E+ P  V +   Y+
Subjt:  AIALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPKEVEDFYLYE

XP_024956542.1 uncharacterized protein LOC112498908 [Citrus sinensis]1.1e-2427.12Show/hide
Query:  IPKPRRLCRDELIWHYEKNGFYSVRSGYNLACVLR--EMTSSSNSEK----------------------------------KKGVKIQNGSPRCKIKEET
        IP PRRL  DELIWH+ K+G Y+V+SGY  A  +R   M SSS S K                                  K+ +  +     CK+  E 
Subjt:  IPKPRRLCRDELIWHYEKNGFYSVRSGYNLACVLR--EMTSSSNSEK----------------------------------KKGVKIQNGSPRCKIKEET

Query:  VFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQFQAFQGKKSGG
        VFHA+ +CK+ K +W+   FY     +  + +  LL  V    S++  + F V  W  WN RN+ +F G+      E     V++  A  +A++  +   
Subjt:  VFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQFQAFQGKKSGG

Query:  AVGIRREEV-----WRPPNDSIFKLNTDALHQLIRGNKPAAVVIRD-----------------------------GLSLAKEAGFTNLEVESDSARAIAL
         V   +++      W PP +   K+NTDA     +       VIRD                             GL +AK+A   ++ +ESDS   ++L
Subjt:  AVGIRREEV-----WRPPNDSIFKLNTDALHQLIRGNKPAAVVIRD-----------------------------GLSLAKEAGFTNLEVESDSARAIAL

Query:  LRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPKEV
        + +     SE+  +V EI+ + +     S  +  RS N +AH L ++ALE+    +W    P +V
Subjt:  LRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPKEV

TrEMBL top hitse value%identityAlignment
A0A1U8NYQ0 uncharacterized protein LOC1079523601.9e-2225.27Show/hide
Query:  IPKPRRLCRDELIWHYEKNGFYSVRSGY--------------NLACVLREMTSSSNSEKKK------------------GVKIQNGS--PRCKIKEETVF
        IP  +  C D  +W  EK G Y+VRSGY              ++  V +++ S S   K K                    +I+N +   RC +  E++ 
Subjt:  IPKPRRLCRDELIWHYEKNGFYSVRSGY--------------NLACVLREMTSSSNSEKKK------------------GVKIQNGS--PRCKIKEETVF

Query:  HAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWL--HMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQFQAFQG-KKSG
        H + EC +VK +W           S +   T  LW++ L  H     +E+ V+  W IW  RNK+V  G     +       +++ L+  +  +   +  
Subjt:  HAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWL--HMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQFQAFQG-KKSG

Query:  GAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIRD-----------------------------GLSLAKEAGFTNLEVESDSARAIALLRSE
          +     + WRPP     KLN DA ++       +  +IRD                             GL  AKE GFT +EVE DS   I  +  E
Subjt:  GAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIRD-----------------------------GLSLAKEAGFTNLEVESDSARAIALLRSE

Query:  ASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPKEVEDFYLYE
            +++ +++ +IK M +  H   F+  +R  N VAH +AR  + R   + W+E+ P  V D  + E
Subjt:  ASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPKEVEDFYLYE

A0A5C7IST2 RNase H domain-containing protein6.6e-2325.67Show/hide
Query:  LGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLACVLREMTSSSNSEKKKGVKIQNGSPRCKIKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTD
        L IP    L RD  +WH+ K+G ++V+S Y +A    +   +  S            P C+   ETV HA+W CKSVK  W + P +       I     
Subjt:  LGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLACVLREMTSSSNSEKKKGVKIQNGSPRCKIKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTD

Query:  LLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQFQAFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKP
         + WV    +      F+   W +WN RN+ +F  R +       W   + +L+  +  + + +  A+     + W PP+    K+N DA   + R    
Subjt:  LLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQFQAFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKP

Query:  AAVVIRD-----------------------------GLSLAKEAGFTNLEVESDSARAIALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNN
          +VIRD                             GL LA  +G   L +ESDS   + L   E S  ++V  ++ +I+++       S  +  RS N 
Subjt:  AAVVIRD-----------------------------GLSLAKEAGFTNLEVESDSARAIALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNN

Query:  VAHVLARVALERRRESIWVEEIPKEVEDFYLYEVL
        VAH +AR A+      I  +  P  ++   L +VL
Subjt:  VAHVLARVALERRRESIWVEEIPKEVEDFYLYEVL

A0A5E4FZN9 PREDICTED: retrotransposon9.2e-2526.61Show/hide
Query:  LGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLACVLREMTSSSNS----------EKKKGVKIQNG-----------------------------SPRCK
        L IP       D LIWHYE+NG YSV+SGY LAC+ ++  S   S          +K   +KI N                               P C 
Subjt:  LGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLACVLREMTSSSNS----------EKKKGVKIQNG-----------------------------SPRCK

Query:  IKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEE--FVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQF---
         K E+V HA+W C++ K +W+N  +        + S  +L  W  L +SSS  E+  F   CW +WNRRN  +F G+ E   +      +++   +F   
Subjt:  IKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEE--FVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQF---

Query:  ----QAFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIR-----------------------------DGLSLAKEAGFTNLEVE
                G++S     +     WRPP   I+K+N D   +     +   VV+R                             +GL  A + GFT   +E
Subjt:  ----QAFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIR-----------------------------DGLSLAKEAGFTNLEVE

Query:  SDSARAIALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPK
         D+   I  + S        G L++E+ ++  +      +W  RS N VAH LA+ A        W+EE P+
Subjt:  SDSARAIALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPK

A0A6J1DAR4 uncharacterized protein LOC1110189546.8e-2830.94Show/hide
Query:  LGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLA-----CVLREMTSSSNSEK----------------------------------KKGVKIQNGSPRCK
        L IP  R    D LIW+YEK G YSVRSGY +A     CV    +SSS   +                                  K+GV+I N    C 
Subjt:  LGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLA-----CVLREMTSSSNSEK----------------------------------KKGVKIQNGSPRCK

Query:  IKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGG--WEWVSEYLAQFQAF
           E   H  W CK  + LW N  F        +R   + L       S + FEE  V  W +WN+RN   F    +   + G    EW ++Y  +F+  
Subjt:  IKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSVTDLLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGG--WEWVSEYLAQFQAF

Query:  QGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIRDGLSLAKEAGFTNLE-VES-DSARAIAL-----LRSE------ASDISEVGAL
        +     G V    E +W+PP++ I+K+NTDA       +    ++I +       A    LE ++S D A AIA      L SE        D+SE G +
Subjt:  QGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIRDGLSLAKEAGFTNLE-VES-DSARAIAL-----LRSE------ASDISEVGAL

Query:  VKEIK-WMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPKEVEDFYLYEVLD
        V + K + +Q LH  SF + KR  N  AH+LAR AL     SIW+E+ P E++     E L+
Subjt:  VKEIK-WMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPKEVEDFYLYEVLD

A0A803P9R9 Uncharacterized protein2.1e-2429Show/hide
Query:  LGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLACVLREMTSSSNSEK----KKGVKIQNGSPRCKIKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIR
        LGI   R    DEL+WH   NG Y V SGY L CV ++   +SN       K+G+KI+     C  K+ET+ HA+W C S+K +W+   F+    P+++ 
Subjt:  LGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLACVLREMTSSSNSEK----KKGVKIQNGSPRCKIKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIR

Query:  SVTDLLWWVWL---HMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGW-EWVSEYLAQFQAFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDA--
         + DLL ++       S  +F+ F+   W +W++RN+ +F  +   N     W  W  +YL          S  +   +R  +W PP      +N DA  
Subjt:  SVTDLLWWVWL---HMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGW-EWVSEYLAQFQAFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDA--

Query:  -LHQL----------IRGNKPAAVV----------------IRDGLSLAKEAGFTNLEVESDSARAIALLRSEASDISEVGALVKEIKWMSQDLHLCSFR
          HQ             GN  AA V                I+ G+ L ++       + SDS   I  L S+ +  +E G  + EI   +  +    F+
Subjt:  -LHQL----------IRGNKPAAVV----------------IRDGLSLAKEAGFTNLEVESDSARAIALLRSEASDISEVGALVKEIKWMSQDLHLCSFR

Query:  WCKRSTNNVAHVLARVALERRRESIWVEEIP
        +  R  N VAH LA+ A+ +R  S W E +P
Subjt:  WCKRSTNNVAHVLARVALERRRESIWVEEIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein3.6e-0520.17Show/hide
Query:  DELIWHYEKNGFYSVRSGY---------NLACV-----------------------------LREMTSSSNSEKKKGVKIQNGSPRCKIKEETVFHAIWE
        D++IW+Y   G Y+VRSGY         N+  +                             L +  +++     +G++I    PRC  + E++ HA++ 
Subjt:  DELIWHYEKNGFYSVRSGY---------NLACV-----------------------------LREMTSSSNSEKKKGVKIQNGSPRCKIKEETVFHAIWE

Query:  CKSVKTLWQNPPFYPRPGPSTIR----------SVTDLLWWVWLHMSSSRFEEF--VVFCWWIWNRRNKEVFG------GRVELNVEEGGWEWVSEYLAQ
        C      W+          S IR          +++++L +V    + S F +   V   W IW  RN  VF        +  L+ +    +W    L  
Subjt:  CKSVKTLWQNPPFYPRPGPSTIR----------SVTDLLWWVWLHMSSSRFEEF--VVFCWWIWNRRNKEVFG------GRVELNVEEGGWEWVSEYLAQ

Query:  FQAFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIRD---------GLSLAKEA--------------------GFTNLEVESDS
         Q+ +   S        +  WR P  +  K N DA   + +       +IR+          + LA  +                    G+T + +E D 
Subjt:  FQAFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIRD---------GLSLAKEA--------------------GFTNLEVESDS

Query:  ARAIALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLAR
           I L+    S  S +   +++I + +       F + +R  N +AHVLA+
Subjt:  ARAIALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTTTCTGTCAAGAGGATGCGAGGGCAAACACTTGGGATCCCAAAACCAAGAAGGTTGTGTAGAGATGAGTTGATATGGCACTACGAGAAGAACGGTTTTTACTC
TGTTAGGAGTGGTTATAACCTAGCTTGTGTATTGAGAGAGATGACTTCGAGCTCAAATTCGGAGAAGAAGAAGGGGGTGAAAATTCAAAATGGTAGCCCCAGATGTAAGA
TCAAAGAAGAAACAGTGTTTCATGCTATTTGGGAGTGCAAATCAGTAAAGACGTTGTGGCAAAATCCTCCTTTTTACCCAAGACCAGGCCCAAGTACTATCAGAAGTGTG
ACAGATCTATTATGGTGGGTTTGGCTTCATATGTCGAGCAGCAGGTTTGAGGAATTTGTGGTATTCTGCTGGTGGATCTGGAATAGAAGGAATAAGGAGGTATTCGGGGG
CAGGGTGGAGCTGAATGTGGAGGAAGGGGGATGGGAATGGGTGTCGGAATATTTGGCTCAGTTTCAGGCCTTTCAGGGTAAGAAGAGTGGAGGTGCAGTGGGGATTAGGA
GGGAGGAAGTTTGGAGACCACCGAATGATTCTATATTTAAACTTAACACAGATGCATTGCATCAATTGATCAGAGGCAACAAACCAGCAGCTGTGGTGATTCGAGATGGG
TTGAGCCTAGCAAAGGAGGCGGGATTCACGAACTTGGAGGTAGAGTCGGATTCTGCTCGAGCGATTGCTCTATTGAGATCTGAGGCGAGTGATATCTCGGAAGTTGGAGC
TCTGGTAAAGGAAATCAAATGGATGAGTCAAGATCTTCATCTCTGTTCTTTCCGATGGTGTAAAAGATCAACAAACAATGTGGCCCATGTACTGGCACGAGTGGCGCTTG
AAAGAAGGCGTGAGAGCATCTGGGTTGAAGAAATCCCAAAAGAAGTCGAGGATTTTTATCTCTATGAGGTTTTGGACAGACCCTCTGGTCTGTGGGATGAAGTGGGCGAC
GTTGAGAATGTGATAGCGATATCGGAAAGGAACGTGACGGACCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCTTTCTGTCAAGAGGATGCGAGGGCAAACACTTGGGATCCCAAAACCAAGAAGGTTGTGTAGAGATGAGTTGATATGGCACTACGAGAAGAACGGTTTTTACTC
TGTTAGGAGTGGTTATAACCTAGCTTGTGTATTGAGAGAGATGACTTCGAGCTCAAATTCGGAGAAGAAGAAGGGGGTGAAAATTCAAAATGGTAGCCCCAGATGTAAGA
TCAAAGAAGAAACAGTGTTTCATGCTATTTGGGAGTGCAAATCAGTAAAGACGTTGTGGCAAAATCCTCCTTTTTACCCAAGACCAGGCCCAAGTACTATCAGAAGTGTG
ACAGATCTATTATGGTGGGTTTGGCTTCATATGTCGAGCAGCAGGTTTGAGGAATTTGTGGTATTCTGCTGGTGGATCTGGAATAGAAGGAATAAGGAGGTATTCGGGGG
CAGGGTGGAGCTGAATGTGGAGGAAGGGGGATGGGAATGGGTGTCGGAATATTTGGCTCAGTTTCAGGCCTTTCAGGGTAAGAAGAGTGGAGGTGCAGTGGGGATTAGGA
GGGAGGAAGTTTGGAGACCACCGAATGATTCTATATTTAAACTTAACACAGATGCATTGCATCAATTGATCAGAGGCAACAAACCAGCAGCTGTGGTGATTCGAGATGGG
TTGAGCCTAGCAAAGGAGGCGGGATTCACGAACTTGGAGGTAGAGTCGGATTCTGCTCGAGCGATTGCTCTATTGAGATCTGAGGCGAGTGATATCTCGGAAGTTGGAGC
TCTGGTAAAGGAAATCAAATGGATGAGTCAAGATCTTCATCTCTGTTCTTTCCGATGGTGTAAAAGATCAACAAACAATGTGGCCCATGTACTGGCACGAGTGGCGCTTG
AAAGAAGGCGTGAGAGCATCTGGGTTGAAGAAATCCCAAAAGAAGTCGAGGATTTTTATCTCTATGAGGTTTTGGACAGACCCTCTGGTCTGTGGGATGAAGTGGGCGAC
GTTGAGAATGTGATAGCGATATCGGAAAGGAACGTGACGGACCAATGA
Protein sequenceShow/hide protein sequence
MNLSVKRMRGQTLGIPKPRRLCRDELIWHYEKNGFYSVRSGYNLACVLREMTSSSNSEKKKGVKIQNGSPRCKIKEETVFHAIWECKSVKTLWQNPPFYPRPGPSTIRSV
TDLLWWVWLHMSSSRFEEFVVFCWWIWNRRNKEVFGGRVELNVEEGGWEWVSEYLAQFQAFQGKKSGGAVGIRREEVWRPPNDSIFKLNTDALHQLIRGNKPAAVVIRDG
LSLAKEAGFTNLEVESDSARAIALLRSEASDISEVGALVKEIKWMSQDLHLCSFRWCKRSTNNVAHVLARVALERRRESIWVEEIPKEVEDFYLYEVLDRPSGLWDEVGD
VENVIAISERNVTDQ