; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021610 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021610
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:9890688..9897696
RNA-Seq ExpressionLag0021610
SyntenyLag0021610
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO69928.1 reverse transcriptase [Corchorus capsularis]5.4e-3531.88Show/hide
Query:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------
        YKI++KV+VNR+K  L + ISENQSAFVPGR I DNI++ +E LHTL+S   GK+G+ ALKLDMSKAYDRVE  FLE + L+LG                
Subjt:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------

Query:  ---SIQNG----------GLRQGDSLSPYLFLFCSDALSALLSGHQ---------------------------------------------------FRK
            + NG          GLRQGD LS YLFLFC++AL A+LS  Q                                                   F K
Subjt:  ---SIQNG----------GLRQGDSLSPYLFLFCSDALSALLSGHQ---------------------------------------------------FRK

Query:  VAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRGDFSQWEGRRYAHDTSLLKAPI---------KSNCS--
         A+ F  NV  + R  L  + G+   D +  YLG+P+   RNRR  F  L     K+ +          GR       L   P          KS C+  
Subjt:  VAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRGDFSQWEGRRYAHDTSLLKAPI---------KSNCS--

Query:  ------FFWGAAFGFVVYYYRVCVNRFGGSGFHYSDKRVD-IDKLGEFVSVKEVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSGYKL
              F+W         Y+      F  + F  S   ++ +  L ++        I+    + + + D+ +WHY N G Y+V+SGY++
Subjt:  ------FFWGAAFGFVVYYYRVCVNRFGGSGFHYSDKRVD-IDKLGEFVSVKEVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSGYKL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.9e-4031.49Show/hide
Query:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------
        YKI++K I NR+K ++  +IS+ QSAFVP R+I DN+IIGHECLHT+ S  +G  G AALKLD+SKA+DRVE  +LE +  K+G                
Subjt:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------

Query:  --SIQ-NG----------GLRQGDSLSPYLFLFCSDALSALL------------------------------------------------------SGH-
          SI  NG          G+RQGD LSPYLFL C++ LSAL+                                                      SG  
Subjt:  --SIQ-NG----------GLRQGDSLSPYLFLFCSDALSALL------------------------------------------------------SGH-

Query:  -QFRKVAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRC---RVGRGDFSQWEG----------------------
          F K A+ FSPNV    +  L+ IL +++V   G YLG+PS FTR R +  +     +G+ C     G  +F   EG                      
Subjt:  -QFRKVAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRC---RVGRGDFSQWEG----------------------

Query:  ----RRYAHDTSLLKAPIKSNCSFFW-GAAFG--FVVYYYRVCVN----------------------RFGG-------SGFHYSDKRVDIDKLGEFVSVK
             +Y  DTSLL+A   S  S+FW G  +G   +V   R+ V                       RF         + F  +D   D+  +      +
Subjt:  ----RRYAHDTSLLKAPIKSNCSFFW-GAAFG--FVVYYYRVCVN----------------------RFGG-------SGFHYSDKRVDIDKLGEFVSVK

Query:  EVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSGYKLAMKTEESGGTSSLN
        +  +I  +PI+S N++D W+WHY   G Y+V+SGYKL M  + +  ++S N
Subjt:  EVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSGYKLAMKTEESGGTSSLN

XP_024642386.1 uncharacterized protein LOC112422877 [Medicago truncatula]5.0e-3330.92Show/hide
Query:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------
        YKIVAKV+ NR+K +L   IS+NQS FVPGRSI DN ++  E +H +KS+  G KG AA KLD+SKAYDR++  +L+ + LK+G                
Subjt:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------

Query:  -------------SIQNGGLRQGDSLSPYLFLFCSDALSALL------------------------------------------------------SGH-
                      I   GLRQGD LSPYLF+ C++ LS+L+                                                      SG  
Subjt:  -------------SIQNGGLRQGDSLSPYLFLFCSDALSALL------------------------------------------------------SGH-

Query:  -QFRKVAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLN-------SVFGKRC--RVGRGDFSQWEGRRYAHDTSLLKAPIKS
          F+K  ++FS NV  T +T +  IL +Q+V   G YLG+PS   R+++  F  +        + +G +C  +VGR  F+ + G  Y H        I  
Subjt:  -QFRKVAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLN-------SVFGKRC--RVGRGDFSQWEGRRYAHDTSLLKAPIKS

Query:  NCSFFWGAAFGFVVYYYRVCVNRFGGS--GFHYSDKRV-----------DIDKLGEFVSVKEVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSGYKLAM
          SF     F   +   +  V     S   FH S+ RV           D+  L      + V  I   P+ +   KD+ IW   N+G+YTV+S Y+L M
Subjt:  NCSFFWGAAFGFVVYYYRVCVNRFGGS--GFHYSDKRV-----------DIDKLGEFVSVKEVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSGYKLAM

Query:  K
        +
Subjt:  K

XP_030498076.1 uncharacterized protein LOC115713736 [Cannabis sativa]1.1e-3234.02Show/hide
Query:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG--------------SI
        YKI++KV+ +R+K +L  IIS NQSAF+PGR I DNI+IG E +H LK +  GK+G+ ALKLD+SKAYDRVE  FL  +  K+G              S+
Subjt:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG--------------SI

Query:  QNGGLRQGD-SLSPYLFLFCSDALSALLSGHQ--FRKVAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRG
             R  D   S  L LF    L  + SG Q  F +++++FS NV    R  L GIL M++ D   TYLG+P    RN+      L     KR +   G
Subjt:  QNGGLRQGD-SLSPYLFLFCSDALSALLSGHQ--FRKVAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRG

Query:  DFSQWEGRRYAHDT----------SLLKAPIKSNCS--------FFWGAAFGFVVYYY---RVC-VNRFGGSGFHYSDKRVDIDKLGEFVSVKEVKIIFV
         F    GR     T          S+   P+K+ CS        ++W ++    V ++   ++C     GG GF  S +  ++  LG+       ++I  
Subjt:  DFSQWEGRRYAHDT----------SLLKAPIKSNCS--------FFWGAAFGFVVYYY---RVC-VNRFGGSGFHYSDKRVDIDKLGEFVSVKEVKIIFV

Query:  IPINSVNVKDKWIWHYTNSGEYTVKSGYKLAMKTEESGGTS
        +P++   V D W W+   +G +TVKS Y        S  +S
Subjt:  IPINSVNVKDKWIWHYTNSGEYTVKSGYKLAMKTEESGGTS

XP_040999571.1 uncharacterized protein LOC121245775 [Juglans microcarpa x Juglans regia]9.5e-3230.14Show/hide
Query:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKL-----------------
        YK+V+K + NR+K IL  II+ NQSAFV GR I DN ++ +E LH++ SR  GKKG+ ALKLDMSKAYDRVE +F+E + +K+                 
Subjt:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKL-----------------

Query:  -------GSIQN-----GGLRQGDSLSPYLFLFCSDALSALLS------------------------------------------------------GHQ
               G  QN      GLRQGD LSPYLF+ C++ALS+L+                                                       GH 
Subjt:  -------GSIQN-----GGLRQGDSLSPYLFLFCSDALSALLS------------------------------------------------------GHQ

Query:  FR--KVAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRGDFSQWEGRR------------YAHDTSLLKAP
            K A+YFS N     +  +  + G+Q + S   YLG+P+   R +   F  L      R    R  F    G              YA    L+ A 
Subjt:  FR--KVAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRGDFSQWEGRR------------YAHDTSLLKAP

Query:  IKSNCS-----FFWG-----AAFGFVVYYYRVCVNRFGGSGF-----HYSDKRVDIDKLGEFVSVKEVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSG
        I S  +     F+WG     +   +V + Y       GG  F         K      L E  + +E++ I  IPI+    +DK  W +T +G +T+KSG
Subjt:  IKSNCS-----FFWG-----AAFGFVVYYYRVCVNRFGGSGF-----HYSDKRVDIDKLGEFVSVKEVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSG

Query:  YKLA--MKTEESGGTSSL
        Y +   ++ E+ G TS +
Subjt:  YKLA--MKTEESGGTSSL

TrEMBL top hitse value%identityAlignment
A0A1R3HHY9 Reverse transcriptase2.6e-3531.88Show/hide
Query:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------
        YKI++KV+VNR+K  L + ISENQSAFVPGR I DNI++ +E LHTL+S   GK+G+ ALKLDMSKAYDRVE  FLE + L+LG                
Subjt:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------

Query:  ---SIQNG----------GLRQGDSLSPYLFLFCSDALSALLSGHQ---------------------------------------------------FRK
            + NG          GLRQGD LS YLFLFC++AL A+LS  Q                                                   F K
Subjt:  ---SIQNG----------GLRQGDSLSPYLFLFCSDALSALLSGHQ---------------------------------------------------FRK

Query:  VAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRGDFSQWEGRRYAHDTSLLKAPI---------KSNCS--
         A+ F  NV  + R  L  + G+   D +  YLG+P+   RNRR  F  L     K+ +          GR       L   P          KS C+  
Subjt:  VAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRGDFSQWEGRRYAHDTSLLKAPI---------KSNCS--

Query:  ------FFWGAAFGFVVYYYRVCVNRFGGSGFHYSDKRVD-IDKLGEFVSVKEVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSGYKL
              F+W         Y+      F  + F  S   ++ +  L ++        I+    + + + D+ +WHY N G Y+V+SGY++
Subjt:  ------FFWGAAFGFVVYYYRVCVNRFGGSGFHYSDKRVD-IDKLGEFVSVKEVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSGYKL

A0A2N9H1U1 Reverse transcriptase domain-containing protein7.8e-3236.88Show/hide
Query:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------
        YKI++KV+ NR+K IL  IISE QSAFVPGR I DNI++  E LH +K+R TGK G+ ALKLDMSKAYDRVE  FL+ + LK+G                
Subjt:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------

Query:  ---SIQNG----------GLRQGDSLSPYLFLFCSDALSA----------------------LLSGHQFR--KVAMYFSPNVRTTDRTTLKGILGMQIVD
            + NG          GLRQGD LSPYLFL C++ L A                       +SG Q    K  ++FS +     ++ +  +LG+ +V 
Subjt:  ---SIQNG----------GLRQGDSLSPYLFLFCSDALSA----------------------LLSGHQFR--KVAMYFSPNVRTTDRTTLKGILGMQIVD

Query:  SLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRGDFSQWEGRRYAHDTSLLKAPIKSNCSF
            YLG+PS   R+RR+ F  +     +R +  +       GR       L+KA +++  +F
Subjt:  SLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRGDFSQWEGRRYAHDTSLLKAPIKSNCSF

A0A2N9HW04 Reverse transcriptase domain-containing protein2.7e-3237.26Show/hide
Query:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------
        YKI++KV+ NR+K IL  IISE QSAFVPGR I DNI++  E LH +K+R TGK G+ ALKLDMSKAYDRVE  FL+ + LK+G                
Subjt:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------

Query:  ---SIQNG----------GLRQGDSLSPYLFLFCSDALSA----------------------LLSGHQFR--KVAMYFSPNVRTTDRTTLKGILGMQIVD
            + NG          GLRQGD LSPYLFL C++ L A                      ++SG Q    K  ++FS +     ++ +  ILG+ +V 
Subjt:  ---SIQNG----------GLRQGDSLSPYLFLFCSDALSA----------------------LLSGHQFR--KVAMYFSPNVRTTDRTTLKGILGMQIVD

Query:  SLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRGDFSQWEGRRYAHDTSLLKAPIKSNCSF
            YLG+PS   R+RR+ F  +     +R +  +       GR       L+KA +++  +F
Subjt:  SLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRGDFSQWEGRRYAHDTSLLKAPIKSNCSF

A0A6J1DX30 uncharacterized protein LOC1110248749.2e-4131.49Show/hide
Query:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------
        YKI++K I NR+K ++  +IS+ QSAFVP R+I DN+IIGHECLHT+ S  +G  G AALKLD+SKA+DRVE  +LE +  K+G                
Subjt:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG----------------

Query:  --SIQ-NG----------GLRQGDSLSPYLFLFCSDALSALL------------------------------------------------------SGH-
          SI  NG          G+RQGD LSPYLFL C++ LSAL+                                                      SG  
Subjt:  --SIQ-NG----------GLRQGDSLSPYLFLFCSDALSALL------------------------------------------------------SGH-

Query:  -QFRKVAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRC---RVGRGDFSQWEG----------------------
          F K A+ FSPNV    +  L+ IL +++V   G YLG+PS FTR R +  +     +G+ C     G  +F   EG                      
Subjt:  -QFRKVAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRC---RVGRGDFSQWEG----------------------

Query:  ----RRYAHDTSLLKAPIKSNCSFFW-GAAFG--FVVYYYRVCVN----------------------RFGG-------SGFHYSDKRVDIDKLGEFVSVK
             +Y  DTSLL+A   S  S+FW G  +G   +V   R+ V                       RF         + F  +D   D+  +      +
Subjt:  ----RRYAHDTSLLKAPIKSNCSFFW-GAAFG--FVVYYYRVCVN----------------------RFGG-------SGFHYSDKRVDIDKLGEFVSVK

Query:  EVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSGYKLAMKTEESGGTSSLN
        +  +I  +PI+S N++D W+WHY   G Y+V+SGYKL M  + +  ++S N
Subjt:  EVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSGYKLAMKTEESGGTSSLN

A0A803PYQ3 Uncharacterized protein7.8e-3239.04Show/hide
Query:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLGSIQ-NGGLRQGDSLSP
        YKI+++V+VNR+K IL  IIS +QSAF+P R I DNIIIG E +H+L  R +G+ GW ALKLDM+KA+DRVE +FL ++   LG I+ + G+RQGD LSP
Subjt:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLGSIQ-NGGLRQGDSLSP

Query:  YLFLFCSDALSALL----------------------------------------------------------SGH--QFRKVAMYFSPNVRTTDRTTLKG
        YLFL CS+ LSAL+                                                          SG    F K A++FSPN     +T +  
Subjt:  YLFLFCSDALSALL----------------------------------------------------------SGH--QFRKVAMYFSPNVRTTDRTTLKG

Query:  ILGMQIVDSLGTYLGVPSSFTRNRRDDF
        ILG+ + D+   YLG+P +F R++++ F
Subjt:  ILGMQIVDSLGTYLGVPSSFTRNRRDDF

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein1.4e-0948.68Show/hide
Query:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFL
        YKIVAK I  R+K +L ++I  +QS  VPGR+IFDN+ +  + LH   +R TG    A L LD  KA+DRV+ Q+L
Subjt:  YKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFL

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.2e-1140.26Show/hide
Query:  IVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG
        +V R+K ++ ++I   Q++F+PGR   DNI+   E +H+++ R  G KGW  LKLD+ KAYDR+   +LE   +  G
Subjt:  IVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATGTGGCTACAAGATTGTCGCCAAGGTTATAGTCAACAGAATGAAGTGGATCCTTCAGGATATAATCTCTGAAAATCAATCGGCGTTTGTTCCTGGGAGGTCGAT
ATTTGATAATATCATTATTGGTCATGAATGTCTTCACACTCTCAAATCTAGGCATACAGGAAAAAAGGGTTGGGCGGCTCTCAAGTTGGATATGAGCAAAGCATACGATA
GAGTAGAATGTCAATTTCTGGAAAAGTTATGGTTAAAATTGGGTTCCATCCAAAATGGAGGTTTACGCCAGGGTGACTCTCTTTCACCATATCTATTTCTGTTTTGCTCT
GACGCTTTGTCGGCACTTCTATCTGGTCATCAATTTAGGAAAGTGGCAATGTATTTCTCTCCCAATGTTCGTACTACGGACCGCACAACTCTCAAAGGGATCTTGGGGAT
GCAAATTGTTGACTCATTGGGGACGTATTTGGGCGTTCCATCGTCATTCACTCGAAACCGAAGGGATGACTTCGAGGGGTTAAACAGCGTGTTTGGCAAACGCTGCAGGG
TTGGAAGGGGCGATTTTTCTCAATGGGAGGGAAGGAGGTATGCTCATGACACTTCCTTATTAAAAGCTCCTATCAAATCCAATTGTTCTTTCTTTTGGGGAGCTGCGTTT
GGGTTTGTAGTCTATTATTACAGGGTATGCGTAAACAGATTCGGTGGTAGCGGATTTCATTACTCCGACAAAAGGGTGGATATTGACAAACTAGGTGAATTTGTTTCCGT
CAAGGAAGTCAAGATCATATTTGTTATTCCCATCAATTCGGTCAATGTGAAAGATAAATGGATTTGGCATTATACAAATAGTGGAGAGTATACAGTTAAAAGTGGATACA
AACTGGCTATGAAGACAGAAGAGAGTGGTGGCACTTCCAGTTTAAATACACAGGAAGGAATGCTTTCGGAGGGAAGACCAGTCGTTGAGTGGTATCAGCAATGCGAATGG
ATTATTCAATACTGGAAAGAGACAACTCAAAGACGAATTATAGATGGCATCATGGACAAGCATACAGAGGCTCCTTTGAATCAGGAGGATGGGGGCTTTACAGTTTTCAC
TGATGCAACGGTTTCTCCTAGTAGTGCTGGGGCAGGATATGGGGTCGTCGTTGTTGGCGAGAATAATAACATCTATAGTGCTATGGAAATGGTAGAGTTTGCAGCATTAA
GCCCTTTGACTGCAGAAGTCCAAGCCATGTTACATGGGATAAGGTTGGCTCATAGAATGCAAATTAAAATTTCTTTTCTCTCTCCCTCTCACGCACTCTCCTCTCTCCCT
CACAACTCTCTCTCCGTCACGCTCCCTCTCTCCCTCCTCAGCCTTCAGCTCGCCGCCGCACGCCGCCTTCAGCTCACCGCCGCCGACCTCGCGCCTCCGCCTCTGCCGTT
CGCCACATCGCCGCCGCTCGCCGACAGTCTGCCGTCGCTTGTCGCCTGTCTCCCGCCGCTCGTTTCTCGCCGCCGTGAACCACTACCATCTCACCGTCGTGAACCGCCGC
CATTTGCCGTCTCAAGAAGCGCCGCCGCAGCCACTGCCACCCTCGGTTTCATCCTTCGTTTCAGATATGTTCGACGCCTTCAGCTACTTCCAATTTCAAGGTGTCCCGTA
GCGTTTGGAGGTCGAATTGTCGAAGGTGAGCGGTTTTGGACTGCTGGACAGCAAGCTGTGTTTCTGTTTTGGGTAGAGGTAAAGGTGTACAGAACGGGTGACGGAGTGCT
GCTGAGCCATGGATTTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAATGTGGCTACAAGATTGTCGCCAAGGTTATAGTCAACAGAATGAAGTGGATCCTTCAGGATATAATCTCTGAAAATCAATCGGCGTTTGTTCCTGGGAGGTCGAT
ATTTGATAATATCATTATTGGTCATGAATGTCTTCACACTCTCAAATCTAGGCATACAGGAAAAAAGGGTTGGGCGGCTCTCAAGTTGGATATGAGCAAAGCATACGATA
GAGTAGAATGTCAATTTCTGGAAAAGTTATGGTTAAAATTGGGTTCCATCCAAAATGGAGGTTTACGCCAGGGTGACTCTCTTTCACCATATCTATTTCTGTTTTGCTCT
GACGCTTTGTCGGCACTTCTATCTGGTCATCAATTTAGGAAAGTGGCAATGTATTTCTCTCCCAATGTTCGTACTACGGACCGCACAACTCTCAAAGGGATCTTGGGGAT
GCAAATTGTTGACTCATTGGGGACGTATTTGGGCGTTCCATCGTCATTCACTCGAAACCGAAGGGATGACTTCGAGGGGTTAAACAGCGTGTTTGGCAAACGCTGCAGGG
TTGGAAGGGGCGATTTTTCTCAATGGGAGGGAAGGAGGTATGCTCATGACACTTCCTTATTAAAAGCTCCTATCAAATCCAATTGTTCTTTCTTTTGGGGAGCTGCGTTT
GGGTTTGTAGTCTATTATTACAGGGTATGCGTAAACAGATTCGGTGGTAGCGGATTTCATTACTCCGACAAAAGGGTGGATATTGACAAACTAGGTGAATTTGTTTCCGT
CAAGGAAGTCAAGATCATATTTGTTATTCCCATCAATTCGGTCAATGTGAAAGATAAATGGATTTGGCATTATACAAATAGTGGAGAGTATACAGTTAAAAGTGGATACA
AACTGGCTATGAAGACAGAAGAGAGTGGTGGCACTTCCAGTTTAAATACACAGGAAGGAATGCTTTCGGAGGGAAGACCAGTCGTTGAGTGGTATCAGCAATGCGAATGG
ATTATTCAATACTGGAAAGAGACAACTCAAAGACGAATTATAGATGGCATCATGGACAAGCATACAGAGGCTCCTTTGAATCAGGAGGATGGGGGCTTTACAGTTTTCAC
TGATGCAACGGTTTCTCCTAGTAGTGCTGGGGCAGGATATGGGGTCGTCGTTGTTGGCGAGAATAATAACATCTATAGTGCTATGGAAATGGTAGAGTTTGCAGCATTAA
GCCCTTTGACTGCAGAAGTCCAAGCCATGTTACATGGGATAAGGTTGGCTCATAGAATGCAAATTAAAATTTCTTTTCTCTCTCCCTCTCACGCACTCTCCTCTCTCCCT
CACAACTCTCTCTCCGTCACGCTCCCTCTCTCCCTCCTCAGCCTTCAGCTCGCCGCCGCACGCCGCCTTCAGCTCACCGCCGCCGACCTCGCGCCTCCGCCTCTGCCGTT
CGCCACATCGCCGCCGCTCGCCGACAGTCTGCCGTCGCTTGTCGCCTGTCTCCCGCCGCTCGTTTCTCGCCGCCGTGAACCACTACCATCTCACCGTCGTGAACCGCCGC
CATTTGCCGTCTCAAGAAGCGCCGCCGCAGCCACTGCCACCCTCGGTTTCATCCTTCGTTTCAGATATGTTCGACGCCTTCAGCTACTTCCAATTTCAAGGTGTCCCGTA
GCGTTTGGAGGTCGAATTGTCGAAGGTGAGCGGTTTTGGACTGCTGGACAGCAAGCTGTGTTTCTGTTTTGGGTAGAGGTAAAGGTGTACAGAACGGGTGACGGAGTGCT
GCTGAGCCATGGATTTCCTTAG
Protein sequenceShow/hide protein sequence
MQCGYKIVAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTLKSRHTGKKGWAALKLDMSKAYDRVECQFLEKLWLKLGSIQNGGLRQGDSLSPYLFLFCS
DALSALLSGHQFRKVAMYFSPNVRTTDRTTLKGILGMQIVDSLGTYLGVPSSFTRNRRDDFEGLNSVFGKRCRVGRGDFSQWEGRRYAHDTSLLKAPIKSNCSFFWGAAF
GFVVYYYRVCVNRFGGSGFHYSDKRVDIDKLGEFVSVKEVKIIFVIPINSVNVKDKWIWHYTNSGEYTVKSGYKLAMKTEESGGTSSLNTQEGMLSEGRPVVEWYQQCEW
IIQYWKETTQRRIIDGIMDKHTEAPLNQEDGGFTVFTDATVSPSSAGAGYGVVVVGENNNIYSAMEMVEFAALSPLTAEVQAMLHGIRLAHRMQIKISFLSPSHALSSLP
HNSLSVTLPLSLLSLQLAAARRLQLTAADLAPPPLPFATSPPLADSLPSLVACLPPLVSRRREPLPSHRREPPPFAVSRSAAAATATLGFILRFRYVRRLQLLPISRCPV
AFGGRIVEGERFWTAGQQAVFLFWVEVKVYRTGDGVLLSHGFP