; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035727 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035727
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-directed DNA polymerase
Genome locationchr3:28793845..28798942
RNA-Seq ExpressionLag0035727
SyntenyLag0035727
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022951570.1 uncharacterized protein LOC111454344 [Cucurbita moschata]1.1e-6451.55Show/hide
Query:  QQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQ-----LESGRN--DGG------------------
        Q  S +S+E+L+KEYMA  DV IQS QAS++ LE+Q+G LA EL+ RP  KLPA+TE P+R+GKEQ     L SG+    GG                  
Subjt:  QQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQ-----LESGRN--DGG------------------

Query:  --NSINARASSFVIDVEPLYVPPP------------PYVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKRLGEF
          N   A       D   +   P              Y P  PFPQR K K ++  F+KF++I K++HINIP VEA++QMPNY KFLKD+LT  ++  EF
Subjt:  --NSINARASSFVIDVEPLYVPPP------------PYVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKRLGEF

Query:  EIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVV
        ++VSL E+CS ILKN  P K KDPGSFTI V+IGGKE+GRALCDLGASIN+MPL +Y+KLGIGE R TT TLQLA+RSITYPEG+   I++
Subjt:  EIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVV

XP_022960431.1 uncharacterized protein LOC111461167 [Cucurbita moschata]3.6e-6348.03Show/hide
Query:  QQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQLESGRNDGGNSINARASSF---------------
        Q   E+SLE+L+KEYMA  DV IQS QAS+R LE+Q+G LANEL+ RP  KLP++TE P+R+G EQ ++     G  I++R                   
Subjt:  QQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQLESGRNDGGNSINARASSF---------------

Query:  ------VI-------DVEPLY----------VPPPP--------------YVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFL
              V+       D E L           +  PP              Y P  PFPQR K K ++  F+KF++IFK++HINIPLVEA++QM NY KFL
Subjt:  ------VI-------DVEPLY----------VPPPP--------------YVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFL

Query:  KDILTKNKRLGEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETF
        KD+LT  ++  EF++V L E+CS ILKN  P K KDPGSFTI ++IGGK++GRALCDLG+SIN+MPL +Y+KLGIGE R TT TLQLA+RS T+PEG+  
Subjt:  KDILTKNKRLGEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETF

Query:  PIVV
         I++
Subjt:  PIVV

XP_024028757.1 uncharacterized protein LOC112093792 [Morus notabilis]2.8e-6349.66Show/hide
Query:  QQNKQALPQQNSESSLETLMKEYMAHTD-------VAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQ---------LESGRN-DGG
        +Q+ Q  P Q S + +E L+KEYMA  D         +QS  AS+R LE Q+G LAN L  RPQ  LP++T++PRRDGKE          L++GR  +  
Subjt:  QQNKQALPQQNSESSLETLMKEYMAHTD-------VAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQ---------LESGRN-DGG

Query:  NSINARASSFVIDVEPLYVPP-------------------PPYVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNK
            A      I  + +  PP                    P  PP PFPQR + + QD QF++FL++ KQLHINIPLVEA+EQMP+Y KF+KDILTK +
Subjt:  NSINARASSFVIDVEPLYVPP-------------------PPYVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNK

Query:  RLGEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVV
        RLGEFE V+LTE+CS ILKN  P K KDPGSFTI  +IG + +G+ALCDLGASIN+MP+ ++RKLGIGEV  TT TLQLA+RS  +PEG+   ++V
Subjt:  RLGEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVV

XP_030505532.1 uncharacterized protein LOC115720524 [Cannabis sativa]8.1e-7156.88Show/hide
Query:  QQNKQALPQQNSE-SSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQLESGRNDGGNSINARASSFVIDVEP
        QQ + +   QNS+ SSLE+LM++YMA  D  IQS  AS+R LE+Q+GHLANELKARPQ  LP++T++PRRDGKEQ +S +   G           +D   
Subjt:  QQNKQALPQQNSE-SSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQLESGRNDGGNSINARASSFVIDVEP

Query:  LYVPPPPY--------VPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKRLGEFEIVSLTEKCSVILKNGPPTKAK
           P             PPLPFPQR + + QDGQFKKFL++ KQLHINIPLVEA+EQMPNY KFLKDILTK +RLGEFE V+LTE C+ +LK+  P K K
Subjt:  LYVPPPPY--------VPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKRLGEFEIVSLTEKCSVILKNGPPTKAK

Query:  DPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVV
        DPGSFTI  +I G++VGRAL DLGASIN+MP+ +++ LGIGE R TT TLQLA+RS+ +PEG+   ++V
Subjt:  DPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVV

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]3.9e-7354.08Show/hide
Query:  QQNKQALPQQNSE-SSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQ-----LESGRN--------------
        QQ + +   QN++ SSLE+LM++YMA  D  IQS  AS+R LE+Q+GHLANELKARPQ  LP++TE+PRRDGKEQ     L SG++              
Subjt:  QQNKQALPQQNSE-SSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQ-----LESGRN--------------

Query:  ---DGGNSINARASSFVIDVEPL-----------YVPPPPYVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKRL
                ++ + +  + D  P+              P    PPLPFPQR + + QDGQFKKFL++ KQLHINIPLVEA+EQMPNY KFLKDILTK +RL
Subjt:  ---DGGNSINARASSFVIDVEPL-----------YVPPPPYVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKRL

Query:  GEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVV
        GEFE V+LTE CS +LK+  P K KDPGSFTI  +IGG++VGRALCDLGASIN+MP+ +++KLGIGE R TT TLQLA+RS+ +PEG+   ++V
Subjt:  GEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVV

TrEMBL top hitse value%identityAlignment
A0A6J1DVZ9 uncharacterized protein LOC1110249706.3e-6142.12Show/hide
Query:  TWNELAERFLSKYFPPTRNAKLRSEIVEFRQLEDETFSEAWERVD--IAMLANALKNVTV---------------------VSHQQPPVVEPTAVALPQQ
        +WN+LAE+F ++ F P +  +  S+ V  R   +   +EA   ++  IA L N +KN+                        S  Q  V +   + +P Q
Subjt:  TWNELAERFLSKYFPPTRNAKLRSEIVEFRQLEDETFSEAWERVD--IAMLANALKNVTV---------------------VSHQQPPVVEPTAVALPQQ

Query:  NKQALPQQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARP-------QRKLPANTEHPRRDGKEQL---ESGRNDGGNSINARASS
         +   P  NS +S+ET+M+EYM   D  IQS  A  R LE+Q+G +AN+LK RP       +       E+   D  E +   E  R     +  A+  S
Subjt:  NKQALPQQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARP-------QRKLPANTEHPRRDGKEQL---ESGRNDGGNSINARASS

Query:  FVIDVEPLYVPPPPYVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKRLGEFEIVSLTEKCSVILKNGPPTKAKD
            V      P  Y  P  +PQR + K QD QF +FLE+ KQLHINIPLVEA+EQMPNY KFLKDIL K +RL EFEIV+LT++C+ IL   P  K  D
Subjt:  FVIDVEPLYVPPPPYVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKRLGEFEIVSLTEKCSVILKNGPPTKAKD

Query:  PGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVV
         GSF I V+I GK VG ALCDL ASIN+MPL + +KL IG+ R TT TLQLA+RSIT+PEG+   ++V
Subjt:  PGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVV

A0A6J1DY39 uncharacterized protein LOC1110256537.5e-0658.14Show/hide
Query:  TWNELAERFLSKYFPPTRNAKLRSEIVEFRQLEDETFSEAWER
        TW+++ ++FL KYFPPTRNA +R EI+ FRQ E+E  + AWER
Subjt:  TWNELAERFLSKYFPPTRNAKLRSEIVEFRQLEDETFSEAWER

A0A6J1EQ90 uncharacterized protein LOC1114364114.5e-5950Show/hide
Query:  QQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQLESGRNDGGNSINARASSFVIDVEPLYVPPPPYV
        Q  SE+S+E+L+KEYMA  D  IQS QAS+R LE+Q+G   N  +     +  A+T+    +   Q E  ++        +  +     +        Y 
Subjt:  QQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQLESGRNDGGNSINARASSFVIDVEPLYVPPPPYV

Query:  PPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKRLGEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVG
        P  PFPQR K K ++  F+KF++I K++HINIPLVEA++QMPNY KFLKD+L   ++  EF++VSL E+CS ILKN  P K KDPGSFTI V+IGGKE+G
Subjt:  PPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKRLGEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVG

Query:  RALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEG--ETFPIVVASYLMLEDEVAL
        RALCDLGA+IN+MPL +Y+KLGIGE R TT TLQLA+RSITYPEG  E   I V  ++ L D + L
Subjt:  RALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEG--ETFPIVVASYLMLEDEVAL

A0A6J1EQ90 uncharacterized protein LOC1114364113.1e-0769.77Show/hide
Query:  TWNELAERFLSKYFPPTRNAKLRSEIVEFRQLEDETFSEAWER
        +WN LAE FL KYFPPTRNA+ ++EIV F+Q EDET SEA ER
Subjt:  TWNELAERFLSKYFPPTRNAKLRSEIVEFRQLEDETFSEAWER

A0A6J1EQ90 uncharacterized protein LOC1114364112.9e-5847.28Show/hide
Query:  PVVEPTAVALPQQNKQALPQQNSESSLETL-----------MKEYMAHTDVAIQS----------NQASMRALEMQMGHLANELKARPQRKLPANTEHPR
        P   PT     QQ     P Q + S++E L           MKE M  TDV ++           N  ++R LEMQ+G L NE++ RPQ  LP++TE PR
Subjt:  PVVEPTAVALPQQNKQALPQQNSESSLETL-----------MKEYMAHTDVAIQS----------NQASMRALEMQMGHLANELKARPQRKLPANTEHPR

Query:  RDGKEQLESGRNDGG--------------NSINARASSFVID--VEP-LYVPPPPYV----PPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIE
        R GKE   S     G              +    + +  V D  VEP + VP  P V    PP PFPQR   K+QD  F+KFL+I KQLHINIP VEA+E
Subjt:  RDGKEQLESGRNDGG--------------NSINARASSFVID--VEP-LYVPPPPYV----PPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIE

Query:  QMPNYAKFLKDILTKNKRLGEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRS
        QMP YAKF+KDI+T+ K+LGE+E V+LTE  S + K+  P K KDPGSFTI   IGGK+VGRALCDLGASIN+MPL +++K  IG+   TT TLQLA+RS
Subjt:  QMPNYAKFLKDILTKNKRLGEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRS

Query:  ITYPEGETFPIVV
        IT PEG+   ++V
Subjt:  ITYPEGETFPIVV

A0A6J1GJ68 uncharacterized protein LOC1114543445.5e-6551.55Show/hide
Query:  QQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQ-----LESGRN--DGG------------------
        Q  S +S+E+L+KEYMA  DV IQS QAS++ LE+Q+G LA EL+ RP  KLPA+TE P+R+GKEQ     L SG+    GG                  
Subjt:  QQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQ-----LESGRN--DGG------------------

Query:  --NSINARASSFVIDVEPLYVPPP------------PYVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKRLGEF
          N   A       D   +   P              Y P  PFPQR K K ++  F+KF++I K++HINIP VEA++QMPNY KFLKD+LT  ++  EF
Subjt:  --NSINARASSFVIDVEPLYVPPP------------PYVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKRLGEF

Query:  EIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVV
        ++VSL E+CS ILKN  P K KDPGSFTI V+IGGKE+GRALCDLGASIN+MPL +Y+KLGIGE R TT TLQLA+RSITYPEG+   I++
Subjt:  EIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVV

A0A6J1H7K8 uncharacterized protein LOC1114611671.8e-6348.03Show/hide
Query:  QQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQLESGRNDGGNSINARASSF---------------
        Q   E+SLE+L+KEYMA  DV IQS QAS+R LE+Q+G LANEL+ RP  KLP++TE P+R+G EQ ++     G  I++R                   
Subjt:  QQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKARPQRKLPANTEHPRRDGKEQLESGRNDGGNSINARASSF---------------

Query:  ------VI-------DVEPLY----------VPPPP--------------YVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFL
              V+       D E L           +  PP              Y P  PFPQR K K ++  F+KF++IFK++HINIPLVEA++QM NY KFL
Subjt:  ------VI-------DVEPLY----------VPPPP--------------YVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFL

Query:  KDILTKNKRLGEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETF
        KD+LT  ++  EF++V L E+CS ILKN  P K KDPGSFTI ++IGGK++GRALCDLG+SIN+MPL +Y+KLGIGE R TT TLQLA+RS T+PEG+  
Subjt:  KDILTKNKRLGEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETF

Query:  PIVV
         I++
Subjt:  PIVV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGATCCTCCTGGGGTGAGATTCGAGCTTGATCTAGAAATTGAAAGAATATTCAGAAGAAGGAGAGAGCAGCGGAGGAACAACAATCCAATGGAGAACGTGTCGCG
TCTTCCTGAAATTCTTGAAGATCAAGTTGATCCCCAGCAAAATCGTGTGTTGCAGCCAAATCAGCCGCTGGAGCAAAATGGACAGCGAAATAATTGTTGCCAAATCAGCT
GCTGGAGCAAAATGGACAGCAAAATAATCAGGCTAAGAACCCTATTTTGGACATGGAATGAGTTAGCGGAAAGATTTCTTAGTAAGTATTTTCCACCAACTAGGAATGCC
AAGTTGAGGAGTGAGATAGTGGAATTTAGGCAACTTGAAGATGAAACTTTTAGTGAGGCTTGGGAAAGGGTTGATATTGCAATGTTAGCTAACGCTCTTAAAAATGTAAC
AGTGGTTAGTCATCAGCAGCCGCCAGTGGTGGAGCCTACTGCAGTGGCCTTGCCCCAACAAAATAAGCAGGCTTTGCCCCAACAAAATTCAGAGAGTTCTCTTGAGACAC
TGATGAAAGAATATATGGCTCATACAGATGTTGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTAGAAATGCAAATGGGTCATCTAGCTAATGAGCTGAAGGCACGA
CCTCAAAGAAAACTTCCTGCGAACACTGAGCACCCTAGAAGGGATGGTAAGGAGCAGTTGGAGTCTGGTAGAAATGATGGAGGCAACAGTATTAATGCTAGAGCATCTAG
TTTTGTTATAGATGTAGAACCACTTTATGTACCACCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAGAAGCCTAAGGATCAGGATGGTCAATTTAAGAAGT
TTTTAGAAATTTTTAAGCAATTACATATAAATATTCCTTTAGTAGAAGCTATAGAGCAAATGCCAAATTATGCTAAATTTCTAAAGGATATATTAACTAAAAATAAGAGG
TTAGGAGAATTTGAAATTGTATCTCTTACTGAGAAATGTAGTGTTATTCTCAAGAATGGGCCACCAACCAAGGCTAAGGATCCAGGGTCATTTACTATTCTTGTCACAAT
AGGTGGAAAAGAGGTGGGTAGAGCACTTTGCGATTTAGGCGCAAGCATTAACGTTATGCCTCTTTTGGTCTATCGAAAGCTAGGCATTGGTGAAGTTAGGTCAACCACAG
CCACACTCCAATTAGCTAATAGGTCTATCACATATCCAGAGGGTGAGACGTTTCCCATTGTTGTTGCATCATATTTAATGTTGGAGGATGAAGTGGCCTTAATAAAGTTG
CTGCAGCAATATCGCAACGAAATAGGTTGGACATTGACTGACATTCAGGGAATTGGCTCATCTTTCTATAACAATTGGGTAAGCCTTGTCCAATCTGTTCCAAAGGAAAT
AAGTGTTACGGTAGTGTCTAATAAGGACAATGAGTTGATTCCCACCAGGATAGTAACTGGCTGTAGGGTTTTGATAAGGGATGACCTAGGCAACTTTGCCTTAATCCTGA
GTGGATTATGGACTCCTGTCCATGAGGGATATTCCTTTGATTTGTACGGAAATATATCTGCAGTGAGAAGAGTGCAACTGTGGTTCTTTAGTGGAGTGAACCACAGTCCA
TTAGGTCCCACCGGTAGCTCATTCAGGGCGTTGAGAGTTTTTTTTCCAGAAGAAATCCTTAGAGTTTACGGTTCCCACAAGCTCTCAAAGTGTACCCTTTTGAGAATACT
GGTGAATCCTCGTGGTGGTGTTTGTGGCAATTTTTCCAGCGAAAACAAGGATTGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGATCCTCCTGGGGTGAGATTCGAGCTTGATCTAGAAATTGAAAGAATATTCAGAAGAAGGAGAGAGCAGCGGAGGAACAACAATCCAATGGAGAACGTGTCGCG
TCTTCCTGAAATTCTTGAAGATCAAGTTGATCCCCAGCAAAATCGTGTGTTGCAGCCAAATCAGCCGCTGGAGCAAAATGGACAGCGAAATAATTGTTGCCAAATCAGCT
GCTGGAGCAAAATGGACAGCAAAATAATCAGGCTAAGAACCCTATTTTGGACATGGAATGAGTTAGCGGAAAGATTTCTTAGTAAGTATTTTCCACCAACTAGGAATGCC
AAGTTGAGGAGTGAGATAGTGGAATTTAGGCAACTTGAAGATGAAACTTTTAGTGAGGCTTGGGAAAGGGTTGATATTGCAATGTTAGCTAACGCTCTTAAAAATGTAAC
AGTGGTTAGTCATCAGCAGCCGCCAGTGGTGGAGCCTACTGCAGTGGCCTTGCCCCAACAAAATAAGCAGGCTTTGCCCCAACAAAATTCAGAGAGTTCTCTTGAGACAC
TGATGAAAGAATATATGGCTCATACAGATGTTGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTAGAAATGCAAATGGGTCATCTAGCTAATGAGCTGAAGGCACGA
CCTCAAAGAAAACTTCCTGCGAACACTGAGCACCCTAGAAGGGATGGTAAGGAGCAGTTGGAGTCTGGTAGAAATGATGGAGGCAACAGTATTAATGCTAGAGCATCTAG
TTTTGTTATAGATGTAGAACCACTTTATGTACCACCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAGAAGCCTAAGGATCAGGATGGTCAATTTAAGAAGT
TTTTAGAAATTTTTAAGCAATTACATATAAATATTCCTTTAGTAGAAGCTATAGAGCAAATGCCAAATTATGCTAAATTTCTAAAGGATATATTAACTAAAAATAAGAGG
TTAGGAGAATTTGAAATTGTATCTCTTACTGAGAAATGTAGTGTTATTCTCAAGAATGGGCCACCAACCAAGGCTAAGGATCCAGGGTCATTTACTATTCTTGTCACAAT
AGGTGGAAAAGAGGTGGGTAGAGCACTTTGCGATTTAGGCGCAAGCATTAACGTTATGCCTCTTTTGGTCTATCGAAAGCTAGGCATTGGTGAAGTTAGGTCAACCACAG
CCACACTCCAATTAGCTAATAGGTCTATCACATATCCAGAGGGTGAGACGTTTCCCATTGTTGTTGCATCATATTTAATGTTGGAGGATGAAGTGGCCTTAATAAAGTTG
CTGCAGCAATATCGCAACGAAATAGGTTGGACATTGACTGACATTCAGGGAATTGGCTCATCTTTCTATAACAATTGGGTAAGCCTTGTCCAATCTGTTCCAAAGGAAAT
AAGTGTTACGGTAGTGTCTAATAAGGACAATGAGTTGATTCCCACCAGGATAGTAACTGGCTGTAGGGTTTTGATAAGGGATGACCTAGGCAACTTTGCCTTAATCCTGA
GTGGATTATGGACTCCTGTCCATGAGGGATATTCCTTTGATTTGTACGGAAATATATCTGCAGTGAGAAGAGTGCAACTGTGGTTCTTTAGTGGAGTGAACCACAGTCCA
TTAGGTCCCACCGGTAGCTCATTCAGGGCGTTGAGAGTTTTTTTTCCAGAAGAAATCCTTAGAGTTTACGGTTCCCACAAGCTCTCAAAGTGTACCCTTTTGAGAATACT
GGTGAATCCTCGTGGTGGTGTTTGTGGCAATTTTTCCAGCGAAAACAAGGATTGCTAG
Protein sequenceShow/hide protein sequence
MSDPPGVRFELDLEIERIFRRRREQRRNNNPMENVSRLPEILEDQVDPQQNRVLQPNQPLEQNGQRNNCCQISCWSKMDSKIIRLRTLFWTWNELAERFLSKYFPPTRNA
KLRSEIVEFRQLEDETFSEAWERVDIAMLANALKNVTVVSHQQPPVVEPTAVALPQQNKQALPQQNSESSLETLMKEYMAHTDVAIQSNQASMRALEMQMGHLANELKAR
PQRKLPANTEHPRRDGKEQLESGRNDGGNSINARASSFVIDVEPLYVPPPPYVPPLPFPQRQKPKDQDGQFKKFLEIFKQLHINIPLVEAIEQMPNYAKFLKDILTKNKR
LGEFEIVSLTEKCSVILKNGPPTKAKDPGSFTILVTIGGKEVGRALCDLGASINVMPLLVYRKLGIGEVRSTTATLQLANRSITYPEGETFPIVVASYLMLEDEVALIKL
LQQYRNEIGWTLTDIQGIGSSFYNNWVSLVQSVPKEISVTVVSNKDNELIPTRIVTGCRVLIRDDLGNFALILSGLWTPVHEGYSFDLYGNISAVRRVQLWFFSGVNHSP
LGPTGSSFRALRVFFPEEILRVYGSHKLSKCTLLRILVNPRGGVCGNFSSENKDC