; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010681 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010681
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr1:3720072..3724305
RNA-Seq ExpressionLag0010681
SyntenyLag0010681
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_019172468.1 PREDICTED: uncharacterized protein LOC109167852, partial [Ipomoea nil]3.8e-2632.2Show/hide
Query:  DRSMVDVASGGTLVNKMPTETRQLISSMTENSRTGAGESLWALLDDQPYYRWDLID----LQIRSCGLYSMTSHTKSEGGGT-------------TRISV
        DRSM+D ASGG LV+K P   R LI++M  NS+         +   +  YR         +Q  S G+ ++    KS    T             TR S+
Subjt:  DRSMVDVASGGTLVNKMPTETRQLISSMTENSRTGAGESLWALLDDQPYYRWDLID----LQIRSCGLYSMTSHTKSEGGGT-------------TRISV

Query:  METNSGRSCLNIIRLQVPPPQVVTQVSKMDNKSSGKLPAQPEVNPNANVS-VIWSGVAKLDYLLGFSSVSNKFS---SDFNDVIAFPYPRVDLTNPITMS
         ET +     +I  L+    Q+ T VS+++++SSGKLP+Q  VNP  N S ++   V +       +  S   S   +D+  V  FP     L+  I  +
Subjt:  METNSGRSCLNIIRLQVPPPQVVTQVSKMDNKSSGKLPAQPEVNPNANVS-VIWSGVAKLDYLLGFSSVSNKFS---SDFNDVIAFPYPRVDLTNPITMS

Query:  VQDDL-DVFKRVKVNIPLLEAILKILAYAEFLK--AWYEGRERSSGKEEV--SENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDL--------
          ++L + FK  +VNIPLL+AI ++  YA+FLK     + ++RSS  E+V   +NVS ++   LP KC DP MFT+PCMIG  K++ AMLDL        
Subjt:  VQDDL-DVFKRVKVNIPLLEAILKILAYAEFLK--AWYEGRERSSGKEEV--SENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDL--------

Query:  ----------------------DTPN--------------------PDF-------SPSSTTILLGRPFMKTAKTVIDVDFG
                              D  N                     DF       +  S  ILLGRPF+KTAKT ID+  G
Subjt:  ----------------------DTPN--------------------PDF-------SPSSTTILLGRPFMKTAKTVIDVDFG

XP_023912744.1 uncharacterized protein LOC112024310 [Quercus suber]3.4e-2728.94Show/hide
Query:  DRSMVDVASGGTLVNKMPTETRQLISSMTENSR----------TGAGESLWALLDDQPYYRWDLI------DLQ-IRSCGLYSMTSHTKSEGGGTTRISV
        D SM+D ASGG LV+K P   R LI++M  NS+              E   + L+ Q      L+      ++Q +++CG+ S+  H  +         +
Subjt:  DRSMVDVASGGTLVNKMPTETRQLISSMTENSR----------TGAGESLWALLDDQPYYRWDLI------DLQ-IRSCGLYSMTSHTKSEGGGTTRISV

Query:  METNSGRSCLNIIRLQVPPPQVVTQVSKMDNKSSGKLPAQPEVNPNANVSVIWSGVAKLDYLLGFSSVSN-------KFSSDFNDVIAFPYPRVDLTNPI
         + N+  +  NI  L+    Q+ T +S+++ +SSGKL +Q  VNP  N S I         ++   +V N       KF S F+     P+P+    +  
Subjt:  METNSGRSCLNIIRLQVPPPQVVTQVSKMDNKSSGKLPAQPEVNPNANVSVIWSGVAKLDYLLGFSSVSN-------KFSSDFNDVIAFPYPRVDLTNPI

Query:  TMSVQDDLDVFKRVKVNIPLLEAILKILAYAEFLK--AWYEGRERSSGKEE--VSENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLD-------
           ++D  + F+R +V+IPLL+AI ++  YA+FLK     + +++  G E+  V ENV  ++   LPAKC DP MFT+PC IG+ +++  MLD       
Subjt:  TMSVQDDLDVFKRVKVNIPLLEAILKILAYAEFLK--AWYEGRERSSGKEE--VSENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLD-------

Query:  -----------------------------------------------------LDTPNPDFSPSSTTILLGRPFMKTAKTVIDVDFG
                                                             LD  N D    +T ILLGRPF+KT+KT IDV  G
Subjt:  -----------------------------------------------------LDTPNPDFSPSSTTILLGRPFMKTAKTVIDVDFG

XP_024041424.1 uncharacterized protein LOC112098931 [Citrus clementina]5.0e-2630Show/hide
Query:  MDRSMVDVASGGTLVNKMPTETRQLISSMTENSRTGAGESLWALLDDQPYYRWDLIDLQIRSCGLYSMTSHTKSEGGGTTRISVMETNSGRSCLNIIRLQ
        MDRSM+D ASGG LVNK PT+ R+LIS+M  N++                                  TS T +E    T +S+    +  S        
Subjt:  MDRSMVDVASGGTLVNKMPTETRQLISSMTENSRTGAGESLWALLDDQPYYRWDLIDLQIRSCGLYSMTSHTKSEGGGTTRISVMETNSGRSCLNIIRLQ

Query:  VPPPQVVTQVSKMDNKSSGKLPAQPEVNPNANVS--VIWSGV------AKLDYLLGFSSVSNKFSSDFND-----------VIAFPYPRVDLTNPITMSV
            Q+ T VS+++++  G+LP+Q EVNP  NVS  ++ SG        K+   +      N+      D           VI  P+P     +      
Subjt:  VPPPQVVTQVSKMDNKSSGKLPAQPEVNPNANVS--VIWSGV------AKLDYLLGFSSVSNKFSSDFND-----------VIAFPYPRVDLTNPITMSV

Query:  QDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEEV--SENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDL------------
        +D L+ F++V+VNIPLL+AI +I  YA+ LK     + +  G E+V   ENVS +L   LP KC DP MFT+PC IG  +++ AMLDL            
Subjt:  QDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEEV--SENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDL------------

Query:  ------------------DTPN-----------------------------PDFSPSSTTILLGRPFMKTAKTVIDVDFG
                          D  N                              D S +S  ILLG+PF+KTA+T +DV  G
Subjt:  ------------------DTPN-----------------------------PDFSPSSTTILLGRPFMKTAKTVIDVDFG

XP_024042080.1 uncharacterized protein LOC112099192, partial [Citrus clementina]4.5e-2727.69Show/hide
Query:  MDRSMVDVASGGTLVNKMPTETRQLISSMTENSR--------------------------------TGAGESLWALLDDQPYYRWDLI----DLQIRSCG
        MDRSM+D ASGG LVNK PT+ R+LIS+M  N++                                 G    +   L ++P  + + +     +  R   
Subjt:  MDRSMVDVASGGTLVNKMPTETRQLISSMTENSR--------------------------------TGAGESLWALLDDQPYYRWDLI----DLQIRSCG

Query:  LYSMT------SHTKSEGGGTTRISVMETNSGRSCLNIIRLQVPPPQVVTQVSKMDNKSSGKLPAQPEVNP--NANVSVIWSG-----------------
         Y+ T       H     G T      +  +     +I  L+    Q+ T +S+++++ SGKLP+Q EVNP  NA+  ++ SG                 
Subjt:  LYSMT------SHTKSEGGGTTRISVMETNSGRSCLNIIRLQVPPPQVVTQVSKMDNKSSGKLPAQPEVNP--NANVSVIWSG-----------------

Query:  VAKLDYLLGFSSVSNKFSSDFNDVIAFPYPRVDLTNPITMSVQDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEEV--SENVSVLLSD
        + K + +          +     VI  P+P     +      +D L+ F++V+VNIPLL+AI +I  YA+FLK     + +  G E V   ENVSV+L  
Subjt:  VAKLDYLLGFSSVSNKFSSDFNDVIAFPYPRVDLTNPITMSVQDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEEV--SENVSVLLSD

Query:  DLPAKCSDPSMFTLPCMIGHYKIKYAMLDL------------------------------DTPN-----------------------------PDFSPSS
         LP KC DP MFT+PC IG  +++ AMLDL                              D  N                              D S +S
Subjt:  DLPAKCSDPSMFTLPCMIGHYKIKYAMLDL------------------------------DTPN-----------------------------PDFSPSS

Query:  TTILLGRPFMKTAKTVIDVDFGEWRKIVVDVLHQFFIY------SPKHSVLNLGI
          ILLGRPF+KTA+T +DV  G         + +F IY      S +HSV ++ +
Subjt:  TTILLGRPFMKTAKTVIDVDFGEWRKIVVDVLHQFFIY------SPKHSVLNLGI

XP_024046622.1 uncharacterized protein LOC112100976, partial [Citrus clementina]1.1e-2528.09Show/hide
Query:  MDRSMVDVASGGTLVNKMPTETRQLISSMTENSRTGAGESLWALLDDQPYYRWDLIDLQIRSCGLYSMTSHTKSEGGGTTRISVMETNSGRSCLNIIRLQ
        MD +M+D ASGG LVNK P + R+LIS+M  N++                                  TS   +E   T  I  +E              
Subjt:  MDRSMVDVASGGTLVNKMPTETRQLISSMTENSRTGAGESLWALLDDQPYYRWDLIDLQIRSCGLYSMTSHTKSEGGGTTRISVMETNSGRSCLNIIRLQ

Query:  VPPPQVVTQVSKMDNKSSGKLPAQPEVNP--NANVSVIWSG-----------------VAKLDYLLGFSSVSNKFSSDFNDVIAFPYPRVDLTNPITMSV
            Q+ T +S+++++ SGKLP+Q EVNP  NA+  ++ SG                 + K + +          +     VI  P+P     +      
Subjt:  VPPPQVVTQVSKMDNKSSGKLPAQPEVNP--NANVSVIWSG-----------------VAKLDYLLGFSSVSNKFSSDFNDVIAFPYPRVDLTNPITMSV

Query:  QDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEEV--SENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDL------------
        +D L+ F++V+VNIPLL+AI +I  YA+FLK     + +  G E V   ENVS +L   LP KC DP MFT+PC IG  +++ AMLDL            
Subjt:  QDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEEV--SENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDL------------

Query:  ------------------DTPN-----------------------------PDFSPSSTTILLGRPFMKTAKTVIDVDFGEWRKIVVDVLHQFFIY----
                          D  N                              D S +S  ILLGRPF+KTA+T +DV  G         + +F +Y    
Subjt:  ------------------DTPN-----------------------------PDFSPSSTTILLGRPFMKTAKTVIDVDFGEWRKIVVDVLHQFFIY----

Query:  --SPKHSVLNLGI
          S +HSV ++ +
Subjt:  --SPKHSVLNLGI

TrEMBL top hitse value%identityAlignment
A0A2G9HVP4 Retrotrans_gag domain-containing protein4.2e-2330.49Show/hide
Query:  MDRSMVDVASGGTLVNKMPTETRQLISSMTENSRT-GAGESLW--------ALLDDQPYYRWDLID-------LQIRSCGLYSMTSHTKSEGGGTTRISV
        M+R M+D ASGG +VNK P+E R LIS+M  NS+  G G   +        + L+ Q      L+         Q+++CG+ +   H  ++   T +   
Subjt:  MDRSMVDVASGGTLVNKMPTETRQLISSMTENSRT-GAGESLW--------ALLDDQPYYRWDLID-------LQIRSCGLYSMTSHTKSEGGGTTRISV

Query:  METNSGRSCLNIIRLQVPPPQVVTQVSKMDNKSSGKLPAQPEVNPNANVSVI-------------------WSGVAKLDYLLGFSSVSNKFSSDFNDVIA
         E  +  +  +I  L+    Q+ T V++++  S GKLP+Q  VNP  N+S I                       A+ +  +  S          + ++ 
Subjt:  METNSGRSCLNIIRLQVPPPQVVTQVSKMDNKSSGKLPAQPEVNPNANVSVI-------------------WSGVAKLDYLLGFSSVSNKFSSDFNDVIA

Query:  F-PYPRVDLTNPITMSVQDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKE--EVSENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKY
          P+P     +      ++ L+ F++V+VNIPLL+AI +I  YA+FLK     +++  G E   V EN+SV+L   LP KC DP MF++PC IG  +I+ 
Subjt:  F-PYPRVDLTNPITMSVQDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKE--EVSENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKY

Query:  AMLDL
         M DL
Subjt:  AMLDL

A0A6I9T1C3 uncharacterized protein LOC1051601773.2e-2327.94Show/hide
Query:  MDRSMVDVASGGTLVNKMPTETRQLISSMTENSRTGAGESLWALLDDQPYYRWDLIDLQIRSCGLYSMTSHTKSEGGGTTRISVMETNSGRSCLNIIRLQ
        MDR ++D ASGG L +K PTE R+LIS M  N+     +      DD P                                    ++N  ++  +I  L+
Subjt:  MDRSMVDVASGGTLVNKMPTETRQLISSMTENSRTGAGESLWALLDDQPYYRWDLIDLQIRSCGLYSMTSHTKSEGGGTTRISVMETNSGRSCLNIIRLQ

Query:  VPPPQVVTQVSKMDNKSSGKLPAQPEVNPNANVSVIWSGVAK-------LDYLLGFSSVSNKFSSDFNDVIAFPYPRVDLTNPITM--------------
            Q+ + V K++  S GKLP+Q  +NP  NVS I     K        D  +    V  K   +         P  +   P+                
Subjt:  VPPPQVVTQVSKMDNKSSGKLPAQPEVNPNANVSVIWSGVAK-------LDYLLGFSSVSNKFSSDFNDVIAFPYPRVDLTNPITM--------------

Query:  -SVQDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEEVS--ENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDLDTP------
           ++ L+ F++V+VNIPLL+AI +I  YA+F K     + +  G E VS  ENVS +L   LP KC+DP  F++PC IG   I+ AM DL         
Subjt:  -SVQDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEEVS--ENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDLDTP------

Query:  -----------------------------------------------------NPDFSPSSTTILLGRPFMKTAKTVIDVDFG
                                                               D  P+ST+ILLGRPF+KT+KT IDVD G
Subjt:  -----------------------------------------------------NPDFSPSSTTILLGRPFMKTAKTVIDVDFG

A0A6P6SC24 uncharacterized protein LOC1136894823.8e-2436.94Show/hide
Query:  QVVTQVSKMDNKSSGKLPAQPEVNP-NANVSVIWSG--VAKLDYLLGFSSVSNKFSSDFN-------DVIAFPYPRVDL-TNPITM-----------SVQ
        Q+ T ++++D ++ GKLP+QPE+NP N +   + SG  +   + ++       K  ++         D    P P + + TNP+               +
Subjt:  QVVTQVSKMDNKSSGKLPAQPEVNP-NANVSVIWSG--VAKLDYLLGFSSVSNKFSSDFN-------DVIAFPYPRVDL-TNPITM-----------SVQ

Query:  DDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEE--VSENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDLDTPNPDFSPSS--
        + L+VF++V++NIPLL+AI ++  YA+FL+     R R  G E   V ENVS +L   LP KC DP MFT+PC IG+  I+ AMLDL   + +  P S  
Subjt:  DDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEE--VSENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDLDTPNPDFSPSS--

Query:  TTILLGRPFMKTAKTVIDVDFG
        T++ LGRPFM TA+T IDV+ G
Subjt:  TTILLGRPFMKTAKTVIDVDFG

A0A6P6TG20 uncharacterized protein LOC1137007181.3e-2434.71Show/hide
Query:  QVVTQVSKMDNKSSGKLPAQPEVN-PNANVSVIWSG--VAKLDYLLGFSSVSNKFSSDFN-------DVIAFPYPRVDL-TNPITMS-----------VQ
        Q+ T ++++D+++ GKLP+QPE+N  N +   + SG  +   + ++       K  ++         D+   P P +   TNP   S            +
Subjt:  QVVTQVSKMDNKSSGKLPAQPEVN-PNANVSVIWSG--VAKLDYLLGFSSVSNKFSSDFN-------DVIAFPYPRVDL-TNPITMS-----------VQ

Query:  DDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEE--VSENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDLDTP----------
        + L+VF++V++NIPLL+AI ++  YA+FL+  Y  R+R  G E   V ENVS +L   LP KC DPSMFT+PC IG+  I+  MLDL             
Subjt:  DDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEE--VSENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDLDTP----------

Query:  ------------NPDFSPSSTTILLGRPFMKTAKTVIDVDFG
                    + D SP  + +LLGRPFM TA+T IDV+ G
Subjt:  ------------NPDFSPSSTTILLGRPFMKTAKTVIDVDFG

A0A6P6X2Y5 uncharacterized protein LOC1137388671.1e-2331.27Show/hide
Query:  DRSMVDVASGGTLVNKMPTETRQLISSMTENSRTGAGESLWALLDDQPYYRWDLIDLQIRSCGLYSMTSHTKSEGGGTTR-----------ISVMETNSG
        DRS++D ASGG L NK P E   LI +M ENS+       +   +  P  R +  +       L  +TS  + +G    R                +NSG
Subjt:  DRSMVDVASGGTLVNKMPTETRQLISSMTENSRTGAGESLWALLDDQPYYRWDLIDLQIRSCGLYSMTSHTKSEGGGTTR-----------ISVMETNSG

Query:  RSCLNIIR-------------------LQVPPPQVVTQVSKMDNKSSGKLPAQPEVNPNANVSVIWSGVAKLDYLLGFSSVSNKFSSD-------FNDVI
         S  ++++                    +    Q+ T ++++++   GKLP+QPEVN + NVS +     K   L G    ++K  SD         +  
Subjt:  RSCLNIIR-------------------LQVPPPQVVTQVSKMDNKSSGKLPAQPEVNPNANVSVIWSGVAKLDYLLGFSSVSNKFSSD-------FNDVI

Query:  AFPYPRVDLTNPITMS------------------VQDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEEVS--ENVSVLLSDDLPAKCS
            P+V  T+P+T+S                   ++ LDVF++V++NIPLL+AI +I  YA+FLK     + +  G E V+  ENVS +L   LP KC 
Subjt:  AFPYPRVDLTNPITMS------------------VQDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRERSSGKEEVS--ENVSVLLSDDLPAKCS

Query:  DPSMFTLPCMIGHYKIKYAMLDL
        DP MFT+PC IG  +I+ AMLDL
Subjt:  DPSMFTLPCMIGHYKIKYAMLDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCGGAGCATGGTGGATGTTGCCAGTGGGGGAACACTTGTGAACAAAATGCCAACTGAAACAAGGCAACTAATTTCTAGTATGACTGAGAATTCTCGGACAGGTGC
AGGCGAAAGCCTGTGGGCTCTGCTCGATGACCAGCCATACTACCGGTGGGACCTAATCGACCTACAAATCAGAAGTTGTGGGCTCTACTCGATGACCAGCCATACGAAGA
GTGAGGGTGGAGGGACCACCCGAATTTCAGTTATGGAAACTAACAGCGGTCGTTCGTGTTTAAACATAATCAGGCTTCAAGTTCCTCCACCTCAGGTGGTAACTCAAGTC
AGCAAGATGGACAATAAGAGCTCTGGAAAGCTTCCTGCTCAACCTGAGGTGAACCCGAATGCGAACGTGAGTGTTATATGGAGTGGAGTGGCCAAACTAGACTACCTTCT
AGGATTTTCTTCTGTTTCTAATAAATTTTCTTCTGATTTTAATGATGTGATTGCTTTTCCTTACCCTCGTGTGGATTTGACTAACCCTATTACAATGAGTGTGCAGGATG
ATCTTGATGTTTTTAAGAGGGTGAAGGTGAACATTCCCTTACTTGAGGCGATTCTAAAAATTCTAGCCTACGCTGAGTTCCTCAAAGCTTGGTATGAAGGTAGAGAAAGA
TCCTCAGGTAAAGAAGAGGTTAGTGAAAATGTTAGTGTTTTATTATCTGATGACTTGCCTGCTAAGTGTTCTGACCCTAGCATGTTTACATTACCGTGCATGATCGGACA
TTATAAGATAAAATATGCCATGCTTGATTTAGATACTCCTAATCCTGATTTCTCTCCTTCCTCTACGACGATACTTCTTGGTCGACCTTTTATGAAGACTGCCAAGACAG
TCATTGACGTTGATTTCGGGGAATGGAGGAAGATAGTTGTCGATGTCCTCCACCAGTTCTTCATATATTCTCCTAAGCACTCGGTCTTGAATCTAGGCATAGGGCAGGTA
CATCTGCACTGGTTCTGTAACTGCTCCAGGTTTTTCCTCAACTTCTCTAAATTATCTTCAGGTGGTTGCGGAGAAGCTAAGGATTGCACTGGATTTGCGGCCGCACTTGC
CGTAGTAAAAGTTGACTATGAGCTTTCTGATCAAGCTCGCATCTACGGAAGTCTTGGCAACCTCTCTTTCCTCTTGAGCCTCGGGCTCAACTTCTGCCATTCTGTAGAGC
TTCGTGATCAAACTGTGGAAGAAGAGCCTTCCCATATTCTTGTGCCCACAGATGAGGATCTCTGTCCATATGATCCGACCAAGGTTGAGATCCAAGGCTTTTTAAATGTT
TTACAACAACATGAGACGCTCTTGAGAAATGGTGGTGTCATGTCTTGTAGGCATTATTCTGTTCTTAATCACATAAAGTCATACTGCAGCTTCTACCTTCAAGTCTCTAG
AAAGTGGAGTCTTTATCCTCATCATGGACTTTTTCTACTGGTGCCTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACCGGAGCATGGTGGATGTTGCCAGTGGGGGAACACTTGTGAACAAAATGCCAACTGAAACAAGGCAACTAATTTCTAGTATGACTGAGAATTCTCGGACAGGTGC
AGGCGAAAGCCTGTGGGCTCTGCTCGATGACCAGCCATACTACCGGTGGGACCTAATCGACCTACAAATCAGAAGTTGTGGGCTCTACTCGATGACCAGCCATACGAAGA
GTGAGGGTGGAGGGACCACCCGAATTTCAGTTATGGAAACTAACAGCGGTCGTTCGTGTTTAAACATAATCAGGCTTCAAGTTCCTCCACCTCAGGTGGTAACTCAAGTC
AGCAAGATGGACAATAAGAGCTCTGGAAAGCTTCCTGCTCAACCTGAGGTGAACCCGAATGCGAACGTGAGTGTTATATGGAGTGGAGTGGCCAAACTAGACTACCTTCT
AGGATTTTCTTCTGTTTCTAATAAATTTTCTTCTGATTTTAATGATGTGATTGCTTTTCCTTACCCTCGTGTGGATTTGACTAACCCTATTACAATGAGTGTGCAGGATG
ATCTTGATGTTTTTAAGAGGGTGAAGGTGAACATTCCCTTACTTGAGGCGATTCTAAAAATTCTAGCCTACGCTGAGTTCCTCAAAGCTTGGTATGAAGGTAGAGAAAGA
TCCTCAGGTAAAGAAGAGGTTAGTGAAAATGTTAGTGTTTTATTATCTGATGACTTGCCTGCTAAGTGTTCTGACCCTAGCATGTTTACATTACCGTGCATGATCGGACA
TTATAAGATAAAATATGCCATGCTTGATTTAGATACTCCTAATCCTGATTTCTCTCCTTCCTCTACGACGATACTTCTTGGTCGACCTTTTATGAAGACTGCCAAGACAG
TCATTGACGTTGATTTCGGGGAATGGAGGAAGATAGTTGTCGATGTCCTCCACCAGTTCTTCATATATTCTCCTAAGCACTCGGTCTTGAATCTAGGCATAGGGCAGGTA
CATCTGCACTGGTTCTGTAACTGCTCCAGGTTTTTCCTCAACTTCTCTAAATTATCTTCAGGTGGTTGCGGAGAAGCTAAGGATTGCACTGGATTTGCGGCCGCACTTGC
CGTAGTAAAAGTTGACTATGAGCTTTCTGATCAAGCTCGCATCTACGGAAGTCTTGGCAACCTCTCTTTCCTCTTGAGCCTCGGGCTCAACTTCTGCCATTCTGTAGAGC
TTCGTGATCAAACTGTGGAAGAAGAGCCTTCCCATATTCTTGTGCCCACAGATGAGGATCTCTGTCCATATGATCCGACCAAGGTTGAGATCCAAGGCTTTTTAAATGTT
TTACAACAACATGAGACGCTCTTGAGAAATGGTGGTGTCATGTCTTGTAGGCATTATTCTGTTCTTAATCACATAAAGTCATACTGCAGCTTCTACCTTCAAGTCTCTAG
AAAGTGGAGTCTTTATCCTCATCATGGACTTTTTCTACTGGTGCCTTCATGA
Protein sequenceShow/hide protein sequence
MDRSMVDVASGGTLVNKMPTETRQLISSMTENSRTGAGESLWALLDDQPYYRWDLIDLQIRSCGLYSMTSHTKSEGGGTTRISVMETNSGRSCLNIIRLQVPPPQVVTQV
SKMDNKSSGKLPAQPEVNPNANVSVIWSGVAKLDYLLGFSSVSNKFSSDFNDVIAFPYPRVDLTNPITMSVQDDLDVFKRVKVNIPLLEAILKILAYAEFLKAWYEGRER
SSGKEEVSENVSVLLSDDLPAKCSDPSMFTLPCMIGHYKIKYAMLDLDTPNPDFSPSSTTILLGRPFMKTAKTVIDVDFGEWRKIVVDVLHQFFIYSPKHSVLNLGIGQV
HLHWFCNCSRFFLNFSKLSSGGCGEAKDCTGFAAALAVVKVDYELSDQARIYGSLGNLSFLLSLGLNFCHSVELRDQTVEEEPSHILVPTDEDLCPYDPTKVEIQGFLNV
LQQHETLLRNGGVMSCRHYSVLNHIKSYCSFYLQVSRKWSLYPHHGLFLLVPS