; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011572 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011572
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr1:27980878..27982302
RNA-Seq ExpressionLag0011572
SyntenyLag0011572
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]7.2e-3640.4Show/hide
Query:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA
        D+AL+T+INAT SP ALAY +G T+S+QVW  L K +SS SRS++V LKSDL++I   P E+I+A+++RIK+I +KLA V   I++ED+ IYALNGLP+ 
Subjt:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA

Query:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNE----------VSSQSL---AMTANFSFGVGRSNSRNNRENDRGRGRSS------GEGLFQQ-
        YN F+T++ TRSQ ++FEELHVL+ ++E AL +Q K +           SSQSL   A T N +F  G  + +N      G GR S      G GL Q+ 
Subjt:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNE----------VSSQSL---AMTANFSFGVGRSNSRNNRENDRGRGRSS------GEGLFQQ-

Query:  -------------------------LLNYSFPTSPNGSGGRGSGGSSQKIDSSGAGNSPKVLLSDSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALP
                                  +NY+F              S      S   +S    L+DSG N H+TSD+  + ++ EYNGE  V V NGQ  P
Subjt:  -------------------------LLNYSFPTSPNGSGGRGSGGSSQKIDSSGAGNSPKVLLSDSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALP

Query:  VT
        ++
Subjt:  VT

KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]9.4e-3641.64Show/hide
Query:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA
        D+AL+T+INAT SP ALAY +G TTS+QVW+ L K +SSSSRS++V LKSDL++IS    E+I+A+++RIK+I +KLA V  V++ ED+ IYALNGLP+ 
Subjt:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA

Query:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVK-NEVSSQSLAMTANF-SFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGS
        YN F+T++ TRS  ++FEELHVL+ ++E AL +Q K +++  Q  A+ A+  S     S   NN    RGRGR  G G        SF T      GRG 
Subjt:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVK-NEVSSQSLAMTANF-SFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGS

Query:  GGSSQKIDSSGAGN--SPKVLL----------SDSGYNAH----------------------LTSDLANMKISSEYNGEANVIVRNGQALPVT
        GGSSQ+     A N  S ++ L          +   YN H                      + SD+  + ++S Y+GE  V V +GQ+LP++
Subjt:  GGSSQKIDSSGAGN--SPKVLL----------SDSGYNAH----------------------LTSDLANMKISSEYNGEANVIVRNGQALPVT

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-3741.06Show/hide
Query:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA
        D+AL+T+INAT SP ALAY +G TTS+QVW+ L K +SSSSRS++V LKSDL++IS    E+I+A+++RIK+I +KLA V  V++ ED+ IYALNGLP+ 
Subjt:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA

Query:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVK-NEVSSQSLAMTANF-SFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGS
        YN F+T++ TRS  ++FEELHVL+ ++E AL +Q K +++  Q  A+ A+  S     S   NN    RGRGR  G G        SF T      GRG 
Subjt:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVK-NEVSSQSLAMTANF-SFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGS

Query:  GGSSQKIDSSGAGN--SPKVLL----------SDSGYNAH----------------------LTSDLANMKISSEYNGEANVIVRNGQALPVTDTSCSSI
        GGSSQ+     A N  S ++ L          +   YN H                      + SD+  + ++S Y+GE  V V +GQ+LP++ + C + 
Subjt:  GGSSQKIDSSGAGN--SPKVLL----------SDSGYNAH----------------------LTSDLANMKISSEYNGEANVIVRNGQALPVTDTSCSSI

Query:  HT
         T
Subjt:  HT

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]7.2e-3640.4Show/hide
Query:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA
        D+AL+T+INAT SP ALAY +G T+S+QVW  L K +SS SRS++V LKSDL++I   P E+I+A+++RIK+I +KLA V   I++ED+ IYALNGLP+ 
Subjt:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA

Query:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNE----------VSSQSL---AMTANFSFGVGRSNSRNNRENDRGRGRSS------GEGLFQQ-
        YN F+T++ TRSQ ++FEELHVL+ ++E AL +Q K +           SSQSL   A T N +F  G  + +N      G GR S      G GL Q+ 
Subjt:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNE----------VSSQSL---AMTANFSFGVGRSNSRNNRENDRGRGRSS------GEGLFQQ-

Query:  -------------------------LLNYSFPTSPNGSGGRGSGGSSQKIDSSGAGNSPKVLLSDSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALP
                                  +NY+F              S      S   +S    L+DSG N H+TSD+  + ++ EYNGE  V V NGQ  P
Subjt:  -------------------------LLNYSFPTSPNGSGGRGSGGSSQKIDSSGAGNSPKVLLSDSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALP

Query:  VT
        ++
Subjt:  VT

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]6.9e-3938.49Show/hide
Query:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA
        D+AL+TLINAT S  ALAY +   TS+QVW  LEKH+SS+SR+++V LKSDL+SI     E+I+A+V+RIK+I +K A V + I+ E + IYALNGL + 
Subjt:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA

Query:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNE-VSSQSLAMTANFSFGVGRSNS-RNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGS
        YN   T++ TR+Q++SFEELHV M S+E A+E+Q+K E + +Q  A+ A+      R+++   N+ +DRGRG+++G G      N++ PT  N   GR S
Subjt:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNE-VSSQSLAMTANFSFGVGRSNS-RNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGS

Query:  GG--SSQKIDSSG----------------------------------------------AGNSPKVLLSDSGYNAHLTSDLANM---KISSEYNGEANVI
        G   +S + D+                                                  +SP   L+DS  N H+T+DL+N+    I+S+YNGE N+ 
Subjt:  GG--SSQKIDSSG----------------------------------------------AGNSPKVLLSDSGYNAHLTSDLANM---KISSEYNGEANVI

Query:  VRNGQALPVTDTSCSSI
        V +GQ+ P+T   C  +
Subjt:  VRNGQALPVTDTSCSSI

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X21.9e-3436.16Show/hide
Query:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA
        D+AL+T+INAT SP ALAY +G T+S+QVW  L K +SS SRS++V LKSDL++I   P E+I+A+++RIK+I +KLA V   I++ED+ IYALNGLP+ 
Subjt:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA

Query:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNEVS--SQSLAMTANFSFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSG----
        YN F+T++ TRSQ ++FEELHVL+ ++E AL +Q K + S    ++ ++++ S         NN     G G+  G G F      SF     G G    
Subjt:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNEVS--SQSLAMTANFSFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSG----

Query:  -----------------GRGSGGSSQKIDSSGAGNSP--------------------KVLLSDSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALPVT
                         G  +     +++ +  G  P                       L+DSG N  +TSD+  + ++ EYNGE  V + NGQ  P++
Subjt:  -----------------GRGSGGSSQKIDSSGAGNSP--------------------KVLLSDSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALPVT

Query:  DTSCSSIHTGFSSFVLSN
               H+G   F L++
Subjt:  DTSCSSIHTGFSSFVLSN

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X11.9e-3436.16Show/hide
Query:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA
        D+AL+T+INAT SP ALAY +G T+S+QVW  L K +SS SRS++V LKSDL++I   P E+I+A+++RIK+I +KLA V   I++ED+ IYALNGLP+ 
Subjt:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA

Query:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNEVS--SQSLAMTANFSFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSG----
        YN F+T++ TRSQ ++FEELHVL+ ++E AL +Q K + S    ++ ++++ S         NN     G G+  G G F      SF     G G    
Subjt:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNEVS--SQSLAMTANFSFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSG----

Query:  -----------------GRGSGGSSQKIDSSGAGNSP--------------------KVLLSDSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALPVT
                         G  +     +++ +  G  P                       L+DSG N  +TSD+  + ++ EYNGE  V + NGQ  P++
Subjt:  -----------------GRGSGGSSQKIDSSGAGNSP--------------------KVLLSDSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALPVT

Query:  DTSCSSIHTGFSSFVLSN
               H+G   F L++
Subjt:  DTSCSSIHTGFSSFVLSN

A0A6J1D9L6 uncharacterized protein LOC1110188923.4e-3938.49Show/hide
Query:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA
        D+AL+TLINAT S  ALAY +   TS+QVW  LEKH+SS+SR+++V LKSDL+SI     E+I+A+V+RIK+I +K A V + I+ E + IYALNGL + 
Subjt:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA

Query:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNE-VSSQSLAMTANFSFGVGRSNS-RNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGS
        YN   T++ TR+Q++SFEELHV M S+E A+E+Q+K E + +Q  A+ A+      R+++   N+ +DRGRG+++G G      N++ PT  N   GR S
Subjt:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNE-VSSQSLAMTANFSFGVGRSNS-RNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGS

Query:  GG--SSQKIDSSG----------------------------------------------AGNSPKVLLSDSGYNAHLTSDLANM---KISSEYNGEANVI
        G   +S + D+                                                  +SP   L+DS  N H+T+DL+N+    I+S+YNGE N+ 
Subjt:  GG--SSQKIDSSG----------------------------------------------AGNSPKVLLSDSGYNAHLTSDLANM---KISSEYNGEANVI

Query:  VRNGQALPVTDTSCSSI
        V +GQ+ P+T   C  +
Subjt:  VRNGQALPVTDTSCSSI

A0A6J1DT57 uncharacterized protein LOC1110241491.1e-3453.45Show/hide
Query:  ITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSAYNVF
        +TLINAT SPSA AY +G T+S+++W  LEKH+SSSSR+++V LKSDL+SIS    E I+ +V+RIK++ +KL  V VV+D ED+ IY LNGLPSAYNVF
Subjt:  ITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSAYNVF

Query:  KTTVCTRSQNLSFEELHVLMSSKEHALEQQVK-NEVSSQSLAMTA--NFSFGVGRSNSRNNR-ENDRGRGRSSG
        +T++ TRSQ+++F+ELHVLM S+E AL++QVK +++ SQ+ A+ A  N S      NS ++R + ++G GR+ G
Subjt:  KTTVCTRSQNLSFEELHVLMSSKEHALEQQVK-NEVSSQSLAMTA--NFSFGVGRSNSRNNR-ENDRGRGRSSG

A0A6J1DYF1 uncharacterized protein LOC1110257091.0e-3549.03Show/hide
Query:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA
        D AL+TL+NAT SPSALAY +GC +SQQVW  L K++SSSSR+++V LKS+L+SIS  P E+I+ ++QRIK++ +KLA V V++D ED+ IY LNGLP  
Subjt:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA

Query:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVK-NEVSSQSLAMTANFSFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGSG
        +N F T++CTRSQ++SFEEL+VL+  +E A+++Q K +EV  QS  + AN +      NS  N    RG     G+G F    N S  +S   +GGR  G
Subjt:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVK-NEVSSQSLAMTANFSFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGSG

Query:  GSSQKI
         S   I
Subjt:  GSSQKI

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.5e-0723.51Show/hide
Query:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA
        D+ + + +    S S        TT+ Q+W  L K +++ S  H+  L++ L+  +   T+TI+ ++Q +    ++LA++   +D ++     L  LP  
Subjt:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA

Query:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQ--------VKNEVSSQSLAMTANFSFG--VGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSP
        Y      +  +    +  E+H  + + E  +             N VS ++   T N + G    R ++RNN  N +   +SS    F    N S P   
Subjt:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQ--------VKNEVSSQSLAMTANFSFG--VGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSP

Query:  N----GSGGRGSGGSSQ------KIDSS-----------------GAGNSPKVLLSDSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALPVTDTSCSS
             G  G  +   SQ       ++S                  G+  S    L DSG   H+TSD  N+ +   Y G  +V+V +G  +P++ T  +S
Subjt:  N----GSGGRGSGGSSQ------KIDSS-----------------GAGNSPKVLLSDSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALPVTDTSCSS

Query:  IHTGFSSFVLSNLLRVPHI
        + T      L N+L VP+I
Subjt:  IHTGFSSFVLSNLLRVPHI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.9e-0523.01Show/hide
Query:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA
        D+ + + I    S S        TT+ Q+W  L K +++ S  H+  L+                F+ R     ++LA++   +D ++     L  LP  
Subjt:  DRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSA

Query:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNEVSSQSLAMTANFSFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGSG-
        Y      +  +    S  E+H  + ++E  L      EV    + +TAN      R+ + N  +N+RG  R+     +    N S    P+ SG R    
Subjt:  YNVFKTTVCTRSQNLSFEELHVLMSSKEHALEQQVKNEVSSQSLAMTANFSFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGSG-

Query:  ------GSSQKIDSSG----------------------------------AGNSP---KVLLSDSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALPV
              G  Q     G                                  A NSP      L DSG   H+TSD  N+     Y G  +V++ +G  +P+
Subjt:  ------GSSQKIDSSG----------------------------------AGNSP---KVLLSDSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALPV

Query:  TDTSCSSIHTGFSSFVLSNLLRVPHI
        T T  +S+ T   S  L+ +L VP+I
Subjt:  TDTSCSSIHTGFSSFVLSNLLRVPHI

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGGGCTTTAATTACTCTTATCAATGCAACATCATCACCATCTGCACTTGCGTACACCATTGGTTGCACAACTTCACAACAGGTTTGGTCTCGCCTTGAAAAACA
TTTTTCTTCTTCTTCGCGGTCGCATATTGTTGGATTAAAGTCAGATTTACGGAGTATTTCGTATTTACCTACTGAAACAATAAATGCTTTTGTTCAACGAATTAAAGACA
TTATGAACAAACTTGCTGTAGTTTTGGTTGTTATCGATCAAGAGGATGTTGCGATTTATGCTCTTAACGGACTTCCTTCAGCCTATAATGTGTTTAAAACTACGGTTTGT
ACGAGATCTCAGAATCTTTCTTTCGAGGAACTTCATGTGTTGATGAGTTCCAAAGAACATGCGCTTGAACAACAAGTTAAGAATGAGGTCTCTTCTCAGTCTCTTGCTAT
GACTGCAAATTTTTCTTTTGGTGTTGGTCGTAGTAATTCTAGAAATAATCGTGAAAATGATCGAGGAAGAGGTCGATCTAGTGGAGAAGGGTTGTTTCAACAATTACTTA
ACTATTCCTTTCCGACTTCTCCTAATGGATCTGGTGGTCGAGGATCTGGTGGATCTTCTCAAAAAATTGACTCGTCTGGTGCTGGTAATTCTCCGAAGGTTTTGCTTTCA
GATTCTGGATATAATGCACACTTAACATCTGATTTAGCAAATATGAAGATTTCATCCGAATATAATGGTGAGGCAAATGTCATTGTTAGAAATGGTCAAGCCCTGCCAGT
TACAGATACAAGTTGTTCCTCTATTCATACTGGTTTTTCCTCTTTTGTTCTTTCTAATCTTCTTCGAGTTCCACATATTGAAGGATTGGAATTCTCAATTCCGTTACAGC
GGAAGCAATTGGACCGTTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCGGGCTTTAATTACTCTTATCAATGCAACATCATCACCATCTGCACTTGCGTACACCATTGGTTGCACAACTTCACAACAGGTTTGGTCTCGCCTTGAAAAACA
TTTTTCTTCTTCTTCGCGGTCGCATATTGTTGGATTAAAGTCAGATTTACGGAGTATTTCGTATTTACCTACTGAAACAATAAATGCTTTTGTTCAACGAATTAAAGACA
TTATGAACAAACTTGCTGTAGTTTTGGTTGTTATCGATCAAGAGGATGTTGCGATTTATGCTCTTAACGGACTTCCTTCAGCCTATAATGTGTTTAAAACTACGGTTTGT
ACGAGATCTCAGAATCTTTCTTTCGAGGAACTTCATGTGTTGATGAGTTCCAAAGAACATGCGCTTGAACAACAAGTTAAGAATGAGGTCTCTTCTCAGTCTCTTGCTAT
GACTGCAAATTTTTCTTTTGGTGTTGGTCGTAGTAATTCTAGAAATAATCGTGAAAATGATCGAGGAAGAGGTCGATCTAGTGGAGAAGGGTTGTTTCAACAATTACTTA
ACTATTCCTTTCCGACTTCTCCTAATGGATCTGGTGGTCGAGGATCTGGTGGATCTTCTCAAAAAATTGACTCGTCTGGTGCTGGTAATTCTCCGAAGGTTTTGCTTTCA
GATTCTGGATATAATGCACACTTAACATCTGATTTAGCAAATATGAAGATTTCATCCGAATATAATGGTGAGGCAAATGTCATTGTTAGAAATGGTCAAGCCCTGCCAGT
TACAGATACAAGTTGTTCCTCTATTCATACTGGTTTTTCCTCTTTTGTTCTTTCTAATCTTCTTCGAGTTCCACATATTGAAGGATTGGAATTCTCAATTCCGTTACAGC
GGAAGCAATTGGACCGTTCGTAA
Protein sequenceShow/hide protein sequence
MDRALITLINATSSPSALAYTIGCTTSQQVWSRLEKHFSSSSRSHIVGLKSDLRSISYLPTETINAFVQRIKDIMNKLAVVLVVIDQEDVAIYALNGLPSAYNVFKTTVC
TRSQNLSFEELHVLMSSKEHALEQQVKNEVSSQSLAMTANFSFGVGRSNSRNNRENDRGRGRSSGEGLFQQLLNYSFPTSPNGSGGRGSGGSSQKIDSSGAGNSPKVLLS
DSGYNAHLTSDLANMKISSEYNGEANVIVRNGQALPVTDTSCSSIHTGFSSFVLSNLLRVPHIEGLEFSIPLQRKQLDRS