; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021127 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021127
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr7:4873203..4875338
RNA-Seq ExpressionLag0021127
SyntenyLag0021127
Gene Ontology termsGO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]4.2e-4340.58Show/hide
Query:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED
        Q N  YE W  +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V LK+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ ED
Subjt:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED

Query:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD---FGGRGNWRGRGRGRFD-------
        L IY +NGL + YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N          S L+ A +F+     G G+ +  G GRF        
Subjt:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD---FGGRGNWRGRGRGRFD-------

Query:  ------------------------------GNRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG
                                       NR     +GRHPP +LAAM     S N A  S+ N +       L+D+GCN H+T ++  +S++  YNG
Subjt:  ------------------------------GNRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG

Query:  EENITVGN
        EE + VGN
Subjt:  EENITVGN

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]5.5e-4340.58Show/hide
Query:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED
        Q N  YE W  +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V LK+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ ED
Subjt:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED

Query:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD-----FGGRGNWRGRGRGRFDG----
        L IY +NGL + YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N          S L+ A +FD       G G   G GR  FD     
Subjt:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD-----FGGRGNWRGRGRGRFDG----

Query:  -------------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG
                                       NR     +GRHPP +LAAM     S N A  S+ N +       L+D+GCN  +T ++  +S++  YNG
Subjt:  -------------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG

Query:  EENITVGN
        EE + +GN
Subjt:  EENITVGN

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]5.5e-4340.58Show/hide
Query:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED
        Q N  YE W  +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V LK+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ ED
Subjt:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED

Query:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD-----FGGRGNWRGRGRGRFDG----
        L IY +NGL + YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N          S L+ A +FD       G G   G GR  FD     
Subjt:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD-----FGGRGNWRGRGRGRFDG----

Query:  -------------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG
                                       NR     +GRHPP +LAAM     S N A  S+ N +       L+D+GCN  +T ++  +S++  YNG
Subjt:  -------------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG

Query:  EENITVGN
        EE + +GN
Subjt:  EENITVGN

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]4.2e-4340.58Show/hide
Query:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED
        Q N  YE W  +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V LK+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ ED
Subjt:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED

Query:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD---FGGRGNWRGRGRGRFD-------
        L IY +NGL + YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N          S L+ A +F+     G G+ +  G GRF        
Subjt:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD---FGGRGNWRGRGRGRFD-------

Query:  ------------------------------GNRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG
                                       NR     +GRHPP +LAAM     S N A  S+ N +       L+D+GCN H+T ++  +S++  YNG
Subjt:  ------------------------------GNRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG

Query:  EENITVGN
        EE + VGN
Subjt:  EENITVGN

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]6.2e-4741.59Show/hide
Query:  VNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAEDL
        +N  +E W  +DQAL+TLINATL+  AL+YV+   TSK+VW+ LEKH+SS+++TN+V LK+ LQS+ KK+ ESID YV+RIKEI +K   VS+ I+ E L
Subjt:  VNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAEDL

Query:  NIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAA---------------SFDFG-GRGNWRGR----------
         IY +NGLS+ YN   TS+ TR+Q+++F ELH+ MK+EE+A+E+Q+K +++    +   A+               S D G G+ N RG+          
Subjt:  NIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAA---------------SFDFG-GRGNWRGR----------

Query:  GRGRFDG----------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSN
        GRGR  G                            NR     +GRHPP +LAAM   A  NN ++ +V N +P+    WL+D+ CN H+T +L+NLS+++
Subjt:  GRGRFDG----------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSN

Query:  A---YNGEENITVGN
            YNGEENI+VG+
Subjt:  A---YNGEENITVGN

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X22.6e-4340.58Show/hide
Query:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED
        Q N  YE W  +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V LK+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ ED
Subjt:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED

Query:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD-----FGGRGNWRGRGRGRFDG----
        L IY +NGL + YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N          S L+ A +FD       G G   G GR  FD     
Subjt:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD-----FGGRGNWRGRGRGRFDG----

Query:  -------------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG
                                       NR     +GRHPP +LAAM     S N A  S+ N +       L+D+GCN  +T ++  +S++  YNG
Subjt:  -------------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG

Query:  EENITVGN
        EE + +GN
Subjt:  EENITVGN

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X32.6e-4340.58Show/hide
Query:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED
        Q N  YE W  +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V LK+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ ED
Subjt:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED

Query:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD-----FGGRGNWRGRGRGRFDG----
        L IY +NGL + YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N          S L+ A +FD       G G   G GR  FD     
Subjt:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD-----FGGRGNWRGRGRGRFDG----

Query:  -------------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG
                                       NR     +GRHPP +LAAM     S N A  S+ N +       L+D+GCN  +T ++  +S++  YNG
Subjt:  -------------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG

Query:  EENITVGN
        EE + +GN
Subjt:  EENITVGN

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X12.6e-4340.58Show/hide
Query:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED
        Q N  YE W  +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V LK+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ ED
Subjt:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED

Query:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD-----FGGRGNWRGRGRGRFDG----
        L IY +NGL + YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N          S L+ A +FD       G G   G GR  FD     
Subjt:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD-----FGGRGNWRGRGRGRFDG----

Query:  -------------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG
                                       NR     +GRHPP +LAAM     S N A  S+ N +       L+D+GCN  +T ++  +S++  YNG
Subjt:  -------------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG

Query:  EENITVGN
        EE + +GN
Subjt:  EENITVGN

A0A5D3CLI6 T4.52.6e-4340.58Show/hide
Query:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED
        Q N  YE W  +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V LK+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ ED
Subjt:  QVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAED

Query:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD-----FGGRGNWRGRGRGRFDG----
        L IY +NGL + YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N          S L+ A +FD       G G   G GR  FD     
Subjt:  LNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPN---------NSHLAMAASFD-----FGGRGNWRGRGRGRFDG----

Query:  -------------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG
                                       NR     +GRHPP +LAAM     S N A  S+ N +       L+D+GCN  +T ++  +S++  YNG
Subjt:  -------------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNG

Query:  EENITVGN
        EE + +GN
Subjt:  EENITVGN

A0A6J1D9L6 uncharacterized protein LOC1110188923.0e-4741.59Show/hide
Query:  VNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAEDL
        +N  +E W  +DQAL+TLINATL+  AL+YV+   TSK+VW+ LEKH+SS+++TN+V LK+ LQS+ KK+ ESID YV+RIKEI +K   VS+ I+ E L
Subjt:  VNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDAEDL

Query:  NIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAA---------------SFDFG-GRGNWRGR----------
         IY +NGLS+ YN   TS+ TR+Q+++F ELH+ MK+EE+A+E+Q+K +++    +   A+               S D G G+ N RG+          
Subjt:  NIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAA---------------SFDFG-GRGNWRGR----------

Query:  GRGRFDG----------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSN
        GRGR  G                            NR     +GRHPP +LAAM   A  NN ++ +V N +P+    WL+D+ CN H+T +L+NLS+++
Subjt:  GRGRFDG----------------------------NRNGGRGRGRHPPAKLAAMATFAPSNNLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSN

Query:  A---YNGEENITVGN
            YNGEENI+VG+
Subjt:  A---YNGEENITVGN

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-0425.71Show/hide
Query:  LMVQSRLHHRLQVN------LEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIK
        L++Q  LH  L V+      ++ E W + D+   + I   L+   ++ +I   T++ +W RLE  + S T TN + LK +L ++      +   ++    
Subjt:  LMVQSRLHHRLQVN------LEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIK

Query:  EIVNKLVAVSVVIDAEDLNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRGRGRFDGN
         ++ +L  + V I+ ED  I  +N L S+Y+   T++       T +EL    K   +AL    K  + P N   A+       GRG  R   R   +  
Subjt:  EIVNKLVAVSVVIDAEDLNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRGRGRFDGN

Query:  RNGGRGRGRH
        R+G RG+ ++
Subjt:  RNGGRGRGRH

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGATGGTACAATCAAGGCTCCACCACAGACTGCAAGTTAATCTAGAGTATGAATCTTGGTATGAACGAGATCAAGCATTAATTACACTGATCAATGCAACCTTGAC
GCAAACCGCCCTCTCTTATGTAATCGGTTGTCAAACCTCCAAGGAAGTTTGGGATCGACTTGAGAAGCACTTCTCCTCTTCGACTCAAACGAACATTGTAGGTCTGAAGA
CAAAGTTGCAAAGTGTTTCAAAGAAATCCAGTGAGTCAATTGATGTTTATGTTCGAAGAATCAAGGAAATTGTGAACAAATTGGTTGCGGTGTCGGTAGTTATTGATGCT
GAAGATCTCAATATTTACACTATTAATGGCCTTTCATCTGCTTACAATGTTTTCAAGACCTCTTTATGCACCAGATCTCAAGCCCTAACATTTGTCGAGTTACATATCCT
GATGAAGACTGAGGAGACTGCACTTGAACAACAAATAAAAGCTGATGAAATCCCGAATAATTCTCATTTGGCCATGGCGGCTAGTTTTGATTTTGGTGGCAGAGGAAATT
GGAGAGGTCGTGGTAGAGGTCGTTTTGATGGAAATCGAAATGGAGGTAGAGGTCGTGGCAGACATCCACCAGCTAAGCTTGCAGCAATGGCTACCTTTGCCCCGTCAAAC
AATTTGGCTCATAATTCTGTTTCTAATGTTGCACCCTCTGACTCACATGTTTGGCTTTCTGACACAGGCTGCAATGCACATTTGACTCATAATCTTGCCAATTTGAGTGT
GTCCAATGCATACAATGGGGAAGAGAACATCACTGTTGGTAATGAAATGAGATCTGCCTCCATTTTCCACTGCACCCGAGGTTGGGCATGTCAAAAGGACCCGTATTGGT
CTATGGGGACAATCTTTGTTCATAGAAGCAGTTCAAGACTATTCAACCCAAAAGGCCTCATACCAATGGAGATAGTTGTCCCTCCAATTGGCGGTCGGCGAGGTGAAGAG
ATCGAAATTGCAATTGTAAGCAAAAGACTTGGTGTTGTCTCTTGTATAGTATTCTTTCTTCACTTCCCCGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGATGGTACAATCAAGGCTCCACCACAGACTGCAAGTTAATCTAGAGTATGAATCTTGGTATGAACGAGATCAAGCATTAATTACACTGATCAATGCAACCTTGAC
GCAAACCGCCCTCTCTTATGTAATCGGTTGTCAAACCTCCAAGGAAGTTTGGGATCGACTTGAGAAGCACTTCTCCTCTTCGACTCAAACGAACATTGTAGGTCTGAAGA
CAAAGTTGCAAAGTGTTTCAAAGAAATCCAGTGAGTCAATTGATGTTTATGTTCGAAGAATCAAGGAAATTGTGAACAAATTGGTTGCGGTGTCGGTAGTTATTGATGCT
GAAGATCTCAATATTTACACTATTAATGGCCTTTCATCTGCTTACAATGTTTTCAAGACCTCTTTATGCACCAGATCTCAAGCCCTAACATTTGTCGAGTTACATATCCT
GATGAAGACTGAGGAGACTGCACTTGAACAACAAATAAAAGCTGATGAAATCCCGAATAATTCTCATTTGGCCATGGCGGCTAGTTTTGATTTTGGTGGCAGAGGAAATT
GGAGAGGTCGTGGTAGAGGTCGTTTTGATGGAAATCGAAATGGAGGTAGAGGTCGTGGCAGACATCCACCAGCTAAGCTTGCAGCAATGGCTACCTTTGCCCCGTCAAAC
AATTTGGCTCATAATTCTGTTTCTAATGTTGCACCCTCTGACTCACATGTTTGGCTTTCTGACACAGGCTGCAATGCACATTTGACTCATAATCTTGCCAATTTGAGTGT
GTCCAATGCATACAATGGGGAAGAGAACATCACTGTTGGTAATGAAATGAGATCTGCCTCCATTTTCCACTGCACCCGAGGTTGGGCATGTCAAAAGGACCCGTATTGGT
CTATGGGGACAATCTTTGTTCATAGAAGCAGTTCAAGACTATTCAACCCAAAAGGCCTCATACCAATGGAGATAGTTGTCCCTCCAATTGGCGGTCGGCGAGGTGAAGAG
ATCGAAATTGCAATTGTAAGCAAAAGACTTGGTGTTGTCTCTTGTATAGTATTCTTTCTTCACTTCCCCGTCTAG
Protein sequenceShow/hide protein sequence
MLMVQSRLHHRLQVNLEYESWYERDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGLKTKLQSVSKKSSESIDVYVRRIKEIVNKLVAVSVVIDA
EDLNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRGRGRFDGNRNGGRGRGRHPPAKLAAMATFAPSN
NLAHNSVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSNAYNGEENITVGNEMRSASIFHCTRGWACQKDPYWSMGTIFVHRSSSRLFNPKGLIPMEIVVPPIGGRRGEE
IEIAIVSKRLGVVSCIVFFLHFPV