; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025180 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025180
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:9517183..9520021
RNA-Seq ExpressionLag0025180
SyntenyLag0025180
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONK66393.1 uncharacterized protein A4U43_C06F7380 [Asparagus officinalis]1.0e-3840.7Show/hide
Query:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLF
        +WGKE+L+MG+RW+VG+G QI++    WIP+    RV S  ++P +ATVA+L+  SR W+  L+ + F  +E N+ILSIP+ ++  VD ++WHY K+G +
Subjt:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLF

Query:  SVKSGYMLAQSAILVR-----GPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSI
        SVKSGY +A  A   R     G S  + +     WK  W + +P+K+KVFLWR C   +P  D L ++ V +   C  C    ES +H LW+CK  + +
Subjt:  SVKSGYMLAQSAILVR-----GPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSI

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.3e-3635.04Show/hide
Query:  WGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLFS
        WGKELL  G+RW+VGNG  I++Y   W+P  S  ++ S   LP    V +L T+S  W+  LL+  F   EV+  L IP+      D +IWHYE++G++S
Subjt:  WGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLFS

Query:  VKSGYMLAQSAILVRGPSSSSPNSIRD----WWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILRE
        VKSGY L   A L +   S  P+   D    +WK  W + +P+K+K FLWR   D LP    L  R +    +C  C ++ ES +H +W C+ A+ + R 
Subjt:  VKSGYMLAQSAILVRGPSSSSPNSIRD----WWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILRE

Query:  ASFGVILERIRAGCCLLLCRDIKEMIGGE--NLKSWWCYGGQCGQPEIKFVFKG
        +++G + E  R      L   ++    GE   L ++ C+G         F+F+G
Subjt:  ASFGVILERIRAGCCLLLCRDIKEMIGGE--NLKSWWCYGGQCGQPEIKFVFKG

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]6.5e-5733.33Show/hide
Query:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSR-GWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGL
        +WG++LL+ G+RW++GNG+ + IYG NW+P    L++ S+  LP  + V+ L+     GW   ++   F   E   ILSIPI +    D +IW+YEK+G+
Subjt:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSR-GWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGL

Query:  FSVKSGYMLA-QSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILREA
        +SV+SGY +A  +   V+ PSSSS   +R WW G WKM +P+K+KVFLWR+CLDRLPT  NL +RGV++ + C+FCG+ GE SIH+ W CK+A ++   +
Subjt:  FSVKSGYMLA-QSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILREA

Query:  SFGVILERIRAGCCLLLCRDIKEMIGGENLKS--------WWCYGGQCGQPEIKFVFK----------------------GLTGRSR-------------
         FG +          L+ R+  E +   + +         W     +      K VFK                       +TGR               
Subjt:  SFGVILERIRAGCCLLLCRDIKEMIGGENLKS--------WWCYGGQCGQPEIKFVFK----------------------GLTGRSR-------------

Query:  -----------LRDNCAG-----YYGSGDVV--DDLHWDNVRDVDMAEGYVAVKSLELVTDMGLAPAILETDSSRVFQLLKRERE------DVMFDFTYR
                     D  AG     +   G V+     + +N++ VDMAE   AV+ L+L +++G+ PA+   D S   +++ + +          F+F  R
Subjt:  -----------LRDNCAG-----YYGSGDVV--DDLHWDNVRDVDMAEGYVAVKSLELVTDMGLAPAILETDSSRVFQLLKRERE------DVMFDFTYR

Query:  EGNQAAHRLARLAL
        EGN+AAH LAR AL
Subjt:  EGNQAAHRLARLAL

XP_024035599.1 uncharacterized protein LOC112096407 [Citrus clementina]3.6e-3938.46Show/hide
Query:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLF
        +WG+++L+ G RW++GNGE+I+I  SNWIPR +  ++    SLP+ A V+EL+  ++ W+ +++ Q F   + ++I SI + + P  D +IWHY++ GL+
Subjt:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLF

Query:  SVKSGYMLAQSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILREASF
        SVKSGY LA        P+SS+  S    W+  WK+ +P K+K+F+W+     LPT +NL RR +    +C  C +  E   H L ECK AR I R    
Subjt:  SVKSGYMLAQSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILREASF

Query:  GVILERIR
           ++ IR
Subjt:  GVILERIR

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]1.2e-3940.82Show/hide
Query:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLF
        +WG+++L  G RW++GNG+ + +YG+NWIPR +  +  SA S+ +D TVAEL+   + W   L+ QHF   +   I+ IP+ + P  D +IWHY+K G +
Subjt:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLF

Query:  SVKSGYMLAQSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILR
        SVKSGY +A        PS S  N  ++ W+  WK+ +P K+K+FLWR   D LPT +NL ++ V    +C  C    E+  H L EC  AR I R
Subjt:  SVKSGYMLAQSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILR

TrEMBL top hitse value%identityAlignment
A0A2N9I609 Uncharacterized protein1.2e-3730.85Show/hide
Query:  KELLEMGVRWQVGNGEQIKIYGSNWIPRDS-NLRVNSAISLPSDATVAELMTTSRG-WDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLFS
        + ++  G RW+VGNG  I+I+   W+P  S  + V     LP DA V+ L+    G WD  L+E+ F  +E +LILS+ +    P D ++W  EKSG +S
Subjt:  KELLEMGVRWQVGNGEQIKIYGSNWIPRDS-NLRVNSAISLPSDATVAELMTTSRG-WDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLFS

Query:  VKSGYMLAQSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWEC----------KWA
        V+S Y +  +A     P SS+ +  R +WK  W + VP K++ FLWRVC + LPT+ NL RR +     C FC    E  +H LW C          K  
Subjt:  VKSGYMLAQSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWEC----------KWA

Query:  RSILR--EASFGVILE---RIRAGCCLLLCRDIKEMIGGENLK-SWWCYGGQCGQPEIKFVFKGLTGRSR------------LRDNCAGYYGSGDVVDDL
        R I R    SF  I +    I +   L     +  M G +  +        Q  QP  +F FK     +             +RD     +G   V    
Subjt:  RSILR--EASFGVILE---RIRAGCCLLLCRDIKEMIGGENLK-SWWCYGGQCGQPEIKFVFKGLTGRSR------------LRDNCAGYYGSGDVVDDL

Query:  HWDNVRDVDMAEGYVAVKSLELVTDMGLAPAILETDSSRVFQLLKRE--------------------REDVMFDFTYREGNQAAHRLARLALTRLSDEGS
        H+  + DVD AE     ++++L  D+GL    +E DS  +FQ L+++                     + V F    R  N+ AH LAR  LT  SD   
Subjt:  HWDNVRDVDMAEGYVAVKSLELVTDMGLAPAILETDSSRVFQLLKRE--------------------REDVMFDFTYREGNQAAHRLARLALTRLSDEGS

Query:  WV
        W+
Subjt:  WV

A0A5P1EL44 Uncharacterized protein5.0e-3940.7Show/hide
Query:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLF
        +WGKE+L+MG+RW+VG+G QI++    WIP+    RV S  ++P +ATVA+L+  SR W+  L+ + F  +E N+ILSIP+ ++  VD ++WHY K+G +
Subjt:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLF

Query:  SVKSGYMLAQSAILVR-----GPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSI
        SVKSGY +A  A   R     G S  + +     WK  W + +P+K+KVFLWR C   +P  D L ++ V +   C  C    ES +H LW+CK  + +
Subjt:  SVKSGYMLAQSAILVR-----GPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSI

A0A6J1DAR4 uncharacterized protein LOC1110189543.1e-5733.33Show/hide
Query:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSR-GWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGL
        +WG++LL+ G+RW++GNG+ + IYG NW+P    L++ S+  LP  + V+ L+     GW   ++   F   E   ILSIPI +    D +IW+YEK+G+
Subjt:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSR-GWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGL

Query:  FSVKSGYMLA-QSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILREA
        +SV+SGY +A  +   V+ PSSSS   +R WW G WKM +P+K+KVFLWR+CLDRLPT  NL +RGV++ + C+FCG+ GE SIH+ W CK+A ++   +
Subjt:  FSVKSGYMLA-QSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILREA

Query:  SFGVILERIRAGCCLLLCRDIKEMIGGENLKS--------WWCYGGQCGQPEIKFVFK----------------------GLTGRSR-------------
         FG +          L+ R+  E +   + +         W     +      K VFK                       +TGR               
Subjt:  SFGVILERIRAGCCLLLCRDIKEMIGGENLKS--------WWCYGGQCGQPEIKFVFK----------------------GLTGRSR-------------

Query:  -----------LRDNCAG-----YYGSGDVV--DDLHWDNVRDVDMAEGYVAVKSLELVTDMGLAPAILETDSSRVFQLLKRERE------DVMFDFTYR
                     D  AG     +   G V+     + +N++ VDMAE   AV+ L+L +++G+ PA+   D S   +++ + +          F+F  R
Subjt:  -----------LRDNCAG-----YYGSGDVV--DDLHWDNVRDVDMAEGYVAVKSLELVTDMGLAPAILETDSSRVFQLLKRERE------DVMFDFTYR

Query:  EGNQAAHRLARLAL
        EGN+AAH LAR AL
Subjt:  EGNQAAHRLARLAL

A0A803PQQ6 Uncharacterized protein2.8e-3733.63Show/hide
Query:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLF
        MWGK+LL  G RW++GNG  +++    W+PR    RV      P +  V +L     GWD   ++ HFN  +V+LIL++P       D ++WHY K+G +
Subjt:  MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLF

Query:  SVKSGYMLAQSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFC-GKRGESSIHILWECKWARSILREAS
        +V+SGY LA     V   S+        WW+  WK  VP K+K F W+VC   LPT   L +RG+ ++  C  C G   E   H+LW+C W++ + ++  
Subjt:  SVKSGYMLAQSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFC-GKRGESSIHILWECKWARSILREAS

Query:  FGVILERIRAGCCLLLCRDIKEM
            + ++R+   LL+ + ++++
Subjt:  FGVILERIRAGCCLLLCRDIKEM

M5XHI9 Reverse transcriptase domain-containing protein2.1e-3731.81Show/hide
Query:  KELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRV-NSAISLPSDATVAELMTT--SRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLF
        +++LEMG R+Q+G+G+ ++I+G  W+PR +   V  S +    +  V+EL+    S  WD   L   F   +V  I+ IP+    P D ++W+Y+K GLF
Subjt:  KELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRV-NSAISLPSDATVAELMTT--SRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLF

Query:  SVKSGYMLAQSAILVRGPSSSSPNSIRDW-WKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWEC-----KWARSI
        +VKS Y +A          SSS NS     W+  W   VP+K+K+F WRV  D LPT  NL+++GVD+ D+C FCG   ES++H+L  C      W  S+
Subjt:  SVKSGYMLAQSAILVRGPSSSSPNSIRDW-WKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWEC-----KWARSI

Query:  L-REASFGVILERIRAGCCLLLCRDIKEMIGGENLKS----------WWCYGGQCGQPEIKF--VFKGLTGRSRL----RDNCAGYYGS-----GDVVDD
        L R A  GV  +R          + + E I   +  S           W      G+ +  F   F   +GR  +    RD   G+  +     G+V+  
Subjt:  L-REASFGVILERIRAGCCLLLCRDIKEMIGGENLKS----------WWCYGGQCGQPEIKF--VFKGLTGRSRL----RDNCAGYYGS-----GDVVDD

Query:  LHWDNVRDVDMAEGYVAVKSLELVTDMGLAPAILETDSSRVFQLLKREREDV--------------------MFDFTYREGNQAAHRLARLAL
         H         AE  VA + + L   +G A  I E DS+ V   +KR  +D                     +F FT RE N  AHRLAR  L
Subjt:  LHWDNVRDVDMAEGYVAVKSLELVTDMGLAPAILETDSSRVFQLLKREREDV--------------------MFDFTYREGNQAAHRLARLAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.2e-1625.48Show/hide
Query:  GKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRG---WDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGL
        G  LL+ G R  +G+G+ I+I   N +       +N+  +   + T+  L         WD + + Q  + ++   I  I + +    D +IW+Y  +G 
Subjt:  GKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRG---WDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGL

Query:  FSVKSGYMLAQSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILREAS
        ++V+SGY L         P+ + P+   D     W + +  K+K FLWR     L T + L  RG+ +   C  C +  ES  H L+ C +A    R + 
Subjt:  FSVKSGYMLAQSAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILREAS

Query:  FGVILERI
          +I  ++
Subjt:  FGVILERI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGGCAAGGAGCTTTTAGAGATGGGTGTGCGGTGGCAAGTTGGGAATGGGGAACAAATTAAGATATATGGATCTAACTGGATCCCGAGGGATTCTAACTTGAGAGT
TAACTCGGCAATATCCTTGCCATCAGATGCTACGGTGGCGGAGCTGATGACGACCTCGAGAGGGTGGGATCATACACTGCTGGAGCAGCATTTTAATGCTGCAGAGGTAA
ACCTCATCTTGTCTATTCCGATTAGACAACACCCACCAGTGGATTCAGTTATCTGGCACTATGAAAAGTCTGGGCTTTTCTCTGTCAAAAGTGGATACATGTTGGCCCAG
TCAGCTATACTGGTGCGGGGGCCTTCATCCTCCTCTCCGAACTCCATCCGTGACTGGTGGAAGGGATGTTGGAAGATGATGGTTCCAAGTAAGATGAAGGTGTTCCTTTG
GAGAGTGTGTCTGGATAGGCTGCCGACGATTGATAACCTGGTAAGAAGGGGAGTTGACTTGCTGGATGTATGCTTTTTCTGCGGGAAGCGGGGGGAATCTAGCATCCACA
TTTTATGGGAATGTAAGTGGGCCAGATCGATTCTGCGAGAAGCGAGTTTTGGGGTGATTTTGGAGAGGATACGAGCAGGGTGTTGTCTTCTGCTCTGTAGGGATATCAAG
GAGATGATAGGGGGTGAGAATTTGAAGAGCTGGTGGTGTTATGGTGGTCAATGTGGTCAGCCCGAAATAAAGTTCGTTTTCAAGGGGCTGACAGGCCGAAGCCGGCTTAG
GGATAATTGTGCGGGATACTATGGGTCAGGTGATGTTGTCGACGACCTTCACTGGGATAATGTGAGAGATGTTGACATGGCTGAAGGATATGTAGCTGTGAAAAGTCTGG
AGCTAGTGACGGATATGGGTTTGGCCCCAGCAATTCTTGAGACGGATTCGAGTAGAGTTTTTCAGCTTCTTAAGCGAGAACGTGAAGATGTAATGTTTGATTTCACCTAC
CGTGAGGGGAACCAGGCGGCGCACCGATTGGCGAGGCTGGCCTTAACTCGACTGAGTGATGAGGGGTCTTGGGTGTATTGCATGTTAGTGGAGTCTAGGCGGATGAGGGC
TGCAACCGAACATCAGGGTGATGGAGGTTTACTTGCTAGCTTAATTTTGAGGACAGTGTTAATATTTTCTGAATCTCTTGCACTTTCTGAGCACATGGTCTGTCCGTTAA
GTAGATTTGAAGGTCTAGATATACTTTATGGGGCTCATAAGCCTGTTGTCACTCCGATAGGGCGTGCTGAAGGCCATGGACCTGCTTGGATTAAATCATATGTATACAGG
AGACAAGAGGATAAAGATTTGAATGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGGGCAAGGAGCTTTTAGAGATGGGTGTGCGGTGGCAAGTTGGGAATGGGGAACAAATTAAGATATATGGATCTAACTGGATCCCGAGGGATTCTAACTTGAGAGT
TAACTCGGCAATATCCTTGCCATCAGATGCTACGGTGGCGGAGCTGATGACGACCTCGAGAGGGTGGGATCATACACTGCTGGAGCAGCATTTTAATGCTGCAGAGGTAA
ACCTCATCTTGTCTATTCCGATTAGACAACACCCACCAGTGGATTCAGTTATCTGGCACTATGAAAAGTCTGGGCTTTTCTCTGTCAAAAGTGGATACATGTTGGCCCAG
TCAGCTATACTGGTGCGGGGGCCTTCATCCTCCTCTCCGAACTCCATCCGTGACTGGTGGAAGGGATGTTGGAAGATGATGGTTCCAAGTAAGATGAAGGTGTTCCTTTG
GAGAGTGTGTCTGGATAGGCTGCCGACGATTGATAACCTGGTAAGAAGGGGAGTTGACTTGCTGGATGTATGCTTTTTCTGCGGGAAGCGGGGGGAATCTAGCATCCACA
TTTTATGGGAATGTAAGTGGGCCAGATCGATTCTGCGAGAAGCGAGTTTTGGGGTGATTTTGGAGAGGATACGAGCAGGGTGTTGTCTTCTGCTCTGTAGGGATATCAAG
GAGATGATAGGGGGTGAGAATTTGAAGAGCTGGTGGTGTTATGGTGGTCAATGTGGTCAGCCCGAAATAAAGTTCGTTTTCAAGGGGCTGACAGGCCGAAGCCGGCTTAG
GGATAATTGTGCGGGATACTATGGGTCAGGTGATGTTGTCGACGACCTTCACTGGGATAATGTGAGAGATGTTGACATGGCTGAAGGATATGTAGCTGTGAAAAGTCTGG
AGCTAGTGACGGATATGGGTTTGGCCCCAGCAATTCTTGAGACGGATTCGAGTAGAGTTTTTCAGCTTCTTAAGCGAGAACGTGAAGATGTAATGTTTGATTTCACCTAC
CGTGAGGGGAACCAGGCGGCGCACCGATTGGCGAGGCTGGCCTTAACTCGACTGAGTGATGAGGGGTCTTGGGTGTATTGCATGTTAGTGGAGTCTAGGCGGATGAGGGC
TGCAACCGAACATCAGGGTGATGGAGGTTTACTTGCTAGCTTAATTTTGAGGACAGTGTTAATATTTTCTGAATCTCTTGCACTTTCTGAGCACATGGTCTGTCCGTTAA
GTAGATTTGAAGGTCTAGATATACTTTATGGGGCTCATAAGCCTGTTGTCACTCCGATAGGGCGTGCTGAAGGCCATGGACCTGCTTGGATTAAATCATATGTATACAGG
AGACAAGAGGATAAAGATTTGAATGGCTAG
Protein sequenceShow/hide protein sequence
MWGKELLEMGVRWQVGNGEQIKIYGSNWIPRDSNLRVNSAISLPSDATVAELMTTSRGWDHTLLEQHFNAAEVNLILSIPIRQHPPVDSVIWHYEKSGLFSVKSGYMLAQ
SAILVRGPSSSSPNSIRDWWKGCWKMMVPSKMKVFLWRVCLDRLPTIDNLVRRGVDLLDVCFFCGKRGESSIHILWECKWARSILREASFGVILERIRAGCCLLLCRDIK
EMIGGENLKSWWCYGGQCGQPEIKFVFKGLTGRSRLRDNCAGYYGSGDVVDDLHWDNVRDVDMAEGYVAVKSLELVTDMGLAPAILETDSSRVFQLLKREREDVMFDFTY
REGNQAAHRLARLALTRLSDEGSWVYCMLVESRRMRAATEHQGDGGLLASLILRTVLIFSESLALSEHMVCPLSRFEGLDILYGAHKPVVTPIGRAEGHGPAWIKSYVYR
RQEDKDLNG