; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038923 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038923
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:31014581..31016452
RNA-Seq ExpressionLag0038923
SyntenyLag0038923
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3490037.1 reverse transcriptase [Gossypium australe]7.1e-4432.37Show/hide
Query:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST
        M P KAP  +GF A F+Q+FW  +G   +  CL +LN    V D N T I LIPK + PK +S +RPISLCNV YKII KV+VNRM  + +E I+E Q  
Subjt:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST

Query:  FVPGRSIFDNIIVGHECN----------------------------------ITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCP
        F+  R I DN+++ +E                                     +P RGLRQGDPLSPYLFL+C+E  S+++     R ++ G   ++   
Subjt:  FVPGRSIFDNIIVGHECN----------------------------------ITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCP

Query:  EVSHLFFADDSLIFCRAPL-----------------------------------------NRLGYSDL--SFIGMSWLLD----KWS-------ILEDYV
         ++HLFFADD ++F  A                                           NR+   D+  S+  +S L+D     W+       + E+  
Subjt:  EVSHLFFADDSLIFCRAPL-----------------------------------------NRLGYSDL--SFIGMSWLLD----KWS-------ILEDYV

Query:  KLIATIPISASDEDDKWIWHYTLNGEYSVKSGYKLLMSTA--LNLESSSHNRQRTWWDRLWKTKIPSKIKLSFGRLIVNV
          I +IPI+ S  +D  +W +  +G Y+VKSGY++L ++    N+ SS+ +    ++  L    IP KIK+   RL  N+
Subjt:  KLIATIPISASDEDDKWIWHYTLNGEYSVKSGYKLLMSTA--LNLESSSHNRQRTWWDRLWKTKIPSKIKLSFGRLIVNV

KAF5445283.1 hypothetical protein F2P56_034346 [Juglans regia]2.6e-4651.91Show/hide
Query:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST
        M+P  +P PNGFP+ F QK W   GD      L+ LN  RS+++ N T+ITLIPK  SP +V +YRPISLCNV YK++ K + NR+K +  +II+ NQS 
Subjt:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST

Query:  FVPGRSIFDNIIVGHECNITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRA
        FVPGR I DN +V +E    P RGLRQGDPLSPYLF+LC+E LSS++        L  +   +   +V+HLFFADDSL+FC+A
Subjt:  FVPGRSIFDNIIVGHECNITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRA

XP_018823372.2 uncharacterized protein LOC108993057 [Juglans regia]5.4e-4451.91Show/hide
Query:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST
        M+P  +P P+GFPA FYQK W  +GD      L+ LN    + D N TYI+LIPK  SP KV++YRPISLCNV YKI+ K I NR+K +  +II+ NQS 
Subjt:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST

Query:  FVPGRSIFDNIIVGHECNITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRA
        FVP R I DN ++       P RGLRQGDPLSPYLF+LC++ L+S++N       L      +    V+HLFFADDSL+FCRA
Subjt:  FVPGRSIFDNIIVGHECNITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRA

XP_023874626.1 uncharacterized protein LOC111987155 [Quercus suber]3.5e-4341.5Show/hide
Query:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNC-LEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQS
        MHP+KAP P+G   +FYQK+W EI D  ++ C L +LN        N TYI LIPK +SP+K++ +RPISLCNV YKII KV+ NR+KGV +E+I E+QS
Subjt:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNC-LEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQS

Query:  TFVPGRSIFDNIIVGHEC---------------------------------------------------------------------NITPQRGLRQGDP
         FVPGRSI DN++V  E                                                                       I P RGLRQGDP
Subjt:  TFVPGRSIFDNIIVGHEC---------------------------------------------------------------------NITPQRGLRQGDP

Query:  LSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRA
        +SPYLFLLC+E LS+M+        L G+  ++  P VSHL FADDS+IFCRA
Subjt:  LSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRA

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]4.1e-4438.1Show/hide
Query:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST
        MHP+KAP P+G  A+F+QK+W  +G+  +   L++LN   S+ + N T ITL+PK  +P K+S++RPISLCNV YK+I KV+ NR+K +  +IISENQS 
Subjt:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST

Query:  FVPGRSIFDNIIVGHEC---------------------------------------------------------------------NITPQRGLRQGDPL
        F+ GR I DN++V  E                                                                      +ITP RGLRQGDP+
Subjt:  FVPGRSIFDNIIVGHEC---------------------------------------------------------------------NITPQRGLRQGDPL

Query:  SPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRA
        SPY+FLLC++  SS++N V  + ++ G+   + CP+++HLFFADDSL+FC+A
Subjt:  SPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRA

TrEMBL top hitse value%identityAlignment
A0A2I4EVF9 uncharacterized protein LOC1089930572.6e-4451.91Show/hide
Query:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST
        M+P  +P P+GFPA FYQK W  +GD      L+ LN    + D N TYI+LIPK  SP KV++YRPISLCNV YKI+ K I NR+K +  +II+ NQS 
Subjt:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST

Query:  FVPGRSIFDNIIVGHECNITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRA
        FVP R I DN ++       P RGLRQGDPLSPYLF+LC++ L+S++N       L      +    V+HLFFADDSL+FCRA
Subjt:  FVPGRSIFDNIIVGHECNITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRA

A0A2N9GC56 Reverse transcriptase domain-containing protein2.2e-4340.31Show/hide
Query:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST
        M P+KAP P+G P +FYQ FW +IG T     LE LNQ  S+   N T++ LIPK  SP  V+ YRPISLCNV YK++ K + NR+KGV   IIS+NQS 
Subjt:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST

Query:  FVPGRSIFDNIIVGHEC---------------------------------------------------------------------NITPQRGLRQGDPL
        FVPGR I DNI+V  E                                                                       I P RGLRQGDPL
Subjt:  FVPGRSIFDNIIVGHEC---------------------------------------------------------------------NITPQRGLRQGDPL

Query:  SPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRAPLNRLG
        SPYLFLLC E LS++++      +L G+  ++  P++SHLFFADDSL+FC A +   G
Subjt:  SPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRAPLNRLG

A0A5B6XAZ1 Reverse transcriptase3.4e-4432.37Show/hide
Query:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST
        M P KAP  +GF A F+Q+FW  +G   +  CL +LN    V D N T I LIPK + PK +S +RPISLCNV YKII KV+VNRM  + +E I+E Q  
Subjt:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST

Query:  FVPGRSIFDNIIVGHECN----------------------------------ITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCP
        F+  R I DN+++ +E                                     +P RGLRQGDPLSPYLFL+C+E  S+++     R ++ G   ++   
Subjt:  FVPGRSIFDNIIVGHECN----------------------------------ITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCP

Query:  EVSHLFFADDSLIFCRAPL-----------------------------------------NRLGYSDL--SFIGMSWLLD----KWS-------ILEDYV
         ++HLFFADD ++F  A                                           NR+   D+  S+  +S L+D     W+       + E+  
Subjt:  EVSHLFFADDSLIFCRAPL-----------------------------------------NRLGYSDL--SFIGMSWLLD----KWS-------ILEDYV

Query:  KLIATIPISASDEDDKWIWHYTLNGEYSVKSGYKLLMSTA--LNLESSSHNRQRTWWDRLWKTKIPSKIKLSFGRLIVNV
          I +IPI+ S  +D  +W +  +G Y+VKSGY++L ++    N+ SS+ +    ++  L    IP KIK+   RL  N+
Subjt:  KLIATIPISASDEDDKWIWHYTLNGEYSVKSGYKLLMSTA--LNLESSSHNRQRTWWDRLWKTKIPSKIKLSFGRLIVNV

A0A803P3Q4 Uncharacterized protein9.6e-4734.22Show/hide
Query:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST
        M   K+P  +G   +FY   W  +GD      L +LN+      +N T +T IPK   P  + ++RPISLCNV YK+I K+IV R K V   ++S+ QS 
Subjt:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST

Query:  FVPGRSIFDNIIVGHEC--------------------------------------NITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKAN
        F+P R I DN+++  E                                       +ITPQRGLRQGDPLSPYLFL+CSE LS ++       +L G   +
Subjt:  FVPGRSIFDNIIVGHEC--------------------------------------NITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKAN

Query:  KHCPEVSHLFFADDSLIFCRAPLNRLGY---------------SDLSFIGMSWLLDKWSILEDYVKLIATIPISASDE-----DDKWIWHYTLNGEYSVK
        +  P +SHLFFADDSL+FC+A     G                 +L    MS+  +  + ++   + I  +PIS   E       +WIWH+T   EYSV+
Subjt:  KHCPEVSHLFFADDSLIFCRAPLNRLGY---------------SDLSFIGMSWLLDKWSILEDYVKLIATIPISASDE-----DDKWIWHYTLNGEYSVK

Query:  SGYKLLMSTALNLESSSHNRQRTWWDRLWKTKIPSKIKL
        + Y        +  S+    Q TWW   W  K+PSK+K+
Subjt:  SGYKLLMSTALNLESSSHNRQRTWWDRLWKTKIPSKIKL

Q94H40 Putative reverse transcriptase2.9e-4345.63Show/hide
Query:  KAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQSTFVPG
        KAP P+G  A+FY++FW  +    +   L+ +N ++    WN T + +IPK + P +V+ +RPISLCNV YKII K++ NR+K +  EIIS+ QS FVPG
Subjt:  KAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQSTFVPG

Query:  RSIFDNIIVGHEC------------------------NITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIF
        R I DN++V +EC                         I P RGLRQGDPLSPYLFLL +E LSSM+ G   R  L+GI+  +  P +SHL FADDSLI 
Subjt:  RSIFDNIIVGHEC------------------------NITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIF

Query:  CRAPLN
         +A  N
Subjt:  CRAPLN

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.5e-0733.33Show/hide
Query:  KAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPK-SNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQSTFVP
        K+P P+GF A FYQ++  E+    +     I  +      +    I LIPK      K  N+RPISL N+  KI+ K++ NR++   +++I  +Q  F+P
Subjt:  KAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPK-SNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQSTFVP

Query:  GRSIFDNI
        G   + NI
Subjt:  GRSIFDNI

P08548 LINE-1 reverse transcriptase homolog3.4e-0937.84Show/hide
Query:  KAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTY---ITLIPK-SNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST
        K+P P+GF + FYQ F  E+    +LN  + + +   +   N  Y   ITLIPK    P +  NYRPISL N+  KI+ K++ NR++   ++II  +Q  
Subjt:  KAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTY---ITLIPK-SNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST

Query:  FVPGRSIFDNI
        F+PG   + NI
Subjt:  FVPGRSIFDNI

P11369 LINE-1 retrotransposable element ORF2 protein1.1e-1228.85Show/hide
Query:  KAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTY---ITLIPK-SNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST
        K+P P+GF A FYQ F  ++    IL+ L    +V      N  Y   ITLIPK    P K+ N+RPISL N+  KI+ K++ NR++   + II  +Q  
Subjt:  KAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTY---ITLIPK-SNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST

Query:  FVPG--------------------------------RSIFDNI----------------------------------IVGHECNITP-QRGLRQGDPLSP
        F+PG                                   FD I                                  + G +    P + G RQG PLSP
Subjt:  FVPG--------------------------------RSIFDNI----------------------------------IVGHECNITP-QRGLRQGDPLSP

Query:  YLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRAPLN
        YLF +  EVL+  I     +K++ GI+  K   EV     ADD +++   P N
Subjt:  YLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRAPLN

P14381 Transposon TX1 uncharacterized 149 kDa protein7.4e-1233.63Show/hide
Query:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST
        M  +K+P  +G    F+Q FW  +G        E   +           ++L+PK    + + N+RP+SL +  YKI+ K I  R+K V  E+I  +QS 
Subjt:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQST

Query:  FVPGRSIFDNIIV
         VPGR+IFDN+ +
Subjt:  FVPGRSIFDNIIV

P92555 Uncharacterized mitochondrial protein AtMg012506.7e-1354.24Show/hide
Query:  ITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDS
        +TP RGLRQGDPLSPYLF+LC+EVLS +      + +L GI+ + + P ++HL FADD+
Subjt:  ITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.1e-0939.74Show/hide
Query:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKII
        M  +KAP P+ F A F+ + W  + D+TI    E       +  +N T ITLIPK     ++S +RP+S C V YKII
Subjt:  MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKII

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)4.8e-1454.24Show/hide
Query:  ITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDS
        +TP RGLRQGDPLSPYLF+LC+EVLS +      + +L GI+ + + P ++HL FADD+
Subjt:  ITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCCATCTAAGGCTCCAAGACCCAATGGTTTTCCGGCACTATTCTATCAGAAATTTTGGGCTGAGATTGGTGATACCACAATATTGAATTGCTTGGAAATTCTAAA
CCAAGTACGGTCTGTGGCGGATTGGAATTGCACTTATATTACCCTTATTCCCAAATCAAACTCTCCCAAGAAAGTTTCGAATTATCGCCCCATTAGCTTATGTAATGTGG
CATATAAAATCATTGTGAAGGTGATCGTTAATCGTATGAAGGGAGTGTTTCAAGAAATCATTTCTGAAAATCAATCTACTTTTGTTCCTGGGCGATCCATCTTTGATAAT
ATAATTGTGGGACATGAATGTAATATAACACCTCAGCGGGGTTTGCGACAGGGTGACCCACTATCCCCATACTTATTCTTGCTCTGTTCTGAGGTGTTGTCTTCTATGAT
AAATGGAGTTGTGCTGAGAAAACAATTAATGGGAATAAAGGCCAATAAACATTGCCCAGAGGTTTCTCACCTATTTTTTGCAGACGACAGCCTCATCTTTTGTAGAGCAC
CATTGAACAGACTTGGTTATTCCGATCTATCCTTCATCGGTATGAGTTGGCTTCTGGACAAATGGTCAATACTAGAAGATTATGTAAAGTTGATAGCCACAATTCCAATC
AGTGCTAGTGATGAGGATGATAAGTGGATATGGCACTACACTCTTAATGGGGAATACTCAGTGAAGAGCGGGTACAAACTACTTATGAGTACTGCTTTGAATTTGGAGTC
ATCTAGTCATAATAGGCAAAGAACTTGGTGGGACAGACTGTGGAAGACTAAAATTCCATCCAAAATCAAACTATCATTTGGAAGGCTTATAGTGAATGTTTGCCTGTTAA
TTGGTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATCCATCTAAGGCTCCAAGACCCAATGGTTTTCCGGCACTATTCTATCAGAAATTTTGGGCTGAGATTGGTGATACCACAATATTGAATTGCTTGGAAATTCTAAA
CCAAGTACGGTCTGTGGCGGATTGGAATTGCACTTATATTACCCTTATTCCCAAATCAAACTCTCCCAAGAAAGTTTCGAATTATCGCCCCATTAGCTTATGTAATGTGG
CATATAAAATCATTGTGAAGGTGATCGTTAATCGTATGAAGGGAGTGTTTCAAGAAATCATTTCTGAAAATCAATCTACTTTTGTTCCTGGGCGATCCATCTTTGATAAT
ATAATTGTGGGACATGAATGTAATATAACACCTCAGCGGGGTTTGCGACAGGGTGACCCACTATCCCCATACTTATTCTTGCTCTGTTCTGAGGTGTTGTCTTCTATGAT
AAATGGAGTTGTGCTGAGAAAACAATTAATGGGAATAAAGGCCAATAAACATTGCCCAGAGGTTTCTCACCTATTTTTTGCAGACGACAGCCTCATCTTTTGTAGAGCAC
CATTGAACAGACTTGGTTATTCCGATCTATCCTTCATCGGTATGAGTTGGCTTCTGGACAAATGGTCAATACTAGAAGATTATGTAAAGTTGATAGCCACAATTCCAATC
AGTGCTAGTGATGAGGATGATAAGTGGATATGGCACTACACTCTTAATGGGGAATACTCAGTGAAGAGCGGGTACAAACTACTTATGAGTACTGCTTTGAATTTGGAGTC
ATCTAGTCATAATAGGCAAAGAACTTGGTGGGACAGACTGTGGAAGACTAAAATTCCATCCAAAATCAAACTATCATTTGGAAGGCTTATAGTGAATGTTTGCCTGTTAA
TTGGTGCTTAG
Protein sequenceShow/hide protein sequence
MHPSKAPRPNGFPALFYQKFWAEIGDTTILNCLEILNQVRSVADWNCTYITLIPKSNSPKKVSNYRPISLCNVAYKIIVKVIVNRMKGVFQEIISENQSTFVPGRSIFDN
IIVGHECNITPQRGLRQGDPLSPYLFLLCSEVLSSMINGVVLRKQLMGIKANKHCPEVSHLFFADDSLIFCRAPLNRLGYSDLSFIGMSWLLDKWSILEDYVKLIATIPI
SASDEDDKWIWHYTLNGEYSVKSGYKLLMSTALNLESSSHNRQRTWWDRLWKTKIPSKIKLSFGRLIVNVCLLIGA